BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780980|ref|YP_003065393.1| hypothetical protein CLIBASIA_04405 [Candidatus Liberibacter asiaticus str. psy62] (121 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254780980|ref|YP_003065393.1| hypothetical protein CLIBASIA_04405 [Candidatus Liberibacter asiaticus str. psy62] gi|254040657|gb|ACT57453.1| hypothetical protein CLIBASIA_04405 [Candidatus Liberibacter asiaticus str. psy62] Length = 121 Score = 247 bits (630), Expect = 5e-64, Method: Compositional matrix adjust. Identities = 121/121 (100%), Positives = 121/121 (100%) Query: 1 MKAKILMTSAFSTTLLTIGGCDIVIGRTEDLLNKLQKNSTQMIKEANFKISETHRLAQER 60 MKAKILMTSAFSTTLLTIGGCDIVIGRTEDLLNKLQKNSTQMIKEANFKISETHRLAQER Sbjct: 1 MKAKILMTSAFSTTLLTIGGCDIVIGRTEDLLNKLQKNSTQMIKEANFKISETHRLAQER 60 Query: 61 VEAAEKRVKEVEERATASRKLSVDELANAFWDLSDEDKNAFTGNVKQEVCKVKKITVPPS 120 VEAAEKRVKEVEERATASRKLSVDELANAFWDLSDEDKNAFTGNVKQEVCKVKKITVPPS Sbjct: 61 VEAAEKRVKEVEERATASRKLSVDELANAFWDLSDEDKNAFTGNVKQEVCKVKKITVPPS 120 Query: 121 N 121 N Sbjct: 121 N 121 >gi|254780556|ref|YP_003064969.1| hypothetical protein CLIBASIA_02215 [Candidatus Liberibacter asiaticus str. psy62] gi|254040233|gb|ACT57029.1| hypothetical protein CLIBASIA_02215 [Candidatus Liberibacter asiaticus str. psy62] Length = 120 Score = 67.4 bits (163), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 41/108 (37%), Positives = 66/108 (61%), Gaps = 10/108 (9%) Query: 1 MKAKILMTSAFSTTLLTIGGCDIVIGRTEDLLNKLQKNSTQMIKEANFKISETHRLAQER 60 M+AK L+ S TT +TI GC +V ED N+++ S +M+KEA ++ E H+LA+E Sbjct: 1 MRAKTLLASTLVTTAITIIGCSLV----ED--NRIE--SLRMVKEAKMEVLEAHKLAKEY 52 Query: 61 VEAAEKRVKEVEERATAS--RKLSVDELANAFWDLSDEDKNAFTGNVK 106 VE A +RVKE EE++ A + L +D+L F +L +++ F ++ Sbjct: 53 VEQANQRVKEAEEQSNARLLKGLGMDDLVRYFMNLDSQNQAFFIDTIQ 100 >gi|327271882|ref|XP_003220716.1| PREDICTED: laminin subunit alpha-5-like [Anolis carolinensis] Length = 3663 Score = 38.1 bits (87), Expect = 0.45, Method: Composition-based stats. Identities = 27/94 (28%), Positives = 48/94 (51%), Gaps = 7/94 (7%) Query: 27 RTEDLLNKLQ--KNSTQMIKEANFKISETHRLAQERVEAAEKRVKEVEERATASRKLSVD 84 + EDLLN+LQ K++ M+++ + A++ A + V +EE+ +K ++D Sbjct: 2571 KKEDLLNQLQSVKSTLDMVQDGTAGAIQN---AKDAASQANETVARMEEKLNNMQK-NLD 2626 Query: 85 ELANAFWDLSDEDKNAFTGNVKQEVCKVKKITVP 118 E + DL +ED N G K V ++ T+P Sbjct: 2627 EWKQQYGDLKNEDLNQAVGEAKDTVSSLES-TLP 2659 >gi|58259447|ref|XP_567136.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21] gi|134107537|ref|XP_777653.1| hypothetical protein CNBA7730 [Cryptococcus neoformans var. neoformans B-3501A] gi|50260347|gb|EAL23006.1| hypothetical protein CNBA7730 [Cryptococcus neoformans var. neoformans B-3501A] gi|57223273|gb|AAW41317.1| conserved hypothetical protein [Cryptococcus neoformans var. neoformans JEC21] Length = 1184 Score = 36.6 bits (83), Expect = 1.3, Method: Compositional matrix adjust. Identities = 29/89 (32%), Positives = 46/89 (51%), Gaps = 11/89 (12%) Query: 5 ILMTSAFSTTLLTIGGCDIVIGRTEDLLNKLQKNSTQMIKEANFKISETHRLAQERVEAA 64 I TS + T + CD+ ++DL K Q ++A + + AQ+RV+AA Sbjct: 254 IRHTSTYVTLRMNSALCDVAADVSKDLSVK------QRQRDAEVRKAGATNAAQKRVKAA 307 Query: 65 EKRVKEVEERATASRKLSVDELANAFWDL 93 E RVKEV+E RK +++EL +D+ Sbjct: 308 EDRVKEVQE-----RKQTLEELMQEIFDV 331 >gi|91788850|ref|YP_549802.1| multi-sensor signal transduction histidine kinase [Polaromonas sp. JS666] gi|91698075|gb|ABE44904.1| multi-sensor signal transduction histidine kinase [Polaromonas sp. JS666] Length = 1253 Score = 36.2 bits (82), Expect = 1.7, Method: Composition-based stats. Identities = 33/113 (29%), Positives = 54/113 (47%), Gaps = 15/113 (13%) Query: 12 STTLLTIGGCDIVIG----RTEDLLN--KLQKNSTQMIKEAN---FKISETH-RLAQERV 61 +T L+ GC + + R E L + KL ++ Q+I E F+ ++ +LA R+ Sbjct: 33 ATALIVFSGCAVAVALFHLRNEALRSGEKLTQSLAQVIAEQTSRTFQAADQRLQLAASRL 92 Query: 62 EA--AEKRVKEVEERATASRKLSVDELANAFWDLSDEDKNAF---TGNVKQEV 109 +A AEKR+ E RA +L A A W L + + F TGN+ ++ Sbjct: 93 QALDAEKRLTEESVRALLREQLRGLPFARAIWVLDRQGRTVFDSDTGNIGTQL 145 >gi|321250385|ref|XP_003191789.1| hypothetical protein CGB_A9310C [Cryptococcus gattii WM276] gi|317458256|gb|ADV20002.1| Conserved hypothetical protein [Cryptococcus gattii WM276] Length = 1309 Score = 35.8 bits (81), Expect = 1.8, Method: Compositional matrix adjust. Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 11/89 (12%) Query: 5 ILMTSAFSTTLLTIGGCDIVIGRTEDLLNKLQKNSTQMIKEANFKISETHRLAQERVEAA 64 I TS + T + CD+ ++DL K Q ++A + + AQ+R++AA Sbjct: 254 IRHTSTYMTLKINSALCDVAADVSKDLSVK------QRQRDAEVRKAGATNAAQKRMQAA 307 Query: 65 EKRVKEVEERATASRKLSVDELANAFWDL 93 E RVKEV+E RK +++EL +D+ Sbjct: 308 EDRVKEVQE-----RKQTLEELMQEIFDV 331 >gi|302682220|ref|XP_003030791.1| hypothetical protein SCHCODRAFT_257312 [Schizophyllum commune H4-8] gi|300104483|gb|EFI95888.1| hypothetical protein SCHCODRAFT_257312 [Schizophyllum commune H4-8] Length = 764 Score = 35.4 bits (80), Expect = 2.6, Method: Compositional matrix adjust. Identities = 22/53 (41%), Positives = 36/53 (67%), Gaps = 1/53 (1%) Query: 26 GRTEDLLNKLQKNSTQMIKEANFKISETHRLAQERVEAAEKRVKEVEERATAS 78 GR E + +L++ S Q ++E+N ++ +T RL Q+ V AAE+RV + E RA A+ Sbjct: 519 GRREVAIRQLEE-SNQRLEESNHQLEDTVRLLQQSVAAAEERVVKAETRALAA 570 >gi|120602938|ref|YP_967338.1| multi-sensor signal transduction histidine kinase [Desulfovibrio vulgaris DP4] gi|120563167|gb|ABM28911.1| multi-sensor signal transduction histidine kinase [Desulfovibrio vulgaris DP4] Length = 1085 Score = 35.0 bits (79), Expect = 3.6, Method: Composition-based stats. Identities = 16/49 (32%), Positives = 29/49 (59%) Query: 55 RLAQERVEAAEKRVKEVEERATASRKLSVDELANAFWDLSDEDKNAFTG 103 +LA+E++ +E+R+KE+ E + +L +EL +F + DK A G Sbjct: 786 KLAEEKLRHSEERLKELNESLESQVRLRTEELERSFESVRQADKMASLG 834 >gi|46579568|ref|YP_010376.1| sensory box histidine kinase [Desulfovibrio vulgaris str. Hildenborough] gi|46448983|gb|AAS95635.1| sensory box histidine kinase [Desulfovibrio vulgaris str. Hildenborough] gi|311233379|gb|ADP86233.1| multi-sensor signal transduction histidine kinase [Desulfovibrio vulgaris RCH1] Length = 1085 Score = 35.0 bits (79), Expect = 3.6, Method: Composition-based stats. Identities = 16/49 (32%), Positives = 29/49 (59%) Query: 55 RLAQERVEAAEKRVKEVEERATASRKLSVDELANAFWDLSDEDKNAFTG 103 +LA+E++ +E+R+KE+ E + +L +EL +F + DK A G Sbjct: 786 KLAEEKLRHSEERLKELNESLESQVRLRTEELERSFESVRQADKMASLG 834 >gi|227824806|ref|ZP_03989638.1| formiminotetrahydrofolate cyclodeaminase [Acidaminococcus sp. D21] gi|226905305|gb|EEH91223.1| formiminotetrahydrofolate cyclodeaminase [Acidaminococcus sp. D21] Length = 206 Score = 33.9 bits (76), Expect = 8.1, Method: Compositional matrix adjust. Identities = 20/46 (43%), Positives = 25/46 (54%), Gaps = 8/46 (17%) Query: 56 LAQERVEAAEKRVKEVEERATASRKLSVDELANAFWDLSDEDKNAF 101 L +E+ A EK +KEV E+A + L N F L DED NAF Sbjct: 47 LGKEKYAAVEKEMKEVAEKAAS--------LKNQFLALIDEDSNAF 84 >gi|159486149|ref|XP_001701106.1| hypothetical protein CHLREDRAFT_194202 [Chlamydomonas reinhardtii] gi|158272000|gb|EDO97808.1| predicted protein [Chlamydomonas reinhardtii] Length = 1430 Score = 33.5 bits (75), Expect = 8.9, Method: Composition-based stats. Identities = 17/50 (34%), Positives = 30/50 (60%) Query: 31 LLNKLQKNSTQMIKEANFKISETHRLAQERVEAAEKRVKEVEERATASRK 80 +L KL K + ++EA + + A++RVE AE+RV+E E+R + + Sbjct: 67 ILQKLVKAVKEDVEEAQQEAQQRVEKAEQRVEKAEQRVEEAEQRVEKAEQ 116 Searching..................................................done Results from round 2 CONVERGED! >gi|254780980|ref|YP_003065393.1| hypothetical protein CLIBASIA_04405 [Candidatus Liberibacter asiaticus str. psy62] gi|254040657|gb|ACT57453.1| hypothetical protein CLIBASIA_04405 [Candidatus Liberibacter asiaticus str. psy62] Length = 121 Score = 174 bits (442), Expect = 2e-42, Method: Composition-based stats. Identities = 121/121 (100%), Positives = 121/121 (100%) Query: 1 MKAKILMTSAFSTTLLTIGGCDIVIGRTEDLLNKLQKNSTQMIKEANFKISETHRLAQER 60 MKAKILMTSAFSTTLLTIGGCDIVIGRTEDLLNKLQKNSTQMIKEANFKISETHRLAQER Sbjct: 1 MKAKILMTSAFSTTLLTIGGCDIVIGRTEDLLNKLQKNSTQMIKEANFKISETHRLAQER 60 Query: 61 VEAAEKRVKEVEERATASRKLSVDELANAFWDLSDEDKNAFTGNVKQEVCKVKKITVPPS 120 VEAAEKRVKEVEERATASRKLSVDELANAFWDLSDEDKNAFTGNVKQEVCKVKKITVPPS Sbjct: 61 VEAAEKRVKEVEERATASRKLSVDELANAFWDLSDEDKNAFTGNVKQEVCKVKKITVPPS 120 Query: 121 N 121 N Sbjct: 121 N 121 >gi|254780556|ref|YP_003064969.1| hypothetical protein CLIBASIA_02215 [Candidatus Liberibacter asiaticus str. psy62] gi|254040233|gb|ACT57029.1| hypothetical protein CLIBASIA_02215 [Candidatus Liberibacter asiaticus str. psy62] Length = 120 Score = 120 bits (302), Expect = 5e-26, Method: Composition-based stats. Identities = 41/110 (37%), Positives = 67/110 (60%), Gaps = 10/110 (9%) Query: 1 MKAKILMTSAFSTTLLTIGGCDIVIGRTEDLLNKLQKNSTQMIKEANFKISETHRLAQER 60 M+AK L+ S TT +TI GC +V ED N+++ S +M+KEA ++ E H+LA+E Sbjct: 1 MRAKTLLASTLVTTAITIIGCSLV----ED--NRIE--SLRMVKEAKMEVLEAHKLAKEY 52 Query: 61 VEAAEKRVKEVEERATAS--RKLSVDELANAFWDLSDEDKNAFTGNVKQE 108 VE A +RVKE EE++ A + L +D+L F +L +++ F ++ + Sbjct: 53 VEQANQRVKEAEEQSNARLLKGLGMDDLVRYFMNLDSQNQAFFIDTIQNK 102 >gi|75911009|ref|YP_325305.1| WD-40 repeat-containing protein [Anabaena variabilis ATCC 29413] gi|75704734|gb|ABA24410.1| WD-40 repeat-containing protein [Anabaena variabilis ATCC 29413] Length = 1477 Score = 36.6 bits (83), Expect = 1.0, Method: Composition-based stats. Identities = 19/68 (27%), Positives = 32/68 (47%), Gaps = 7/68 (10%) Query: 43 IKEANFKISETHRLAQERVEAAEKRVKEV-------EERATASRKLSVDELANAFWDLSD 95 + +AN K++ T + A+E + A +RVKE +E A K + D++ A +L Sbjct: 496 VNDANVKVANTQKEAKELTDKANQRVKEANIQVANTKEEAKNRTKKADDDVRLALENLGK 555 Query: 96 EDKNAFTG 103 K A Sbjct: 556 ITKQAKID 563 Score = 34.3 bits (77), Expect = 5.1, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 9/91 (9%) Query: 27 RTEDLLNKLQKNSTQMIKEANFKISETHRLAQERVEAAEKRVKEVEERATASRKLS---- 82 R D ++ + S ++ K+ + E RLA+E E A + + E E+ A ++ Sbjct: 419 RLSDEDDRFLRESREVEKQDIDRKLEAERLARETAEQANQILLEAEQTAKGKVRVGSIIL 478 Query: 83 -----VDELANAFWDLSDEDKNAFTGNVKQE 108 V +A+ + +S D N N ++E Sbjct: 479 AGTLVVAGVASVWAGISVNDANVKVANTQKE 509 >gi|159486149|ref|XP_001701106.1| hypothetical protein CHLREDRAFT_194202 [Chlamydomonas reinhardtii] gi|158272000|gb|EDO97808.1| predicted protein [Chlamydomonas reinhardtii] Length = 1430 Score = 36.6 bits (83), Expect = 1.2, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 23/34 (67%) Query: 41 QMIKEANFKISETHRLAQERVEAAEKRVKEVEER 74 +++K + E + AQ+RVE AE+RV++ E+R Sbjct: 70 KLVKAVKEDVEEAQQEAQQRVEKAEQRVEKAEQR 103 Score = 34.7 bits (78), Expect = 3.9, Method: Composition-based stats. Identities = 17/44 (38%), Positives = 28/44 (63%) Query: 31 LLNKLQKNSTQMIKEANFKISETHRLAQERVEAAEKRVKEVEER 74 +L KL K + ++EA + + A++RVE AE+RV+E E+R Sbjct: 67 ILQKLVKAVKEDVEEAQQEAQQRVEKAEQRVEKAEQRVEEAEQR 110 >gi|288922780|ref|ZP_06416949.1| hypothetical protein FrEUN1fDRAFT_6647 [Frankia sp. EUN1f] gi|288345893|gb|EFC80253.1| hypothetical protein FrEUN1fDRAFT_6647 [Frankia sp. EUN1f] Length = 674 Score = 36.2 bits (82), Expect = 1.4, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 40/87 (45%), Gaps = 5/87 (5%) Query: 14 TLLTIGGCDIVIGRTEDLLNKLQKNSTQMIKEANFKISETHRLAQERVEAAEKRVKEVEE 73 T + G D V+ E ++Q+++ ++ E + R A VE AE+ ++ E Sbjct: 267 TAKGVDGSDEVVAAQE----RVQESAAAIV-ETQLDGQRSVRDALHAVEQAERSLQSARE 321 Query: 74 RATASRKLSVDELANAFWDLSDEDKNA 100 ++ AS VD A+A DL + + Sbjct: 322 QSAASAAGGVDAYADALKDLPAKTQAF 348 >gi|255943494|ref|XP_002562515.1| Pc19g00220 [Penicillium chrysogenum Wisconsin 54-1255] gi|211587249|emb|CAP79438.1| Pc19g00220 [Penicillium chrysogenum Wisconsin 54-1255] Length = 566 Score = 36.2 bits (82), Expect = 1.5, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 32/78 (41%), Gaps = 5/78 (6%) Query: 23 IVIGRTEDLLNKLQKNST---QMIKEANFKI--SETHRLAQERVEAAEKRVKEVEERATA 77 I+ G ++ L+ + NS + KE I H ++ A + +KE EE + Sbjct: 484 IIKGASKSPLSSIAGNSRVHRRCTKETKLDIGNDSAHARIRDYWLRANQALKEKEEASGG 543 Query: 78 SRKLSVDELANAFWDLSD 95 L VD+ + ++ Sbjct: 544 RDYLDVDDEVFKYVNIDS 561 >gi|58259447|ref|XP_567136.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21] gi|134107537|ref|XP_777653.1| hypothetical protein CNBA7730 [Cryptococcus neoformans var. neoformans B-3501A] gi|50260347|gb|EAL23006.1| hypothetical protein CNBA7730 [Cryptococcus neoformans var. neoformans B-3501A] gi|57223273|gb|AAW41317.1| conserved hypothetical protein [Cryptococcus neoformans var. neoformans JEC21] Length = 1184 Score = 35.1 bits (79), Expect = 3.5, Method: Composition-based stats. Identities = 25/74 (33%), Positives = 38/74 (51%), Gaps = 6/74 (8%) Query: 8 TSAFSTTLLTIGGCDIVIGRTEDLLNKLQKNSTQMIKEANFKISETHRLAQERVEAAEKR 67 TS + T + CD+ ++DL K Q ++A + + AQ+RV+AAE R Sbjct: 257 TSTYVTLRMNSALCDVAADVSKDLSVK------QRQRDAEVRKAGATNAAQKRVKAAEDR 310 Query: 68 VKEVEERATASRKL 81 VKEV+ER +L Sbjct: 311 VKEVQERKQTLEEL 324 >gi|72389040|ref|XP_844815.1| 65 kDa invariant surface glycoprotein [Trypanosoma brucei TREU927] gi|62176336|gb|AAX70448.1| 65 kDa invariant surface glycoprotein, putative [Trypanosoma brucei] gi|70801349|gb|AAZ11256.1| 65 kDa invariant surface glycoprotein, putative [Trypanosoma brucei brucei strain 927/4 GUTat10.1] Length = 435 Score = 34.7 bits (78), Expect = 4.6, Method: Composition-based stats. Identities = 21/62 (33%), Positives = 32/62 (51%), Gaps = 4/62 (6%) Query: 28 TEDLLNKLQKNSTQMIKE----ANFKISETHRLAQERVEAAEKRVKEVEERATASRKLSV 83 + + +KL T+ +KE A K+SE A+E E A KR +EV E A +R + Sbjct: 84 SNNGYSKLSDADTKKVKEIYEKAKGKVSEQLPKAKEFGEEAGKRHQEVTEAAKRARGWGL 143 Query: 84 DE 85 D+ Sbjct: 144 DD 145 >gi|52080071|ref|YP_078862.1| inositol monophosphatase SuhB [Bacillus licheniformis ATCC 14580] gi|52785445|ref|YP_091274.1| YktC [Bacillus licheniformis ATCC 14580] gi|319646154|ref|ZP_08000384.1| YktC protein [Bacillus sp. BT1B_CT2] gi|52003282|gb|AAU23224.1| Inositol monophosphatase SuhB [Bacillus licheniformis ATCC 14580] gi|52347947|gb|AAU40581.1| YktC [Bacillus licheniformis ATCC 14580] gi|317391904|gb|EFV72701.1| YktC protein [Bacillus sp. BT1B_CT2] Length = 264 Score = 34.3 bits (77), Expect = 5.1, Method: Composition-based stats. Identities = 14/56 (25%), Positives = 29/56 (51%), Gaps = 4/56 (7%) Query: 52 ETHRLAQERVEAAEKRVKEV-EERATASRKLSVDELANAFWDLSDEDKNAFTGNVK 106 E RLA+ V+ A +R+K+ +E+ T K + ++L ++ E + F ++ Sbjct: 6 EIDRLAKSWVKEAGQRIKQSMKEKMTIETKSNPNDLVT---NIDKETERFFIEKIQ 58 >gi|261328538|emb|CBH11515.1| 64 kDa invariant surface glycoprotein, putative [Trypanosoma brucei gambiense DAL972] Length = 435 Score = 34.3 bits (77), Expect = 5.6, Method: Composition-based stats. Identities = 21/62 (33%), Positives = 32/62 (51%), Gaps = 4/62 (6%) Query: 28 TEDLLNKLQKNSTQMIKE----ANFKISETHRLAQERVEAAEKRVKEVEERATASRKLSV 83 + + +KL T+ +KE A K+SE A+E E A KR +EV E A +R + Sbjct: 84 SNNGYSKLSDADTKKVKEIYEKAKGKVSEQLPKAKEFGEEAGKRHQEVTEAAKRARGWGL 143 Query: 84 DE 85 D+ Sbjct: 144 DD 145 >gi|321250385|ref|XP_003191789.1| hypothetical protein CGB_A9310C [Cryptococcus gattii WM276] gi|317458256|gb|ADV20002.1| Conserved hypothetical protein [Cryptococcus gattii WM276] Length = 1309 Score = 34.3 bits (77), Expect = 5.9, Method: Composition-based stats. Identities = 24/74 (32%), Positives = 38/74 (51%), Gaps = 6/74 (8%) Query: 8 TSAFSTTLLTIGGCDIVIGRTEDLLNKLQKNSTQMIKEANFKISETHRLAQERVEAAEKR 67 TS + T + CD+ ++DL K Q ++A + + AQ+R++AAE R Sbjct: 257 TSTYMTLKINSALCDVAADVSKDLSVK------QRQRDAEVRKAGATNAAQKRMQAAEDR 310 Query: 68 VKEVEERATASRKL 81 VKEV+ER +L Sbjct: 311 VKEVQERKQTLEEL 324 >gi|261328521|emb|CBH11498.1| 65 kDa invariant surface glycoprotein, putative [Trypanosoma brucei gambiense DAL972] Length = 431 Score = 34.3 bits (77), Expect = 6.1, Method: Composition-based stats. Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 4/62 (6%) Query: 28 TEDLLNKLQKNSTQMIKE----ANFKISETHRLAQERVEAAEKRVKEVEERATASRKLSV 83 + D +KL T+ +K+ A K+SE A+E E A KR + V E A +R + Sbjct: 84 SNDGYSKLSDADTKKVKDIYEKAKGKVSEQLPKAKEFGEEAGKRCQSVTEAAKKARGWGL 143 Query: 84 DE 85 D+ Sbjct: 144 DD 145 >gi|224075467|ref|XP_002304646.1| predicted protein [Populus trichocarpa] gi|222842078|gb|EEE79625.1| predicted protein [Populus trichocarpa] Length = 555 Score = 34.3 bits (77), Expect = 6.2, Method: Composition-based stats. Identities = 18/60 (30%), Positives = 36/60 (60%), Gaps = 4/60 (6%) Query: 22 DIVIGRTEDLLNKLQKNSTQ----MIKEANFKISETHRLAQERVEAAEKRVKEVEERATA 77 ++V ++D ++ + ++ + M++E N I E RL +ER + A+ RVKE+E++ A Sbjct: 194 EVVNFNSKDTGDQHEASALRDELDMLQEENGNILEKLRLEEERCKEADARVKELEKQVAA 253 >gi|261328080|emb|CBH11057.1| 65 kDa invariant surface glycoprotein, putative [Trypanosoma brucei gambiense DAL972] Length = 432 Score = 33.9 bits (76), Expect = 6.4, Method: Composition-based stats. Identities = 21/62 (33%), Positives = 32/62 (51%), Gaps = 4/62 (6%) Query: 28 TEDLLNKLQKNSTQMIKE----ANFKISETHRLAQERVEAAEKRVKEVEERATASRKLSV 83 + + +KL T+ +KE A K+SE A+E E A KR +EV E A +R + Sbjct: 84 SNNGYSKLSDADTKKVKEIYEKAKGKVSEQLPKAKEFGEEAGKRHQEVTEAAKRARGWGL 143 Query: 84 DE 85 D+ Sbjct: 144 DD 145 >gi|270009401|gb|EFA05849.1| hypothetical protein TcasGA2_TC008640 [Tribolium castaneum] Length = 4573 Score = 33.9 bits (76), Expect = 7.1, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 56/111 (50%), Gaps = 6/111 (5%) Query: 11 FSTTLLTIGGCDIVIGRTEDLLNKLQKNSTQMIKEANFKISETHRLAQERVEAAEK--RV 68 ++T + G V+ E LN+ + + ++I + +E ++A+E+ A+E+ +V Sbjct: 3163 LASTSEEVDGLKEVLAVQEVELNEKNEAAGKLIAVLS---AENEKVAKEQAIASEEEAKV 3219 Query: 69 KEVEERATASRKLSVDELANAFWDLSDEDKNAFTGNVKQEVCKVKKITVPP 119 K +EE + +K+ D+LA A L + T N K + ++K T PP Sbjct: 3220 KLIEEDVSVKQKICADDLAKAEPALIAAQQALNTLN-KNNLTELKSFTQPP 3269 >gi|255948250|ref|XP_002564892.1| Pc22g08800 [Penicillium chrysogenum Wisconsin 54-1255] gi|211591909|emb|CAP98168.1| Pc22g08800 [Penicillium chrysogenum Wisconsin 54-1255] Length = 481 Score = 33.9 bits (76), Expect = 7.9, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 32/78 (41%), Gaps = 5/78 (6%) Query: 23 IVIGRTEDLLNKLQKNST---QMIKEANFKI--SETHRLAQERVEAAEKRVKEVEERATA 77 I+ G ++ L+ + NS + KE I H ++ A + +KE EE + Sbjct: 399 IIKGASKSPLSSIAGNSRVHRRCTKETKLDIGNESAHARIRDYWLRANQALKEKEEASGG 458 Query: 78 SRKLSVDELANAFWDLSD 95 L V++ + ++ Sbjct: 459 RDYLDVEDEVFKYVNIDS 476 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.316 0.129 0.322 Lambda K H 0.267 0.0392 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,653,615,044 Number of Sequences: 14124377 Number of extensions: 49592727 Number of successful extensions: 293169 Number of sequences better than 10.0: 124 Number of HSP's better than 10.0 without gapping: 95 Number of HSP's successfully gapped in prelim test: 289 Number of HSP's that attempted gapping in prelim test: 292829 Number of HSP's gapped (non-prelim): 659 length of query: 121 length of database: 4,842,793,630 effective HSP length: 88 effective length of query: 33 effective length of database: 3,599,848,454 effective search space: 118794998982 effective search space used: 118794998982 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (22.1 bits) S2: 75 (33.6 bits)