BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780682|ref|YP_003065095.1| hypothetical protein CLIBASIA_02845 [Candidatus Liberibacter asiaticus str. psy62] (199 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254780682|ref|YP_003065095.1| hypothetical protein CLIBASIA_02845 [Candidatus Liberibacter asiaticus str. psy62] gi|254040359|gb|ACT57155.1| hypothetical protein CLIBASIA_02845 [Candidatus Liberibacter asiaticus str. psy62] Length = 199 Score = 340 bits (872), Expect = 6e-92, Method: Composition-based stats. Identities = 199/199 (100%), Positives = 199/199 (100%) Query: 1 MRSTILFLIKHKLKLHKNIMSKGKINESFWYIRTLFPYLSHQIHKALMSISSGIILILSP 60 MRSTILFLIKHKLKLHKNIMSKGKINESFWYIRTLFPYLSHQIHKALMSISSGIILILSP Sbjct: 1 MRSTILFLIKHKLKLHKNIMSKGKINESFWYIRTLFPYLSHQIHKALMSISSGIILILSP 60 Query: 61 TDCTANTANILIPSRKIIDYHRLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHIA 120 TDCTANTANILIPSRKIIDYHRLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHIA Sbjct: 61 TDCTANTANILIPSRKIIDYHRLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHIA 120 Query: 121 LLKERLRTDINTFDNTNLETKIPLPNNLKPNVCVKEKKLIPPRKNINNLKDTNHRLKIKN 180 LLKERLRTDINTFDNTNLETKIPLPNNLKPNVCVKEKKLIPPRKNINNLKDTNHRLKIKN Sbjct: 121 LLKERLRTDINTFDNTNLETKIPLPNNLKPNVCVKEKKLIPPRKNINNLKDTNHRLKIKN 180 Query: 181 NQEIKNIHHKKNKPRLHCQ 199 NQEIKNIHHKKNKPRLHCQ Sbjct: 181 NQEIKNIHHKKNKPRLHCQ 199 >gi|315122207|ref|YP_004062696.1| Peptidase M23 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495609|gb|ADR52208.1| Peptidase M23 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 344 Score = 70.1 bits (170), Expect = 2e-10, Method: Composition-based stats. Identities = 61/171 (35%), Positives = 84/171 (49%), Gaps = 30/171 (17%) Query: 55 ILILSPTDCTANTANILIPSRKIIDYH-RLLEQKKNHNLQYSLINIPSQNKQESPKNANN 113 +LIL T N NIL+ KI +YH RL+EQK N Q +I IP KQE P N NN Sbjct: 23 LLILVSTSDKTNAENILVLPSKITNYHNRLVEQKNNPEKQ--VIPIPLPTKQEYPSNTNN 80 Query: 114 NILDHIALLKERLRTDINT-FDNTNLETKIPLP----------NNLKPNVCVKEKKLIPP 162 +D LL ++ D N + + L KIPLP N+L N C++ K Sbjct: 81 KTIDQNTLLAKKGFVDKNNLYSSMYLSKKIPLPNKCLLFPPDNNSLHLNNCIENKSPNSS 140 Query: 163 RKNINNLKDTNHR--------------LKIKNNQEIKNIHHKKNK--PRLH 197 +KNI++ + K+KNNQ+IK+I +KKN P++H Sbjct: 141 KKNISHTRKIPKYKKNNPKKSGNIAPAFKVKNNQKIKHIQYKKNNYSPQIH 191 >gi|145523511|ref|XP_001447594.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124415105|emb|CAK80197.1| unnamed protein product [Paramecium tetraurelia] Length = 983 Score = 39.3 bits (90), Expect = 0.31, Method: Composition-based stats. Identities = 28/104 (26%), Positives = 51/104 (49%), Gaps = 17/104 (16%) Query: 86 QKKNHNLQYSLINIPSQNKQESPKNANNNILDHIALLKERLRTDINTFDNTNLETKIPLP 145 + KNH L+ L+N+ K E P NA ++ I LLK+ +D N ++++L + Sbjct: 815 RNKNHILEAELMNLKEAKKIEDPNNATKRLISEIQLLKQSQVSDNNNINDSSLRRE---R 871 Query: 146 NNLKPNVCVKEKKLIPPRKNINNLKDTN----HRLKIKNNQEIK 185 N+L+ R+ + LKD+N H+++ N + +K Sbjct: 872 NDLQEE----------NRRLVQLLKDSNKWDLHKMQQDNERLLK 905 >gi|325690047|gb|EGD32051.1| hypothetical protein HMPREF9382_1005 [Streptococcus sanguinis SK115] Length = 547 Score = 36.6 bits (83), Expect = 1.8, Method: Composition-based stats. Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 3/59 (5%) Query: 115 ILDHIALLKERLRTDINTFDNTNLETKIPLPNNLKPNVCVKEKKLIPPRKNINNLKDTN 173 I+D++ ++E+LR DI +DN T + NL V K+KK I PR ++N + N Sbjct: 275 IIDNVNCIREKLRFDIEDYDNFIHYTSLKTLKNL---VYTKDKKNIYPRLRLSNARQMN 330 >gi|295702542|ref|YP_003595617.1| extracellular ribonuclease [Bacillus megaterium DSM 319] gi|294800201|gb|ADF37267.1| Extracellular ribonuclease [Bacillus megaterium DSM 319] Length = 384 Score = 36.6 bits (83), Expect = 2.0, Method: Composition-based stats. Identities = 29/97 (29%), Positives = 43/97 (44%), Gaps = 21/97 (21%) Query: 34 TLFPYLSHQIHKALMSISSGIILILSPTDCTANTANILIPSRKIIDYHRLLEQKKNHNLQ 93 TL PY SH K++ SIS+ T T + P+ + DY+R + +K ++L+ Sbjct: 120 TLTPYFSHPGIKSVTSIST-------ETGGTDPAPDPTEPTVPVEDYYRTVAEKTGNSLK 172 Query: 94 YSLINIPSQNKQ--------------ESPKNANNNIL 116 L NI + + E P NANN IL Sbjct: 173 TELHNIIDHHTELSYSAVWEALKKTDEDPANANNVIL 209 >gi|51490739|emb|CAH18713.1| extracellular ribonuclease [Bacillus megaterium] Length = 382 Score = 36.6 bits (83), Expect = 2.0, Method: Composition-based stats. Identities = 29/97 (29%), Positives = 43/97 (44%), Gaps = 21/97 (21%) Query: 34 TLFPYLSHQIHKALMSISSGIILILSPTDCTANTANILIPSRKIIDYHRLLEQKKNHNLQ 93 TL PY SH K++ SIS+ T T + P+ + DY+R + +K ++L+ Sbjct: 120 TLTPYFSHPGIKSVTSIST-------ETGGTDPAPDPTEPTVPVEDYYRTVAEKTGNSLK 172 Query: 94 YSLINIPSQNKQ--------------ESPKNANNNIL 116 L NI + + E P NANN IL Sbjct: 173 TELHNIIDHHTELSYSAVWEALKETDEDPANANNVIL 209 >gi|113476744|ref|YP_722805.1| WD-40 repeat-containing protein [Trichodesmium erythraeum IMS101] gi|110167792|gb|ABG52332.1| WD-40 repeat [Trichodesmium erythraeum IMS101] Length = 1304 Score = 36.6 bits (83), Expect = 2.2, Method: Composition-based stats. Identities = 44/156 (28%), Positives = 67/156 (42%), Gaps = 32/156 (20%) Query: 65 ANTANILIPSRKIIDYHRLL--EQK--------KNHNLQYSLI----NIPSQNKQESPKN 110 AN +N IP + H + EQK KN N Q + N Q++ KN Sbjct: 409 ANVSNFEIPENQKNYSHNEINSEQKLLVNPFKLKNFNNQVTNFQLGENFSGLESQDNQKN 468 Query: 111 ANNNILDHIALLKERLRTDINTFD----NTNLETKIP----------LPNNLKPNVCVKE 156 +NNI+D E L D+N FD N NLE I NNL + ++ Sbjct: 469 QSNNIIDPEQKQLENL-LDLNQFDSSVTNLNLEENISNLQLFENHQIQSNNL---IHPQQ 524 Query: 157 KKLIPPRKNINNLKDTNHRLKIKNNQEIKNIHHKKN 192 K+L P + I + K+ ++++ K +++ HK N Sbjct: 525 KELETPGEKITHQKNNKNQVRDKKAKKLAISQHKLN 560 >gi|296083309|emb|CBI22945.3| unnamed protein product [Vitis vinifera] Length = 628 Score = 36.2 bits (82), Expect = 2.3, Method: Composition-based stats. Identities = 30/109 (27%), Positives = 55/109 (50%), Gaps = 12/109 (11%) Query: 65 ANTANILIPSRKIIDYHRLLEQKKNHNLQY------SLINIPSQNKQESPKNANNNI--L 116 AN L+P KI D+H ++ +N N+ L+N+ SQNK K+ ++ Sbjct: 387 ANAEEALLPIPKIFDFHEKNKRYQNSNIDPDEIMADDLMNVRSQNKAVESKDGLESVEAS 446 Query: 117 DHIALLKERLRTDINTFDNTNLETKIPLPNNLK-PNVCVKEKKLIPPRK 164 + + L+E+LR+ + DN + T + N+ K + ++E +PP+K Sbjct: 447 NETSCLEEQLRS-LGLLDNKDDLTSNSMLNSRKFKGISLEEN--MPPKK 492 >gi|57096618|ref|XP_532571.1| PREDICTED: similar to SHC SH2-domain binding protein 1 [Canis familiaris] Length = 666 Score = 35.8 bits (81), Expect = 3.0, Method: Composition-based stats. Identities = 29/92 (31%), Positives = 43/92 (46%), Gaps = 14/92 (15%) Query: 72 IPSRKIIDYHRLLEQ-----KKNHNLQYSLINIPSQNKQESPKNANN-NILDHIALLKER 125 +PS I+DYH LL Q +K NL+ SL N S ++QE+ + I LK++ Sbjct: 233 VPSGLIVDYHNLLSQCEESYRKFLNLRSSLSNCNSDSEQENISMVEGLKLYSEIEQLKQK 292 Query: 126 LRTDINTF--------DNTNLETKIPLPNNLK 149 L+ N N+N++ K PN K Sbjct: 293 LKLIENPLLRYVFGYQKNSNIQAKGIRPNGQK 324 >gi|147853966|emb|CAN79552.1| hypothetical protein VITISV_025726 [Vitis vinifera] Length = 692 Score = 35.8 bits (81), Expect = 3.1, Method: Composition-based stats. Identities = 30/109 (27%), Positives = 55/109 (50%), Gaps = 12/109 (11%) Query: 65 ANTANILIPSRKIIDYHRLLEQKKNHNLQY------SLINIPSQNKQESPKNANNNI--L 116 AN L+P KI D+H ++ +N N+ L+N+ SQNK K+ ++ Sbjct: 451 ANAEEALLPIPKIFDFHEKNKRYQNSNIDPDEIMADDLMNVRSQNKAVESKDDLESVEAS 510 Query: 117 DHIALLKERLRTDINTFDNTNLETKIPLPNNLK-PNVCVKEKKLIPPRK 164 + + L+E+LR+ + DN + T + N+ K + ++E +PP+K Sbjct: 511 NETSCLEEQLRS-LGLLDNKDDLTSNSMLNSTKFKGISLEEN--MPPKK 556 >gi|74201453|dbj|BAE26159.1| unnamed protein product [Mus musculus] Length = 668 Score = 35.8 bits (81), Expect = 3.3, Method: Composition-based stats. Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 6/62 (9%) Query: 72 IPSRKIIDYHRLLEQ-----KKNHNLQYSLINIPSQNKQESPKNANN-NILDHIALLKER 125 +PS I+DYH LL Q +K NL+ SL N S ++QE+ N+ I LK++ Sbjct: 239 VPSGLIVDYHNLLSQCEESYRKFLNLRSSLSNCNSDSEQENISMVEGLNLYSEIEQLKQK 298 Query: 126 LR 127 L+ Sbjct: 299 LK 300 >gi|74146441|dbj|BAE28972.1| unnamed protein product [Mus musculus] Length = 357 Score = 35.8 bits (81), Expect = 3.3, Method: Composition-based stats. Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 6/62 (9%) Query: 72 IPSRKIIDYHRLLEQ-----KKNHNLQYSLINIPSQNKQESPKNANN-NILDHIALLKER 125 +PS I+DYH LL Q +K NL+ SL N S ++QE+ N+ I LK++ Sbjct: 239 VPSGLIVDYHNLLSQCEESYRKFLNLRSSLSNCNSDSEQENISMVEGLNLYSEIEQLKQK 298 Query: 126 LR 127 L+ Sbjct: 299 LK 300 >gi|6755508|ref|NP_035499.1| SHC SH2 domain-binding protein 1 [Mus musculus] gi|81882097|sp|Q9Z179|SHCBP_MOUSE RecName: Full=SHC SH2 domain-binding protein 1; AltName: Full=Protein expressed in activated lymphocytes; Short=mPAL; AltName: Full=SHC-binding protein gi|4102877|gb|AAD01613.1| Shc binding protein [Mus musculus] gi|47683039|gb|AAH70455.1| Shc SH2-domain binding protein 1 [Mus musculus] gi|148690064|gb|EDL22011.1| Shc SH2-domain binding protein 1, isoform CRA_b [Mus musculus] Length = 668 Score = 35.8 bits (81), Expect = 3.3, Method: Composition-based stats. Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 6/62 (9%) Query: 72 IPSRKIIDYHRLLEQ-----KKNHNLQYSLINIPSQNKQESPKNANN-NILDHIALLKER 125 +PS I+DYH LL Q +K NL+ SL N S ++QE+ N+ I LK++ Sbjct: 239 VPSGLIVDYHNLLSQCEESYRKFLNLRSSLSNCNSDSEQENISMVEGLNLYSEIEQLKQK 298 Query: 126 LR 127 L+ Sbjct: 299 LK 300 >gi|66805773|ref|XP_636608.1| WD40 repeat-containing protein [Dictyostelium discoideum AX4] gi|60464996|gb|EAL63106.1| WD40 repeat-containing protein [Dictyostelium discoideum AX4] Length = 560 Score = 35.8 bits (81), Expect = 3.4, Method: Composition-based stats. Identities = 42/180 (23%), Positives = 85/180 (47%), Gaps = 21/180 (11%) Query: 13 LKLHKNIMSKGKI--NESFW--YIRTLFPYLSHQIHKALMSISSGIILILSPTDC-TANT 67 L + +++ G I N + W + R + +++ I ++ S+S ++++P D N Sbjct: 338 LSMDGSLLISGSIDGNCNIWDTFSRQIVRSIANTIKGSISSLS----VLMNPIDSLNFNV 393 Query: 68 ANILIPSRKIIDYHRLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHIALLKERLR 127 ++K D E+ ++N++ + ++P + KQ + N NI ++ L+ R+ Sbjct: 394 NGTAENNKKASDPIAPFEKYSSNNIERT--SVPIKLKQ---IDDNGNIKITVSNLQSRIE 448 Query: 128 TDINTFDNTNLETKIPLPNN-----LKPNVCVKEKKLIPPRKNINNLKDTNHRLKIKNNQ 182 I N + L NN L + EK+++ +K I NL+D N+ LK+ N+Q Sbjct: 449 NTI--VSNEQITNHTNLENNNKIIELNKTIQEYEKEILQFKKQIQNLQDNNNTLKLTNDQ 506 >gi|198425856|ref|XP_002124004.1| PREDICTED: similar to Serine/threonine-protein kinase atr (Ataxia telangiectasia and Rad3-related protein) (Xatr) [Ciona intestinalis] Length = 2497 Score = 35.4 bits (80), Expect = 4.1, Method: Composition-based stats. Identities = 42/155 (27%), Positives = 65/155 (41%), Gaps = 36/155 (23%) Query: 10 KHKLKLHKNIMSKGKINESF----------WYIRTLFPYLSHQI-------HKALMSISS 52 K+ L L K + G ++ +F W ++TL YL QI + L SIS+ Sbjct: 738 KNNLALSKYLELVGAMSRTFVDVSTLSVLEWTLKTLVQYLVRQIPHCYSTSYLQLESISA 797 Query: 53 GII-----------LILSPTDCTANTANILIPSRKIIDYHRLLEQKKNHNLQYSLIN--- 98 I LI+ C AN + PSRK +D L + N +L+Y + N Sbjct: 798 DISRRQEVFRHHCRLIIET--CVANLLDENNPSRKPLDVMEDLSKLFNSSLEYFITNNLM 855 Query: 99 ---IPSQNKQESPKNANNNILDHIALLKERLRTDI 130 P Q+S N +N+L ++ E+ R D+ Sbjct: 856 YLLPPIIVMQQSHNNKKDNLLTLLSENSEKSRIDM 890 >gi|30684987|ref|NP_564005.2| unknown protein [Arabidopsis thaliana] gi|22022516|gb|AAM83216.1| At1g16750/F19K19_26 [Arabidopsis thaliana] gi|24111407|gb|AAN46838.1| At1g16750/F19K19_26 [Arabidopsis thaliana] gi|332191371|gb|AEE29492.1| uncharacterized protein [Arabidopsis thaliana] Length = 529 Score = 35.4 bits (80), Expect = 4.2, Method: Composition-based stats. Identities = 38/151 (25%), Positives = 66/151 (43%), Gaps = 29/151 (19%) Query: 7 FLIKHKLKLHKNIMSKGKINESFWYIRTLFPYLSHQIHKALMSISSGIILILSP--TDCT 64 F ++H +K KN + K +R L S Q HK +S +S + + T+ Sbjct: 45 FELEHDVKRLKNQLQKETA------MRALLLKASDQSHKIELSHASSLPRSVQELLTNIA 98 Query: 65 ANTANILIPSRKIIDYHRLLEQKKN------HNLQYSL--------INIPSQNKQESPKN 110 A A + ++I+ H LL Q++N +NL +SL + + +N+ PK Sbjct: 99 AMEATVSKLEQEIMSLHFLLIQERNERKLAEYNLTHSLSPPNALDLVRLSEKNESLRPK- 157 Query: 111 ANNNILDHIALLKERLRTDINTFDNTNLETK 141 DH A + ++ + +FDN N +K Sbjct: 158 ------DHKAQPRSKVAKSLQSFDNANELSK 182 >gi|328786694|ref|XP_397483.4| PREDICTED: hypothetical protein LOC414048 [Apis mellifera] Length = 1219 Score = 35.4 bits (80), Expect = 4.4, Method: Composition-based stats. Identities = 27/92 (29%), Positives = 49/92 (53%), Gaps = 11/92 (11%) Query: 101 SQNKQESPKNANNNILDHIALLKERLRTDINTFDNTNLETKIPLPNNLKPNVCVKEKKLI 160 SQN + KN++NNI + ++++KE DIN + + + E+ I + N +K + I Sbjct: 483 SQNVKRK-KNSSNNIENDVSIIKEN-SDDINIYHSLSKESDILINNKIKETLDQLNSDSI 540 Query: 161 PPRKNIN---------NLKDTNHRLKIKNNQE 183 +K IN + +D+ RL+I N++E Sbjct: 541 KIKKEINKEDESFDELSDEDSRDRLEIDNSEE 572 >gi|332519820|ref|ZP_08396284.1| efflux transporter, RND family, MFP subunit [Lacinutrix algicola 5H-3-7-4] gi|332044379|gb|EGI80573.1| efflux transporter, RND family, MFP subunit [Lacinutrix algicola 5H-3-7-4] Length = 378 Score = 35.4 bits (80), Expect = 4.9, Method: Composition-based stats. Identities = 25/108 (23%), Positives = 49/108 (45%), Gaps = 11/108 (10%) Query: 82 RLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHIALLKERLRTD-----------I 130 +L + + +N +LI++ + S ++A +N+ + IA + LR D + Sbjct: 83 KLFKPGQEYNRGETLISLDAAEYYASVQSAKSNLYNSIAAIMPDLRLDFKDVYQKWQTYL 142 Query: 131 NTFDNTNLETKIPLPNNLKPNVCVKEKKLIPPRKNINNLKDTNHRLKI 178 N FD K+P +N K N + + +I N+ NL+ + +I Sbjct: 143 NNFDLNKTTPKLPEVSNEKENYFITGRGIISAYYNVKNLEQRLAKYRI 190 >gi|328714558|ref|XP_001944844.2| PREDICTED: hypothetical protein LOC100168206 [Acyrthosiphon pisum] Length = 3806 Score = 35.0 bits (79), Expect = 5.3, Method: Composition-based stats. Identities = 15/43 (34%), Positives = 24/43 (55%) Query: 135 NTNLETKIPLPNNLKPNVCVKEKKLIPPRKNINNLKDTNHRLK 177 +T+ E ++P P+ LK + +K KK+ P N LK+ N K Sbjct: 2858 DTDYEPQLPSPSQLKYKILIKNKKIAPLESEANRLKNNNSSTK 2900 >gi|326405796|gb|ADZ62867.1| Putative glycerophosphoryl diester phosphodiesterase [Lactococcus lactis subsp. lactis CV56] Length = 703 Score = 35.0 bits (79), Expect = 6.1, Method: Composition-based stats. Identities = 30/116 (25%), Positives = 52/116 (44%), Gaps = 10/116 (8%) Query: 85 EQKKNHNLQYSLINIPSQNKQESPKNANNNILDHIALLKERLRTDINTFDNTNLET---- 140 +Q+ + N+ SL+ N + N+ D I+ E++ D N+FD+ N ET Sbjct: 49 DQEDSSNISLSLVQESKLNDEADSTLKTNSSTDQISN-DEKINQD-NSFDHGNSETGQDV 106 Query: 141 KIPLPNNLKPNVCVKEKKLIPPRKNINNLKDTNHRLKIKNNQEIKNIHHKKNKPRL 196 K+P N++ KE+ +NL + L+ + QE NI + KP + Sbjct: 107 KVPESNSMDN----KEQNSTSYTTYDSNLTQDSKILESQTKQETNNISETEKKPEV 158 >gi|12045116|ref|NP_072927.1| DNA polymerase III, alpha subunit [Mycoplasma genitalium G37] gi|255660357|ref|ZP_05405766.1| DNA polymerase III, alpha subunit [Mycoplasma genitalium G37] gi|2494193|sp|Q49405|DPO3A_MYCGE RecName: Full=DNA polymerase III subunit alpha gi|1045954|gb|AAC71482.1| DNA polymerase III, alpha subunit [Mycoplasma genitalium G37] gi|166078627|gb|ABY79245.1| DNA polymerase III, alpha subunit [synthetic Mycoplasma genitalium JCVI-1.0] Length = 874 Score = 35.0 bits (79), Expect = 6.1, Method: Composition-based stats. Identities = 21/81 (25%), Positives = 42/81 (51%), Gaps = 8/81 (9%) Query: 115 ILDHIALLKERLRTDINTFDNTNLETKIPLPNNLKPNVCVKEKKL---IPPRKNIN---- 167 +L+H K++ + I T + + +E PL N +PN ++ KK+ + K IN Sbjct: 724 LLNHFQSSKDKQKLIIRTLEKSGIEIYPPLLNKAQPNSVIENKKIYLGLNLIKGINDRYI 783 Query: 168 -NLKDTNHRLKIKNNQEIKNI 187 NL+ H ++ +NN ++ ++ Sbjct: 784 QNLQKVQHLIQTQNNLQLTDV 804 >gi|83319343|ref|YP_424108.1| oligopeptide ABC transporter, oligopeptide-binding protein, putative [Mycoplasma capricolum subsp. capricolum ATCC 27343] gi|83283229|gb|ABC01161.1| oligopeptide ABC transporter, oligopeptide-binding protein, putative [Mycoplasma capricolum subsp. capricolum ATCC 27343] Length = 984 Score = 34.7 bits (78), Expect = 6.6, Method: Composition-based stats. Identities = 24/67 (35%), Positives = 34/67 (50%), Gaps = 6/67 (8%) Query: 133 FDNTNLETKIPLPN--NLKPNVCVKEKKLIPPRKNINNLK----DTNHRLKIKNNQEIKN 186 FD ET I LPN L PN+ V+++KLIP RK + K D + + ++ +K Sbjct: 331 FDPNADETIIMLPNLERLNPNLSVEQRKLIPQRKAVKIKKYLFTDPRQKFSKEFDELLKK 390 Query: 187 IHHKKNK 193 KNK Sbjct: 391 SKELKNK 397 >gi|332826761|gb|EGJ99578.1| hypothetical protein HMPREF9455_04074 [Dysgonomonas gadei ATCC BAA-286] Length = 1476 Score = 34.7 bits (78), Expect = 7.2, Method: Composition-based stats. Identities = 37/126 (29%), Positives = 55/126 (43%), Gaps = 9/126 (7%) Query: 13 LKLHKNIMSKGKINESFWYIRTLF--PYLSHQIHKALMSISSGIILILSPTDCTANTANI 70 L L ++S KI S+ I F + + A MSI SG + I +P D N+ Sbjct: 637 LNLPVTVLSTSKIIGSYNNINDRFNLEIFAPSVKAAGMSIQSGYVAIRNPHDTIDAKVNV 696 Query: 71 LIPSRKIIDYHRLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHIALLK---ERLR 127 LI +K + K NL + I++ + Q++ N + + L L K E LR Sbjct: 697 LIAGKKNTINDVAINAKVKDNLINTNISLVNTGAQKAMGNFSISTL----LTKNEAEPLR 752 Query: 128 TDINTF 133 DINT Sbjct: 753 IDINTL 758 >gi|320168470|gb|EFW45369.1| predicted protein [Capsaspora owczarzaki ATCC 30864] Length = 546 Score = 34.7 bits (78), Expect = 7.2, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 44/91 (48%), Gaps = 6/91 (6%) Query: 59 SPTDCTANTANILIPSRKIIDYHRLLEQKKNHNLQYSLINIPS--QNKQESPKNANNNIL 116 +P+D + LI SRK D H +E+++ +N+ ++ + S N + PK + +IL Sbjct: 449 APSDYDPDDIQTLIKSRKKKDNHNAIERRRRYNINDRIVELGSLLPNAEIDPKASKGSIL 508 Query: 117 ----DHIALLKERLRTDINTFDNTNLETKIP 143 D+I L++ R+ T ++P Sbjct: 509 KRSVDYIKYLQDINRSLSEKLAQTGTVVELP 539 >gi|300870969|ref|YP_003785840.1| transporter [Brachyspira pilosicoli 95/1000] gi|300688668|gb|ADK31339.1| transporter [Brachyspira pilosicoli 95/1000] Length = 446 Score = 34.3 bits (77), Expect = 9.2, Method: Composition-based stats. Identities = 37/116 (31%), Positives = 52/116 (44%), Gaps = 23/116 (19%) Query: 88 KNHNLQYSLINIPSQNKQESPKNANNNILDHIALLKERLRTDINTFDNTNLETKIPLPNN 147 K+ L L +NKQ KN NNNI D ++K L I +++N E Sbjct: 191 KHSFLSILLFEQQDKNKQYISKNINNNIFD---VIKYFLSMSIYIKNSSNTE-------- 239 Query: 148 LKPNVCVKEKKLIPPRKN----INNLKDTNHRLKIKNN------QEIKNIHHKKNK 193 K + V +KK IP N IN + NH KI NN +IK +++KK + Sbjct: 240 -KGKISV-DKKFIPKMDNGSIKINEEEKLNHTEKILNNFFTSIYSDIKQVYYKKER 293 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.318 0.134 0.390 Lambda K H 0.267 0.0411 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,880,619,649 Number of Sequences: 14124377 Number of extensions: 79056716 Number of successful extensions: 258226 Number of sequences better than 10.0: 599 Number of HSP's better than 10.0 without gapping: 29 Number of HSP's successfully gapped in prelim test: 570 Number of HSP's that attempted gapping in prelim test: 256345 Number of HSP's gapped (non-prelim): 2351 length of query: 199 length of database: 4,842,793,630 effective HSP length: 132 effective length of query: 67 effective length of database: 2,978,375,866 effective search space: 199551183022 effective search space used: 199551183022 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 77 (34.3 bits)