BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780682|ref|YP_003065095.1| hypothetical protein CLIBASIA_02845 [Candidatus Liberibacter asiaticus str. psy62] (199 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254780682|ref|YP_003065095.1| hypothetical protein CLIBASIA_02845 [Candidatus Liberibacter asiaticus str. psy62] gi|254040359|gb|ACT57155.1| hypothetical protein CLIBASIA_02845 [Candidatus Liberibacter asiaticus str. psy62] Length = 199 Score = 392 bits (1006), Expect = e-107, Method: Compositional matrix adjust. Identities = 199/199 (100%), Positives = 199/199 (100%) Query: 1 MRSTILFLIKHKLKLHKNIMSKGKINESFWYIRTLFPYLSHQIHKALMSISSGIILILSP 60 MRSTILFLIKHKLKLHKNIMSKGKINESFWYIRTLFPYLSHQIHKALMSISSGIILILSP Sbjct: 1 MRSTILFLIKHKLKLHKNIMSKGKINESFWYIRTLFPYLSHQIHKALMSISSGIILILSP 60 Query: 61 TDCTANTANILIPSRKIIDYHRLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHIA 120 TDCTANTANILIPSRKIIDYHRLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHIA Sbjct: 61 TDCTANTANILIPSRKIIDYHRLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHIA 120 Query: 121 LLKERLRTDINTFDNTNLETKIPLPNNLKPNVCVKEKKLIPPRKNINNLKDTNHRLKIKN 180 LLKERLRTDINTFDNTNLETKIPLPNNLKPNVCVKEKKLIPPRKNINNLKDTNHRLKIKN Sbjct: 121 LLKERLRTDINTFDNTNLETKIPLPNNLKPNVCVKEKKLIPPRKNINNLKDTNHRLKIKN 180 Query: 181 NQEIKNIHHKKNKPRLHCQ 199 NQEIKNIHHKKNKPRLHCQ Sbjct: 181 NQEIKNIHHKKNKPRLHCQ 199 >gi|315122207|ref|YP_004062696.1| Peptidase M23 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495609|gb|ADR52208.1| Peptidase M23 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 344 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 63/182 (34%), Positives = 88/182 (48%), Gaps = 30/182 (16%) Query: 44 HKALMSISSGIILILSPTDCTANTANILIPSRKIIDYH-RLLEQKKNHNLQYSLINIPSQ 102 +K L +LIL T N NIL+ KI +YH RL+EQK N Q +I IP Sbjct: 12 YKILAKTIFCFLLILVSTSDKTNAENILVLPSKITNYHNRLVEQKNNPEKQ--VIPIPLP 69 Query: 103 NKQESPKNANNNILDHIALLKERLRTDINT-FDNTNLETKIPLP----------NNLKPN 151 KQE P N NN +D LL ++ D N + + L KIPLP N+L N Sbjct: 70 TKQEYPSNTNNKTIDQNTLLAKKGFVDKNNLYSSMYLSKKIPLPNKCLLFPPDNNSLHLN 129 Query: 152 VCVKEKKLIPPRKNINNLK--------------DTNHRLKIKNNQEIKNIHHKKN--KPR 195 C++ K +KNI++ + + K+KNNQ+IK+I +KKN P+ Sbjct: 130 NCIENKSPNSSKKNISHTRKIPKYKKNNPKKSGNIAPAFKVKNNQKIKHIQYKKNNYSPQ 189 Query: 196 LH 197 +H Sbjct: 190 IH 191 >gi|57096618|ref|XP_532571.1| PREDICTED: similar to SHC SH2-domain binding protein 1 [Canis familiaris] Length = 666 Score = 35.8 bits (81), Expect = 3.6, Method: Composition-based stats. Identities = 29/92 (31%), Positives = 43/92 (46%), Gaps = 14/92 (15%) Query: 72 IPSRKIIDYHRLLEQ-----KKNHNLQYSLINIPSQNKQESPKNANN-NILDHIALLKER 125 +PS I+DYH LL Q +K NL+ SL N S ++QE+ + I LK++ Sbjct: 233 VPSGLIVDYHNLLSQCEESYRKFLNLRSSLSNCNSDSEQENISMVEGLKLYSEIEQLKQK 292 Query: 126 LRTDINTF--------DNTNLETKIPLPNNLK 149 L+ N N+N++ K PN K Sbjct: 293 LKLIENPLLRYVFGYQKNSNIQAKGIRPNGQK 324 >gi|147853966|emb|CAN79552.1| hypothetical protein VITISV_025726 [Vitis vinifera] Length = 692 Score = 35.4 bits (80), Expect = 4.2, Method: Composition-based stats. Identities = 30/109 (27%), Positives = 55/109 (50%), Gaps = 12/109 (11%) Query: 65 ANTANILIPSRKIIDYHRLLEQKKNHNLQY------SLINIPSQNKQESPKNANNNI--L 116 AN L+P KI D+H ++ +N N+ L+N+ SQNK K+ ++ Sbjct: 451 ANAEEALLPIPKIFDFHEKNKRYQNSNIDPDEIMADDLMNVRSQNKAVESKDDLESVEAS 510 Query: 117 DHIALLKERLRTDINTFDNTNLETKIPLPNNLK-PNVCVKEKKLIPPRK 164 + + L+E+LR+ + DN + T + N+ K + ++E +PP+K Sbjct: 511 NETSCLEEQLRS-LGLLDNKDDLTSNSMLNSTKFKGISLEEN--MPPKK 556 >gi|296083309|emb|CBI22945.3| unnamed protein product [Vitis vinifera] Length = 628 Score = 35.0 bits (79), Expect = 5.3, Method: Composition-based stats. Identities = 30/109 (27%), Positives = 55/109 (50%), Gaps = 12/109 (11%) Query: 65 ANTANILIPSRKIIDYHRLLEQKKNHNLQY------SLINIPSQNKQESPKNANNNI--L 116 AN L+P KI D+H ++ +N N+ L+N+ SQNK K+ ++ Sbjct: 387 ANAEEALLPIPKIFDFHEKNKRYQNSNIDPDEIMADDLMNVRSQNKAVESKDGLESVEAS 446 Query: 117 DHIALLKERLRTDINTFDNTNLETKIPLPNNLK-PNVCVKEKKLIPPRK 164 + + L+E+LR+ + DN + T + N+ K + ++E +PP+K Sbjct: 447 NETSCLEEQLRS-LGLLDNKDDLTSNSMLNSRKFKGISLEEN--MPPKK 492 >gi|51490739|emb|CAH18713.1| extracellular ribonuclease [Bacillus megaterium] Length = 382 Score = 35.0 bits (79), Expect = 5.4, Method: Compositional matrix adjust. Identities = 30/97 (30%), Positives = 41/97 (42%), Gaps = 21/97 (21%) Query: 34 TLFPYLSHQIHKALMSISSGIILILSPTDCTANTANILIPSRKIIDYHRLLEQKKN---- 89 TL PY SH K++ SIS+ T T + P+ + DY+R + +K Sbjct: 120 TLTPYFSHPGIKSVTSIST-------ETGGTDPAPDPTEPTVPVEDYYRTVAEKTGNSLK 172 Query: 90 ---HN-------LQYSLINIPSQNKQESPKNANNNIL 116 HN L YS + + E P NANN IL Sbjct: 173 TELHNIIDHHTELSYSAVWEALKETDEDPANANNVIL 209 >gi|295702542|ref|YP_003595617.1| extracellular ribonuclease [Bacillus megaterium DSM 319] gi|294800201|gb|ADF37267.1| Extracellular ribonuclease [Bacillus megaterium DSM 319] Length = 384 Score = 35.0 bits (79), Expect = 6.3, Method: Compositional matrix adjust. Identities = 30/97 (30%), Positives = 41/97 (42%), Gaps = 21/97 (21%) Query: 34 TLFPYLSHQIHKALMSISSGIILILSPTDCTANTANILIPSRKIIDYHRLLEQKKN---- 89 TL PY SH K++ SIS+ T T + P+ + DY+R + +K Sbjct: 120 TLTPYFSHPGIKSVTSIST-------ETGGTDPAPDPTEPTVPVEDYYRTVAEKTGNSLK 172 Query: 90 ---HN-------LQYSLINIPSQNKQESPKNANNNIL 116 HN L YS + + E P NANN IL Sbjct: 173 TELHNIIDHHTELSYSAVWEALKKTDEDPANANNVIL 209 >gi|6755508|ref|NP_035499.1| SHC SH2 domain-binding protein 1 [Mus musculus] gi|81882097|sp|Q9Z179|SHCBP_MOUSE RecName: Full=SHC SH2 domain-binding protein 1; AltName: Full=Protein expressed in activated lymphocytes; Short=mPAL; AltName: Full=SHC-binding protein gi|4102877|gb|AAD01613.1| Shc binding protein [Mus musculus] gi|47683039|gb|AAH70455.1| Shc SH2-domain binding protein 1 [Mus musculus] gi|148690064|gb|EDL22011.1| Shc SH2-domain binding protein 1, isoform CRA_b [Mus musculus] Length = 668 Score = 34.7 bits (78), Expect = 6.8, Method: Composition-based stats. Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 6/62 (9%) Query: 72 IPSRKIIDYHRLLEQ-----KKNHNLQYSLINIPSQNKQESPKNANN-NILDHIALLKER 125 +PS I+DYH LL Q +K NL+ SL N S ++QE+ N+ I LK++ Sbjct: 239 VPSGLIVDYHNLLSQCEESYRKFLNLRSSLSNCNSDSEQENISMVEGLNLYSEIEQLKQK 298 Query: 126 LR 127 L+ Sbjct: 299 LK 300 >gi|74201453|dbj|BAE26159.1| unnamed protein product [Mus musculus] Length = 668 Score = 34.7 bits (78), Expect = 7.0, Method: Composition-based stats. Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 6/62 (9%) Query: 72 IPSRKIIDYHRLLEQ-----KKNHNLQYSLINIPSQNKQESPKNANN-NILDHIALLKER 125 +PS I+DYH LL Q +K NL+ SL N S ++QE+ N+ I LK++ Sbjct: 239 VPSGLIVDYHNLLSQCEESYRKFLNLRSSLSNCNSDSEQENISMVEGLNLYSEIEQLKQK 298 Query: 126 LR 127 L+ Sbjct: 299 LK 300 Searching..................................................done Results from round 2 CONVERGED! >gi|254780682|ref|YP_003065095.1| hypothetical protein CLIBASIA_02845 [Candidatus Liberibacter asiaticus str. psy62] gi|254040359|gb|ACT57155.1| hypothetical protein CLIBASIA_02845 [Candidatus Liberibacter asiaticus str. psy62] Length = 199 Score = 272 bits (695), Expect = 2e-71, Method: Composition-based stats. Identities = 199/199 (100%), Positives = 199/199 (100%) Query: 1 MRSTILFLIKHKLKLHKNIMSKGKINESFWYIRTLFPYLSHQIHKALMSISSGIILILSP 60 MRSTILFLIKHKLKLHKNIMSKGKINESFWYIRTLFPYLSHQIHKALMSISSGIILILSP Sbjct: 1 MRSTILFLIKHKLKLHKNIMSKGKINESFWYIRTLFPYLSHQIHKALMSISSGIILILSP 60 Query: 61 TDCTANTANILIPSRKIIDYHRLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHIA 120 TDCTANTANILIPSRKIIDYHRLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHIA Sbjct: 61 TDCTANTANILIPSRKIIDYHRLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHIA 120 Query: 121 LLKERLRTDINTFDNTNLETKIPLPNNLKPNVCVKEKKLIPPRKNINNLKDTNHRLKIKN 180 LLKERLRTDINTFDNTNLETKIPLPNNLKPNVCVKEKKLIPPRKNINNLKDTNHRLKIKN Sbjct: 121 LLKERLRTDINTFDNTNLETKIPLPNNLKPNVCVKEKKLIPPRKNINNLKDTNHRLKIKN 180 Query: 181 NQEIKNIHHKKNKPRLHCQ 199 NQEIKNIHHKKNKPRLHCQ Sbjct: 181 NQEIKNIHHKKNKPRLHCQ 199 >gi|315122207|ref|YP_004062696.1| Peptidase M23 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495609|gb|ADR52208.1| Peptidase M23 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 344 Score = 177 bits (448), Expect = 1e-42, Method: Composition-based stats. Identities = 63/185 (34%), Positives = 90/185 (48%), Gaps = 30/185 (16%) Query: 41 HQIHKALMSISSGIILILSPTDCTANTANILIPSRKIIDYH-RLLEQKKNHNLQYSLINI 99 ++ +K L +LIL T N NIL+ KI +YH RL+EQK N Q +I I Sbjct: 9 NKYYKILAKTIFCFLLILVSTSDKTNAENILVLPSKITNYHNRLVEQKNNPEKQ--VIPI 66 Query: 100 PSQNKQESPKNANNNILDHIALLKERLRTDINT-FDNTNLETKIPLP----------NNL 148 P KQE P N NN +D LL ++ D N + + L KIPLP N+L Sbjct: 67 PLPTKQEYPSNTNNKTIDQNTLLAKKGFVDKNNLYSSMYLSKKIPLPNKCLLFPPDNNSL 126 Query: 149 KPNVCVKEKKLIPPRKNINNLK--------------DTNHRLKIKNNQEIKNIHHKKN-- 192 N C++ K +KNI++ + + K+KNNQ+IK+I +KKN Sbjct: 127 HLNNCIENKSPNSSKKNISHTRKIPKYKKNNPKKSGNIAPAFKVKNNQKIKHIQYKKNNY 186 Query: 193 KPRLH 197 P++H Sbjct: 187 SPQIH 191 >gi|113476744|ref|YP_722805.1| WD-40 repeat-containing protein [Trichodesmium erythraeum IMS101] gi|110167792|gb|ABG52332.1| WD-40 repeat [Trichodesmium erythraeum IMS101] Length = 1304 Score = 36.6 bits (83), Expect = 2.1, Method: Composition-based stats. Identities = 35/136 (25%), Positives = 64/136 (47%), Gaps = 18/136 (13%) Query: 68 ANILIPSRKIIDYHRLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHIALLKERLR 127 +L+ K+ +++ Q N L + + SQ+ Q KN +NNI+D E L Sbjct: 432 QKLLVNPFKLKNFN---NQVTNFQLGENFSGLESQDNQ---KNQSNNIIDPEQKQLENL- 484 Query: 128 TDINTFD----NTNLETKIP-----LPNNLKPNVCV--KEKKLIPPRKNINNLKDTNHRL 176 D+N FD N NLE I + ++ N + ++K+L P + I + K+ +++ Sbjct: 485 LDLNQFDSSVTNLNLEENISNLQLFENHQIQSNNLIHPQQKELETPGEKITHQKNNKNQV 544 Query: 177 KIKNNQEIKNIHHKKN 192 + K +++ HK N Sbjct: 545 RDKKAKKLAISQHKLN 560 >gi|114330989|ref|YP_747211.1| RND family efflux transporter MFP subunit [Nitrosomonas eutropha C91] gi|114308003|gb|ABI59246.1| efflux transporter, RND family, MFP subunit [Nitrosomonas eutropha C91] Length = 363 Score = 36.2 bits (82), Expect = 2.5, Method: Composition-based stats. Identities = 13/68 (19%), Positives = 34/68 (50%), Gaps = 5/68 (7%) Query: 134 DNTNLETKIPLPNNL----KPNVCVKEKKLIPPRKNINNLKDTNHRLKIKNNQEIKNIHH 189 N +L +P P ++ +PN+ V+ + I+++++ + + +N+ I + + Sbjct: 202 SNQHLRAHLPFPESIASYIRPNLEVRLSTPTSDTEIISHIRELKPAIGV-SNRAIDALVY 260 Query: 190 KKNKPRLH 197 +N+P H Sbjct: 261 IENQPGWH 268 >gi|194751423|ref|XP_001958026.1| GF23720 [Drosophila ananassae] gi|190625308|gb|EDV40832.1| GF23720 [Drosophila ananassae] Length = 1134 Score = 35.8 bits (81), Expect = 3.3, Method: Composition-based stats. Identities = 25/91 (27%), Positives = 39/91 (42%), Gaps = 4/91 (4%) Query: 87 KKNHNLQYSLINIPSQNKQESP-KNAN---NNILDHIALLKERLRTDINTFDNTNLETKI 142 K + L NIP NK+ P N N ++LD LK +T + + T + Sbjct: 610 KHGGKADFDLNNIPMPNKEIDPLPNTNLSSTDVLDSSTPLKPSQQTTLISNLKPKSLTAL 669 Query: 143 PLPNNLKPNVCVKEKKLIPPRKNINNLKDTN 173 PLP + V E+ P +K ++ K+ N Sbjct: 670 PLPPGMNALDLVNERSPSPAQKKESDEKNIN 700 >gi|123407421|ref|XP_001303005.1| mRNA capping enzyme, large subunit family protein [Trichomonas vaginalis G3] gi|121884347|gb|EAX90075.1| mRNA capping enzyme, large subunit family protein [Trichomonas vaginalis G3] Length = 355 Score = 35.8 bits (81), Expect = 3.5, Method: Composition-based stats. Identities = 12/54 (22%), Positives = 23/54 (42%) Query: 90 HNLQYSLINIPSQNKQESPKNANNNILDHIALLKERLRTDINTFDNTNLETKIP 143 + Q + + P +K+ PK ++D +E + D + +D TK P Sbjct: 300 YQKQGTFVPPPKPDKKSYPKTWTMKLIDAKTGAEELVNVDHDDYDTPRGRTKKP 353 >gi|77556686|gb|ABA99482.1| transposon protein, putative, CACTA, En/Spm sub-class, expressed [Oryza sativa Japonica Group] Length = 1091 Score = 35.0 bits (79), Expect = 6.2, Method: Composition-based stats. Identities = 25/106 (23%), Positives = 48/106 (45%), Gaps = 15/106 (14%) Query: 60 PTDCTANTANILIPSRKIIDYHRLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHI 119 P+D +++ +++ R++ L +K+ + P+ K SP ++ + L Sbjct: 506 PSDGGSSSEGLVVSPRRMPT---PLLPRKSPS------PPPAHCKSPSPPSSKHQSLASS 556 Query: 120 ALLKERLRTDINTF----DNTNLETKIPLPNNLKPNVCVKEKKLIP 161 LK+R R + + K+PLPN L P+ +KEKK + Sbjct: 557 IPLKKRGRLCKTSQMSEPPSLKKPKKVPLPNMLVPS--LKEKKEMS 600 >gi|329927657|ref|ZP_08281804.1| hypothetical protein HMPREF9412_2972 [Paenibacillus sp. HGF5] gi|328938355|gb|EGG34746.1| hypothetical protein HMPREF9412_2972 [Paenibacillus sp. HGF5] Length = 386 Score = 34.7 bits (78), Expect = 6.6, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 57/125 (45%), Gaps = 18/125 (14%) Query: 20 MSKGKINESFWYIRTLFPYLSHQIHKALMSISSGIILILSPTDCTANTANILIPSRKIID 79 +S G+ E++ + ++ + + +++ I+ GI+++ + N N+++ Sbjct: 115 ISGGRFTETYALLASMIDEIDYVVYE----INYGIVVV------SDNDPNVVVYPS---- 160 Query: 80 YHRLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHIALLKERLRTDINTFDNTNLE 139 L+E+ + L++ P +N++ P NA+N + H L K L D + Sbjct: 161 ---LVEKLGQPIPRSWLVDFPDKNRESLPSNAHNWVTSHF-LNKWALYHDRDAISYRLFM 216 Query: 140 TKIPL 144 T+ P+ Sbjct: 217 TRTPM 221 >gi|307170585|gb|EFN62779.1| Transformation/transcription domain-associated protein [Camponotus floridanus] Length = 3826 Score = 34.7 bits (78), Expect = 6.7, Method: Composition-based stats. Identities = 33/130 (25%), Positives = 63/130 (48%), Gaps = 8/130 (6%) Query: 8 LIKHKLKLHKNIMSKGK---INE-SFWYIRTLFPYLSHQI--HKALMSISSGIILILSPT 61 + H +K+ I+ +G+ I+E + +R L + H++ ++ L I+ ++S Sbjct: 52 FLDHMMKIFLKILQEGEPHFISEYNIQQVRKLILEMIHRLPSNEYLRPYVRQILTLMSKL 111 Query: 62 DCTANTANILIPSRKIIDYHRLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHIAL 121 T N N+L+ R II+ H+ + N +QY L + S E PKN + I + Sbjct: 112 LETDNEENVLVCLRIIIELHKQYKPTFNPEIQYFLQFVKS-VYSELPKNL-SKIFEPRPA 169 Query: 122 LKERLRTDIN 131 L+ + ++IN Sbjct: 170 LRVKDLSEIN 179 >gi|193210429|ref|NP_509769.2| hypothetical protein F59F5.7 [Caenorhabditis elegans] gi|145292013|emb|CAA90661.3| C. elegans protein F59F5.7, partially confirmed by transcript evidence [Caenorhabditis elegans] gi|145292089|emb|CAA93477.3| C. elegans protein F59F5.7, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 896 Score = 34.7 bits (78), Expect = 6.7, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 44/109 (40%), Gaps = 1/109 (0%) Query: 82 RLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHIALLKERLRTDINT-FDNTNLET 140 ++ E K+N N+ + IP K+E N L ++ + + R + + + ++ Sbjct: 362 KVFENKENKNIDPLALPIPEPEKKEKVSKEVENTLKLLSTMPDEQRKMLESAIKDGEIDA 421 Query: 141 KIPLPNNLKPNVCVKEKKLIPPRKNINNLKDTNHRLKIKNNQEIKNIHH 189 PL NL N +E K + I ++ K + KN+ + Sbjct: 422 DSPLIKNLVKNDVTEESKKEKANRLIEWIRSNRPSTKTPESGIPKNVPY 470 >gi|221118968|ref|XP_002160131.1| PREDICTED: similar to DIRAS family, GTP-binding RAS-like 1 [Hydra magnipapillata] Length = 223 Score = 34.7 bits (78), Expect = 7.7, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 40/101 (39%), Gaps = 4/101 (3%) Query: 80 YHRLLE-QKKNHNLQYSLINIPSQNKQESPKNANNNILDHIALLKERLRTDINTFDNTNL 138 Y RLL+ + +NH + S++ +Q+ +N N +D A +KE +T + L Sbjct: 118 YQRLLKIRGENHEVYMSVVGTKMDKQQQYDENTKNEAVD--AFIKEIGGRAKHTITSAKL 175 Query: 139 ETKIPLPNNLKPNVCVKEKKLIPPR-KNINNLKDTNHRLKI 178 + + V K ++ L TN + I Sbjct: 176 NINVVEAFEAALSNVVANMTPNEDAIKRLDKLMKTNEKKSI 216 >gi|241762403|ref|ZP_04760482.1| DNA polymerase III, subunits gamma and tau [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241373090|gb|EER62739.1| DNA polymerase III, subunits gamma and tau [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 628 Score = 34.7 bits (78), Expect = 7.8, Method: Composition-based stats. Identities = 36/167 (21%), Positives = 68/167 (40%), Gaps = 18/167 (10%) Query: 36 FPYLSHQIHKALMSISSGIILILSPTDCTANTANILIPSRKIIDYHRLLEQ-KKNHNLQY 94 FP L H++ + L+ S + + P + +I + ++ D ++ Q KK ++ Sbjct: 336 FPVL-HRLWQLLLKGHSEVQNAVLPLEAAEMALLRVIHASQLPDPATIIAQLKKQGSVSP 394 Query: 95 SLINIPSQNKQESPKNANNNILDHIALLKERLRTDINTFDNTNLE---TKIPLP---NNL 148 +L P N+ P ++ +I + NTF N K LP N+ Sbjct: 395 NLSPSPEDNRLALPTSSFQDI--------KAGDVKKNTFKPVNSSGSQEKEALPKNRNDF 446 Query: 149 KPNVCVKEKKLIPPRKNINNLKDTNHRLKIKNNQEIKNIHHKKNKPR 195 ++ EKK P + NN + ++ NQ+ + +N+PR Sbjct: 447 SKSLQSTEKKDFSPFDDDNNSRVAASGYPVEGNQKFSEL--PENQPR 491 >gi|297729277|ref|NP_001177002.1| Os12g0551900 [Oryza sativa Japonica Group] gi|255670383|dbj|BAH95730.1| Os12g0551900 [Oryza sativa Japonica Group] Length = 853 Score = 34.3 bits (77), Expect = 9.3, Method: Composition-based stats. Identities = 25/106 (23%), Positives = 48/106 (45%), Gaps = 15/106 (14%) Query: 60 PTDCTANTANILIPSRKIIDYHRLLEQKKNHNLQYSLINIPSQNKQESPKNANNNILDHI 119 P+D +++ +++ R++ L +K+ + P+ K SP ++ + L Sbjct: 325 PSDGGSSSEGLVVSPRRMPT---PLLPRKSPS------PPPAHCKSPSPPSSKHQSLASS 375 Query: 120 ALLKERLRTDINTF----DNTNLETKIPLPNNLKPNVCVKEKKLIP 161 LK+R R + + K+PLPN L P+ +KEKK + Sbjct: 376 IPLKKRGRLCKTSQMSEPPSLKKPKKVPLPNMLVPS--LKEKKEMS 419 >gi|134107189|ref|XP_777725.1| hypothetical protein CNBA6030 [Cryptococcus neoformans var. neoformans B-3501A] gi|50260421|gb|EAL23078.1| hypothetical protein CNBA6030 [Cryptococcus neoformans var. neoformans B-3501A] Length = 3400 Score = 34.3 bits (77), Expect = 9.8, Method: Composition-based stats. Identities = 17/66 (25%), Positives = 29/66 (43%), Gaps = 1/66 (1%) Query: 38 YLSHQIHKALMSISSGIILILSPTDCTANTANILIPSRKIIDYHRLLEQKKNHNLQYSLI 97 Y SH ++ +I++ L+ SPT N+ + +K D R+ E K L Sbjct: 278 YRSHTHPQSASAIANKYPLMSSPTKGATNSETLPATPKKASDLIRMFESKSTPTK-GELP 336 Query: 98 NIPSQN 103 +P Q+ Sbjct: 337 PVPQQS 342 >gi|145523511|ref|XP_001447594.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124415105|emb|CAK80197.1| unnamed protein product [Paramecium tetraurelia] Length = 983 Score = 34.3 bits (77), Expect = 9.9, Method: Composition-based stats. Identities = 18/56 (32%), Positives = 31/56 (55%) Query: 86 QKKNHNLQYSLINIPSQNKQESPKNANNNILDHIALLKERLRTDINTFDNTNLETK 141 + KNH L+ L+N+ K E P NA ++ I LLK+ +D N ++++L + Sbjct: 815 RNKNHILEAELMNLKEAKKIEDPNNATKRLISEIQLLKQSQVSDNNNINDSSLRRE 870 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.314 0.133 0.367 Lambda K H 0.267 0.0412 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 3,588,469,637 Number of Sequences: 14124377 Number of extensions: 145337369 Number of successful extensions: 497266 Number of sequences better than 10.0: 829 Number of HSP's better than 10.0 without gapping: 79 Number of HSP's successfully gapped in prelim test: 1356 Number of HSP's that attempted gapping in prelim test: 492257 Number of HSP's gapped (non-prelim): 5857 length of query: 199 length of database: 4,842,793,630 effective HSP length: 132 effective length of query: 67 effective length of database: 2,978,375,866 effective search space: 199551183022 effective search space used: 199551183022 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.5 bits) S2: 77 (34.3 bits)