BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781163|ref|YP_003065576.1| hypothetical protein CLIBASIA_05350 [Candidatus Liberibacter asiaticus str. psy62] (214 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781163|ref|YP_003065576.1| hypothetical protein CLIBASIA_05350 [Candidatus Liberibacter asiaticus str. psy62] gi|254040840|gb|ACT57636.1| hypothetical protein CLIBASIA_05350 [Candidatus Liberibacter asiaticus str. psy62] Length = 214 Score = 425 bits (1094), Expect = e-117, Method: Composition-based stats. Identities = 214/214 (100%), Positives = 214/214 (100%) Query: 1 MQDIEDSIPKITSSTGKCLKFIMRAFGLFDSAEQRKLWSLLRYWSVEFWELLCRTFLWLL 60 MQDIEDSIPKITSSTGKCLKFIMRAFGLFDSAEQRKLWSLLRYWSVEFWELLCRTFLWLL Sbjct: 1 MQDIEDSIPKITSSTGKCLKFIMRAFGLFDSAEQRKLWSLLRYWSVEFWELLCRTFLWLL 60 Query: 61 CLGFFTLFAYWFFGEKFSYDVSIKIVLAINVLLFSFVIALLGNPNIEMAKDIAQLFTLID 120 CLGFFTLFAYWFFGEKFSYDVSIKIVLAINVLLFSFVIALLGNPNIEMAKDIAQLFTLID Sbjct: 61 CLGFFTLFAYWFFGEKFSYDVSIKIVLAINVLLFSFVIALLGNPNIEMAKDIAQLFTLID 120 Query: 121 NIENGYGQYLEKISQKNVEKDFFNSKFFTHFKEYIDEVDKILMRYGIEGIPLTDFDHEKI 180 NIENGYGQYLEKISQKNVEKDFFNSKFFTHFKEYIDEVDKILMRYGIEGIPLTDFDHEKI Sbjct: 121 NIENGYGQYLEKISQKNVEKDFFNSKFFTHFKEYIDEVDKILMRYGIEGIPLTDFDHEKI 180 Query: 181 KSYLTHIKIVRDCLKKDNIHYAKELRNDWIGSNR 214 KSYLTHIKIVRDCLKKDNIHYAKELRNDWIGSNR Sbjct: 181 KSYLTHIKIVRDCLKKDNIHYAKELRNDWIGSNR 214 >gi|50305897|ref|XP_452909.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140] gi|49642042|emb|CAH01760.1| KLLA0C15873p [Kluyveromyces lactis] Length = 568 Score = 43.0 bits (100), Expect = 0.030, Method: Composition-based stats. Identities = 25/66 (37%), Positives = 38/66 (57%), Gaps = 1/66 (1%) Query: 39 SLLRYWSVEFWELLCRTFLWLLCLGFFTLFAYWFFGEKFSYDVSIKIVLAINVLLFSFVI 98 ++L YW +F + LL +G +LFA +GE ++ S KI+LAI + LF+F+ Sbjct: 159 TILHYWRDDFSPAIPLCVQVLLYIGI-SLFAVRCYGETEAFLASFKILLAIGLFLFTFIT 217 Query: 99 ALLGNP 104 L GNP Sbjct: 218 MLGGNP 223 >gi|170065173|ref|XP_001867830.1| sterol desaturase [Culex quinquefasciatus] gi|167882282|gb|EDS45665.1| sterol desaturase [Culex quinquefasciatus] Length = 340 Score = 39.6 bits (91), Expect = 0.29, Method: Composition-based stats. Identities = 26/89 (29%), Positives = 40/89 (44%), Gaps = 13/89 (14%) Query: 9 PKITSSTGKCLKFIMRAFGLFDSAEQRKLWSLLRYW--SVEFWELLCRTF---------- 56 P++ SS L + F + W L R+W S +FW+ F Sbjct: 76 PRLLSSLYNFLVIVSIFLVGFAAFRNTLTWHLQRFWGASGDFWQAHWDRFIDFTGEDPAT 135 Query: 57 LWLLCLGFFTLFAYWFFGEKFS-YDVSIK 84 LW+L FT+F YWFFG ++ +D++ K Sbjct: 136 LWVLSTTIFTIFVYWFFGGIYTVFDLTCK 164 >gi|157132002|ref|XP_001662401.1| sterol desaturase [Aedes aegypti] gi|108871316|gb|EAT35541.1| sterol desaturase [Aedes aegypti] Length = 360 Score = 39.6 bits (91), Expect = 0.31, Method: Composition-based stats. Identities = 21/60 (35%), Positives = 31/60 (51%), Gaps = 13/60 (21%) Query: 38 WSLLRYW--SVEFWELLCRTF----------LWLLCLGFFTLFAYWFFGEKFS-YDVSIK 84 W L R+W S +FW+ F LW+L FT+F YWFFG ++ +D++ K Sbjct: 74 WHLQRFWGASGDFWQAHWDRFIDFTGEDPATLWVLSTTLFTVFVYWFFGGIYTIFDLTCK 133 >gi|254580801|ref|XP_002496386.1| ZYRO0C17182p [Zygosaccharomyces rouxii] gi|238939277|emb|CAR27453.1| ZYRO0C17182p [Zygosaccharomyces rouxii] Length = 578 Score = 38.8 bits (89), Expect = 0.55, Method: Composition-based stats. Identities = 22/66 (33%), Positives = 35/66 (53%), Gaps = 1/66 (1%) Query: 39 SLLRYWSVEFWELLCRTFLWLLCLGFFTLFAYWFFGEKFSYDVSIKIVLAINVLLFSFVI 98 +++ YW ++ + L+ GF +L ++GE + S KIVLAI + F+FV Sbjct: 154 TIIHYWRDDY-NAAIPLVVQLVLYGFISLCTVRYYGEAEFWLASFKIVLAIGLYFFTFVT 212 Query: 99 ALLGNP 104 L GNP Sbjct: 213 MLGGNP 218 >gi|254781075|ref|YP_003065488.1| hypothetical protein CLIBASIA_04885 [Candidatus Liberibacter asiaticus str. psy62] gi|254040752|gb|ACT57548.1| hypothetical protein CLIBASIA_04885 [Candidatus Liberibacter asiaticus str. psy62] Length = 266 Score = 38.8 bits (89), Expect = 0.57, Method: Composition-based stats. Identities = 49/238 (20%), Positives = 90/238 (37%), Gaps = 66/238 (27%) Query: 34 QRKLWSLLRYWSVEFWELLCRTFLWLLCLGFFTLFAYWFFG-----------EKFSYDVS 82 R++W LLR +F RTF C FF Y FF + Y +S Sbjct: 32 HRRIWHLLRD---DFGYFTKRTFF---CFAFFITIIYVFFPIFINFLISYIYKNIGYKIS 85 Query: 83 IK-----------IVLAINVLLFSFVIALLGNPNIEMAKDIAQLFTLIDNIENGYGQYLE 131 + I + +L S ++ + + M +D ++ +I +I + QY E Sbjct: 86 SQYYSLTEYTWHDIETFLALLYMSCILVATDDRKLRMIEDGDRIIEIIQSIRRAFDQYKE 145 Query: 132 --KISQKNVEKDFFNSK-----FFTHFKEYI----------------------------- 155 + + ++++ N F + +EYI Sbjct: 146 EERKALSDIQEQLLNDASKKIVFSSDREEYIGLHQKQIDLIHQNIDRHLARMKNQLVAES 205 Query: 156 DEVDKILMRYGIEGIPLT--DFDHEKIKSYLTHIKIVRDCLKKDNIHYAKELRNDWIG 211 DE++ IL RY IE IPL + ++ Y+ H+ ++ + + I ++ R ++IG Sbjct: 206 DELEDILSRYSIESIPLDLPEMSQDEAIKYINHVYLLGKWISRGEIQRSQIARREYIG 263 >gi|302894925|ref|XP_003046343.1| hypothetical protein NECHADRAFT_32512 [Nectria haematococca mpVI 77-13-4] gi|256727270|gb|EEU40630.1| hypothetical protein NECHADRAFT_32512 [Nectria haematococca mpVI 77-13-4] Length = 548 Score = 38.4 bits (88), Expect = 0.75, Method: Composition-based stats. Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 1/81 (1%) Query: 25 AFGLFDSAEQRKLWSLLRYWSVEFWELLCRTFLWLLCLGFFTLFAYWFFGEKFSYDVSIK 84 A+ L AE + L YW+ + T ++ LG + A FFGE + SIK Sbjct: 129 AYALLIGAEASAVAILFDYWNTNVHSAVWLTIYLVVILGL-NIIAVSFFGEAEFWFASIK 187 Query: 85 IVLAINVLLFSFVIALLGNPN 105 ++ + ++L VI L G PN Sbjct: 188 LITIVGLILTGLVIMLGGAPN 208 >gi|67528244|ref|XP_661932.1| hypothetical protein AN4328.2 [Aspergillus nidulans FGSC A4] gi|40741299|gb|EAA60489.1| hypothetical protein AN4328.2 [Aspergillus nidulans FGSC A4] gi|259482865|tpe|CBF77751.1| TPA: amino acid transporter (Eurofung) [Aspergillus nidulans FGSC A4] Length = 581 Score = 38.0 bits (87), Expect = 0.76, Method: Composition-based stats. Identities = 18/49 (36%), Positives = 30/49 (61%) Query: 56 FLWLLCLGFFTLFAYWFFGEKFSYDVSIKIVLAINVLLFSFVIALLGNP 104 F L L F +FA ++GE + S K++L++ ++LF+F+ L GNP Sbjct: 191 FFVLTSLRFLNVFAVKWYGESEFWLASSKVLLSVGLILFTFITMLGGNP 239 >gi|302306333|ref|NP_982579.2| AAR038Wp [Ashbya gossypii ATCC 10895] gi|299788467|gb|AAS50403.2| AAR038Wp [Ashbya gossypii ATCC 10895] Length = 566 Score = 38.0 bits (87), Expect = 0.80, Method: Composition-based stats. Identities = 22/69 (31%), Positives = 39/69 (56%), Gaps = 7/69 (10%) Query: 39 SLLRYWSVEF---WELLCRTFLWLLCLGFFTLFAYWFFGEKFSYDVSIKIVLAINVLLFS 95 S++ YW ++ L+ + L+LL +LFA ++GE + S K++LA+ + F+ Sbjct: 159 SIIHYWRDDYSAAITLVVQVMLYLL----ISLFAVRYYGEIEFWLASFKVLLAVGLFCFT 214 Query: 96 FVIALLGNP 104 FV + GNP Sbjct: 215 FVTMVGGNP 223 >gi|302691234|ref|XP_003035296.1| hypothetical protein SCHCODRAFT_50914 [Schizophyllum commune H4-8] gi|300108992|gb|EFJ00394.1| hypothetical protein SCHCODRAFT_50914 [Schizophyllum commune H4-8] Length = 472 Score = 38.0 bits (87), Expect = 0.87, Method: Composition-based stats. Identities = 21/65 (32%), Positives = 36/65 (55%), Gaps = 1/65 (1%) Query: 40 LLRYWSVEFWELLCRTFLWLLCLGFFTLFAYWFFGEKFSYDVSIKIVLAINVLLFSFVIA 99 +LR+W+ + F+ L+C FA ++GE + K++LA+ ++ F+FVI Sbjct: 95 VLRFWTDKIPVEAVFVFI-LVCYFLLNFFAVQWYGESEFWLAIGKVILAVGLICFTFVIM 153 Query: 100 LLGNP 104 L GNP Sbjct: 154 LGGNP 158 >gi|319902961|ref|YP_004162689.1| integrase family protein [Bacteroides helcogenes P 36-108] gi|319417992|gb|ADV45103.1| integrase family protein [Bacteroides helcogenes P 36-108] Length = 410 Score = 37.6 bits (86), Expect = 1.3, Method: Composition-based stats. Identities = 24/86 (27%), Positives = 46/86 (53%), Gaps = 12/86 (13%) Query: 138 VEKDF--FNSKFFTHFKEYIDEVDKILMRYGIEGIPLTDFDHEKIKSYLTHIKIVRDCLK 195 V KD+ + + + K+Y++EV I +Y + +PLT+ + E ++++ ++K R+C + Sbjct: 128 VGKDYALITVRRYDNCKKYLNEV--IRYKYAKDDLPLTEVNGELVRAFEFYLKTERECQQ 185 Query: 196 KDNIHYAKELR--------NDWIGSN 213 I Y K L+ N+WI N Sbjct: 186 NTVIRYMKCLKKIINLALSNEWITKN 211 >gi|289613558|emb|CBI59498.1| unnamed protein product [Sordaria macrospora] Length = 569 Score = 37.3 bits (85), Expect = 1.6, Method: Composition-based stats. Identities = 17/53 (32%), Positives = 33/53 (62%), Gaps = 1/53 (1%) Query: 55 TFLWLLCLGFFTLFAYWFFGEKFSYDVSIKIVLAINVLLFSFVIALLGNPNIE 107 + +WL+ + F +FA +GE + S+KI+ + +L+ +F++ L GNP+ E Sbjct: 174 SIIWLVII-FLNVFAVSLYGEAEFWFASLKIITIVGLLIMAFIVDLGGNPHHE 225 >gi|310798862|gb|EFQ33755.1| amino acid permease [Glomerella graminicola M1.001] Length = 555 Score = 36.9 bits (84), Expect = 1.9, Method: Composition-based stats. Identities = 26/81 (32%), Positives = 39/81 (48%), Gaps = 1/81 (1%) Query: 25 AFGLFDSAEQRKLWSLLRYWSVEFWELLCRTFLWLLCLGFFTLFAYWFFGEKFSYDVSIK 84 A+ + AE LL YW + T + L+ L +FA FGE + SIK Sbjct: 139 AYAMLVGAEASAGAILLDYWQSPVPPAVWITIILLVTLAL-NIFAVEIFGEAEFWFASIK 197 Query: 85 IVLAINVLLFSFVIALLGNPN 105 ++ + ++L SFVI L G P+ Sbjct: 198 LITILGLILVSFVIILGGAPD 218 >gi|145247622|ref|XP_001396060.1| proline permease PrnB [Aspergillus niger CBS 513.88] gi|134080799|emb|CAL00913.1| unnamed protein product [Aspergillus niger] Length = 538 Score = 36.5 bits (83), Expect = 2.2, Method: Composition-based stats. Identities = 26/83 (31%), Positives = 41/83 (49%), Gaps = 5/83 (6%) Query: 25 AFGLFDSAEQRKLWSLLRYWS--VEFWELLCRTFLWLLCLGFFTLFAYWFFGEKFSYDVS 82 +FG+ + E ++ YWS V + + +L L FF + Y GE + S Sbjct: 125 SFGILVAYETTAAALVINYWSNPVPLAVWITLMLIVVLALNFFPVKVY---GETEFWFAS 181 Query: 83 IKIVLAINVLLFSFVIALLGNPN 105 +K+ L I +L+ SFV+ L G PN Sbjct: 182 LKVFLIIGLLILSFVLFLGGGPN 204 >gi|154304107|ref|XP_001552459.1| hypothetical protein BC1G_09689 [Botryotinia fuckeliana B05.10] gi|150854324|gb|EDN29516.1| hypothetical protein BC1G_09689 [Botryotinia fuckeliana B05.10] Length = 462 Score = 36.5 bits (83), Expect = 2.3, Method: Composition-based stats. Identities = 16/42 (38%), Positives = 26/42 (61%) Query: 63 GFFTLFAYWFFGEKFSYDVSIKIVLAINVLLFSFVIALLGNP 104 GF LFA ++GE Y K++L + +++F+F+ L GNP Sbjct: 188 GFINLFAVRWYGESEFYLAMGKVILIVGLIIFTFITMLGGNP 229 >gi|145498648|ref|XP_001435311.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124402442|emb|CAK67914.1| unnamed protein product [Paramecium tetraurelia] Length = 1263 Score = 36.5 bits (83), Expect = 2.4, Method: Composition-based stats. Identities = 34/102 (33%), Positives = 53/102 (51%), Gaps = 6/102 (5%) Query: 78 SYDVSIKIVLA---INVLLFSFVIALLGNPNIEMAKDIAQLFTLIDNIE-NGYGQYLEKI 133 ++D ++IV I +L+F ++ L I K I Q+F +DN + + Q LE + Sbjct: 864 AFDNQVQIVFYSFYIGILVFYTMLMLTLARIIYKIKQILQIFQRLDNRDIDDELQRLEFV 923 Query: 134 SQK-NVEKDFFNSKFFTHFKEYIDEVDKILMRYGIEGIPLTD 174 SQ +E+ +FNSKFF F E +V K + G + I L D Sbjct: 924 SQNVKIEEFWFNSKFFQSFYEMSQDVSKTQTQ-GSQNIKLDD 964 >gi|169603363|ref|XP_001795103.1| hypothetical protein SNOG_04691 [Phaeosphaeria nodorum SN15] gi|160706377|gb|EAT88451.2| hypothetical protein SNOG_04691 [Phaeosphaeria nodorum SN15] Length = 1406 Score = 36.5 bits (83), Expect = 2.4, Method: Composition-based stats. Identities = 24/81 (29%), Positives = 40/81 (49%), Gaps = 1/81 (1%) Query: 25 AFGLFDSAEQRKLWSLLRYWSVEFWELLCRTFLWLLCLGFFTLFAYWFFGEKFSYDVSIK 84 A+ + +AE LL YW + T ++L+ + F + A FGE + SIK Sbjct: 140 AYAILVAAEATAGAILLSYWETPVHNAVWIT-IFLVVVLFLNIVAVEVFGEAEFWFASIK 198 Query: 85 IVLAINVLLFSFVIALLGNPN 105 + + +++ FVI L G+PN Sbjct: 199 FITIMGLIILGFVIMLGGSPN 219 >gi|123479432|ref|XP_001322874.1| hypothetical protein [Trichomonas vaginalis G3] gi|121905728|gb|EAY10651.1| conserved hypothetical protein [Trichomonas vaginalis G3] Length = 545 Score = 36.5 bits (83), Expect = 2.5, Method: Composition-based stats. Identities = 18/60 (30%), Positives = 33/60 (55%), Gaps = 6/60 (10%) Query: 112 IAQLFTLIDNIENGYGQYLEKISQKNVEKDFFNSKFFTHFKEYIDEVDKILMRYGIEGIP 171 I LFT ++ ++ + +Y E++ +KN N KF + Y+ + +I +YGI G+P Sbjct: 18 IVVLFTRDEDKDDDFNKYFEELKKKN------NYKFTVFYVLYVQDAAEIFKQYGITGVP 71 >gi|227872719|ref|ZP_03991046.1| conserved hypothetical protein [Oribacterium sinus F0268] gi|227841444|gb|EEJ51747.1| conserved hypothetical protein [Oribacterium sinus F0268] Length = 400 Score = 36.5 bits (83), Expect = 2.7, Method: Composition-based stats. Identities = 32/127 (25%), Positives = 64/127 (50%), Gaps = 8/127 (6%) Query: 36 KLWSLLRYWSVEFWELLCRTFLWLLCLGFFTLFAYWFF-GEKFSYDVSIKIVLAINVLLF 94 KL+++L Y V F E+L FL+LL G + Y+F + + V I + +++L+F Sbjct: 157 KLFAIL-YKPVSFSEVLGMIFLFLLFFGSIPISIYYFLMPHRILFLVGIYL---LDILIF 212 Query: 95 SFVIALLGNPNIEMAKDIAQLFTLIDNIENGYGQYLEKISQKNVEKDFFNSKFFTHFKEY 154 + +GN + +D+ I N + ++++ +K + KD S + H +E+ Sbjct: 213 GGLYVFIGNRTVGKFRDVVNQGAEIRKKINRNKKQMKRL-KKEINKDSNESHY--HLEEF 269 Query: 155 IDEVDKI 161 DE+ ++ Sbjct: 270 DDELSRL 276 >gi|291514108|emb|CBK63318.1| Site-specific recombinase XerD [Alistipes shahii WAL 8301] Length = 409 Score = 36.5 bits (83), Expect = 2.8, Method: Composition-based stats. Identities = 20/70 (28%), Positives = 37/70 (52%), Gaps = 10/70 (14%) Query: 152 KEYIDEVDKILMRYGIEGIPLTDFDHEKIKSYLTHIKIVRDCLKKDNIHYAKELR----- 206 + Y+ E+ I RYG E +PLT+ + E ++++ ++K + C + I Y K L+ Sbjct: 143 RRYLAEL--IRQRYGKEDLPLTEVNGELVRAFAFYLKTEKGCQQNTVIRYMKCLKKITNL 200 Query: 207 ---NDWIGSN 213 NDW+ + Sbjct: 201 ACANDWMAKD 210 >gi|210609809|ref|ZP_03288116.1| hypothetical protein CLONEX_00300 [Clostridium nexile DSM 1787] gi|210152736|gb|EEA83742.1| hypothetical protein CLONEX_00300 [Clostridium nexile DSM 1787] Length = 488 Score = 36.5 bits (83), Expect = 2.8, Method: Composition-based stats. Identities = 29/114 (25%), Positives = 53/114 (46%), Gaps = 20/114 (17%) Query: 85 IVLAINVLLFSFVIALLGNPNIEMAKDIAQLFTLIDNIENGYGQYLEKISQKNVEKDFFN 144 I+L + VLL +++ L N +M K A++ I+++ GY + D+ Sbjct: 191 IILGMTVLLLVVIVSFLSN---KMVKPFARITQAIEDVTEGY------------DNDYLK 235 Query: 145 SKFFTHFKEYIDEVDKILMRYGIEGIPLTDF----DHEKIKSYLTHIKIVRDCL 194 +T K+ D +K+L R + +F HE +K+ LT +K++ D L Sbjct: 236 ENAYTETKQISDAFNKMLGRLKVLDDSRQEFVSNVSHE-LKTPLTSMKVLADSL 288 >gi|312215355|emb|CBX95307.1| similar to proline-specific permease [Leptosphaeria maculans] Length = 564 Score = 36.1 bits (82), Expect = 3.0, Method: Composition-based stats. Identities = 24/81 (29%), Positives = 40/81 (49%), Gaps = 1/81 (1%) Query: 25 AFGLFDSAEQRKLWSLLRYWSVEFWELLCRTFLWLLCLGFFTLFAYWFFGEKFSYDVSIK 84 A+ + +AE LL YW + T ++L+ + F + A FGE + SIK Sbjct: 145 AYAILVAAEATAGAILLDYWETPVHSAVWIT-IFLVVVLFLNIVAVEVFGEAEFWFASIK 203 Query: 85 IVLAINVLLFSFVIALLGNPN 105 + + +++ FVI L G+PN Sbjct: 204 FITIMGLIILGFVIMLGGSPN 224 >gi|154491554|ref|ZP_02031180.1| hypothetical protein PARMER_01165 [Parabacteroides merdae ATCC 43184] gi|154088355|gb|EDN87400.1| hypothetical protein PARMER_01165 [Parabacteroides merdae ATCC 43184] Length = 411 Score = 36.1 bits (82), Expect = 3.1, Method: Composition-based stats. Identities = 20/67 (29%), Positives = 38/67 (56%), Gaps = 10/67 (14%) Query: 152 KEYIDEVDKILMRYGIEGIPLTDFDHEKIKSYLTHIKIVRDCLKKDNIHYAKELR----- 206 K Y+ E+ I ++YG E +PL++ + E ++S+ ++K ++C + I Y K L+ Sbjct: 144 KRYLAEL--IKLKYGKEDLPLSEVNGELVRSFEFYLKTEKECQQNTVIRYMKCLKKITNL 201 Query: 207 ---NDWI 210 N+WI Sbjct: 202 ALANEWI 208 >gi|89898040|ref|YP_515150.1| insulinase family metalloproteinase [Chlamydophila felis Fe/C-56] gi|89331412|dbj|BAE81005.1| insulinase family metalloproteinase [Chlamydophila felis Fe/C-56] Length = 974 Score = 36.1 bits (82), Expect = 3.3, Method: Composition-based stats. Identities = 18/70 (25%), Positives = 39/70 (55%), Gaps = 1/70 (1%) Query: 131 EKISQKNVEKDFFNSKFFTHFKEYIDEVDKILMRYGIEGIPLTDFDHEKIKSYLTHIKIV 190 EK+S +N+EK SK ++ +++D+IL + ++ +P + + +K ++H +++ Sbjct: 482 EKLSPENIEKIRLTSKVLEEYQSQNEDLDRILPNFSLDKVPNSGKEFPLVKKNVSHGEVL 541 Query: 191 -RDCLKKDNI 199 DC D I Sbjct: 542 HHDCFTNDLI 551 >gi|296272538|ref|YP_003655169.1| hypothetical protein Arnit_0999 [Arcobacter nitrofigilis DSM 7299] gi|296096712|gb|ADG92662.1| hypothetical protein Arnit_0999 [Arcobacter nitrofigilis DSM 7299] Length = 254 Score = 36.1 bits (82), Expect = 3.7, Method: Composition-based stats. Identities = 20/62 (32%), Positives = 33/62 (53%), Gaps = 2/62 (3%) Query: 105 NIEMAKDIAQLFTLIDNIENGYGQYLEKISQKNVEKDFFNSKFFTHFKEYIDEVDKILMR 164 N MA+ I +LF I+NI++ + L + +N+ F+ F F+ +DE D + R Sbjct: 137 NKAMAEVIDELFVGIENIDDLENKILTSLETENILCSFYEKYIFKRFERNLDEYD--IQR 194 Query: 165 YG 166 YG Sbjct: 195 YG 196 >gi|319935792|ref|ZP_08010221.1| hypothetical protein HMPREF9488_01052 [Coprobacillus sp. 29_1] gi|319809227|gb|EFW05676.1| hypothetical protein HMPREF9488_01052 [Coprobacillus sp. 29_1] Length = 379 Score = 35.7 bits (81), Expect = 4.6, Method: Composition-based stats. Identities = 39/137 (28%), Positives = 62/137 (45%), Gaps = 17/137 (12%) Query: 75 EKFSYDVSIKIVLAINVLLFSFVIALLGNPNIEMAKDIAQLFTLIDNIENGYGQY----L 130 EK+++D +I+I+L + L + + G I +LF L D IEN +Y Sbjct: 224 EKYNFDDNIEIILNDDDLAYKSISN--GKKTISHEFKNYKLFDLDDYIENNPDEYEKAVT 281 Query: 131 EKISQKNVEKDFFNSKFFTHFKEYIDEVDKILMRYGIEGIPLTDFDHEKIKSYLTHIKIV 190 E++S K EK + HF I+E K YG G +F K+ + + I Sbjct: 282 ERLSWKRPEKSW-------HFDCIIEEF-KDFFYYGTLGYTENEF---KLSQMIRYNLIS 330 Query: 191 RDCLKKDNIHYAKELRN 207 RD K+ HY +++N Sbjct: 331 RDDALKEIYHYRNKMKN 347 >gi|307564627|ref|ZP_07627161.1| conserved hypothetical protein [Prevotella amnii CRIS 21A-A] gi|307346672|gb|EFN91975.1| conserved hypothetical protein [Prevotella amnii CRIS 21A-A] Length = 688 Score = 35.7 bits (81), Expect = 4.6, Method: Composition-based stats. Identities = 21/62 (33%), Positives = 33/62 (53%), Gaps = 5/62 (8%) Query: 110 KDIAQLFTLI----DNIENGYGQYLEKISQKNVE-KDFFNSKFFTHFKEYIDEVDKILMR 164 K++ +LF I DNIE Y Y E + +V+ +D+ N+ + FK IDE K+ R Sbjct: 587 KNVKRLFAWIMLKDDNIEKAYNVYTELLEMPSVKAEDYLNAGYSRWFKGDIDEASKLFKR 646 Query: 165 YG 166 + Sbjct: 647 WA 648 >gi|194246669|ref|YP_002004308.1| NAD-dependent DNA ligase [Candidatus Phytoplasma mali] gi|254781388|sp|B3QZS5|DNLJ_PHYMT RecName: Full=DNA ligase; AltName: Full=Polydeoxyribonucleotide synthase [NAD+] gi|193807026|emb|CAP18462.1| NAD-dependent DNA ligase [Candidatus Phytoplasma mali] Length = 659 Score = 35.3 bits (80), Expect = 4.9, Method: Composition-based stats. Identities = 28/81 (34%), Positives = 44/81 (54%), Gaps = 7/81 (8%) Query: 73 FGEKFSYDVSIKIVLAINVLLFSFVIAL-LGNPNIEMAKDIAQLFTLIDNIENGYGQYLE 131 FG+K ++ I + + N L +F+I L + N I +AK +AQ F IDN++ + L Sbjct: 478 FGKKKFNNIIISLEKSKNKCLSNFLIGLGIKNVGIHLAKILAQKFENIDNLQKASIESLL 537 Query: 132 KI------SQKNVEKDFFNSK 146 KI S +N++ F NSK Sbjct: 538 KIDEIGIKSAQNIKNFFLNSK 558 >gi|157115589|ref|XP_001652623.1| hypothetical protein AaeL_AAEL007270 [Aedes aegypti] gi|108876843|gb|EAT41068.1| hypothetical protein AaeL_AAEL007270 [Aedes aegypti] Length = 1413 Score = 35.3 bits (80), Expect = 4.9, Method: Composition-based stats. Identities = 22/75 (29%), Positives = 40/75 (53%), Gaps = 2/75 (2%) Query: 87 LAINVLLF--SFVIALLGNPNIEMAKDIAQLFTLIDNIENGYGQYLEKISQKNVEKDFFN 144 + ++VLLF ++ + +P + D+ Q+F LI + + +Y KI QK V+ FN Sbjct: 268 IKLDVLLFLKYILLHIFVDPEKSVLSDMHQVFQLIKYVFHLLKKYFIKIDQKLVQDFDFN 327 Query: 145 SKFFTHFKEYIDEVD 159 F KE++++ D Sbjct: 328 GIFTIKLKEFLEKYD 342 >gi|302907990|ref|XP_003049769.1| hypothetical protein NECHADRAFT_102931 [Nectria haematococca mpVI 77-13-4] gi|256730705|gb|EEU44056.1| hypothetical protein NECHADRAFT_102931 [Nectria haematococca mpVI 77-13-4] Length = 542 Score = 35.3 bits (80), Expect = 6.0, Method: Composition-based stats. Identities = 19/50 (38%), Positives = 29/50 (58%), Gaps = 2/50 (4%) Query: 56 FLWLLCLGFFTLFAYWFFGEKFSYDVSIKIVLAINVLLFSFVIALLGNPN 105 FL ++C+ F FFGE + S K++ I ++LFS V+AL G P+ Sbjct: 141 FLVIICV--INYFGIRFFGELEFWLSSFKVITIIGIILFSLVVALGGAPD 188 >gi|254571937|ref|XP_002493078.1| High affinity polyamine permease [Pichia pastoris GS115] gi|238032876|emb|CAY70899.1| High affinity polyamine permease [Pichia pastoris GS115] gi|328352907|emb|CCA39305.1| High-affinity glutamine permease [Pichia pastoris CBS 7435] Length = 570 Score = 35.3 bits (80), Expect = 6.0, Method: Composition-based stats. Identities = 19/69 (27%), Positives = 37/69 (53%), Gaps = 7/69 (10%) Query: 39 SLLRYWSVEF---WELLCRTFLWLLCLGFFTLFAYWFFGEKFSYDVSIKIVLAINVLLFS 95 +++ YW ++ + +T L++L F +F +FGE + K++L I +L F+ Sbjct: 158 TMIHYWRSDYSPAITFVVQTLLYVL----FNVFTVKWFGESEFWMSITKLILCIGLLFFT 213 Query: 96 FVIALLGNP 104 F+ + GNP Sbjct: 214 FITMVGGNP 222 >gi|221481712|gb|EEE20088.1| ankyrin repeat-containing protein, putative [Toxoplasma gondii GT1] gi|221502222|gb|EEE27960.1| ankyrin repeat and FYVE domain containing protein [Toxoplasma gondii VEG] Length = 714 Score = 34.9 bits (79), Expect = 6.5, Method: Composition-based stats. Identities = 20/55 (36%), Positives = 32/55 (58%), Gaps = 2/55 (3%) Query: 107 EMAKDIAQLFTLIDNIENGYGQYLEKISQKNVEKDFFNSKFFTHFKEYIDEVDKI 161 + KD+ Q+F + I N G ++ I KN++K F N T F++Y+D+ DKI Sbjct: 509 QFLKDVEQVFINCE-IYNQQGSWVWSIG-KNMQKFFTNQVMITRFRDYVDKYDKI 561 >gi|237832321|ref|XP_002365458.1| 26S proteasome non-ATPase regulatory subunit, putative [Toxoplasma gondii ME49] gi|211963122|gb|EEA98317.1| 26S proteasome non-ATPase regulatory subunit, putative [Toxoplasma gondii ME49] Length = 714 Score = 34.9 bits (79), Expect = 6.5, Method: Composition-based stats. Identities = 20/55 (36%), Positives = 32/55 (58%), Gaps = 2/55 (3%) Query: 107 EMAKDIAQLFTLIDNIENGYGQYLEKISQKNVEKDFFNSKFFTHFKEYIDEVDKI 161 + KD+ Q+F + I N G ++ I KN++K F N T F++Y+D+ DKI Sbjct: 509 QFLKDVEQVFINCE-IYNQQGSWVWSIG-KNMQKFFTNQVMITRFRDYVDKYDKI 561 >gi|222529318|ref|YP_002573200.1| hydroxymethylbutenyl pyrophosphate reductase [Caldicellulosiruptor bescii DSM 6725] gi|222456165|gb|ACM60427.1| hydroxymethylbutenyl pyrophosphate reductase [Caldicellulosiruptor bescii DSM 6725] Length = 663 Score = 34.9 bits (79), Expect = 7.0, Method: Composition-based stats. Identities = 27/116 (23%), Positives = 54/116 (46%), Gaps = 7/116 (6%) Query: 95 SFVIALLGNPNIEMAKDIAQLFTLIDNIENGYGQYLEKISQKNVEKDFFNSKFFTHFKEY 154 S+VI L + +++ +DI Q+ +G +E+ ++ D K + Y Sbjct: 44 SYVIDKLKDLGVKVVEDIEQIGKEDIIFIRSHGVSMEEYAEIEKRAD----KVYDFTCPY 99 Query: 155 IDEVDKILMRYGIEG---IPLTDFDHEKIKSYLTHIKIVRDCLKKDNIHYAKELRN 207 + ++ +I+M + + G I + D +H ++K + H+ R C D+I KE N Sbjct: 100 VKKIHEIVMEHSVNGYDIIVVGDMNHPEVKGIVGHVGNNRKCFVVDSIEKVKEAIN 155 >gi|329121331|ref|ZP_08249957.1| phosphoribosylglycinamide formyltransferase [Dialister micraerophilus DSM 19965] gi|327469740|gb|EGF15206.1| phosphoribosylglycinamide formyltransferase [Dialister micraerophilus DSM 19965] Length = 207 Score = 34.9 bits (79), Expect = 7.2, Method: Composition-based stats. Identities = 20/63 (31%), Positives = 35/63 (55%), Gaps = 2/63 (3%) Query: 113 AQLFTLIDNIENGYGQYLEKISQKNVEKDFFNSKFFTHFKEYIDEVDKILMRYGIEGIPL 172 A++ + N +N YG LE+ + N+E + N K F+ +EY +E+ K+L I+ I L Sbjct: 29 AKIKLVFSNKKNAYG--LERAKKANIETLYLNRKNFSKSEEYDEEILKVLKEKDIDLIVL 86 Query: 173 TDF 175 + Sbjct: 87 AGY 89 >gi|313892332|ref|ZP_07825924.1| phosphoribosylglycinamide formyltransferase [Dialister microaerophilus UPII 345-E] gi|313119191|gb|EFR42391.1| phosphoribosylglycinamide formyltransferase [Dialister microaerophilus UPII 345-E] Length = 207 Score = 34.9 bits (79), Expect = 7.2, Method: Composition-based stats. Identities = 20/63 (31%), Positives = 35/63 (55%), Gaps = 2/63 (3%) Query: 113 AQLFTLIDNIENGYGQYLEKISQKNVEKDFFNSKFFTHFKEYIDEVDKILMRYGIEGIPL 172 A++ + N +N YG LE+ + N+E + N K F+ +EY +E+ K+L I+ I L Sbjct: 29 AKIKLVFSNKKNAYG--LERAKKANIETLYLNRKNFSKSEEYDEEILKVLKEKDIDLIVL 86 Query: 173 TDF 175 + Sbjct: 87 AGY 89 >gi|255012270|ref|ZP_05284396.1| putative bacteriophage integrase [Bacteroides sp. 2_1_7] Length = 411 Score = 34.9 bits (79), Expect = 7.5, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 43/86 (50%), Gaps = 12/86 (13%) Query: 138 VEKDF--FNSKFFTHFKEYIDEVDKILMRYGIEGIPLTDFDHEKIKSYLTHIKIVRDCLK 195 V KDF + + K Y+ E+ KI +Y E +PL D + E I+++ ++K ++C + Sbjct: 128 VGKDFALVTVRRYESCKRYLAELIKI--KYDKEDLPLKDINGEFIRAFDFYLKTEKECSQ 185 Query: 196 KDNIHYAKELR--------NDWIGSN 213 I Y K L+ N+WI + Sbjct: 186 NTVIRYMKCLKKITNLSLANEWINKD 211 >gi|260890589|ref|ZP_05901852.1| potassium uptake protein, Trk family [Leptotrichia hofstadii F0254] gi|260859634|gb|EEX74134.1| potassium uptake protein, Trk family [Leptotrichia hofstadii F0254] Length = 452 Score = 34.9 bits (79), Expect = 8.1, Method: Composition-based stats. Identities = 23/68 (33%), Positives = 36/68 (52%), Gaps = 4/68 (5%) Query: 98 IALLGNPNIEMAKDIAQ--LFTLIDNIENGYGQYLEKISQKNVEKDFFNSKFFTHFKEYI 155 I+L+ NP E AK+IA +F + ++EN +GQ IS + + F N H + Sbjct: 116 ISLMLNPEFESAKNIANKLMFPVALSVENFFGQKANFISIRVEKHSFLNGTQLKHLE--F 173 Query: 156 DEVDKILM 163 D DKI++ Sbjct: 174 DPQDKIII 181 >gi|213406718|ref|XP_002174130.1| general amino acid permease AGP2 [Schizosaccharomyces japonicus yFS275] gi|212002177|gb|EEB07837.1| general amino acid permease AGP2 [Schizosaccharomyces japonicus yFS275] Length = 559 Score = 34.6 bits (78), Expect = 8.5, Method: Composition-based stats. Identities = 18/69 (26%), Positives = 39/69 (56%), Gaps = 1/69 (1%) Query: 40 LLRYWSVEFWELLCRTFLWLLCLGFFTLFAYWFFGEKFSYDVSIKIVLAINVLLFSFVIA 99 ++ YW+ + + + L ++ L++ +FFGE Y K++LA+ ++LF+ ++ Sbjct: 154 VIEYWTDQIPKAAIISIL-IVVFAALNLYSVFFFGEGEFYLSLGKVILALGLILFTIIVM 212 Query: 100 LLGNPNIEM 108 GNP+ E+ Sbjct: 213 CGGNPHHEV 221 >gi|307707969|ref|ZP_07644444.1| IgA1 protease [Streptococcus mitis NCTC 12261] gi|307616034|gb|EFN95232.1| IgA1 protease [Streptococcus mitis NCTC 12261] Length = 1978 Score = 34.6 bits (78), Expect = 8.7, Method: Composition-based stats. Identities = 21/86 (24%), Positives = 41/86 (47%), Gaps = 8/86 (9%) Query: 119 IDNIENGYGQYLEKISQKNVEK--DFFNSKFFTHFKEYIDEVDKILMRYGIEGIPLTDFD 176 +D + GQ K++ N+EK F+N + H+ +D DK+ ++ +P+ D D Sbjct: 1121 VDYTQLSQGQASRKVAYHNIEKLMPFYNKELVVHYGNQVDPTDKLYTTELLDVVPMKDND 1180 Query: 177 HEKIKSYLTHIKIVRDCLKKDNIHYA 202 +T I+ + + K +H+A Sbjct: 1181 ------IITDIQANKVAINKLMLHFA 1200 >gi|78044363|ref|YP_361039.1| hypothetical protein CHY_2226 [Carboxydothermus hydrogenoformans Z-2901] gi|77996478|gb|ABB15377.1| putative membrane protein [Carboxydothermus hydrogenoformans Z-2901] Length = 308 Score = 34.6 bits (78), Expect = 8.8, Method: Composition-based stats. Identities = 19/69 (27%), Positives = 37/69 (53%), Gaps = 3/69 (4%) Query: 54 RTFLWLLCLGFFTLFAY---WFFGEKFSYDVSIKIVLAINVLLFSFVIALLGNPNIEMAK 110 + +L+L+ +GF +F Y +FFG KF+ +V+A+N ++ S + AL + K Sbjct: 67 KNYLYLVIMGFTGIFLYNLFFFFGVKFNPASDSSLVIAVNPIVVSLLAALFLKEKLTFEK 126 Query: 111 DIAQLFTLI 119 + + + I Sbjct: 127 ILGLIISFI 135 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.327 0.143 0.450 Lambda K H 0.267 0.0462 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,780,682,132 Number of Sequences: 14124377 Number of extensions: 144053003 Number of successful extensions: 484396 Number of sequences better than 10.0: 116 Number of HSP's better than 10.0 without gapping: 18 Number of HSP's successfully gapped in prelim test: 98 Number of HSP's that attempted gapping in prelim test: 484357 Number of HSP's gapped (non-prelim): 126 length of query: 214 length of database: 4,842,793,630 effective HSP length: 133 effective length of query: 81 effective length of database: 2,964,251,489 effective search space: 240104370609 effective search space used: 240104370609 T: 11 A: 40 X1: 16 ( 7.6 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.7 bits) S2: 79 (34.9 bits)