RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254781033|ref|YP_003065446.1| HemY domain-containing protein [Candidatus Liberibacter asiaticus str. psy62] (492 letters) >gnl|CDD|33686 COG3898, COG3898, Uncharacterized membrane-bound protein [Function unknown]. Length = 531 Score = 286 bits (733), Expect = 1e-77 Identities = 153/481 (31%), Positives = 245/481 (50%), Gaps = 25/481 (5%) Query: 1 MLRLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFA 60 M+R++ + +++ + F ++ P ++S+ W + Y S V SI+ + A +LL+ Sbjct: 1 MIRVLFFLILVAALGFGFAWLADRPGELSLIWQGQQYEMSLMVAASIIVALVAAVLLLWW 60 Query: 61 VSRFFLSCPAMLFHMLHKRNYDKGYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEY 120 + R P +R D+GY+AL TGL++ A + LARKM + S+ + E Sbjct: 61 LVRSIWESPYTARRYFRERKRDRGYQALSTGLIAAGAGDASLARKMTARASKLLSSDQEP 120 Query: 121 LVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKA 180 L++LLE Q AL E Y A +K E ML P TR + LY E+ R+G +A+ YA +A Sbjct: 121 LIHLLEAQAALLEGDYEDARKKFEAMLDDPETRLLGLRGLYLEAQRLGAREAARHYAERA 180 Query: 181 LDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKK----NAKEWNRNRAILLIARSLEN 236 + +P PW A ++ A +W A+ ++ ++ R+RA+LL A+++ Sbjct: 181 AEKAPQLPWAARATLEARCAAGDWDGALKLVDAQRAAKVIEKDVAERSRAVLLTAKAMSL 240 Query: 237 ADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIA 296 D D ++ DA+E+ KL + + A++ AA++L RK ILE WK PHP+IA Sbjct: 241 LD-ADPASARDDALEANKLAPDLVPAAVVAARALFRDGNLRKGSKILETAWKAEPHPDIA 299 Query: 297 NIYTHLLSENTV-GKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIA 355 +Y S +T +LKRA +LE + + ES + V++ AL+ G A AKA A + A Sbjct: 300 LLYVRARSGDTALDRLKRAKKLESLKPNNAESSLAVAEAALDAGEFSAARAKAEAAAREA 359 Query: 356 PRKEIFLLLAQIEQANSHNTDKILYWTQSALHAMPDPLWISDDGYLSSVWLPLSPISKTL 415 PR+ +LLLA IE+A + + K+ W A+ A DP W + DG +S W P+SP++ L Sbjct: 360 PRESAYLLLADIEEAETGDQGKVRQWLAQAVKAPRDPAW-TADGVVSEAWAPVSPVTGRL 418 Query: 416 CYFEWKIPTKSPEYISSENINFSLEMAYPADDLQSMLNNGKKNHLPSIKKVSSFEDSTIH 475 FEWK PT ++A+P +D LP S+ E+ Sbjct: 419 DAFEWKAPTG--------------QLAHPIEDRD----EAIMAPLPEAPAKSAIEEPADE 460 Query: 476 P 476 Sbjct: 461 L 461 >gnl|CDD|32885 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]. Length = 400 Score = 50.7 bits (121), Expect = 8e-07 Identities = 60/325 (18%), Positives = 117/325 (36%), Gaps = 19/325 (5%) Query: 1 MLRLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFA 60 M R++ F ++ I + ++ V I N S ++ L L LL Sbjct: 1 MRRVLLLFVLLLAGIGVGLAIAGQQGYVLIQTDNYNIEMSLTTLVIFLIIALVVLYLLEW 60 Query: 61 VSRFFLSCPAMLFHMLHKRNYDKGYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEY 120 + R L PA +R + KAL GL+ + + A K+ ++ Sbjct: 61 LLRRVLRTPAHTRGWFSRRKRRRARKALNEGLLKLFEGDFQQAEKLLRRNAEHG--EQPV 118 Query: 121 LVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRI----GDLNSAQRY 176 L YLL + A + A+ L ++ AV R+ D +A+ Sbjct: 119 LAYLLAAEAAQQRGDEDRANRYLAEAAELAGDDTLAV---ELTRARLLLNRRDYPAAREN 175 Query: 177 ATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKK----NAKEWNR--NRAILLI 230 + L+++P P V ++ Y+ W + L + +K + +E R +A + Sbjct: 176 VDQLLEMTPRHPEVLRLALRAYIRLGAWQALLAILPKLRKAGLLSDEEAARLEQQAWEGL 235 Query: 231 ARSLENADKGDMIASYHDAI-ESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKV 289 + + + + + ++ L+ N + A+ LI +A+ I+E K Sbjct: 236 LQQARDDNGSEGLKTWWKNQPRKLR---NDPELVVAYAERLIRLGDHDEAQEIIEDALKR 292 Query: 290 NPHPEIANIYTHLLSENTVGKLKRA 314 P + + L + +K A Sbjct: 293 QWDPRLCRLIPRLRPGDPEPLIKAA 317 >gnl|CDD|34394 COG4783, COG4783, Putative Zn-dependent protease, contains TPR repeats [General function prediction only]. Length = 484 Score = 34.5 bits (79), Expect = 0.069 Identities = 16/77 (20%), Positives = 28/77 (36%) Query: 168 GDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAI 227 A KAL + P++P + + Q + + AI LN+ N E + Sbjct: 354 NKAKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDL 413 Query: 228 LLIARSLENADKGDMIA 244 L A + ++A Sbjct: 414 LAQAYAELGNRAEALLA 430 >gnl|CDD|147584 pfam05478, Prominin, Prominin. The prominins are an emerging family of proteins that among the multispan membrane proteins display a novel topology. Mouse prominin and human prominin (mouse)-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration. Length = 809 Score = 32.3 bits (74), Expect = 0.32 Identities = 17/75 (22%), Positives = 31/75 (41%), Gaps = 16/75 (21%) Query: 7 YFFVISLVICSFIIVSHYPEDVSIT---WGNRLYRTSP------------FVILSILYFF 51 Y + LV+CS +++ + + G R +P F++ + + F Sbjct: 413 YRWYAGLVVCSLVLLVVLCNLLGLLCGICGLR-RPDTPSDRCCVSNTGGRFLMAGVGFSF 471 Query: 52 LFAWILLFAVSRFFL 66 LF+W L+ V FL Sbjct: 472 LFSWFLMLVVLALFL 486 >gnl|CDD|30116 cd01487, E1_ThiF_like, E1_ThiF_like. Member of superfamily of activating enzymes (E1) of the ubiquitin-like proteins. The common reaction mechanism catalyzed by E1-like enzymes begins with a nucleophilic attack of the C-terminal carboxylate of the ubiquitin-like substrate, on the alpha-phosphate of an ATP molecule bound at the active site of the activating enzymes, leading to the formation of a high-energy acyladenylate intermediate and subsequently to the formation of a thiocarboxylate at the C termini of the substrate. The exact function of this family is unknown.. Length = 174 Score = 32.2 bits (73), Expect = 0.34 Identities = 23/96 (23%), Positives = 37/96 (38%), Gaps = 20/96 (20%) Query: 270 LISQNKKRKAEVILEKIWKVNPHPEIANIYTHLLSENTVGKLKRALRLEEINKESVESLV 329 +SQ + K E + E + ++NP +I I + N +E L Sbjct: 45 FLSQIGEPKVEALKENLREINPFVKIEAINIKIDENN------------------LEGLF 86 Query: 330 IVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLA 365 I +E + D A KAMLA + K ++ A Sbjct: 87 GDCDIVVE--AFDNAETKAMLAESLLGNKNKPVVCA 120 >gnl|CDD|37213 KOG2002, KOG2002, KOG2002, TPR-containing nuclear phosphoprotein that regulates K(+) uptake [Inorganic ion transport and metabolism]. Length = 1018 Score = 31.5 bits (71), Expect = 0.57 Identities = 29/167 (17%), Positives = 65/167 (38%), Gaps = 6/167 (3%) Query: 178 TKALDISPDAPWVTEAVVQQYVLAKEWSRAITF-LNQKKKNAKEWNRNRAILLIARSLEN 236 +A + + P + + K++ R + K + + + + RS Sbjct: 260 QRAYKENNENPVALNHLANHFYFKKDYERVWHLAEHAIKNTENKSIKAESFYQLGRSYHA 319 Query: 237 ADKGDMIASYHDAIESLKLC-DNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPH-PE 294 +GD ++ +ESLK DN ++ + + I + +++ EK+ K P+ E Sbjct: 320 --QGDFEKAFKYYMESLKADNDNFVLPLVGLGQMYIKRGDLEESKFCFEKVLKQLPNNYE 377 Query: 295 IANIYTHLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSI 341 I L + + + KR + + +E + S+ LE+ + Sbjct: 378 TMKILGCLYAHSAKKQEKRDKAS-NVLGKVLEQTPVDSEAWLELAQL 423 >gnl|CDD|133462 cd06852, GT_MraY, Phospho-N-acetylmuramoyl-pentapeptide-transferase (mraY) is an enzyme responsible for the formation of the first lipid intermediate in the synthesis of bacterial cell wall peptidoglycan. It catalyzes the formation of undecaprenyl-pyrophosphoryl-N-acetylmuramoyl- pentapeptide from UDP-MurNAc-pentapeptide and undecaprenyl-phosphate. It is an integral membrane protein with possibly ten transmembrane domains. Length = 280 Score = 29.0 bits (66), Expect = 2.8 Identities = 12/54 (22%), Positives = 22/54 (40%), Gaps = 5/54 (9%) Query: 9 FVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVS 62 F+I++V ++ IT + + + L ILY FA ++ S Sbjct: 78 FLIAIVFA-LLLYYFNGSGTLITL---PFFKNGLIDLGILYI-PFAIFVIVGSS 126 >gnl|CDD|35597 KOG0376, KOG0376, KOG0376, Serine-threonine phosphatase 2A, catalytic subunit [General function prediction only]. Length = 476 Score = 28.3 bits (63), Expect = 4.6 Identities = 33/155 (21%), Positives = 68/155 (43%), Gaps = 8/155 (5%) Query: 197 QYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLC 256 + + K + A+ ++ A E + N AI R+L + + HDA+++++L Sbjct: 13 EALKDKVFDVAVDLYSK----AIELDPNCAIYFANRALAHLKVESFGGALHDALKAIELD 68 Query: 257 DNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANIYTHLLSENTVGKLKRALR 316 I A + ++++ + +KA + LEK+ K+ P+ A T + E + Sbjct: 69 PTYIKAYVRRGTAVMALGEFKKALLDLEKVKKLAPNDPDA---TRKIDECNKIVSEEKFE 125 Query: 317 LEEINKESVESLVIVSKIALE-MGSIDQAHAKAML 350 + E + V+ KI E M I+ ++ +L Sbjct: 126 KAILTPEGDKKSVVEMKIDEEDMDLIESDYSGPVL 160 >gnl|CDD|112041 pfam03209, PUCC, PUCC protein. This protein is required for high-level transcription of the PUC operon. Length = 402 Score = 27.7 bits (62), Expect = 7.1 Identities = 10/56 (17%), Positives = 19/56 (33%), Gaps = 10/56 (17%) Query: 7 YFFVISLVICSFIIVSHYPEDVSITWGNRL-YRTSPFVILSILYFFLFAWILLFAV 61 + F S T + L ++ +P++ L L I+ FA+ Sbjct: 25 LLVAPFRALIGFR---------SDTHRSALGWKRTPYIWLGTLLQAGGLAIMPFAL 71 >gnl|CDD|119408 cd06660, Aldo_ket_red, Aldo-keto reductases (AKRs) are a superfamily of soluble NAD(P)(H) oxidoreductases whose chief purpose is to reduce aldehydes and ketones to primary and secondary alcohols. AKRs are present in all phyla and are of importance to both health and industrial applications. Members have very distinct functions and include the prokaryotic 2,5-diketo-D-gluconic acid reductases and beta-keto ester reductases, the eukaryotic aldose reductases, aldehyde reductases, hydroxysteroid dehydrogenases, steroid 5beta-reductases, potassium channel beta-subunits and aflatoxin aldehyde reductases, among others.. Length = 285 Score = 27.9 bits (63), Expect = 7.3 Identities = 12/57 (21%), Positives = 21/57 (36%), Gaps = 9/57 (15%) Query: 296 ANIYTHLLSENTVGK-LKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLA 351 A++Y SE +G+ LK EE+ I +K+ G + + Sbjct: 49 ADVYGDGESEELLGEALKERGPREEV--------FIATKVGPRPGDGRDLSPEHIRR 97 >gnl|CDD|144949 pfam01545, Cation_efflux, Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells. Length = 273 Score = 27.6 bits (62), Expect = 7.9 Identities = 12/59 (20%), Positives = 24/59 (40%), Gaps = 5/59 (8%) Query: 10 VISLVICSFII-VSHYPEDVSITWGNRLYRTSPFVIL--SILYFFLFAWILLFAVSRFF 65 ++S ++ + +S P D +G R L S+L + +IL ++ R Sbjct: 29 LLSSLLALLALRLSSRPPDKRHPFG--HGRLEVLAALIVSLLLLGVGVFILYESIERLI 85 >gnl|CDD|146370 pfam03699, UPF0182, Uncharacterized protein family (UPF0182). This family contains uncharacterized integral membrane proteins. Length = 771 Score = 27.5 bits (62), Expect = 8.4 Identities = 14/70 (20%), Positives = 30/70 (42%), Gaps = 11/70 (15%) Query: 2 LRLIRYFFVISLVICSFII--VSHYPEDVSITWGNRLYRTSPFV-----ILSILYFFLFA 54 L L+ ++++L++ + I +++Y I R S +L L+F L A Sbjct: 156 LELL-LSWLLALLLLALIATLLTYYLYG-GIRLQGRFPGLSRAARRHLAVLLGLFFLLKA 213 Query: 55 WILLFAVSRF 64 + + R+ Sbjct: 214 A--GYWLDRY 221 >gnl|CDD|177151 MTH00086, CYTB, cytochrome b; Provisional. Length = 355 Score = 27.2 bits (61), Expect = 9.5 Identities = 14/57 (24%), Positives = 23/57 (40%), Gaps = 14/57 (24%) Query: 7 YFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILS----ILYFFLFAWILLF 59 F V S ++ + I+ +W + PF+ LS +YFFL +L Sbjct: 307 KFLVFSFIVVNVIL----------SWLGQCLVEYPFLFLSQIFSFIYFFLILLMLFL 353 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.321 0.134 0.400 Gapped Lambda K H 0.267 0.0664 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 6,051,277 Number of extensions: 321316 Number of successful extensions: 1083 Number of sequences better than 10.0: 1 Number of HSP's gapped: 1072 Number of HSP's successfully gapped: 57 Length of query: 492 Length of database: 6,263,737 Length adjustment: 98 Effective length of query: 394 Effective length of database: 4,146,055 Effective search space: 1633545670 Effective search space used: 1633545670 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 59 (26.6 bits)