RPS-BLAST 2.2.22 [Sep-27-2009]
Database: CddA
21,609 sequences; 6,263,737 total letters
Searching..................................................done
Query= gi|254781033|ref|YP_003065446.1| HemY domain-containing protein
[Candidatus Liberibacter asiaticus str. psy62]
(492 letters)
>gnl|CDD|33686 COG3898, COG3898, Uncharacterized membrane-bound protein [Function
unknown].
Length = 531
Score = 286 bits (733), Expect = 1e-77
Identities = 153/481 (31%), Positives = 245/481 (50%), Gaps = 25/481 (5%)
Query: 1 MLRLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFA 60
M+R++ + +++ + F ++ P ++S+ W + Y S V SI+ + A +LL+
Sbjct: 1 MIRVLFFLILVAALGFGFAWLADRPGELSLIWQGQQYEMSLMVAASIIVALVAAVLLLWW 60
Query: 61 VSRFFLSCPAMLFHMLHKRNYDKGYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEY 120
+ R P +R D+GY+AL TGL++ A + LARKM + S+ + E
Sbjct: 61 LVRSIWESPYTARRYFRERKRDRGYQALSTGLIAAGAGDASLARKMTARASKLLSSDQEP 120
Query: 121 LVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKA 180
L++LLE Q AL E Y A +K E ML P TR + LY E+ R+G +A+ YA +A
Sbjct: 121 LIHLLEAQAALLEGDYEDARKKFEAMLDDPETRLLGLRGLYLEAQRLGAREAARHYAERA 180
Query: 181 LDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKK----NAKEWNRNRAILLIARSLEN 236
+ +P PW A ++ A +W A+ ++ ++ R+RA+LL A+++
Sbjct: 181 AEKAPQLPWAARATLEARCAAGDWDGALKLVDAQRAAKVIEKDVAERSRAVLLTAKAMSL 240
Query: 237 ADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIA 296
D D ++ DA+E+ KL + + A++ AA++L RK ILE WK PHP+IA
Sbjct: 241 LD-ADPASARDDALEANKLAPDLVPAAVVAARALFRDGNLRKGSKILETAWKAEPHPDIA 299
Query: 297 NIYTHLLSENTV-GKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIA 355
+Y S +T +LKRA +LE + + ES + V++ AL+ G A AKA A + A
Sbjct: 300 LLYVRARSGDTALDRLKRAKKLESLKPNNAESSLAVAEAALDAGEFSAARAKAEAAAREA 359
Query: 356 PRKEIFLLLAQIEQANSHNTDKILYWTQSALHAMPDPLWISDDGYLSSVWLPLSPISKTL 415
PR+ +LLLA IE+A + + K+ W A+ A DP W + DG +S W P+SP++ L
Sbjct: 360 PRESAYLLLADIEEAETGDQGKVRQWLAQAVKAPRDPAW-TADGVVSEAWAPVSPVTGRL 418
Query: 416 CYFEWKIPTKSPEYISSENINFSLEMAYPADDLQSMLNNGKKNHLPSIKKVSSFEDSTIH 475
FEWK PT ++A+P +D LP S+ E+
Sbjct: 419 DAFEWKAPTG--------------QLAHPIEDRD----EAIMAPLPEAPAKSAIEEPADE 460
Query: 476 P 476
Sbjct: 461 L 461
>gnl|CDD|32885 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis
[Coenzyme metabolism].
Length = 400
Score = 50.7 bits (121), Expect = 8e-07
Identities = 60/325 (18%), Positives = 117/325 (36%), Gaps = 19/325 (5%)
Query: 1 MLRLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFA 60
M R++ F ++ I + ++ V I N S ++ L L LL
Sbjct: 1 MRRVLLLFVLLLAGIGVGLAIAGQQGYVLIQTDNYNIEMSLTTLVIFLIIALVVLYLLEW 60
Query: 61 VSRFFLSCPAMLFHMLHKRNYDKGYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEY 120
+ R L PA +R + KAL GL+ + + A K+ ++
Sbjct: 61 LLRRVLRTPAHTRGWFSRRKRRRARKALNEGLLKLFEGDFQQAEKLLRRNAEHG--EQPV 118
Query: 121 LVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRI----GDLNSAQRY 176
L YLL + A + A+ L ++ AV R+ D +A+
Sbjct: 119 LAYLLAAEAAQQRGDEDRANRYLAEAAELAGDDTLAV---ELTRARLLLNRRDYPAAREN 175
Query: 177 ATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKK----NAKEWNR--NRAILLI 230
+ L+++P P V ++ Y+ W + L + +K + +E R +A +
Sbjct: 176 VDQLLEMTPRHPEVLRLALRAYIRLGAWQALLAILPKLRKAGLLSDEEAARLEQQAWEGL 235
Query: 231 ARSLENADKGDMIASYHDAI-ESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKV 289
+ + + + + ++ L+ N + A+ LI +A+ I+E K
Sbjct: 236 LQQARDDNGSEGLKTWWKNQPRKLR---NDPELVVAYAERLIRLGDHDEAQEIIEDALKR 292
Query: 290 NPHPEIANIYTHLLSENTVGKLKRA 314
P + + L + +K A
Sbjct: 293 QWDPRLCRLIPRLRPGDPEPLIKAA 317
>gnl|CDD|34394 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
repeats [General function prediction only].
Length = 484
Score = 34.5 bits (79), Expect = 0.069
Identities = 16/77 (20%), Positives = 28/77 (36%)
Query: 168 GDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAI 227
A KAL + P++P + + Q + + AI LN+ N E +
Sbjct: 354 NKAKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDL 413
Query: 228 LLIARSLENADKGDMIA 244
L A + ++A
Sbjct: 414 LAQAYAELGNRAEALLA 430
>gnl|CDD|147584 pfam05478, Prominin, Prominin. The prominins are an emerging
family of proteins that among the multispan membrane
proteins display a novel topology. Mouse prominin and
human prominin (mouse)-like 1 (PROML1) are predicted to
contain five membrane spanning domains, with an
N-terminal domain exposed to the extracellular space
followed by four, alternating small cytoplasmic and
large extracellular, loops and a cytoplasmic C-terminal
domain. The exact function of prominin is unknown
although in humans defects in PROM1, the gene coding for
prominin, cause retinal degeneration.
Length = 809
Score = 32.3 bits (74), Expect = 0.32
Identities = 17/75 (22%), Positives = 31/75 (41%), Gaps = 16/75 (21%)
Query: 7 YFFVISLVICSFIIVSHYPEDVSIT---WGNRLYRTSP------------FVILSILYFF 51
Y + LV+CS +++ + + G R +P F++ + + F
Sbjct: 413 YRWYAGLVVCSLVLLVVLCNLLGLLCGICGLR-RPDTPSDRCCVSNTGGRFLMAGVGFSF 471
Query: 52 LFAWILLFAVSRFFL 66
LF+W L+ V FL
Sbjct: 472 LFSWFLMLVVLALFL 486
>gnl|CDD|30116 cd01487, E1_ThiF_like, E1_ThiF_like. Member of superfamily of
activating enzymes (E1) of the ubiquitin-like proteins.
The common reaction mechanism catalyzed by E1-like
enzymes begins with a nucleophilic attack of the
C-terminal carboxylate of the ubiquitin-like substrate,
on the alpha-phosphate of an ATP molecule bound at the
active site of the activating enzymes, leading to the
formation of a high-energy acyladenylate intermediate
and subsequently to the formation of a thiocarboxylate
at the C termini of the substrate. The exact function of
this family is unknown..
Length = 174
Score = 32.2 bits (73), Expect = 0.34
Identities = 23/96 (23%), Positives = 37/96 (38%), Gaps = 20/96 (20%)
Query: 270 LISQNKKRKAEVILEKIWKVNPHPEIANIYTHLLSENTVGKLKRALRLEEINKESVESLV 329
+SQ + K E + E + ++NP +I I + N +E L
Sbjct: 45 FLSQIGEPKVEALKENLREINPFVKIEAINIKIDENN------------------LEGLF 86
Query: 330 IVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLA 365
I +E + D A KAMLA + K ++ A
Sbjct: 87 GDCDIVVE--AFDNAETKAMLAESLLGNKNKPVVCA 120
>gnl|CDD|37213 KOG2002, KOG2002, KOG2002, TPR-containing nuclear phosphoprotein
that regulates K(+) uptake [Inorganic ion transport and
metabolism].
Length = 1018
Score = 31.5 bits (71), Expect = 0.57
Identities = 29/167 (17%), Positives = 65/167 (38%), Gaps = 6/167 (3%)
Query: 178 TKALDISPDAPWVTEAVVQQYVLAKEWSRAITF-LNQKKKNAKEWNRNRAILLIARSLEN 236
+A + + P + + K++ R + K + + + + RS
Sbjct: 260 QRAYKENNENPVALNHLANHFYFKKDYERVWHLAEHAIKNTENKSIKAESFYQLGRSYHA 319
Query: 237 ADKGDMIASYHDAIESLKLC-DNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPH-PE 294
+GD ++ +ESLK DN ++ + + I + +++ EK+ K P+ E
Sbjct: 320 --QGDFEKAFKYYMESLKADNDNFVLPLVGLGQMYIKRGDLEESKFCFEKVLKQLPNNYE 377
Query: 295 IANIYTHLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSI 341
I L + + + KR + + +E + S+ LE+ +
Sbjct: 378 TMKILGCLYAHSAKKQEKRDKAS-NVLGKVLEQTPVDSEAWLELAQL 423
>gnl|CDD|133462 cd06852, GT_MraY, Phospho-N-acetylmuramoyl-pentapeptide-transferase
(mraY) is an enzyme responsible for the formation of the
first lipid intermediate in the synthesis of bacterial
cell wall peptidoglycan. It catalyzes the formation of
undecaprenyl-pyrophosphoryl-N-acetylmuramoyl-
pentapeptide from UDP-MurNAc-pentapeptide and
undecaprenyl-phosphate. It is an integral membrane
protein with possibly ten transmembrane domains.
Length = 280
Score = 29.0 bits (66), Expect = 2.8
Identities = 12/54 (22%), Positives = 22/54 (40%), Gaps = 5/54 (9%)
Query: 9 FVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVS 62
F+I++V ++ IT + + + L ILY FA ++ S
Sbjct: 78 FLIAIVFA-LLLYYFNGSGTLITL---PFFKNGLIDLGILYI-PFAIFVIVGSS 126
>gnl|CDD|35597 KOG0376, KOG0376, KOG0376, Serine-threonine phosphatase 2A,
catalytic subunit [General function prediction only].
Length = 476
Score = 28.3 bits (63), Expect = 4.6
Identities = 33/155 (21%), Positives = 68/155 (43%), Gaps = 8/155 (5%)
Query: 197 QYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLC 256
+ + K + A+ ++ A E + N AI R+L + + HDA+++++L
Sbjct: 13 EALKDKVFDVAVDLYSK----AIELDPNCAIYFANRALAHLKVESFGGALHDALKAIELD 68
Query: 257 DNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANIYTHLLSENTVGKLKRALR 316
I A + ++++ + +KA + LEK+ K+ P+ A T + E +
Sbjct: 69 PTYIKAYVRRGTAVMALGEFKKALLDLEKVKKLAPNDPDA---TRKIDECNKIVSEEKFE 125
Query: 317 LEEINKESVESLVIVSKIALE-MGSIDQAHAKAML 350
+ E + V+ KI E M I+ ++ +L
Sbjct: 126 KAILTPEGDKKSVVEMKIDEEDMDLIESDYSGPVL 160
>gnl|CDD|112041 pfam03209, PUCC, PUCC protein. This protein is required for
high-level transcription of the PUC operon.
Length = 402
Score = 27.7 bits (62), Expect = 7.1
Identities = 10/56 (17%), Positives = 19/56 (33%), Gaps = 10/56 (17%)
Query: 7 YFFVISLVICSFIIVSHYPEDVSITWGNRL-YRTSPFVILSILYFFLFAWILLFAV 61
+ F S T + L ++ +P++ L L I+ FA+
Sbjct: 25 LLVAPFRALIGFR---------SDTHRSALGWKRTPYIWLGTLLQAGGLAIMPFAL 71
>gnl|CDD|119408 cd06660, Aldo_ket_red, Aldo-keto reductases (AKRs) are a
superfamily of soluble NAD(P)(H) oxidoreductases whose
chief purpose is to reduce aldehydes and ketones to
primary and secondary alcohols. AKRs are present in all
phyla and are of importance to both health and
industrial applications. Members have very distinct
functions and include the prokaryotic
2,5-diketo-D-gluconic acid reductases and beta-keto
ester reductases, the eukaryotic aldose reductases,
aldehyde reductases, hydroxysteroid dehydrogenases,
steroid 5beta-reductases, potassium channel
beta-subunits and aflatoxin aldehyde reductases, among
others..
Length = 285
Score = 27.9 bits (63), Expect = 7.3
Identities = 12/57 (21%), Positives = 21/57 (36%), Gaps = 9/57 (15%)
Query: 296 ANIYTHLLSENTVGK-LKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLA 351
A++Y SE +G+ LK EE+ I +K+ G + +
Sbjct: 49 ADVYGDGESEELLGEALKERGPREEV--------FIATKVGPRPGDGRDLSPEHIRR 97
>gnl|CDD|144949 pfam01545, Cation_efflux, Cation efflux family. Members of this
family are integral membrane proteins, that are found
to increase tolerance to divalent metal ions such as
cadmium, zinc, and cobalt. These proteins are thought
to be efflux pumps that remove these ions from cells.
Length = 273
Score = 27.6 bits (62), Expect = 7.9
Identities = 12/59 (20%), Positives = 24/59 (40%), Gaps = 5/59 (8%)
Query: 10 VISLVICSFII-VSHYPEDVSITWGNRLYRTSPFVIL--SILYFFLFAWILLFAVSRFF 65
++S ++ + +S P D +G R L S+L + +IL ++ R
Sbjct: 29 LLSSLLALLALRLSSRPPDKRHPFG--HGRLEVLAALIVSLLLLGVGVFILYESIERLI 85
>gnl|CDD|146370 pfam03699, UPF0182, Uncharacterized protein family (UPF0182). This
family contains uncharacterized integral membrane
proteins.
Length = 771
Score = 27.5 bits (62), Expect = 8.4
Identities = 14/70 (20%), Positives = 30/70 (42%), Gaps = 11/70 (15%)
Query: 2 LRLIRYFFVISLVICSFII--VSHYPEDVSITWGNRLYRTSPFV-----ILSILYFFLFA 54
L L+ ++++L++ + I +++Y I R S +L L+F L A
Sbjct: 156 LELL-LSWLLALLLLALIATLLTYYLYG-GIRLQGRFPGLSRAARRHLAVLLGLFFLLKA 213
Query: 55 WILLFAVSRF 64
+ + R+
Sbjct: 214 A--GYWLDRY 221
>gnl|CDD|177151 MTH00086, CYTB, cytochrome b; Provisional.
Length = 355
Score = 27.2 bits (61), Expect = 9.5
Identities = 14/57 (24%), Positives = 23/57 (40%), Gaps = 14/57 (24%)
Query: 7 YFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILS----ILYFFLFAWILLF 59
F V S ++ + I+ +W + PF+ LS +YFFL +L
Sbjct: 307 KFLVFSFIVVNVIL----------SWLGQCLVEYPFLFLSQIFSFIYFFLILLMLFL 353
Database: CddA
Posted date: Feb 4, 2011 9:38 PM
Number of letters in database: 6,263,737
Number of sequences in database: 21,609
Lambda K H
0.321 0.134 0.400
Gapped
Lambda K H
0.267 0.0664 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 21609
Number of Hits to DB: 6,051,277
Number of extensions: 321316
Number of successful extensions: 1083
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1072
Number of HSP's successfully gapped: 57
Length of query: 492
Length of database: 6,263,737
Length adjustment: 98
Effective length of query: 394
Effective length of database: 4,146,055
Effective search space: 1633545670
Effective search space used: 1633545670
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 59 (26.6 bits)