RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254781161|ref|YP_003065574.1| glutamate--cysteine ligase [Candidatus Liberibacter asiaticus str. psy62] (457 letters) >gnl|CDD|33374 COG3572, GshA, Gamma-glutamylcysteine synthetase [Coenzyme metabolism]. Length = 456 Score = 532 bits (1372), Expect = e-152 Identities = 231/457 (50%), Positives = 303/457 (66%), Gaps = 1/457 (0%) Query: 1 MTCNPLANTIVTSIDDLVQHIASGIKPQEEFRIGTEHESFIFSRADHRPLPYDGEKSIVT 60 M + T +TS+ +L ++A G K + ++RIGTEHE F F P+PY+GE I Sbjct: 1 MARDTTTETPLTSVAELTDYLAKGDKEKTDWRIGTEHEKFGFYLDGLSPVPYEGEAGIFA 60 Query: 61 ILQAIQKKLAWKEIMDKGNIIGLANPLSKAGISIEPGGQLELSTTTLQNVHQIKGEILGY 120 +L +Q+ L W+ IMD GNIIGL P+ + IS+EPGGQ ELS L+ +HQ GE+ + Sbjct: 61 LLDGMQR-LGWEPIMDVGNIIGLVEPIGQGAISLEPGGQFELSGAPLETIHQTCGEMNQH 119 Query: 121 IQILKEITQNLDLGILGMGFNPKWKLDEMPIMPKSRYVLMKKYMPQVGTHGLDMMFRTCT 180 + +L+EI L LG +G+G +PKW E+P+MPKSRY +M +YMP+VG GLDMM RTCT Sbjct: 120 LAVLREIAAELGLGFVGLGGSPKWTRAEVPVMPKSRYAIMTRYMPKVGVKGLDMMTRTCT 179 Query: 181 TQVSLDFSSEHDMATKLRVSFKLQPLATAIFASSPFAEGRINGFQSWRSEIWRHTDSDRT 240 QV+LDFSSE DM K+RVS LQPLATA+FA+SPF EG+ NG SWR +IWR TD R+ Sbjct: 180 IQVNLDFSSETDMRRKMRVSLALQPLATALFANSPFTEGKPNGLLSWRGDIWRDTDPQRS 239 Query: 241 EILPFILRDNSNFEHYAQWALDIPMYFILRKKEYYCCTDITFRQFMNGALKGRIKEWHPT 300 +LPF D+ F Y +ALD+PMYF+ R Y+ C ++FRQFM GALKG + PT Sbjct: 240 GVLPFAFSDDFGFIDYVNYALDVPMYFVRRTGHYHDCAHVSFRQFMAGALKGELPGRRPT 299 Query: 301 LEDWENHLSTLFPAVRLRNCLEMRGADSGRLENIFAVAAFWTGILYDSSALQNADHLTSS 360 ++DW NHLSTLFP VRL+ LEMRGADSG I A+ AFW G+LYD AL A+ LT Sbjct: 300 MKDWTNHLSTLFPDVRLKRFLEMRGADSGPWRRICALPAFWVGLLYDPEALDAAEDLTKD 359 Query: 361 WSFYDINKLNNTVPSKGMRSTVRGQSLKDIAIQILTFAHQGLKNRSAKNHLQEDETIFLK 420 W++ ++ L N VP KG+ + G +L +IA +L + GLKNR N DETIFL Sbjct: 360 WTYEEVLALRNAVPKKGLAAEFAGNTLLEIAKDVLPISRIGLKNRLVLNGDGGDETIFLD 419 Query: 421 PLEKIIHNNQTTADEMLAAYHTRWGKSIDPCFEEYAY 457 PL++++ T A+ ML+ YH WG SI+P F EYAY Sbjct: 420 PLDEVLAGGTTIAEAMLSLYHGAWGGSIEPVFNEYAY 456 >gnl|CDD|146638 pfam04107, GCS2, Glutamate-cysteine ligase family 2(GCS2). Also known as gamma-glutamylcysteine synthetase and gamma-ECS (EC:6.3.2.2). This enzyme catalyses the first and rate limiting step in de novo glutathione biosynthesis. Members of this family are found in archaea, bacteria and plants. May and Leaver discuss the possible evolutionary origins of glutamate-cysteine ligase enzymes in different organisms and suggest that it evolved independently in different eukaryotes, from an ancestral bacterial enzyme. They also state that Arabidopsis thaliana gamma-glutamylcysteine synthetase is structurally unrelated to mammalian, yeast and Escherichia coli homologues. In plants, there are separate cytosolic and chloroplast forms of the enzyme. Length = 291 Score = 223 bits (569), Expect = 1e-58 Identities = 105/307 (34%), Positives = 151/307 (49%), Gaps = 24/307 (7%) Query: 48 RPLPYDGEKSIVTILQAIQKKLAWKEIMDKGNIIGLANPLSKAGISIEPGGQLELSTTTL 107 R L + E +V +L W I++ IGL L + PGGQ+ELST L Sbjct: 1 RTLGVEEEFGVVDLLGGDL--RGWSPILEDEAKIGL--SLGGGFVKELPGGQVELSTPPL 56 Query: 108 QNVHQIKGEILGYIQILKEITQNLDLGILGMGFNPKWKLDEMPIMPKSRYVLMKKYMPQV 167 +++ + EI + + L+ + L LG+LG+G +P P+MPK RY M +YMP+V Sbjct: 57 ESLAEAAEEISQHREELRHVADELGLGLLGLGTHPFALRSRDPVMPKGRYRRMYEYMPRV 116 Query: 168 GTHGLDMMFRTCTTQVSLDFSSEHDMATKLRVSFKLQPLATAIFASSPFAEGRINGFQSW 227 G +G MM C QV++D SSE MA LR+ L P+ A+ A+SPF GR G+ S Sbjct: 117 GVYGRQMMVAGCHVQVNIDSSSEAIMA-VLRLVRALLPVLLALSANSPFWGGRDTGYAST 175 Query: 228 RSEIWRHTDSDRTEILPFILRDNSNFEHYAQWALDIPMYFILRKKEYYCCTDITFRQFMN 287 R+ I+ T + + LP D + FE YA++ALD + F+ R + + Sbjct: 176 RALIF--TQTPQAGPLPLAFEDGA-FERYARYALDTGIIFVRR--------RLWWD---- 220 Query: 288 GALKGRIKEWHPTLEDWENHLSTLFPAVRLRNCLEMRGADSGRLENIFAVAAFWTGILYD 347 GR + H +T FP VRLR LE R D+ + A+ A WT L D Sbjct: 221 ----GRPPGLPGETLELRIHDTTAFPPVRLRALLEARLLDAQPDWRLDALPAAWTVALLD 276 Query: 348 SSALQNA 354 A +NA Sbjct: 277 DEAEENA 283 >gnl|CDD|32353 COG2170, COG2170, Uncharacterized conserved protein [Function unknown]. Length = 369 Score = 45.7 bits (108), Expect = 2e-05 Identities = 33/161 (20%), Positives = 59/161 (36%), Gaps = 11/161 (6%) Query: 99 QLELSTTTLQNVHQIKGEILGYIQILKEITQNLDLGILGMGFNPKWKLDEMPIMPKSRYV 158 +EL+T + + + ++ L + + L I G G +P + RY Sbjct: 49 TVELATGVCRLLAEAAAQLRALRDYLVQAASDHGLRICGGGTHPFADWRRQEVPDNPRY- 107 Query: 159 LMKKYMPQVGTHGLDMMFRTCTTQVSLDFSSEHDMATKLRVSFKLQPLATAIFASSPFAE 218 ++ + + G G M V + S D L + P A+ ASSPF + Sbjct: 108 --QRLIERTGYLGRQMT--VAGQHVHVGIPSPDDAMYLLHRLLRYVPHLLALSASSPFWQ 163 Query: 219 GRINGFQSWRSEIWRHTDSDRTEILPFILRDNSNFEHYAQW 259 G G+ S R+ I+ T LP ++ + + Sbjct: 164 GTDTGYASARANIFSQLP---TNGLPPAF---QSWAAFEAF 198 >gnl|CDD|147656 pfam05603, DUF775, Protein of unknown function (DUF775). This family consists of several eukaryotic proteins of unknown function. Length = 195 Score = 27.7 bits (62), Expect = 6.8 Identities = 20/76 (26%), Positives = 26/76 (34%), Gaps = 2/76 (2%) Query: 68 KLAWKEIMDKGNIIGLANPLSKA--GISIEPGGQLELSTTTLQNVHQIKGEILGYIQILK 125 K++ M NP S A GISIEP QL L+ + Sbjct: 80 KVSNDIDMLDDGNPATGNPQSTAQIGISIEPLDQLAQQLAALKQSQSGSQAAQQNAGVTP 139 Query: 126 EITQNLDLGILGMGFN 141 T+ L I+ FN Sbjct: 140 VSTKQLAQKIVENLFN 155 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.321 0.135 0.413 Gapped Lambda K H 0.267 0.0728 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 5,674,375 Number of extensions: 297940 Number of successful extensions: 561 Number of sequences better than 10.0: 1 Number of HSP's gapped: 557 Number of HSP's successfully gapped: 8 Length of query: 457 Length of database: 6,263,737 Length adjustment: 97 Effective length of query: 360 Effective length of database: 4,167,664 Effective search space: 1500359040 Effective search space used: 1500359040 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 59 (26.5 bits)