RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254781040|ref|YP_003065453.1| hypothetical protein CLIBASIA_04710 [Candidatus Liberibacter asiaticus str. psy62] (143 letters) >gnl|CDD|32770 COG2947, COG2947, Uncharacterized conserved protein [Function unknown]. Length = 156 Score = 174 bits (443), Expect = 6e-45 Identities = 76/157 (48%), Positives = 96/157 (61%), Gaps = 17/157 (10%) Query: 1 MAYWLVKSEPSEWSWKMQQDKGRVGEAWTGVRNYQARNNMRKMRVGDKGFFYHSN-KGRE 59 AYWL+KSEP +S +M + G E W GVRNYQARN MR M++GD GFFYHSN K Sbjct: 2 AAYWLMKSEPDVFSIEMLKAVGE--EPWDGVRNYQARNFMRDMKIGDLGFFYHSNCKPPG 59 Query: 60 IVGIFEVITCTYPDPTAEQSSC-------------WECVDICAVCSMPCPVSLMAIKANP 106 IVG+ EV +PDPT + W CVD+ V +P PV+L +KANP Sbjct: 60 IVGLAEVCALAHPDPTQFDPASPYYDPKATPEDPRWYCVDVRFVRKLPRPVTLKELKANP 119 Query: 107 RLSSMILIV-SSRLSVQPVTTDEYLEVCRMGKLSNPP 142 L+ M L+V +RLSVQPVT +E+ E+ R+G L PP Sbjct: 120 ELAEMSLLVKGNRLSVQPVTPEEWKEILRLGGLDAPP 156 >gnl|CDD|145181 pfam01878, DUF55, Protein of unknown function DUF55. This family of proteins have no known function. Length = 143 Score = 155 bits (395), Expect = 3e-39 Identities = 66/145 (45%), Positives = 84/145 (57%), Gaps = 13/145 (8%) Query: 2 AYWLVKSEPSEWSWKMQQDKGRVGEAWTGVRNYQARNNMRKMRVGDKGFFYHSN-KGREI 60 AYWL+KSEP +S + +G G W GVRNYQARN +R M+VGD FFYHSN K I Sbjct: 1 AYWLMKSEPDHYSIDDLKGEG--GTPWDGVRNYQARNFLRAMKVGDLVFFYHSNEKPPAI 58 Query: 61 VGIFEVITCTYPDPTAEQSSC---------WECVDICAVCSMPCPVSLMAIKANPRLSSM 111 VGI EV++ YPDPTA W VD+ V + PV+L +KA P L + Sbjct: 59 VGIAEVVSEAYPDPTAFDPESPYYDPENPRWYRVDVKFVEKLKRPVTLKELKAEPELKDL 118 Query: 112 ILI-VSSRLSVQPVTTDEYLEVCRM 135 L+ SRLSV PVT +E+ + + Sbjct: 119 SLVRQGSRLSVMPVTEEEWKLILEL 143 >gnl|CDD|38593 KOG3383, KOG3383, KOG3383, Uncharacterized conserved protein [Function unknown]. Length = 187 Score = 111 bits (279), Expect = 5e-26 Identities = 60/166 (36%), Positives = 81/166 (48%), Gaps = 27/166 (16%) Query: 3 YWLVKSEPS---------EWSWKMQQDKGRVGEAWTGVRNYQARNNMRKMRVGDKGFFYH 53 YWL+K+EP E+SW+ + G W GVRNYQARNN+R M++GD FFYH Sbjct: 10 YWLMKTEPEPRYVKGINVEFSWEDLESNGGKISKWDGVRNYQARNNLRSMKLGDLAFFYH 69 Query: 54 SN-KGREIVGIFEVITCTYPDPTA-------------EQSSCWECVDICAVCSMPCPVSL 99 SN K IVG+ E+ YPD Q+ W VD+ A+ M V L Sbjct: 70 SNCKSPCIVGLVEISREWYPDDAEGVENDPYYDPESTPQNPRWGAVDVQAIGEMRRCVDL 129 Query: 100 MAIKAN----PRLSSMILIVSSRLSVQPVTTDEYLEVCRMGKLSNP 141 +K + L M+L RLS+ PV +E+ +C +G P Sbjct: 130 KELKMHKDKGIILKGMVLFRRPRLSIVPVEEEEWNFICELGNGFEP 175 >gnl|CDD|30596 COG0247, GlpC, Fe-S oxidoreductase [Energy production and conversion]. Length = 388 Score = 27.6 bits (60), Expect = 1.4 Identities = 13/55 (23%), Positives = 20/55 (36%), Gaps = 2/55 (3%) Query: 80 SCWECVDICAVCSMPCPVSLMAIKANPR-LSSMILIVSSRLSVQPVTTDEYLEVC 133 S +CV C C+ CP +PR ++ V + E L+ C Sbjct: 7 SLDKCVH-CGFCTNVCPSYRATEALSPRGRIVLVREVLRGKAPGDEEVYEALDTC 60 >gnl|CDD|112456 pfam03639, Glyco_hydro_81, Glycosyl hydrolase family 81. Family of eukaryotic beta-1,3-glucanases. Length = 694 Score = 27.6 bits (61), Expect = 1.5 Identities = 9/20 (45%), Positives = 12/20 (60%) Query: 12 EWSWKMQQDKGRVGEAWTGV 31 EW+WK+ VGE W G+ Sbjct: 635 EWNWKIPPIINSVGEGWKGI 654 >gnl|CDD|39007 KOG3803, KOG3803, KOG3803, Transcription factor containing C2HC type Zn finger [Transcription]. Length = 968 Score = 27.1 bits (59), Expect = 1.6 Identities = 9/37 (24%), Positives = 15/37 (40%) Query: 44 RVGDKGFFYHSNKGREIVGIFEVITCTYPDPTAEQSS 80 ++ HS+ GRE V + C+ D + S Sbjct: 138 DEAEECLLIHSDDGREKVDDSQPPLCSSDDNESNSES 174 >gnl|CDD|38284 KOG3074, KOG3074, KOG3074, Transcriptional regulator of the PUR family, single-stranded-DNA-binding [Transcription]. Length = 263 Score = 26.1 bits (57), Expect = 3.2 Identities = 9/49 (18%), Positives = 17/49 (34%), Gaps = 7/49 (14%) Query: 39 NMRKMRVGDKGFFYHSNKGREIVGIFEVITCTYPDPTAEQS-----SCW 82 + V +K F + K + G+F I+ + + S W Sbjct: 179 EGTSITVDNKRFLFDVGKNKR--GVFVRISEVKSNGSYRNSITVPFKAW 225 >gnl|CDD|34275 COG4656, RnfC, Predicted NADH:ubiquinone oxidoreductase, subunit RnfC [Energy production and conversion]. Length = 529 Score = 26.0 bits (57), Expect = 3.5 Identities = 11/38 (28%), Positives = 15/38 (39%), Gaps = 9/38 (23%) Query: 71 YPDPTAEQSS--CWECVDICAVCSMPCPVSLMAIKANP 106 +P EQS C C D CPV+L+ + Sbjct: 358 LLEPIKEQSCIRCSLCADA-------CPVNLLPQQLYW 388 >gnl|CDD|38190 KOG2979, KOG2979, KOG2979, Protein involved in DNA repair [General function prediction only]. Length = 262 Score = 26.2 bits (57), Expect = 3.5 Identities = 22/79 (27%), Positives = 29/79 (36%), Gaps = 3/79 (3%) Query: 63 IFEVITCTYPDPTAEQSSCWECVDICAVCSMPCPVSLMAIKANPRLSSMILIVSSRLSVQ 122 + I PDP E E + V S P+S I NP +S V R S+ Sbjct: 149 AAKDINMKIPDPEEEDED--EELIGQEVFSNRDPISKKPIV-NPVISKKCGHVYDRDSIM 205 Query: 123 PVTTDEYLEVCRMGKLSNP 141 + DE C + NP Sbjct: 206 QILCDEITIRCPVLGCENP 224 >gnl|CDD|34436 COG4827, COG4827, Predicted transporter [General function prediction only]. Length = 239 Score = 26.0 bits (57), Expect = 3.7 Identities = 13/46 (28%), Positives = 21/46 (45%), Gaps = 6/46 (13%) Query: 88 CAVCSMPCPVSLMAIKANPRLSSMILIVSSRLSVQPVTTDEYLEVC 133 SMPCPV L A ++ I+++S LS+ V + + Sbjct: 107 FLAISMPCPVCLGA------VAVSIMLLSQILSLSGVKIGLLVGIA 146 >gnl|CDD|113611 pfam04845, PurA, PurA ssDNA and RNA-binding protein. This family represents most of the length of the protein. Length = 218 Score = 25.6 bits (56), Expect = 4.8 Identities = 9/30 (30%), Positives = 14/30 (46%), Gaps = 2/30 (6%) Query: 39 NMRKMRVGDKGFFYH--SNKGREIVGIFEV 66 + V +K F++ SNK + I EV Sbjct: 160 EGTSLTVDNKRFYFDVGSNKRGVFMRISEV 189 >gnl|CDD|38865 KOG3661, KOG3661, KOG3661, Uncharacterized conserved protein [Function unknown]. Length = 1019 Score = 25.4 bits (55), Expect = 5.0 Identities = 16/64 (25%), Positives = 24/64 (37%), Gaps = 5/64 (7%) Query: 67 ITCTYPDPTAEQSSCWECVDICAVCSMPCPVSLMAIKANPRL-----SSMILIVSSRLSV 121 +T PT QS+ W + P+S+ ++ PR S IL S LS Sbjct: 174 VTALPLHPTRAQSNLWNPQGPLSDGPGSLPLSIAPVRKRPRHDPGAPSPGILQDSDGLSG 233 Query: 122 QPVT 125 + Sbjct: 234 SYLD 237 >gnl|CDD|36489 KOG1275, KOG1275, KOG1275, PAB-dependent poly(A) ribonuclease, subunit PAN2 [Replication, recombination and repair]. Length = 1118 Score = 25.7 bits (56), Expect = 5.2 Identities = 12/52 (23%), Positives = 20/52 (38%), Gaps = 13/52 (25%) Query: 96 PVSLMA----IKANPRLSSMILIVSSRLSVQPVTTDEYLEVCRMGKLSNPPL 143 P+ ++ +P L++ + + S Q V T LSNPP Sbjct: 261 PIQFPYGPQFLRFHPSLTTRLAVTSQSGQFQFVDTAT---------LSNPPA 303 >gnl|CDD|38461 KOG3251, KOG3251, KOG3251, Golgi SNAP receptor complex member [Intracellular trafficking, secretion, and vesicular transport]. Length = 213 Score = 25.6 bits (56), Expect = 5.3 Identities = 12/49 (24%), Positives = 20/49 (40%), Gaps = 6/49 (12%) Query: 5 LVKSEP-----SEWSWKMQQDKGRVGEAWTGVRNYQARNNMRKMRVGDK 48 LV EP + ++ Q V T +R RNN R+ + ++ Sbjct: 55 LVSKEPPKSRQAAR-LRVDQLLEDVEHLQTSLRTSMNRNNRREQQARER 102 >gnl|CDD|176946 CHL00001, rpoB, RNA polymerase beta subunit. Length = 1070 Score = 25.6 bits (57), Expect = 5.6 Identities = 9/17 (52%), Positives = 12/17 (70%) Query: 41 RKMRVGDKGFFYHSNKG 57 R+++VGDK H NKG Sbjct: 812 REIQVGDKVAGRHGNKG 828 >gnl|CDD|39784 KOG4584, KOG4584, KOG4584, Uncharacterized conserved protein [General function prediction only]. Length = 348 Score = 25.3 bits (55), Expect = 6.8 Identities = 13/45 (28%), Positives = 17/45 (37%) Query: 65 EVITCTYPDPTAEQSSCWECVDICAVCSMPCPVSLMAIKANPRLS 109 EVI C P + E ++ A + C V L AI L Sbjct: 230 EVILCANSSPALNDVTYSELKELAAELANDCNVLLKAIDTGQLLV 274 >gnl|CDD|31237 COG1034, NuoG, NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) [Energy production and conversion]. Length = 693 Score = 24.9 bits (54), Expect = 8.2 Identities = 10/21 (47%), Positives = 11/21 (52%) Query: 70 TYPDPTAEQSSCWECVDICAV 90 TY D E C CVD+C V Sbjct: 178 TYLDQPLESELCGNCVDVCPV 198 >gnl|CDD|133131 cd06600, GH31_MGAM-like, This family includes the following closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), lysosomal acid alpha-glucosidase (GAA), neutral alpha-glucosidase C (GANC), the alpha subunit of neutral alpha-glucosidase AB (GANAB), and alpha-glucosidase II. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal domain found near the membrane-bound end and a C-terminal luminal domain. Both of the tandem GH31 domains of MGAM and SI are included in this family. The domain architecture of GAA includes an N-terminal TFF (trefoil factor family) domain in addition to the GH31 catalytic domain. Deficient GAA expression causes pompe disease, an autosomal recessive genetic disorder also known as glycogen storage disease type II (GSDII). GANC and GANAB are key enzymes in glycogen metabolism that hydrolyze terminal, non-reducing 1,4-linked alpha-D-glucose residues from glycogen in the endoplasmic reticulum. Alpha-glucosidase II is a GH31 enzyme, found in bacteria and plants, which has exo-alpha-1,4-glucosidase and oligo-1,6-glucosidase activities. Alpha-glucosidase II has been characterized in Bacillus thermoamyloliquefaciens where it forms a homohexamer. This family also includes the MalA alpha-glucosidase from Sulfolobus sulfataricus and the AglA alpha-glucosidase from Picrophilus torridus. MalA is part of the carbohydrate-metabolizing machinery that allows this organism to utilize carbohydrates, such as maltose, as the sole carbon and energy source. The MGAM-like family corresponds to subgroup 1 in the Ernst et al classification of GH31 enzymes. Length = 317 Score = 24.8 bits (54), Expect = 9.1 Identities = 13/36 (36%), Positives = 14/36 (38%) Query: 47 DKGFFYHSNKGREIVGIFEVITCTYPDPTAEQSSCW 82 DKG F G VG T YPD T + W Sbjct: 102 DKGKFCEIESGELFVGKMWPGTTVYPDFTNPDTREW 137 >gnl|CDD|31788 COG1600, COG1600, Uncharacterized Fe-S protein [Energy production and conversion]. Length = 337 Score = 24.6 bits (53), Expect = 9.9 Identities = 8/25 (32%), Positives = 9/25 (36%), Gaps = 4/25 (16%) Query: 72 PDPTAEQSSCWECVDICAVCSMPCP 96 PD E+ C C C CP Sbjct: 178 PDRPEEEDHCGSCT----RCLDACP 198 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.320 0.133 0.433 Gapped Lambda K H 0.267 0.0630 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 1,776,672 Number of extensions: 81345 Number of successful extensions: 206 Number of sequences better than 10.0: 1 Number of HSP's gapped: 195 Number of HSP's successfully gapped: 26 Length of query: 143 Length of database: 6,263,737 Length adjustment: 85 Effective length of query: 58 Effective length of database: 4,426,972 Effective search space: 256764376 Effective search space used: 256764376 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 52 (24.0 bits)