RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254780556|ref|YP_003064969.1| hypothetical protein CLIBASIA_02215 [Candidatus Liberibacter asiaticus str. psy62] (120 letters) >gnl|CDD|162371 TIGR01456, CECR5, HAD-superfamily class IIA hydrolase, TIGR01456, CECR5. The Schizosaccharomyces pombe sequence (EGAD|138276) is annotated as "phosphatidyl synthase," however this is due entirely to a C-terminal region of the protein (outside the region of similarity of this model) which is highly homologous to a family of CDP-alcohol phosphatidyltransferases. (Thus, the annotation of GP|4226073 from C. elegans as similar to phosphatidyl synthase, is a mistake as this gene does not contain the C-terminal portion). The physical connection of the phosphatidyl synthase and the HAD-superfamily hydrolase domain in S. pombe may, however, be an important clue to the substrate for the hydrolases in this equivalog. Length = 321 Score = 31.8 bits (72), Expect = 0.046 Identities = 18/66 (27%), Positives = 33/66 (50%), Gaps = 6/66 (9%) Query: 29 IESLRMVKEAKMEVLEAHKLAKEYVEQANQRVKEAEEQSNARLLKGLG------MDDLVR 82 I SL V + ++V+++H K V + +R+ S + +G G D++VR Sbjct: 57 ISSLLGVDVSPLQVIQSHSPYKSLVNKYEKRILAVGTGSVRGVAEGYGFQNVVHQDEIVR 116 Query: 83 YFMNLD 88 YF ++D Sbjct: 117 YFRDID 122 >gnl|CDD|179290 PRK01388, PRK01388, arginine deiminase; Provisional. Length = 406 Score = 29.0 bits (66), Expect = 0.31 Identities = 16/54 (29%), Positives = 25/54 (46%), Gaps = 2/54 (3%) Query: 41 EVLEAHKLAKEYVEQANQRVKEAEEQSNARLLKGLGMDDLVR-YFMNLDSQNQA 93 EVL L E + R + Q + + GLG+ D +R Y +LD++ A Sbjct: 66 EVLYLEDLLAETLANPEAREWFLDRQISEARV-GLGLADELRAYLESLDNRELA 118 >gnl|CDD|178963 PRK00290, dnaK, molecular chaperone DnaK; Provisional. Length = 627 Score = 27.4 bits (62), Expect = 0.86 Identities = 21/47 (44%), Positives = 24/47 (51%), Gaps = 11/47 (23%) Query: 16 ITIIGCSLVEDNRIESLRMVKEAKMEVLEAH----KLAKEYVEQANQ 58 ITI S + D IE RMVK+A EA+ K KE VE NQ Sbjct: 492 ITITASSGLSDEEIE--RMVKDA-----EANAEEDKKRKELVEARNQ 531 >gnl|CDD|177896 PLN02254, PLN02254, gibberellin 3-beta-dioxygenase. Length = 358 Score = 27.4 bits (61), Expect = 0.87 Identities = 11/36 (30%), Positives = 20/36 (55%), Gaps = 1/36 (2%) Query: 77 MDDLVRYFMNLDSQNQAFFIDTIQNKYQDEMEELSE 112 M + + L Q+ F D ++ +YQ EM++L+E Sbjct: 145 MGSPLEHARQLWPQDHTKFCDVME-EYQKEMKKLAE 179 >gnl|CDD|183692 PRK12704, PRK12704, phosphodiesterase; Provisional. Length = 520 Score = 27.4 bits (62), Expect = 1.0 Identities = 16/40 (40%), Positives = 23/40 (57%), Gaps = 1/40 (2%) Query: 22 SLVEDNRIESLRMVKEAKMEV-LEAHKLAKEYVEQANQRV 60 + E+ R E+ ++KE + E EA K AKE + QA QR Sbjct: 162 KVEEEARHEAAVLIKEIEEEAKEEADKKAKEILAQAIQRC 201 >gnl|CDD|169386 PRK08334, PRK08334, translation initiation factor IF-2B subunit beta; Validated. Length = 356 Score = 26.9 bits (59), Expect = 1.3 Identities = 16/39 (41%), Positives = 24/39 (61%), Gaps = 3/39 (7%) Query: 22 SLVEDNRIESLRMVKEAKMEVLEAHKLAKEYVEQANQRV 60 LVE++ + L +K ++ V EA K+A E VE AN R+ Sbjct: 115 KLVEEHLEDPLDEIK--RLIVEEAQKIADEDVE-ANLRM 150 >gnl|CDD|180492 PRK06256, PRK06256, biotin synthase; Validated. Length = 336 Score = 26.7 bits (60), Expect = 1.4 Identities = 7/17 (41%), Positives = 12/17 (70%) Query: 26 DNRIESLRMVKEAKMEV 42 ++RI++ MVK A +E Sbjct: 187 EDRIDTCEMVKAAGIEP 203 >gnl|CDD|131951 TIGR02905, spore_yutH, spore coat protein YutH. Members of this family are homologs of the Bacillus subtilis spore coat protein CotS. Members of this family, designated YutH, are found only in the family Bacillaceae from among the endospore-forming members of the Firmicutes branch of the Bacteria. Length = 313 Score = 26.3 bits (58), Expect = 2.0 Identities = 11/44 (25%), Positives = 20/44 (45%), Gaps = 2/44 (4%) Query: 74 GLGMDDLVRY--FMNLDSQNQAFFIDTIQNKYQDEMEELSEFGE 115 GL D+ R + + NQA+ I + ++E+ EL + Sbjct: 9 GLYCDERFRVGRYEAFRANNQAYLIVPVDGLEEEELAELMAMSD 52 >gnl|CDD|114591 pfam05873, Mt_ATP-synt_D, ATP synthase D chain, mitochondrial (ATP5H). This family consists of several ATP synthase D chain, mitochondrial (ATP5H) proteins. Subunit d has no extensive hydrophobic sequences, and is not apparently related to any subunit described in the simpler ATP synthases in bacteria and chloroplasts. Length = 160 Score = 26.0 bits (57), Expect = 2.4 Identities = 12/56 (21%), Positives = 21/56 (37%), Gaps = 4/56 (7%) Query: 33 RMVKEAKMEVLEAHKLAKEYVEQANQRVKEAEEQ----SNARLLKGLGMDDLVRYF 84 + E E K EY ++ R++E E++ N + M+D F Sbjct: 84 KYTALVDAEEKEDVKRIAEYKTKSADRIQEYEKELEKWKNMIPYDQMTMEDFNEAF 139 >gnl|CDD|181845 PRK09425, prpD, 2-methylcitrate dehydratase; Provisional. Length = 480 Score = 26.0 bits (58), Expect = 2.6 Identities = 10/29 (34%), Positives = 18/29 (62%), Gaps = 4/29 (13%) Query: 26 DNRIESLRMVKEAKMEVLEAHKLAKEYVE 54 D RI++LR KM V+E + ++Y++ Sbjct: 370 DPRIDALR----EKMVVVEDPQFTRDYLD 394 >gnl|CDD|130736 TIGR01675, plant-AP, plant acid phosphatase. This model explicitly excludes the VSPs which lack the nucleophilc aspartate. The possibility exists, however, that some members of this family may, while containing all of the conserved HAD-superfamily catalytic residues, lack activity and have a function related to the function of the VSPs rather than the acid phosphatases. Length = 229 Score = 25.9 bits (57), Expect = 3.0 Identities = 16/55 (29%), Positives = 24/55 (43%) Query: 24 VEDNRIESLRMVKEAKMEVLEAHKLAKEYVEQANQRVKEAEEQSNARLLKGLGMD 78 VE N I V + +E + +K+Y + V EA + + L G GMD Sbjct: 24 VETNNIRDWDTVPAECKDYVEDYMTSKQYKRDVKRVVDEAYFYAKSLALSGDGMD 78 >gnl|CDD|162330 TIGR01381, E1_like_apg7, E1-like protein-activating enzyme Gsa7p/Apg7p. This model represents a family of eukaryotic proteins found in animals, plants, and yeasts, including Apg7p (YHR171W) from Saccharomyces cerevisiae and GSA7 from Pichia pastoris. Members are about 650 to 700 residues in length and include a central domain of about 150 residues shared with the ThiF/MoeB/HesA family of proteins. A low level of similarity to ubiquitin-activating enzyme E1 is described in a paper on peroxisome autophagy mediated by GSA7, and is the basis of the name ubiquitin activating enzyme E1-like protein. Members of the family appear to be involved in protein lipidation events analogous to ubiquitination and required for membrane fusion events during autophagy. Length = 664 Score = 25.7 bits (56), Expect = 3.3 Identities = 9/24 (37%), Positives = 13/24 (54%), Gaps = 3/24 (12%) Query: 90 QNQAFFIDTIQNKYQDEMEELSEF 113 N FF+ + Q+ +M ELSE Sbjct: 181 NNFPFFLYSKQS---SKMLELSEL 201 >gnl|CDD|183190 PRK11548, PRK11548, outer membrane biogenesis protein BamE; Provisional. Length = 113 Score = 25.5 bits (56), Expect = 3.9 Identities = 12/25 (48%), Positives = 15/25 (60%), Gaps = 1/25 (4%) Query: 1 MRAKTLLASTLVTTAITIIGCSLVE 25 MR KTL A+ V +T GCS +E Sbjct: 1 MRCKTLTAAAAVLLMLT-AGCSTLE 24 >gnl|CDD|162191 TIGR01073, pcrA, ATP-dependent DNA helicase PcrA. Designed to identify pcrA members of the uvrD/rep subfamily. Length = 726 Score = 25.1 bits (55), Expect = 4.3 Identities = 21/73 (28%), Positives = 33/73 (45%), Gaps = 3/73 (4%) Query: 34 MVKEAKMEVLEAHKLAKEYVEQANQRVKEAEEQSNARLLKG--LGMDDLVRYFMNLDSQN 91 + AK E+L AKE + V E ++ RLL+ L DDL+ +NL + Sbjct: 140 TISNAKNELLPPEDFAKEATNYFEKVVAEVYQEYQKRLLRNNALDFDDLIMTTINLFQRV 199 Query: 92 QAFFIDTIQNKYQ 104 ++ Q K+Q Sbjct: 200 PD-VLEYYQRKFQ 211 >gnl|CDD|132362 TIGR03319, YmdA_YtgF, conserved hypothetical protein YmdA/YtgF. Length = 514 Score = 25.3 bits (56), Expect = 4.4 Identities = 17/46 (36%), Positives = 26/46 (56%), Gaps = 1/46 (2%) Query: 15 AITIIGCSLVEDNRIESLRMVKEAKMEV-LEAHKLAKEYVEQANQR 59 A I+ + E+ R E+ +++KE + E EA K AKE + A QR Sbjct: 149 AKEILLEEVEEEARHEAAKLIKEIEEEAKEEADKKAKEILATAIQR 194 >gnl|CDD|115046 pfam06364, DUF1068, Protein of unknown function (DUF1068). This family consists of several hypothetical plant proteins from Arabidopsis thaliana and Oryza sativa. The function of this family is unknown. Length = 176 Score = 25.0 bits (54), Expect = 4.6 Identities = 15/50 (30%), Positives = 26/50 (52%), Gaps = 6/50 (12%) Query: 30 ESLRMVKEAKMEVLEAHKLAKEYVEQANQ------RVKEAEEQSNARLLK 73 ES K A + +LEA KL Y ++A++ +EA E++ L++ Sbjct: 100 ESTEKHKRADVGLLEAKKLTSSYQKEADKCNSGMETCEEAREKAEEALVE 149 >gnl|CDD|169691 PRK09173, PRK09173, F0F1 ATP synthase subunit B; Validated. Length = 159 Score = 24.7 bits (54), Expect = 5.4 Identities = 11/34 (32%), Positives = 19/34 (55%), Gaps = 4/34 (11%) Query: 36 KEAKMEVLEAHKLAKEYVEQAN----QRVKEAEE 65 +EA+ EA + +EYV + N Q++ +AE Sbjct: 79 REAEALTAEAKRKTEEYVARRNKLAEQKIAQAET 112 >gnl|CDD|180251 PRK05776, PRK05776, DNA topoisomerase I; Provisional. Length = 670 Score = 25.0 bits (55), Expect = 5.7 Identities = 15/56 (26%), Positives = 28/56 (50%), Gaps = 3/56 (5%) Query: 27 NRIESLRMVKEAKMEVLEAHKLAKEYVEQANQRVKEAEEQSNARLLKGLGMDDLVR 82 ++E +R K + EV+E AKE + + + K+ +++ L K LG+ V Sbjct: 545 EKLEMIRTGKATREEVIEE---AKETLNKLLEEFKKNKDEIGEELAKALGLIKPVG 597 >gnl|CDD|130575 TIGR01511, ATPase-IB1_Cu, copper-(or silver)-translocating P-type ATPase. One member from Halobacterium is annotated as "molybdenum-binding protein" although no evidence can be found for this classification. Length = 562 Score = 24.2 bits (53), Expect = 8.6 Identities = 19/72 (26%), Positives = 31/72 (43%), Gaps = 6/72 (8%) Query: 5 TLLASTLVTTAITIIGCSLVEDNRIESLRMVKEAKMEVLEAHKLAKEYVEQANQRVKEAE 64 TL T + + G + +R E L + A +E H LAK V A ++ Sbjct: 287 TLTQGKPTVTDVHVFG----DRDRTELLALA--AALEAGSEHPLAKAIVSYAKEKGITLV 340 Query: 65 EQSNARLLKGLG 76 E S+ + + G+G Sbjct: 341 EVSDFKAIPGIG 352 >gnl|CDD|181923 PRK09513, fruK, 1-phosphofructokinase; Provisional. Length = 312 Score = 24.3 bits (53), Expect = 8.8 Identities = 15/39 (38%), Positives = 24/39 (61%), Gaps = 3/39 (7%) Query: 69 ARLLKGLGMDDLVRYFMNLDSQN--QAFFIDT-IQNKYQ 104 A++LK LG+D V F+ D+Q+ Q F + I N++Q Sbjct: 46 AKVLKDLGIDVTVGGFLGKDNQDGFQQLFSELGIANRFQ 84 >gnl|CDD|179583 PRK03482, PRK03482, phosphoglycerate mutase; Provisional. Length = 215 Score = 24.3 bits (53), Expect = 9.0 Identities = 25/90 (27%), Positives = 38/90 (42%), Gaps = 17/90 (18%) Query: 2 RAKTL-----LASTLVTTAIT--II----GCSLVEDNRIESLRMVKEAKMEVLEAHKLAK 50 RAK L ++S L T T II GC ++ D R+ +E M VLE + Sbjct: 42 RAKELGITHIISSDLGRTRRTAEIIAQACGCDIIFDPRL------RELNMGVLEKRHIDS 95 Query: 51 EYVEQANQRVKEAEEQSNARLLKGLGMDDL 80 E+ R + + R+ +G M +L Sbjct: 96 LTEEEEGWRRQLVNGTVDGRIPEGESMQEL 125 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.314 0.129 0.334 Gapped Lambda K H 0.267 0.0710 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 1,781,557 Number of extensions: 101228 Number of successful extensions: 435 Number of sequences better than 10.0: 1 Number of HSP's gapped: 432 Number of HSP's successfully gapped: 83 Length of query: 120 Length of database: 5,994,473 Length adjustment: 81 Effective length of query: 39 Effective length of database: 4,244,225 Effective search space: 165524775 Effective search space used: 165524775 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 51 (23.5 bits)