RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254781034|ref|YP_003065447.1| hypothetical protein CLIBASIA_04680 [Candidatus Liberibacter asiaticus str. psy62] (344 letters) >gnl|CDD|150410 pfam09731, Mitofilin, Mitochondrial inner membrane protein. Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence rich in positive-charged and hydroxylated residues and a membrane anchor domain. In addition, it has three centrally located coiled coil domains. Length = 561 Score = 43.2 bits (102), Expect = 1e-04 Identities = 27/81 (33%), Positives = 44/81 (54%), Gaps = 3/81 (3%) Query: 224 IPTTIEILAKFPKVSEEMVFASESLEKDSGFANYLLFQLTRLVKVRPIGGNI--EGDAIT 281 I + ++ +F ++ E+ AS L +++G +LL L + +P G +GD + Sbjct: 445 ILSEEQLRNRFNLLAPELRKAS-LLPENAGLLGHLLSYLFSKLLFKPKQGEADPDGDDVE 503 Query: 282 DVIARIENNLKTGDLVKAAAE 302 V+AR E NL+ GDL KAA E Sbjct: 504 SVLARAEYNLERGDLDKAARE 524 >gnl|CDD|116029 pfam07407, Seadorna_VP6, Seadornavirus VP6 protein. This family consists of several VP6 proteins from the Banna virus as well as a related protein VP5 from the Kadipiro virus. Members of this family are typically of around 420 residues in length. The function of this family is unknown. Length = 420 Score = 31.2 bits (70), Expect = 0.37 Identities = 18/56 (32%), Positives = 30/56 (53%), Gaps = 7/56 (12%) Query: 276 EGDAITDVIARIENNLKTGDLVKAAAEWD-------KIPEKARQPSMFLRNALEAH 324 E +A+ I R+E++ T D+V+ D KI +K R+P +F R+ +E H Sbjct: 47 ENEALKTKIHRLESDWTTSDIVEKIELMDAQFDRIGKIMDKMREPMLFKRDDIELH 102 >gnl|CDD|178000 PLN02372, PLN02372, violaxanthin de-epoxidase. Length = 455 Score = 31.0 bits (70), Expect = 0.43 Identities = 26/97 (26%), Positives = 43/97 (44%), Gaps = 8/97 (8%) Query: 73 KFLRTDNNLLLLPSVSPLKEDPKDISPVIEKEIISQNLSIAQQKDEETADKELANTQNFN 132 F+RTDN P PL E + EK I+ + I ++ ++E F Sbjct: 348 DFVRTDNTCGPEP---PLLERLEKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFK 404 Query: 133 IKPLLEEIASLKQ----LISDLSKNYQDIVTRLTKME 165 L E + L+Q + +LSK ++++ +L KME Sbjct: 405 RVALEEGLKELEQDEENFLKELSKEEKELLEKL-KME 440 >gnl|CDD|178699 PLN03154, PLN03154, putative allyl alcohol dehydrogenase; Provisional. Length = 348 Score = 28.3 bits (63), Expect = 3.4 Identities = 11/34 (32%), Positives = 18/34 (52%) Query: 272 GGNIEGDAITDVIARIENNLKTGDLVKAAAEWDK 305 G IEG ++ V+ + N K GDL+ W++ Sbjct: 77 GQRIEGFGVSKVVDSDDPNFKPGDLISGITGWEE 110 >gnl|CDD|163414 TIGR03702, lip_kinase_YegS, lipid kinase YegS. Members of this protein family are designated YegS, an apparent lipid kinase family in the Proteobacteria. Bakali, et al. report phosphatidylglycerol kinase activity for the member from Escherichia coli, but refrain from calling that activity synonymous with its biological role. Note that a broader, subfamily-type model (TIGR00147), includes this family but also multiple paralogs in some species and varied functions. Length = 293 Score = 28.2 bits (63), Expect = 3.5 Identities = 12/33 (36%), Positives = 16/33 (48%) Query: 245 SESLEKDSGFANYLLFQLTRLVKVRPIGGNIEG 277 SE L+K G A YL+ LTR ++ G Sbjct: 147 SEKLKKALGGAAYLITGLTRFSELTAASCEFRG 179 >gnl|CDD|184822 PRK14791, PRK14791, lipoprotein signal peptidase; Provisional. Length = 146 Score = 28.0 bits (63), Expect = 3.7 Identities = 13/35 (37%), Positives = 17/35 (48%) Query: 42 ILSNKTFFKILALVCVIVLTFIFIFTALFTEKFLR 76 IL NKT F IL + VI+ +I T K + Sbjct: 51 ILQNKTLFLILITIIVILGLIYYIIKYKPTSKLYK 85 >gnl|CDD|185619 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein; Provisional. Length = 2722 Score = 27.9 bits (62), Expect = 4.4 Identities = 16/62 (25%), Positives = 26/62 (41%) Query: 124 ELANTQNFNIKPLLEEIASLKQLISDLSKNYQDIVTRLTKMETLTANPLRNPNTQRMVSL 183 E+ + I L + I LK LIS + Y I++ M++ + N Q V Sbjct: 443 EIKKKYDEKINELKKSINQLKTLISIMKSFYDLIISEKDSMDSKEKKESSDSNYQEKVDE 502 Query: 184 LI 185 L+ Sbjct: 503 LL 504 >gnl|CDD|149044 pfam07760, DUF1616, Protein of unknown function (DUF1616). This is a family of sequences from hypothetical archaeal proteins. The region in question is approximately 330 amino acid residues long. Length = 278 Score = 27.7 bits (62), Expect = 4.5 Identities = 22/72 (30%), Positives = 32/72 (44%), Gaps = 13/72 (18%) Query: 51 ILALVCVIVLTFIFIFTALFTEKFLRTDNNLLLLPSV---------SPLKEDPKDISPVI 101 +L ++ +++LT I I E LR LL LP V + L D+ I Sbjct: 2 LLLIILLVLLTIIAILVPPLNESPLRV---LLGLPLVLFIPGYALIAALFPARDDLDG-I 57 Query: 102 EKEIISQNLSIA 113 E+ +S LSIA Sbjct: 58 ERAALSVGLSIA 69 >gnl|CDD|152155 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein. Genome duplication is precisely regulated by cyclin-dependent kinases CDKs, which bring about the onset of S phase by activating replication origins and then prevent relicensing of origins until mitosis is completed. The optimum sequence motif for CDK phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found to have at least 11 potential phosphorylation sites. Drc1 is required for DNA synthesis and S-M replication checkpoint control. Drc1 associates with Cdc2 and is phosphorylated at the onset of S phase when Cdc2 is activated. Thus Cdc2 promotes DNA replication by phosphorylating Drc1 and regulating its association with Cut5. Sld2 and Sld3 represent the minimal set of S-CDK substrates required for DNA replication. Length = 402 Score = 27.8 bits (62), Expect = 4.7 Identities = 25/116 (21%), Positives = 43/116 (37%), Gaps = 14/116 (12%) Query: 71 TEKFLRTDNNLLLLPSVSPLKEDPKDISPVIEKEIISQNLSIAQQKDEETADKELANTQN 130 T FLR ++ S + + SP+ + + + LS Q +EE+ D EL + Sbjct: 206 TPSFLRRNS-----ASPANEPDFSVSPSPLRPQRPVGKGLSELVQMEEESIDDELDVLRE 260 Query: 131 FNIKPLLEEIASLKQLISDLSKNYQDIV---------TRLTKMETLTANPLRNPNT 177 + E + + S + + V TR KM + A P P+ Sbjct: 261 IEAEEAESEPIEEEVVDSQAANDEPRRVFKKKGQKRTTRRVKMRPVRAKPSDEPSL 316 >gnl|CDD|180067 PRK05420, PRK05420, aquaporin Z; Provisional. Length = 231 Score = 27.2 bits (61), Expect = 5.6 Identities = 9/17 (52%), Positives = 11/17 (64%) Query: 53 ALVCVIVLTFIFIFTAL 69 ALVC +VLT F+ L Sbjct: 134 ALVCEVVLTAFFLLVIL 150 >gnl|CDD|180919 PRK07282, PRK07282, acetolactate synthase catalytic subunit; Reviewed. Length = 566 Score = 27.1 bits (60), Expect = 6.1 Identities = 14/32 (43%), Positives = 18/32 (56%), Gaps = 3/32 (9%) Query: 313 PSMFLRNALEAHICSDAILKEEMAKIPQTDLP 344 P F +NA AHI D E+ KI +TD+P Sbjct: 295 PKTFAKNAKVAHIDIDP---AEIGKIIKTDIP 323 >gnl|CDD|129982 TIGR00904, mreB, cell shape determining protein, MreB/Mrl family. A close homolog is found in the Archaeon Methanobacterium thermoautotrophicum, and a more distant homolog in Archaeoglobus fulgidus. The family is related to cell division protein FtsA and heat shock protein DnaK. Length = 333 Score = 27.0 bits (60), Expect = 6.6 Identities = 9/22 (40%), Positives = 16/22 (72%) Query: 36 KFFWEKILSNKTFFKILALVCV 57 K+F +++ S K+FFK ++CV Sbjct: 84 KYFIKQVHSRKSFFKPRIVICV 105 >gnl|CDD|181068 PRK07656, PRK07656, long-chain-fatty-acid--CoA ligase; Validated. Length = 513 Score = 27.2 bits (61), Expect = 6.8 Identities = 13/40 (32%), Positives = 19/40 (47%), Gaps = 9/40 (22%) Query: 206 FSVLKPCT----ATLMQF-----ANIKIPTTIEILAKFPK 236 + VLKP L+ + A K+P +IE L + PK Sbjct: 460 YVVLKPGAELTEEELIAYCREHLAKYKVPRSIEFLDELPK 499 >gnl|CDD|180125 PRK05550, PRK05550, bifunctional methionine sulfoxide reductase B/A protein; Provisional. Length = 283 Score = 26.8 bits (60), Expect = 8.4 Identities = 7/29 (24%), Positives = 15/29 (51%) Query: 138 EEIASLKQLISDLSKNYQDIVTRLTKMET 166 E+ ++LI++L+K +VT + Sbjct: 229 EQKQIAEKLIAELTKKGYPVVTEVEAAGP 257 >gnl|CDD|152698 pfam12263, DUF3611, Protein of unknown function (DUF3611). This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 180 and 205 amino acids in length. There are two completely conserved residues (W and G) that may be functionally important. Length = 183 Score = 26.9 bits (60), Expect = 8.5 Identities = 11/35 (31%), Positives = 15/35 (42%) Query: 54 LVCVIVLTFIFIFTALFTEKFLRTDNNLLLLPSVS 88 LV +V I +F LF+ D NL L + Sbjct: 26 LVLGVVSALILLFAVLFSGLVRSPDRNLGLGIGLF 60 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.316 0.132 0.364 Gapped Lambda K H 0.267 0.0677 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 5,400,018 Number of extensions: 344646 Number of successful extensions: 858 Number of sequences better than 10.0: 1 Number of HSP's gapped: 858 Number of HSP's successfully gapped: 53 Length of query: 344 Length of database: 5,994,473 Length adjustment: 94 Effective length of query: 250 Effective length of database: 3,963,321 Effective search space: 990830250 Effective search space used: 990830250 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 57 (25.8 bits)