RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254780427|ref|YP_003064840.1| hypothetical protein CLIBASIA_01560 [Candidatus Liberibacter asiaticus str. psy62] (171 letters) >gnl|CDD|145339 pfam02130, UPF0054, Uncharacterized protein family UPF0054. Length = 141 Score = 136 bits (345), Expect = 3e-33 Identities = 55/110 (50%), Positives = 66/110 (60%), Gaps = 1/110 (0%) Query: 55 VELSLVFTDSHRIETLNFEYRGIDKPTNVLSFPTAFASSDGCSSLM-LGDIVLAYEIIEI 113 ELS++F D I LN EYRG DKPT+VLSFP L+ LGDIV++ E Sbjct: 32 FELSILFVDDEEIRELNREYRGKDKPTDVLSFPMDEPEEGDEPLLLLLGDIVISPETAAR 91 Query: 114 EANVLGKEFENHLVHLIIHGFLHLLGYDHVDDKDACVMEGLERSILEDLG 163 EA G E L HL++HG LHLLGYDH +D++A ME LE IL LG Sbjct: 92 EAKEYGHSLERELAHLLVHGLLHLLGYDHEEDEEAEEMEALEEEILAKLG 141 >gnl|CDD|30667 COG0319, COG0319, Predicted metal-dependent hydrolase [General function prediction only]. Length = 153 Score = 130 bits (328), Expect = 2e-31 Identities = 63/158 (39%), Positives = 82/158 (51%), Gaps = 8/158 (5%) Query: 11 LDLQIAVENALWGDEIHLRTLCEVVFAKAVSNLISKGYFVKENIVELSLVFTDSHRIETL 70 L + I V + R F + ++ + K + ELS+V D I L Sbjct: 1 LMMDIEVSDEDGQV----RLPELKQFEEWLAAALKPE---KISTAELSVVLVDDAEIREL 53 Query: 71 NFEYRGIDKPTNVLSFPTAFASSDGCSSLMLGDIVLAYEIIEIEANVLGKEFENHLVHLI 130 N EYRG DKPT+VLSFP + +LGDIV+ E+ E +A G E L HL Sbjct: 54 NLEYRGKDKPTDVLSFP-YNEPGEAGEPPLLGDIVICPEVAEEQAKEQGHSLERELAHLT 112 Query: 131 IHGFLHLLGYDHVDDKDACVMEGLERSILEDLGINDPY 168 IHG LHLLGYDH +D++ ME LE IL LG++DPY Sbjct: 113 IHGILHLLGYDHEEDEEEEEMEELEEEILAALGLSDPY 150 >gnl|CDD|32732 COG2908, COG2908, Uncharacterized protein conserved in bacteria [Function unknown]. Length = 237 Score = 32.1 bits (73), Expect = 0.086 Identities = 14/56 (25%), Positives = 22/56 (39%), Gaps = 2/56 (3%) Query: 107 AYEIIEIEANVLGKEFENHLVHLIIHGFLHLLGYDHVDDKDACVME--GLERSILE 160 A I+++ + E H V +IHG H ++ + E SILE Sbjct: 168 AVNIMDVNPAAVADEARRHGVDGVIHGHTHRPAIHNIPGITYINLGDWVSEGSILE 223 >gnl|CDD|58577 cd04277, ZnMc_serralysin_like, Zinc-dependent metalloprotease, serralysin_like subfamily. Serralysins and related proteases are important virulence factors in pathogenic bacteria. They may be secreted into the medium via a mechanism found in gram-negative bacteria, that does not require n-terminal signal sequences which are cleaved after the transmembrane translocation. A calcium-binding domain c-terminal to the metalloprotease domain, which contains multiple tandem repeats of a nine-residue motif including the pattern GGxGxD, and which forms a parallel beta roll may be involved in the translocation mechanism and/or substrate binding. Serralysin family members may have a broad spectrum of substrates each, including host immunoglobulins, complement proteins, cell matrix and cytoskeletal proteins, as well as antimicrobial peptides.. Length = 186 Score = 28.1 bits (62), Expect = 1.4 Identities = 17/97 (17%), Positives = 32/97 (32%), Gaps = 8/97 (8%) Query: 52 ENIVELSLVFTDSHRIETLNFEYRGIDKPTNVLSFPTAFASSDGCSSLMLGDIVLAYEII 111 E++ ++ V + + F G + + A+ G + GDI Sbjct: 47 EDVADIDFVEVSDNSGADIRF---GNSSDPDGNTAGYAYYPGSGSGTAYGGDIWFNSS-- 101 Query: 112 EIEANVLGKEFENHLVHLIIHGFLHLLGYDHVDDKDA 148 + ++ IIH H LG +H D + Sbjct: 102 ---YDTNSDSPGSYGYQTIIHEIGHALGLEHPGDYNG 135 >gnl|CDD|39400 KOG4199, KOG4199, KOG4199, Uncharacterized conserved protein [Function unknown]. Length = 461 Score = 26.6 bits (58), Expect = 3.6 Identities = 15/68 (22%), Positives = 30/68 (44%), Gaps = 3/68 (4%) Query: 40 VSNLISKGYF---VKENIVELSLVFTDSHRIETLNFEYRGIDKPTNVLSFPTAFASSDGC 96 ++ +IS+ F VKEN+VE S+ +++ FE +GI+ + + Sbjct: 1 MAKVISQDTFDDVVKENVVEFSMTPSEAKEETIKQFEAQGINLANIIKDLSVNPQTGQPV 60 Query: 97 SSLMLGDI 104 + + I Sbjct: 61 INETVDKI 68 >gnl|CDD|185695 cd01081, Aldose_epim, aldose 1-epimerase superfamily. Aldose 1-epimerases or mutarotases are key enzymes of carbohydrate metabolism; they catalyze the interconversion of the alpha- and beta-anomers of hexose sugars such as glucose and galactose. This interconversion is an important step that allows anomer specific metabolic conversion of sugars. Studies of the catalytic mechanism of the best known member of the family, galactose mutarotase, have shown a glutamate and a histidine residue to be critical for catalysis; the glutamate serves as the active site base to initiate the reaction by removing the proton from the C-1 hydroxyl group of the sugar substrate and the histidine as the active site acid to protonate the C-5 ring oxygen. Length = 284 Score = 26.3 bits (58), Expect = 4.6 Identities = 6/21 (28%), Positives = 9/21 (42%) Query: 131 IHGFLHLLGYDHVDDKDACVM 151 IHGF+ L + V + Sbjct: 75 IHGFVRNLPWRVVATDEEEAS 95 >gnl|CDD|37227 KOG2016, KOG2016, KOG2016, NEDD8-activating complex, APP-BP1/UBA5 component [Posttranslational modification, protein turnover, chaperones]. Length = 523 Score = 26.1 bits (57), Expect = 4.7 Identities = 14/66 (21%), Positives = 30/66 (45%) Query: 103 DIVLAYEIIEIEANVLGKEFENHLVHLIIHGFLHLLGYDHVDDKDACVMEGLERSILEDL 162 +V+A + E L + V L++ L G + K+ ++E + L+DL Sbjct: 121 TVVIATNLNEQTLLKLAEILREANVPLLLTRSYGLAGTIRISIKEHTIIESHPDNPLDDL 180 Query: 163 GINDPY 168 +++P+ Sbjct: 181 RLDNPW 186 >gnl|CDD|39858 KOG4659, KOG4659, KOG4659, Uncharacterized conserved protein (Rhs family) [Function unknown]. Length = 1899 Score = 25.7 bits (56), Expect = 6.0 Identities = 15/81 (18%), Positives = 24/81 (29%), Gaps = 3/81 (3%) Query: 4 KKVNLPRLDLQIAVE---NALWGDEIHLRTLCEVVFAKAVSNLISKGYFVKENIVELSLV 60 LPR +L E A G++ VF + + V + + Sbjct: 930 ANGTLPRENLANRWEWRHVAQRGEQSKGSLGRVTVFGRKLRVHGRNLLSVDFDRDTRTEK 989 Query: 61 FTDSHRIETLNFEYRGIDKPT 81 D R L +Y +P Sbjct: 990 IYDDDRQFLLRIDYDQSGRPI 1010 >gnl|CDD|38059 KOG2848, KOG2848, KOG2848, 1-acyl-sn-glycerol-3-phosphate acyltransferase [Lipid transport and metabolism]. Length = 276 Score = 25.6 bits (56), Expect = 7.2 Identities = 19/76 (25%), Positives = 30/76 (39%), Gaps = 10/76 (13%) Query: 99 LMLGDIVLAYEIIEIEANVL--GKEFENHLV-HLIIHGFLHLLGYD-------HVDDKDA 148 G I Y +I +L G+ ENH + L H +LLG ++ Sbjct: 31 AFYGAICSFYGVIASPVCLLRGGRSVENHFIAKLWFHSMKYLLGLRFEVRGEENLPKSKP 90 Query: 149 CVMEGLERSILEDLGI 164 V+ +S L+ LG+ Sbjct: 91 AVIVSNHQSSLDILGM 106 >gnl|CDD|146601 pfam04054, Not1, CCR4-Not complex component, Not1. The Ccr4-Not complex is a global regulator of transcription that affects genes positively and negatively and is thought to regulate transcription factor TFIID. Length = 375 Score = 25.3 bits (56), Expect = 8.5 Identities = 12/35 (34%), Positives = 19/35 (54%) Query: 134 FLHLLGYDHVDDKDACVMEGLERSILEDLGINDPY 168 LHL G D + + + E + R +LE L +N P+ Sbjct: 300 ILHLFGSDPWESQKEDIQEQITRVLLERLIVNKPH 334 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.321 0.141 0.408 Gapped Lambda K H 0.267 0.0764 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 2,041,711 Number of extensions: 103158 Number of successful extensions: 207 Number of sequences better than 10.0: 1 Number of HSP's gapped: 205 Number of HSP's successfully gapped: 16 Length of query: 171 Length of database: 6,263,737 Length adjustment: 87 Effective length of query: 84 Effective length of database: 4,383,754 Effective search space: 368235336 Effective search space used: 368235336 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 54 (24.5 bits)