RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254780459|ref|YP_003064872.1| hypothetical protein CLIBASIA_01720 [Candidatus Liberibacter asiaticus str. psy62] (129 letters) >gnl|CDD|150271 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid). Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown. Length = 104 Score = 62.7 bits (153), Expect = 3e-11 Identities = 37/108 (34%), Positives = 55/108 (50%), Gaps = 5/108 (4%) Query: 1 MAKPELGTKRTCPDTGKRFYDLNKQVIVSPYTQNSWPLAYFEAPTSPEKEKEICSLEEET 60 MAKPE GTKRTCP GKRFYDLNK IV P P ++ ++ +EE Sbjct: 1 MAKPEWGTKRTCPTCGKRFYDLNKDPIVCPKCGEEVPPEVAKSRAPAADAEDAAKKDEEE 60 Query: 61 EVKPFVKHTSEEEEESPISKEEDDTSLDMEYVQEMDLDDDDDFLEQND 108 E + V +++++ ++DD + ++D DDDD + +D Sbjct: 61 EDEDDVVLDDDDDDD-----DDDDLPDLDDDDVDLDDDDDDFLEDDDD 103 >gnl|CDD|131353 TIGR02300, FYDLN_acid, conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown. Length = 129 Score = 51.9 bits (124), Expect = 4e-08 Identities = 39/123 (31%), Positives = 65/123 (52%), Gaps = 3/123 (2%) Query: 1 MAKPELGTKRTCPDTGKRFYDLNKQVIVSPYTQNSWPLAYFEAPT---SPEKEKEICSLE 57 MAKP+LGTKR CP+TG +FYDLN++ VSPYT +P + ++++ + E Sbjct: 1 MAKPDLGTKRICPNTGSKFYDLNRRPAVSPYTGEQFPPEEALKSRRVRADDEKEAVAVKE 60 Query: 58 EETEVKPFVKHTSEEEEESPISKEEDDTSLDMEYVQEMDLDDDDDFLEQNDTDTDTDVVD 117 + E+ + S EE + + + + ++ D+ DDDD + + + D D+ D Sbjct: 61 VDPEIDEAAELVSLEEADEEVDPGAPTPAGGDDLAEDEDIADDDDDVTFLEDEEDDDIDD 120 Query: 118 MDI 120 DI Sbjct: 121 EDI 123 >gnl|CDD|116469 pfam07857, DUF1632, CEO family (DUF1632). These sequences are found in hypothetical eukaryotic proteins of unknown function. The region concerned is approximately 280 residues long. This family has been termed the CEO family for C. elegans ORF. Length = 254 Score = 28.2 bits (63), Expect = 0.58 Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 8/48 (16%) Query: 65 FVKHTSEEEEESPISKEEDDTSLDMEYVQEMDLDDDDDFLEQNDTDTD 112 F+K EE+ K D TSL++E+ E+ + D D+D D Sbjct: 134 FIKS-----EETEPEKRYDRTSLELEHQDEI---EQDLSNATEDSDFD 173 >gnl|CDD|151022 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is a family of uncharacterized proteins. Length = 447 Score = 28.4 bits (63), Expect = 0.61 Identities = 22/71 (30%), Positives = 32/71 (45%), Gaps = 9/71 (12%) Query: 49 KEKEICSLEEETEVKPFVKHTSEEEEESPISKEEDDTSLDMEYVQEMDLDDDDDFLEQND 108 KE I L +E E EE E E+DD D + + D DDD+D +++D Sbjct: 34 KENAIRKLGKEAE---------EEAMEEEDDDEDDDDDDDDDDEDDDDDDDDEDDDDEDD 84 Query: 109 TDTDTDVVDMD 119 D D+ + D Sbjct: 85 DDDDSTLHDGS 95 >gnl|CDD|131585 TIGR02533, type_II_gspE, general secretory pathway protein E. This family describes GspE, the E protein of the type II secretion system, also called the main terminal branch of the general secretion pathway. This model separates GspE from the PilB protein of type IV pilin biosynthesis. Length = 486 Score = 27.3 bits (61), Expect = 1.1 Identities = 15/59 (25%), Positives = 27/59 (45%), Gaps = 6/59 (10%) Query: 69 TSEEEEESPISKEEDDTSLDM-EYVQEMDLDDDDDFLEQNDTDTDTDVVDMDISIPDDA 126 + E + I+ T + V+E++ +DD LE D D++D++ DDA Sbjct: 53 ATASEIDDAINSVYARTGSAAAQIVEEIEGEDDLSALEL-DEPKIEDLLDLE----DDA 106 >gnl|CDD|162051 TIGR00811, sit, silicon transporter. Marine diatoms such as Cylindrotheca fusiformis encode at least six silicon transport protein homologues which exhibit similar size and topology. One characterized member of the family (Sit1) functions in the energy-dependent uptake of either Silicic acid [Si(OH)4] or Silicate [Si(OH)3O-] by a Na+ symport mechanism. The system is found in marine diatoms which make their "glass houses" out of silicon. Length = 545 Score = 26.1 bits (57), Expect = 2.6 Identities = 17/44 (38%), Positives = 19/44 (43%), Gaps = 1/44 (2%) Query: 42 EAPTSPEKEKEICSLEEETEVKPFVKHTSEEEEESPISKEEDDT 85 S E EK EVK K +EEEEE+ KE DT Sbjct: 499 NNAISDE-EKTTFEAALAIEVKALDKLNAEEEEEATNKKEGKDT 541 >gnl|CDD|163126 TIGR03074, PQQ_membr_DH, membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases. Length = 764 Score = 26.1 bits (58), Expect = 2.9 Identities = 8/40 (20%), Positives = 15/40 (37%), Gaps = 2/40 (5%) Query: 13 PDTGKRF--YDLNKQVIVSPYTQNSWPLAYFEAPTSPEKE 50 TGK +D + Q ++Y++AP + Sbjct: 211 AATGKEKWKFDPKLKTEAGRQHQTCRGVSYYDAPAAAAGP 250 >gnl|CDD|180310 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional. Length = 509 Score = 25.7 bits (57), Expect = 3.4 Identities = 13/91 (14%), Positives = 39/91 (42%) Query: 39 AYFEAPTSPEKEKEICSLEEETEVKPFVKHTSEEEEESPISKEEDDTSLDMEYVQEMDLD 98 + S +K ++ +L+++ ++ + + ++DD D + + D + Sbjct: 95 KLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDE 154 Query: 99 DDDDFLEQNDTDTDTDVVDMDISIPDDADEN 129 DDD+ + +D D + + + +D++ Sbjct: 155 DDDEDDDDDDVDDEDEEKKEAKELEKLSDDD 185 Score = 24.2 bits (53), Expect = 9.2 Identities = 14/70 (20%), Positives = 29/70 (41%), Gaps = 3/70 (4%) Query: 48 EKEKEICSLEEETEVKPFVKHTSEEEEESPISKEEDDTSLDMEYVQEMDLDDDDDFLEQN 107 + + + +E+ + +E+EE +KE + S D ++V + +DD + L Q Sbjct: 144 DDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWD---EDDSEALRQA 200 Query: 108 DTDTDTDVVD 117 D Sbjct: 201 RKDAKLTATA 210 >gnl|CDD|183416 PRK12298, obgE, GTPase CgtA; Reviewed. Length = 390 Score = 25.2 bits (56), Expect = 5.1 Identities = 16/54 (29%), Positives = 28/54 (51%), Gaps = 3/54 (5%) Query: 65 FVKHTSEEEEESPISKEEDDTSLD---MEYVQEMDLDDDDDFLEQNDTDTDTDV 115 F++ EE E + E+ + D E ++E++ +DDDD+ + D D D V Sbjct: 331 FIEENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDDDEGV 384 >gnl|CDD|139048 PRK12538, PRK12538, RNA polymerase sigma factor; Provisional. Length = 233 Score = 24.8 bits (54), Expect = 6.1 Identities = 12/54 (22%), Positives = 21/54 (38%), Gaps = 1/54 (1%) Query: 60 TEVKPFVKHTSEEEEESPISKEEDDTSLDMEYVQEMDLDDDDDFLEQNDTDTDT 113 E P S E+P + D V + D+D++ L++ TD + Sbjct: 11 AEAAPATTAPSPIVAEAPDA-ASVSIVPDPPPVADAVFDEDEELLDRLATDDEA 63 >gnl|CDD|165590 PHA03346, PHA03346, US22 family homolog; Provisional. Length = 520 Score = 24.7 bits (54), Expect = 6.3 Identities = 9/28 (32%), Positives = 15/28 (53%) Query: 98 DDDDDFLEQNDTDTDTDVVDMDISIPDD 125 DDDD+ +++D D + D+D D Sbjct: 427 DDDDEGDDEDDDDWEDLGFDLDEDDVYD 454 >gnl|CDD|149439 pfam08375, Rpn3_C, Proteasome regulatory subunit C-terminal. This eukaryotic domain is found at the C-terminus of 26S proteasome regulatory subunits such as the non-ATPase Rpn3 subunit which is essential for proteasomal function. It occurs together with the PCI/PINT domain (pfam01399). Length = 68 Score = 24.6 bits (54), Expect = 7.3 Identities = 9/33 (27%), Positives = 17/33 (51%) Query: 70 SEEEEESPISKEEDDTSLDMEYVQEMDLDDDDD 102 E++EE + + L++E +E+ D DD Sbjct: 36 KEKKEEEKAEEARERDQLELELAKELSEGDLDD 68 >gnl|CDD|151809 pfam11369, DUF3160, Protein of unknown function (DUF3160). This family of proteins has no known function. Length = 637 Score = 24.7 bits (54), Expect = 7.9 Identities = 19/61 (31%), Positives = 25/61 (40%), Gaps = 12/61 (19%) Query: 38 LAYFEAPTS--PEKEKEICSLEEETEVKPFVKHTSEEE----------EESPISKEEDDT 85 +AYF S K ++I + EV FVK E E SPI K ++D Sbjct: 105 VAYFAVALSLLEPKPEQIKQKKYSFEVPAFVKSDVEAELALIEAHEGFAPSPIFKYKEDY 164 Query: 86 S 86 S Sbjct: 165 S 165 >gnl|CDD|178493 PLN02905, PLN02905, beta-amylase. Length = 702 Score = 24.6 bits (53), Expect = 8.1 Identities = 8/23 (34%), Positives = 15/23 (65%) Query: 79 SKEEDDTSLDMEYVQEMDLDDDD 101 S+E+D+ +DM+ +E D D + Sbjct: 12 SEEDDEEEMDMDVKEEDDGDRRN 34 >gnl|CDD|177782 PLN00188, PLN00188, enhanced disease resistance protein (EDR2); Provisional. Length = 719 Score = 24.4 bits (53), Expect = 9.9 Identities = 13/59 (22%), Positives = 27/59 (45%) Query: 56 LEEETEVKPFVKHTSEEEEESPISKEEDDTSLDMEYVQEMDLDDDDDFLEQNDTDTDTD 114 ++E+++ + E+E E DT+++ E ++DL L ++D D D Sbjct: 441 MDEDSDDDEEFQIPESEQEPETTKNETKDTAMEEEPQDKIDLSCFSGNLRRDDRDKARD 499 >gnl|CDD|150551 pfam09891, DUF2118, Uncharacterized protein conserved in archaea (DUF2118). This domain, found in various hypothetical archaeal proteins, has no known function. Length = 149 Score = 24.2 bits (53), Expect = 9.9 Identities = 18/67 (26%), Positives = 25/67 (37%), Gaps = 7/67 (10%) Query: 59 ETEVKPFVKHTSEEEEESPISKEEDDT-SLDMEYVQEMDLDDDDDFLEQNDTDTDTDVVD 117 E E + EE EE E+ + + V E+ D+ D LE T D Sbjct: 8 EGEESENFEVALEEGEEVIKFLEKVEEYFGYGKVVYEVIYDEIVDLLENKVT------RD 61 Query: 118 MDISIPD 124 I +PD Sbjct: 62 FIIVLPD 68 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.305 0.128 0.361 Gapped Lambda K H 0.267 0.0730 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 2,150,204 Number of extensions: 125852 Number of successful extensions: 393 Number of sequences better than 10.0: 1 Number of HSP's gapped: 310 Number of HSP's successfully gapped: 106 Length of query: 129 Length of database: 5,994,473 Length adjustment: 83 Effective length of query: 46 Effective length of database: 4,201,009 Effective search space: 193246414 Effective search space used: 193246414 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 43 (21.9 bits) S2: 51 (23.4 bits)