RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254780780|ref|YP_003065193.1| histidine triad (HIT) protein [Candidatus Liberibacter asiaticus str. psy62] (155 letters) >gnl|CDD|182648 PRK10687, PRK10687, purine nucleoside phosphoramidase; Provisional. Length = 119 Score = 64.9 bits (158), Expect = 9e-12 Identities = 36/105 (34%), Positives = 54/105 (51%), Gaps = 7/105 (6%) Query: 12 QNIFIKIIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIA 71 + IF KIIR E + VY+D+++ A DI P+ P H+LIIP I + + E Sbjct: 4 ETIFSKIIRREIPSDIVYQDELVTAFRDISPQAPTHILIIPNILIPTVNDVSAE---HEQ 60 Query: 72 FLIKKIAIACKSAFQA----DGIQILQFNGHAAGQTVPHLHFHVI 112 L + I +A K A Q DG +++ GQ V H+H H++ Sbjct: 61 ALGRMITVAAKIAEQEGIAEDGYRLIMNTNRHGGQEVYHIHMHLL 105 >gnl|CDD|152404 pfam11969, DcpS_C, Scavenger mRNA decapping enzyme C-term binding. This family consists of several scavenger mRNA decapping enzymes (DcpS) and is the C-terminal region. DcpS is a scavenger pyrophosphatase that hydrolyses the residual cap structure following 3' to 5' decay of an mRNA. The association of DcpS with 3' to 5' exonuclease exosome components suggests that these two activities are linked and there is a coupled exonucleolytic decay-dependent decapping pathway. The C-terminal domain contains a histidine triad (HIT) sequence with three histidines separated by hydrophobic residues. The central histidine within the DcpS HIT motif is critical for decapping activity and defines the HIT motif as a new mRNA decapping domain, making DcpS the first member of the HIT family of proteins with a defined biological function. Length = 113 Score = 60.3 bits (147), Expect = 2e-10 Identities = 33/99 (33%), Positives = 48/99 (48%), Gaps = 1/99 (1%) Query: 14 IFIKIIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFL 73 +F II E VYEDD +A DI P+ P H+L+IPK I+ + + E L + + Sbjct: 3 VFCIIINGEEPERIVYEDDGFVAFKDIKPKAPLHLLVIPKKHIKSLRDLTKEHLPLLGHM 62 Query: 74 IKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHVI 112 + K + +L+ H G +V HLH HVI Sbjct: 63 REVAKKIAKEKYNEGYRDVLRLGFH-YGPSVYHLHLHVI 100 >gnl|CDD|182570 PRK10586, PRK10586, putative oxidoreductase; Provisional. Length = 362 Score = 30.8 bits (70), Expect = 0.14 Identities = 16/59 (27%), Positives = 25/59 (42%), Gaps = 7/59 (11%) Query: 54 SRIRDIFEAPPEILSQIAFLIKKIAIACK-----SAFQADGIQILQFNGHAAGQTVPHL 107 + D F E LS+ ++ + AIA AF+ G + + F GH + V L Sbjct: 23 DHLHDFF--TDEQLSRAVWIYGERAIAAAQPYLPPAFELPGAKHILFRGHCSESDVAQL 79 >gnl|CDD|129313 TIGR00209, galT_1, galactose-1-phosphate uridylyltransferase, family 1. This enzyme is involved in glucose and galactose interconversion. This model describes one of two extremely distantly related branches of the model pfam01087 from PFAM. Length = 347 Score = 30.7 bits (69), Expect = 0.17 Identities = 17/71 (23%), Positives = 29/71 (40%), Gaps = 5/71 (7%) Query: 45 PGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQAD-----GIQILQFNGHA 99 P L++PK+ + I + S +A ++KK+ + F+ G FNG Sbjct: 230 PFETLLLPKAHVLRITDLTDAQRSDLALILKKLTSKYDNLFETSFPYSMGWHGAPFNGEE 289 Query: 100 AGQTVPHLHFH 110 H HF+ Sbjct: 290 NQHWQLHAHFY 300 >gnl|CDD|131798 TIGR02751, PEPCase_arch, phosphoenolpyruvate carboxylase, archaeal type. This family is the archaeal-type phosphoenolpyruvate carboxylase, although not every host species is archaeal. These sequences bear little resemblance to the bacterial/eukaryotic type. The members from Sulfolobus solfataricus and Methanothermobacter thermautotrophicus were verified experimentally, while the activity is known to be present in a number of other archaea. Length = 506 Score = 29.7 bits (67), Expect = 0.36 Identities = 17/72 (23%), Positives = 29/72 (40%), Gaps = 8/72 (11%) Query: 33 ILLAIMDIMPRNPGHVLIIPKSRIRDIFEA-------PPEILSQIAFLIKKIAIACKSAF 85 +LL ++ +PRN RI IFE EIL+ + +K +S Sbjct: 97 LLLETLESIPRNYDVAREFYDERIAPIFEVILPMTTSADEILN-VHQYYEKAVAGKQSIE 155 Query: 86 QADGIQILQFNG 97 D + + ++ G Sbjct: 156 LYDEVTVKEWLG 167 >gnl|CDD|178650 PLN03103, PLN03103, GDP-L-galactose-hexose-1-phosphate guanyltransferase; Provisional. Length = 403 Score = 28.1 bits (63), Expect = 1.0 Identities = 18/66 (27%), Positives = 32/66 (48%), Gaps = 11/66 (16%) Query: 46 GHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGIQI-LQFNGHAAGQTV 104 GHVL++P+ + + P+ + +FL+ A A +A+ + +N A T+ Sbjct: 183 GHVLLVPR-----VLDCLPQRIDPDSFLL-----ALYMAAEANNPYFRVGYNSLGAFATI 232 Query: 105 PHLHFH 110 HLHF Sbjct: 233 NHLHFQ 238 >gnl|CDD|165505 PHA03246, PHA03246, large tegument protein UL36; Provisional. Length = 3095 Score = 27.6 bits (61), Expect = 1.4 Identities = 12/50 (24%), Positives = 23/50 (46%), Gaps = 11/50 (22%) Query: 32 DILLAIMDIMPRNPGHVLIIPKSRIRD--------IFEAPPEILSQIAFL 73 D+++ IM P GHVL K + + ++ + +S +AF+ Sbjct: 2163 DVMVTIMACTP---GHVLTFSKLDLINQYDFMDKTLYNVLTDSISTVAFV 2209 >gnl|CDD|183070 PRK11275, pstC, phosphate transporter permease subunit PstC; Provisional. Length = 319 Score = 27.0 bits (60), Expect = 2.2 Identities = 14/44 (31%), Positives = 23/44 (52%), Gaps = 13/44 (29%) Query: 33 ILLAIMDIMPRNPGHVLIIP--KSRIRDIFEAPPEILSQIAFLI 74 ++LAIM IIP + +RD+FE P ++ + A+ I Sbjct: 175 VILAIM-----------IIPYIAAVMRDVFEQTPVMMKESAYGI 207 >gnl|CDD|182329 PRK10245, adrA, diguanylate cyclase AdrA; Provisional. Length = 366 Score = 26.3 bits (58), Expect = 3.7 Identities = 10/23 (43%), Positives = 15/23 (65%) Query: 18 IIRNETNACRVYEDDILLAIMDI 40 ++RNE + CR + D L I+DI Sbjct: 222 LLRNEFDNCRRHHRDATLLIIDI 244 >gnl|CDD|163375 TIGR03649, ergot_EASG, ergot alkaloid biosynthesis protein, AFUA_2G17970 family. This family consists of fungal proteins of unknown function associated with secondary metabolite biosynthesis, such as of the ergot alkaloids such as ergovaline. Nomenclature differs because gene order differs - this is EasG in Neotyphodium lolii but is designated ergot alkaloid biosynthetic protein A in several other fungi. Length = 285 Score = 26.2 bits (58), Expect = 4.1 Identities = 26/98 (26%), Positives = 38/98 (38%), Gaps = 11/98 (11%) Query: 53 KSRIRDIFEAPPEILSQIAFLIKKIAIACKSA---FQADGIQILQFNGHAAGQTVPHLHF 109 + I ++ P I +IK I A F I++ G A GQ H H Sbjct: 65 EPEISAVYLVAPPIPDLAPPMIKFIDFARSKGVRRFVLLSASIIEKGGPAMGQV--HAHL 122 Query: 110 HVIPCKNGDNASHTNIHPTQKIENFAKLEINAQKIRKE 147 + +T + PT +ENF E + + IRKE Sbjct: 123 DSL-----GGVEYTVLRPTWFMENF-SEEFHVEAIRKE 154 >gnl|CDD|148079 pfam06254, DUF1019, Protein of unknown function (DUF1019). Family of uncharacterized proteins found in Proteobacteria. Length = 88 Score = 25.8 bits (57), Expect = 4.7 Identities = 13/63 (20%), Positives = 25/63 (39%), Gaps = 1/63 (1%) Query: 12 QNIFIKIIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIA 71 Q IF + + +T A R ++ AI+ +P L S + + A E+ Sbjct: 11 QKIF-RWLDGDTKAAREKVQALIPAILAALPPELRARLTSGDSTMYLLALAEKELSEAKE 69 Query: 72 FLI 74 ++ Sbjct: 70 AVL 72 >gnl|CDD|131701 TIGR02653, Lon_rel_chp, conserved hypothetical protein. This model describes a protein family of unknown function, about 690 residues in length, in which some members show C-terminal sequence similarity to Pfam model pfam05362, which is the Lon protease C-terminal proteolytic domain, from MEROPS family S16. However, the annotated catalytic sites of E. coli Lon protease are not conserved in members of this family. Members have a motif GP[RK][GS]TGKS, similar to the ATP-binding P-loop motif GxxGxGK[ST]. Length = 675 Score = 24.8 bits (54), Expect = 9.5 Identities = 13/36 (36%), Positives = 18/36 (50%), Gaps = 2/36 (5%) Query: 50 IIPKSRIRDIFEAPPEILS--QIAFLIKKIAIACKS 83 +IP S RDI P E+ S QI+F + K+ Sbjct: 636 LIPMSSARDIPTVPAELFSKFQISFYSDPVDAVYKA 671 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.323 0.137 0.402 Gapped Lambda K H 0.267 0.0711 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 2,501,564 Number of extensions: 150782 Number of successful extensions: 316 Number of sequences better than 10.0: 1 Number of HSP's gapped: 313 Number of HSP's successfully gapped: 19 Length of query: 155 Length of database: 5,994,473 Length adjustment: 85 Effective length of query: 70 Effective length of database: 4,157,793 Effective search space: 291045510 Effective search space used: 291045510 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 53 (24.2 bits)