RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254780410|ref|YP_003064823.1| hypothetical protein CLIBASIA_01475 [Candidatus Liberibacter asiaticus str. psy62] (362 letters) >gnl|CDD|31753 COG1565, COG1565, Uncharacterized conserved protein [Function unknown]. Length = 370 Score = 305 bits (783), Expect = 1e-83 Identities = 138/366 (37%), Positives = 200/366 (54%), Gaps = 19/366 (5%) Query: 8 KIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEMLAIFL 67 I LI + G ++ + L + DPE GYYS+ G GDF+TAPE+SQ+FGE+LA Sbjct: 7 IIRALIAQGGPISFSDFMELALYDPEHGYYSSAVKIGRKGDFITAPELSQLFGELLAEQF 66 Query: 68 ICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERLTLIQK 127 + W++ G P+ ++LVE+G GRG + DILR + +L P+ + LS Y++E S L QK Sbjct: 67 LQLWQELGRPAPLKLVEIGAGRGTLASDILRTLRRLYPELYEALSYYIIEPSPELRARQK 126 Query: 128 KQLASYGDKINWYTSLADVPLGFT-FLVANEFFDSLPIKQFVMTEHGIRERMIDIDQHDS 186 + L + D I W + D+P F +V+NE FD+LP++QF+ T+ ER++ +D Sbjct: 127 ETLKATEDLIRWVEWVEDLPKKFPGIVVSNELFDALPVEQFIRTKGLFVERVVVLDAEGR 186 Query: 187 LVFNIGDHEIKSNFLTCSD---YFLGAIFENSPCRDREMQSISDRLACDGGTAIVIDYGY 243 LVF+ +E+ L D G I E SP R+ +++I++RL G + IDYGY Sbjct: 187 LVFSHAINELIDEALLPLDAPEAEDGYILEVSPAREALLKAIAERLE--RGVFLFIDYGY 244 Query: 244 L-------QSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAILYKLYINGLT 296 + G TLQA + H + PL NPG ADL++HVDF L+ A L + G Sbjct: 245 PAEEYYHPRRAYGTTLQAYRHHVHDDPLANPGLADLTAHVDFTALAKAAKAAGLEVLGFK 304 Query: 297 TQGKFLEGLGIWQRAFSLMKQT--ARKDILLDSVKRLVSTSADKKSMGELFKILVVSHEK 354 TQG FL LG+ +R +L AR + VKRL MGELFK+L + Sbjct: 305 TQGDFLLDLGLLERLATLSAGKDDARYLKIAAQVKRL----ISPGGMGELFKVLAFGKKL 360 Query: 355 VELMPF 360 +PF Sbjct: 361 KIDLPF 366 >gnl|CDD|38112 KOG2901, KOG2901, KOG2901, Uncharacterized conserved protein [Function unknown]. Length = 415 Score = 253 bits (648), Expect = 5e-68 Identities = 128/364 (35%), Positives = 194/364 (53%), Gaps = 27/364 (7%) Query: 3 NKLIRKIVNLIKKNGQMTVDQYFALCVADPEFGYYSTCNPFGAVGDFVTAPEISQIFGEM 62 L++ + + IK G +TV +Y + +P+ GYY + FGA GDF+T+PEISQIFGEM Sbjct: 33 PHLVKHLKSKIKSTGPITVAEYMKEVLTNPKAGYYMNRDVFGAKGDFITSPEISQIFGEM 92 Query: 63 LAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSERL 122 + ++ + WEQ G P +LVELGPGRG +M D+LRV+ K K + SV ++VE S L Sbjct: 93 IGVWTVSEWEQMGRPERFQLVELGPGRGTLMADVLRVLTKFKDEDLSV---HLVEVSPAL 149 Query: 123 TLIQKKQLASY--------------GDKINWYTSLADVPLGFTFLVANEFFDSLPIKQFV 168 + +Q + L G I+W+ +L DVP GFT ++A+EFFD+LP+ QF Sbjct: 150 SKLQAQNLCCTDESLSEYKKGSTLSGTPIHWHRTLQDVPSGFTLIIAHEFFDALPVHQFQ 209 Query: 169 MTEHGIRERMIDIDQHDSLVFNIGDHE-IKSNFLTCSDYFLGAIFENSPCRDREMQSISD 227 + G E M+D+ + F + + +L + E+SP + + ++ Sbjct: 210 KSTRGWCEVMVDVGEDSKFRFVLSPSPTPAALYLMPATDETREKMEHSPESGKSVDLLAK 269 Query: 228 RLACDGGTAIVIDYGYLQSRVGDTLQAVKGHTYVSPLVNPGQADLSSHVDFQRLSSIAIL 287 R+ DGG A++IDYG+ + D+ +A K H V L PG ADL++ VDF L A Sbjct: 270 RIGSDGGGALIIDYGHDGIKT-DSFRAFKKHKLVDILDMPGSADLTADVDFTYLRHAA-E 327 Query: 288 YKLYINGLTTQGKFLEGLGIWQRAFSLM---KQTARKDILLDSVKRLVSTSADKKSMGEL 344 + NG Q FL G+GI R L+ + ++++ LL L+ + K MGE Sbjct: 328 GNVDKNGPRKQHCFLNGMGIEIRLKILLDKSNEPSQQEQLLQGYDMLM----NPKKMGER 383 Query: 345 FKIL 348 F Sbjct: 384 FNFF 387 >gnl|CDD|145672 pfam02636, DUF185, Uncharacterized ACR, COG1565. This family contains several uncharacterized proteins. One member has been described as an ATP synthase beta subunit transcription termination factor rho protein. Length = 240 Score = 212 bits (542), Expect = 1e-55 Identities = 82/232 (35%), Positives = 123/232 (53%), Gaps = 11/232 (4%) Query: 62 MLAIFLICAWEQHGFPSCVRLVELGPGRGIMMLDILRVICKLKPDFFSVLSIYMVETSER 121 +LA +L W+ G P +RLVE+G GRG + DILR + KL P+F+ L ++VE S Sbjct: 1 LLARWLAQLWKALGSPPPLRLVEIGAGRGTLAADILRALRKLLPEFYEALEYHLVEISPA 60 Query: 122 LTLIQKKQLASYGDK-INWYTSLADVPLGFTFLVANEFFDSLPIKQFVMTEHGIRERMID 180 L QK++LA D + W L ++P ++ANE FD+LP+ F G RER + Sbjct: 61 LRERQKERLAGSADLLVKWLDELPELP-FPGVVLANEVFDALPVDLFRRDGGGWRERYVG 119 Query: 181 IDQHDSLVFNIG---DHEIKSNFL-TCSDYFLGAIFENSPCRDREMQSISDRLACDGGTA 236 +D+ VF +G + + G E P + ++ +++RL G Sbjct: 120 LDEDGKFVFELGPLSTPSLSDYLPELALNLPEGYRLEIPPAAEAFLEQLAERL--PRGAL 177 Query: 237 IVIDYGYLQS--RVGDTLQAVKGHTYV-SPLVNPGQADLSSHVDFQRLSSIA 285 ++IDYGY S R TL+A + H PLVNPG+AD+++HVDF L+ A Sbjct: 178 LIIDYGYPASEYRNNGTLRAYRRHRVHDDPLVNPGEADITAHVDFTALAEAA 229 >gnl|CDD|99984 cd03813, GT1_like_3, This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center and permits a high degree of flexibility. The members of this family are found mainly in bacteria, while some of them are also found in Archaea and eukaryotes.. Length = 475 Score = 30.3 bits (69), Expect = 0.76 Identities = 12/22 (54%), Positives = 16/22 (72%), Gaps = 2/22 (9%) Query: 167 FVMTEHGI--RERMIDIDQHDS 186 F++TEHGI RER I++ Q D Sbjct: 200 FLLTEHGIYTRERKIELLQADW 221 >gnl|CDD|146598 pfam04051, TRAPP, Transport protein particle (TRAPP) component. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is a large multimeric protein that contains at least 10 subunits. This family contains many TRAPP family proteins. The Bet3 subunit is one of the better characterized TRAPP proteins and has a dimeric structure with hydrophobic channels. The channel entrances are located on a putative membrane-interacting surface that is distinctively flat, wide and decorated with positively charged residues. Bet3 is proposed to localize TRAPP to the Golgi. Length = 148 Score = 27.5 bits (62), Expect = 6.2 Identities = 13/45 (28%), Positives = 22/45 (48%), Gaps = 8/45 (17%) Query: 72 EQHGFPSCVRLVEL-----GPGRGIMMLDILRVICKLKPDFFSVL 111 E+ G+ RL+E R +L++L+ ICK D + +L Sbjct: 25 EKMGYNIGQRLIERLLARSKSKRCTDLLEVLKFICK---DLWKML 66 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.324 0.140 0.417 Gapped Lambda K H 0.267 0.0799 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 4,403,252 Number of extensions: 237222 Number of successful extensions: 592 Number of sequences better than 10.0: 1 Number of HSP's gapped: 580 Number of HSP's successfully gapped: 9 Length of query: 362 Length of database: 6,263,737 Length adjustment: 95 Effective length of query: 267 Effective length of database: 4,210,882 Effective search space: 1124305494 Effective search space used: 1124305494 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 15 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.6 bits) S2: 58 (26.0 bits)