RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254780746|ref|YP_003065159.1| hypothetical protein CLIBASIA_03165 [Candidatus Liberibacter asiaticus str. psy62] (215 letters) >gnl|CDD|148442 pfam06835, DUF1239, Protein of unknown function (DUF1239). This family consists of several hypothetical bacterial proteins of around 190 residues in length. The function of this family is unknown. Length = 176 Score = 32.2 bits (74), Expect = 0.10 Identities = 24/170 (14%), Positives = 54/170 (31%), Gaps = 11/170 (6%) Query: 38 IVGWFLLSSWIRISTFSKSSIDLLDFEPMIMKKFILSDYSKD-RVKYSLVAERAKTSFNS 96 + GW L S + + Y + +++Y L A++ + Sbjct: 7 LAGWLLWLLSEESSQDEPQVKPDPSEPDYTAENLTTTQYDEQGKLQYRLTADKLE--HYP 64 Query: 97 GKGIIFLQDFELTV-PTQRSEYGDMYLFAHSARFNLANHTLYISQPFKMK--VKDNLRLD 153 + L + + + + A A+ N TLY+ ++ + Sbjct: 65 DDDTTEFTNPVLIIYQDGKPPW---TITADRAKLNKDGQTLYLYGNVVIQSLEDKSRLTT 121 Query: 154 FET--AVLDVKNITINSSDPVIITHSDFVLSANFARIENSSRSAVFAGQV 201 T +++ ++ PV IT +FV + + + +A V Sbjct: 122 ITTENLTVNLDTQDASTDQPVTITGPNFVSTGTGMKANLKTGTAELLSNV 171 >gnl|CDD|152094 pfam11658, DUF3260, Protein of unknown function (DUF3260). Some members in this family of proteins are annotated as YhjU however this cannot be confirmed. Currently this family has no known function. Length = 516 Score = 31.5 bits (72), Expect = 0.17 Identities = 9/20 (45%), Positives = 14/20 (70%) Query: 34 ATVTIVGWFLLSSWIRISTF 53 V +V + LLS W+R++TF Sbjct: 100 LFVLLVAYLLLSRWLRVTTF 119 >gnl|CDD|132411 TIGR03368, cellulose_yhjU, cellulose synthase operon protein YhjU. This protein was identified by the partial phylogenetic profiling algorithm (PubMed:16930487) as part of the system for cellulose biosynthesis in bacteria, and in fact is found in cellulose biosynthesis gene regions. The protein was designated YhjU in Salmonella enteritidis, where disruption of its gene disrupts cellulose biosynthesis and biofilm formation (PubMed:11929533). Length = 518 Score = 30.8 bits (70), Expect = 0.25 Identities = 9/21 (42%), Positives = 14/21 (66%) Query: 34 ATVTIVGWFLLSSWIRISTFS 54 V +V + LL+ WIR++TF Sbjct: 101 IFVVLVAYLLLNQWIRVTTFV 121 >gnl|CDD|162076 TIGR00867, deg-1, degenerin. This model is designed from the invertebrate members of the ENaC family. Length = 601 Score = 30.5 bits (69), Expect = 0.29 Identities = 21/72 (29%), Positives = 35/72 (48%), Gaps = 6/72 (8%) Query: 64 EPMIMKKFILSDYSKDRVKYSLVAERAKTSFNSGKGIIFLQDFELTV-PTQRSEYGDMYL 122 E +I LS+ +++ + Y+ K SFN GK +DF L + PT +G+ Y Sbjct: 285 ENLIFAMAALSEEAREALSYTKSELILKCSFN-GKPCDIDRDFTLHIDPT----FGNCYT 339 Query: 123 FAHSARFNLANH 134 F ++ NL + Sbjct: 340 FNYNRSVNLTSS 351 >gnl|CDD|181530 PRK08664, PRK08664, aspartate-semialdehyde dehydrogenase; Reviewed. Length = 349 Score = 30.2 bits (69), Expect = 0.41 Identities = 8/24 (33%), Positives = 14/24 (58%) Query: 160 DVKNITINSSDPVIITHSDFVLSA 183 +V ++ + S+DP + D V SA Sbjct: 60 EVADMEVVSTDPEAVDDVDIVFSA 83 >gnl|CDD|130324 TIGR01257, rim_protein, retinal-specific rim ABC transporter. This model describes the photoreceptor protein (rim protein) in eukaryotes. It is the member of ABC transporter superfamily. Rim protein is a membrane glycoprotein which is localized in the photoreceptor outer segment discs. Mutation/s in its genetic loci is implicated in the recessive Stargardt's disease. Length = 2272 Score = 27.7 bits (61), Expect = 2.1 Identities = 17/36 (47%), Positives = 23/36 (63%), Gaps = 6/36 (16%) Query: 23 FIQFLKFFFPFATVTIVGWFLLSSWIRISTFSKSSI 58 FI FL F F+T TI+ FLLS++ FSK+S+ Sbjct: 731 FILFL-FLLAFSTATIMQCFLLSTF-----FSKASL 760 >gnl|CDD|140227 PTZ00200, PTZ00200, cysteine proteinase; Provisional. Length = 448 Score = 27.0 bits (60), Expect = 3.6 Identities = 11/39 (28%), Positives = 18/39 (46%) Query: 24 IQFLKFFFPFATVTIVGWFLLSSWIRISTFSKSSIDLLD 62 I FL F F +IV + L+S + +F + +D Sbjct: 47 ISFLVFLFLAIGGSIVSYVLVSKSKMVKSFKSDLEEHID 85 >gnl|CDD|162144 TIGR00978, asd_EA, aspartate-semialdehyde dehydrogenase (non-peptidoglycan organisms). Two closely related families of aspartate-semialdehyde dehydrogenase are found. They differ by a deep split in phylogenetic and percent identity trees and in gap patterns. Separate models are built for the two types in order to exclude the USG-1 protein, found in several species, which is specifically related to the Bacillus subtilis type of aspartate-semialdehyde dehydrogenase. Members of this type are found primarily in organisms that lack peptidoglycan. Length = 341 Score = 26.6 bits (59), Expect = 4.7 Identities = 8/23 (34%), Positives = 12/23 (52%) Query: 161 VKNITINSSDPVIITHSDFVLSA 183 V+++ I +PV D V SA Sbjct: 58 VRDLPIVEPEPVASKDVDIVFSA 80 >gnl|CDD|182579 PRK10598, PRK10598, lipoprotein; Provisional. Length = 186 Score = 26.2 bits (58), Expect = 7.3 Identities = 9/21 (42%), Positives = 14/21 (66%) Query: 89 RAKTSFNSGKGIIFLQDFELT 109 +A+ F+ KG I+L+D EL Sbjct: 99 KAQPVFDKEKGAIYLKDMELV 119 >gnl|CDD|177842 PLN02190, PLN02190, cellulose synthase-like protein. Length = 756 Score = 25.6 bits (56), Expect = 9.1 Identities = 10/22 (45%), Positives = 14/22 (63%) Query: 10 ILLQRSKYRQHVRFIQFLKFFF 31 I LQ+SK H F+QF + F+ Sbjct: 311 IFLQKSKNSNHCAFVQFPQEFY 332 >gnl|CDD|150265 pfam09531, Ndc1_Nup, Nucleoporin protein Ndc1-Nup. Ndc1 is a nucleoporin protein that is a component of the Nuclear Pore Complex, and, in fungi, also of the Spindle Pole Body. It consists of six transmembrane segments, three lumenal loops, both concentrated at the N-terminus and cytoplasmic domains largely at the C-terminus, all of which are well conserved. Length = 554 Score = 25.7 bits (57), Expect = 9.4 Identities = 5/27 (18%), Positives = 13/27 (48%) Query: 22 RFIQFLKFFFPFATVTIVGWFLLSSWI 48 R L F + + + +++S+W+ Sbjct: 79 RIATILGSLFSLSFLLTLLTYVVSAWL 105 >gnl|CDD|162785 TIGR02251, HIF-SF_euk, Dullard-like phosphatase domain. This domain is related to domains found in FCP1-like phosphatases (TIGR02250), and together both are detected by the Pfam model pfam03031. Length = 162 Score = 25.7 bits (57), Expect = 9.9 Identities = 9/24 (37%), Positives = 13/24 (54%), Gaps = 1/24 (4%) Query: 2 DHLDRRNKILLQRSKYRQHVRFIQ 25 D LDR K++ +R YR+ F Sbjct: 76 DILDRGGKVISRRL-YRESCVFTN 98 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.325 0.137 0.392 Gapped Lambda K H 0.267 0.0716 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 3,411,883 Number of extensions: 205476 Number of successful extensions: 465 Number of sequences better than 10.0: 1 Number of HSP's gapped: 465 Number of HSP's successfully gapped: 21 Length of query: 215 Length of database: 5,994,473 Length adjustment: 89 Effective length of query: 126 Effective length of database: 4,071,361 Effective search space: 512991486 Effective search space used: 512991486 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 15 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.6 bits) S2: 55 (25.0 bits)