RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254780730|ref|YP_003065143.1| hypothetical protein CLIBASIA_03085 [Candidatus Liberibacter asiaticus str. psy62] (120 letters) >gnl|CDD|33638 COG3847, Flp, Flp pilus assembly protein, pilin Flp [Intracellular trafficking and secretion]. Length = 58 Score = 60.3 bits (146), Expect = 1e-10 Identities = 23/53 (43%), Positives = 36/53 (67%) Query: 3 MNIIKKILKNGSGATAIEYGLLASLVSVAIISAVSTLGDRMKGVYQTISTELD 55 ++++ L++ GATAIEYGL+A+L++V II+ STLG +KG + I L Sbjct: 2 KKLLRRFLRDEDGATAIEYGLIAALIAVVIIAGGSTLGTALKGAFTAIGAALT 54 >gnl|CDD|113726 pfam04964, Flp_Fap, Flp/Fap pilin component. Length = 47 Score = 56.8 bits (138), Expect = 1e-09 Identities = 25/46 (54%), Positives = 35/46 (76%) Query: 10 LKNGSGATAIEYGLLASLVSVAIISAVSTLGDRMKGVYQTISTELD 55 LK+ SGATAIEYGL+A+L++V II+ V+TLG +K + +I T L Sbjct: 2 LKDESGATAIEYGLIAALIAVVIIAYVTTLGTALKTKFTSIGTALT 47 >gnl|CDD|36658 KOG1445, KOG1445, KOG1445, Tumor-specific antigen (contains WD repeats) [Cytoskeleton]. Length = 1012 Score = 29.7 bits (66), Expect = 0.22 Identities = 16/56 (28%), Positives = 23/56 (41%) Query: 60 PPTKPGSVPMQPESSNPSTRLQPPAKPTSIPVKTKSSKKSPKRIQSPAKNKKSYVK 115 PP +P P ++ + PPA P + SS P + PA KK V+ Sbjct: 398 PPPEPVPTPKVAQTPSFVPVPTPPAAPRPMSNNNSSSNNVPDVQEQPAVPKKEEVR 453 >gnl|CDD|37888 KOG2677, KOG2677, KOG2677, Stoned B synaptic vesicle biogenesis protein [Intracellular trafficking, secretion, and vesicular transport]. Length = 922 Score = 28.6 bits (63), Expect = 0.41 Identities = 17/61 (27%), Positives = 26/61 (42%), Gaps = 1/61 (1%) Query: 61 PTKPGSVPMQPESSN-PSTRLQPPAKPTSIPVKTKSSKKSPKRIQSPAKNKKSYVKPNKS 119 P P + P++PE P+T P A + IP +S S K+ P ++ K K Sbjct: 243 PLPPVTSPLKPEIRRVPNTPAIPSASRSVIPDVPYNSMGSFKKRDRPKSTLMNFSKVQKL 302 Query: 120 S 120 Sbjct: 303 D 303 >gnl|CDD|35734 KOG0514, KOG0514, KOG0514, Ankyrin repeat protein [General function prediction only]. Length = 452 Score = 27.4 bits (60), Expect = 0.86 Identities = 21/97 (21%), Positives = 34/97 (35%) Query: 24 LASLVSVAIISAVSTLGDRMKGVYQTISTELDKGDVPPTKPGSVPMQPESSNPSTRLQPP 83 LA S A +A T G +G K PP+ M+P +S +T P Sbjct: 6 LARGFSKASPTATRTGGAVRRGTSADSRIPRPKHSSPPSPVRKSFMRPNTSTLATPTPEP 65 Query: 84 AKPTSIPVKTKSSKKSPKRIQSPAKNKKSYVKPNKSS 120 + S S + + +++ +P K S Sbjct: 66 KARLPSLQEKPPSVSSEQASEPLPESQSFIRQPQKDS 102 >gnl|CDD|33967 COG4244, COG4244, Predicted membrane protein [Function unknown]. Length = 160 Score = 27.5 bits (61), Expect = 0.93 Identities = 10/46 (21%), Positives = 22/46 (47%) Query: 3 MNIIKKILKNGSGATAIEYGLLASLVSVAIISAVSTLGDRMKGVYQ 48 + + + +N + A GLL SL +V +++ LG ++ + Sbjct: 103 LTAWRYVHRNDAVAAVSPAGLLLSLATVLLVALQGYLGAQLVYEHG 148 >gnl|CDD|35779 KOG0559, KOG0559, KOG0559, Dihydrolipoamide succinyltransferase (2-oxoglutarate dehydrogenase, E2 subunit) [Energy production and conversion]. Length = 457 Score = 27.3 bits (60), Expect = 0.96 Identities = 11/59 (18%), Positives = 15/59 (25%) Query: 53 ELDKGDVPPTKPGSVPMQPESSNPSTRLQPPAKPTSIPVKTKSSKKSPKRIQSPAKNKK 111 E P P PPA + PV + K S + + P Sbjct: 164 EPKTAPAAAAPPKPSSKPPPKEAAPVAESPPAPSSPEPVPASAKKPSVAQPKPPPSEGA 222 >gnl|CDD|35384 KOG0162, KOG0162, KOG0162, Myosin class I heavy chain [Cytoskeleton]. Length = 1106 Score = 26.9 bits (59), Expect = 1.3 Identities = 19/92 (20%), Positives = 31/92 (33%), Gaps = 5/92 (5%) Query: 34 SAVSTLGDRMKGVYQTISTELDKGDVP----PTKPGSVPMQPESSNPSTRLQPPAKPT-S 88 + L D K + ++ T L P P K ++++ TR P K Sbjct: 903 EDLKVLKDIYKSLTVSVGTGLPPNSKPSRKKPRKATGYSSGRDAASTPTRRAPQNKQAYG 962 Query: 89 IPVKTKSSKKSPKRIQSPAKNKKSYVKPNKSS 120 + ++K SP Q P P +S Sbjct: 963 QNGVSPAAKGSPLPAQKPVNTYNQRPPPVSTS 994 Score = 24.9 bits (54), Expect = 4.7 Identities = 11/45 (24%), Positives = 18/45 (40%), Gaps = 1/45 (2%) Query: 57 GDVPPTKPGSVPMQPESSNPSTRLQPPAKPTSIPVKTKSSKKSPK 101 G P K +P Q N + PP ++ + S++ S K Sbjct: 965 GVSPAAKGSPLPAQK-PVNTYNQRPPPVSTSTTTSQQPSARPSSK 1008 >gnl|CDD|38063 KOG2852, KOG2852, KOG2852, Possible oxidoreductase [General function prediction only]. Length = 380 Score = 26.5 bits (58), Expect = 1.7 Identities = 20/76 (26%), Positives = 28/76 (36%), Gaps = 12/76 (15%) Query: 14 SGATAIEYGLLASLVSVAIISAVST--------LGDRMKGV----YQTISTELDKGDVPP 61 GA+ G LA +II ++T L D GV Y+ ++T K D Sbjct: 52 GGASGKASGFLAKWCQPSIIQPLATLSFKLHEELSDEYDGVNNWGYRALTTWSCKADWEN 111 Query: 62 TKPGSVPMQPESSNPS 77 T P VP + Sbjct: 112 TNPAKVPEGLDWIQRE 127 >gnl|CDD|146909 pfam04502, DUF572, Family of unknown function (DUF572). Family of eukaryotic proteins with undetermined function. Length = 324 Score = 25.9 bits (57), Expect = 2.5 Identities = 12/51 (23%), Positives = 20/51 (39%) Query: 70 QPESSNPSTRLQPPAKPTSIPVKTKSSKKSPKRIQSPAKNKKSYVKPNKSS 120 + +P + P+KPTSI K+ + + KN KP + Sbjct: 223 NDNTPSPKSGSSSPSKPTSILKKSAAKRSEAPSSSKAKKNSLGIPKPKSAL 273 >gnl|CDD|36309 KOG1093, KOG1093, KOG1093, Predicted protein kinase (contains TBC and RHOD domains) [General function prediction only]. Length = 725 Score = 26.1 bits (57), Expect = 2.6 Identities = 11/31 (35%), Positives = 17/31 (54%), Gaps = 1/31 (3%) Query: 71 PESSNPSTRLQPPAKPTSIPVKTKSSKKSPK 101 P+S P P KPTS+ ++ SS+ P+ Sbjct: 594 PKSITPRQHANPF-KPTSLSLQQLSSEHCPR 623 >gnl|CDD|38799 KOG3592, KOG3592, KOG3592, Microtubule-associated proteins [Cytoskeleton]. Length = 934 Score = 25.9 bits (56), Expect = 3.0 Identities = 15/56 (26%), Positives = 24/56 (42%), Gaps = 1/56 (1%) Query: 63 KPGSVPMQPESSNPSTRLQPPAKPTSIPVKTKSSKKSPKRIQSPAKNKKSYVKPNK 118 P + E+ P+TR P K P K ++K P ++ PAK+ + K Sbjct: 319 AGPEKPTKTETKEPATRTAVPKKAPK-PSKAAVAEKQPTEVKRPAKSAPALKKGAP 373 >gnl|CDD|32941 COG3127, COG3127, Predicted ABC-type transport system involved in lysophospholipase L1 biosynthesis, permease component [Secondary metabolites biosynthesis, transport, and catabolism]. Length = 829 Score = 25.6 bits (56), Expect = 3.1 Identities = 13/37 (35%), Positives = 20/37 (54%) Query: 23 LLASLVSVAIISAVSTLGDRMKGVYQTISTELDKGDV 59 L A ++VA I+A+ +L DRM+ + E GD Sbjct: 25 LAALALAVAAIAALGSLSDRMEKGLAQQAREFLAGDR 61 >gnl|CDD|144551 pfam00999, Na_H_Exchanger, Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metabolising cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus and a large cytoplasmic region at the carboxyl terminus. The transmembrane regions M3-M12 share identity with other members of the family. The M6 and M7 regions are highly conserved. Thus, this is thought to be the region that is involved in the transport of sodium and hydrogen ions. The cytoplasmic region has little similarity throughout the family. Length = 371 Score = 25.6 bits (57), Expect = 3.6 Identities = 11/44 (25%), Positives = 16/44 (36%), Gaps = 9/44 (20%) Query: 20 EYGLLASLVSVAIISAVST------LGDRM---KGVYQTISTEL 54 LL +L+ A +SA S L +R + I E Sbjct: 105 GIPLLEALLFGAALSATSPVVVLAILKERGRLNTRLGTLILGES 148 >gnl|CDD|147212 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory factor. Replicative DNA polymerases are capable of polymerising tens of thousands of nucleotides without dissociating from their DNA templates. The high processivity of these polymerases is dependent upon accessory proteins that bind to the catalytic subunit of the polymerase or to the substrate. The Epstein-Barr virus (EBV) BMRF1 protein is an essential component of the viral DNA polymerase and is absolutely required for lytic virus replication. BMRF1 is also a transactivator. This family is predicted to have a UL42 like structure. Length = 381 Score = 25.4 bits (56), Expect = 3.8 Identities = 14/61 (22%), Positives = 18/61 (29%) Query: 51 STELDKGDVPPTKPGSVPMQPESSNPSTRLQPPAKPTSIPVKTKSSKKSPKRIQSPAKNK 110 D PP S P S PPA + K+ + Q K+K Sbjct: 312 HLSSDSSPSPPDTSDSDPSTETPPPASLSHSPPAAFERPLALSPKRKREGDKKQKKKKSK 371 Query: 111 K 111 K Sbjct: 372 K 372 >gnl|CDD|35481 KOG0260, KOG0260, KOG0260, RNA polymerase II, large subunit [Transcription]. Length = 1605 Score = 24.6 bits (53), Expect = 6.2 Identities = 16/52 (30%), Positives = 22/52 (42%), Gaps = 1/52 (1%) Query: 61 PTKPGSVPMQPESSNPSTRLQPPAKPTSIPVKTKSSKKSPKRIQSPAKNKKS 112 PT P P P S +P++ P P+ P S SP SP+ + S Sbjct: 1538 PTSPSYSPTSP-SYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSTSPSYSPTS 1588 >gnl|CDD|38806 KOG3600, KOG3600, KOG3600, Thyroid hormone receptor-associated protein complex, subunit TRAP240 [Transcription]. Length = 2238 Score = 24.7 bits (53), Expect = 6.4 Identities = 10/45 (22%), Positives = 18/45 (40%) Query: 76 PSTRLQPPAKPTSIPVKTKSSKKSPKRIQSPAKNKKSYVKPNKSS 120 S+ P P I ++ + K+SP + KK + + K Sbjct: 551 VSSPQPPKLSPYPIDMQKEIVKRSPSQPLVLLSYKKKFQQYLKEV 595 >gnl|CDD|145975 pfam03117, Herpes_UL49_1, UL49 family. Members of this family, found in several herpesviruses, include EBV BFRF2 and other UL49 proteins (e.g. HCMVA UL49, HSV6 U33). There are eight conserved cysteine residues in this alignment, all lying towards the C-terminus. Their function is unknown. Length = 234 Score = 24.6 bits (54), Expect = 6.5 Identities = 8/32 (25%), Positives = 18/32 (56%), Gaps = 1/32 (3%) Query: 7 KKILKNGSGATAIEYGLLASLVSVAIISAVST 38 KI+ + + + Y LL+S++S+ ++ T Sbjct: 53 PKIVTDL-LSKVLSYNLLSSVISLPVLCPCVT 83 >gnl|CDD|35840 KOG0621, KOG0621, KOG0621, Phospholipid scramblase [Cell wall/membrane/envelope biogenesis]. Length = 292 Score = 24.5 bits (53), Expect = 6.8 Identities = 11/50 (22%), Positives = 18/50 (36%) Query: 59 VPPTKPGSVPMQPESSNPSTRLQPPAKPTSIPVKTKSSKKSPKRIQSPAK 108 T P + E+S +L+PPAK + P+ +P Sbjct: 1 TSLTSAYRSPEEVETSADKKKLEPPAKRARFGRMRGGGTRLPQARDTPMV 50 >gnl|CDD|36310 KOG1094, KOG1094, KOG1094, Discoidin domain receptor DDR1 [Signal transduction mechanisms]. Length = 807 Score = 24.6 bits (53), Expect = 6.9 Identities = 12/51 (23%), Positives = 20/51 (39%) Query: 58 DVPPTKPGSVPMQPESSNPSTRLQPPAKPTSIPVKTKSSKKSPKRIQSPAK 108 VPP P SVP E+ + + +++P K P ++ P Sbjct: 488 LVPPPPPNSVPHYGEADIVTLQGVSGGNSSAVPYLPPGVGKGPALVEFPRS 538 >gnl|CDD|35969 KOG0750, KOG0750, KOG0750, Mitochondrial solute carrier protein [Energy production and conversion]. Length = 304 Score = 24.5 bits (53), Expect = 7.2 Identities = 16/47 (34%), Positives = 22/47 (46%) Query: 10 LKNGSGATAIEYGLLASLVSVAIISAVSTLGDRMKGVYQTISTELDK 56 K+GSGA LA LV+ + + V T D +K QT+ D Sbjct: 202 KKDGSGAAVFYQSFLAGLVAGSASAIVVTPLDVVKTRIQTLGDNEDN 248 >gnl|CDD|34568 COG4961, TadG, Flp pilus assembly protein TadG [Intracellular trafficking and secretion]. Length = 185 Score = 24.3 bits (52), Expect = 7.5 Identities = 7/31 (22%), Positives = 18/31 (58%) Query: 6 IKKILKNGSGATAIEYGLLASLVSVAIISAV 36 +++ ++ GA A+E+ L+A + + + V Sbjct: 12 LRRFRRDRRGAAAVEFALVAPPLLLLVFGIV 42 >gnl|CDD|146621 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit, N-terminal. SRP is a complex of six distinct polypeptides and a 7S RNA that is essential for transferring nascent polypeptide chains that are destined for export from the cell to the translocation apparatus of the endoplasmic reticulum (ER) membrane. SRP binds hydrophobic signal sequences as they emerge from the ribosome, and arrests translation. Length = 235 Score = 24.3 bits (53), Expect = 7.9 Identities = 19/73 (26%), Positives = 27/73 (36%), Gaps = 18/73 (24%) Query: 55 DKGDVPPTKPGSVPMQPESSNPST----------------RLQPPAKPTSIPVKTKSSKK 98 K + + S P SS PST R + AK +S +K Sbjct: 103 KKQESATSAESSPSSTPNSSRPSTPSLLKAKEGPGGRTSRRAKKAAKLSSTASSGD--EK 160 Query: 99 SPKRIQSPAKNKK 111 SPK+ ++P K K Sbjct: 161 SPKKKKAPKKGGK 173 >gnl|CDD|144452 pfam00860, Xan_ur_permease, Permease family. This family includes permeases for diverse substrates such as xanthine, uracil, and vitamin C. However many members of this family are functionally uncharacterized and may transport other substrates. Members of this family have ten predicted transmembrane helices. Length = 389 Score = 24.2 bits (53), Expect = 9.0 Identities = 8/27 (29%), Positives = 17/27 (62%) Query: 15 GATAIEYGLLASLVSVAIISAVSTLGD 41 G GL+ ++++VA+++ V + GD Sbjct: 225 GTPLFNPGLILTVLAVALVAIVESTGD 251 >gnl|CDD|99895 cd05834, HDGF_related, The PWWP domain is an essential part of the Hepatoma Derived Growth Factor (HDGF) family of proteins, and is necessary for DNA binding by HDGF. This family of endogenous nuclear-targeted mitogens includes HRP (HDGF-related proteins 1, 2, 3, 4, or HPR1, HPR2, HPR3, HPR4, respectively) and lens epithelium-derived growth factor, LEDGF. Members of the HDGF family have been linked to human diseases, and HDGF is a prognostic factor in several types of cancer. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.. Length = 83 Score = 24.2 bits (53), Expect = 9.3 Identities = 7/19 (36%), Positives = 11/19 (57%) Query: 100 PKRIQSPAKNKKSYVKPNK 118 P+ + +NKK + KP K Sbjct: 51 PEDLFPYTENKKKFGKPKK 69 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.305 0.123 0.329 Gapped Lambda K H 0.267 0.0666 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 1,257,468 Number of extensions: 53714 Number of successful extensions: 256 Number of sequences better than 10.0: 1 Number of HSP's gapped: 250 Number of HSP's successfully gapped: 91 Length of query: 120 Length of database: 6,263,737 Length adjustment: 82 Effective length of query: 38 Effective length of database: 4,491,799 Effective search space: 170688362 Effective search space used: 170688362 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 43 (21.9 bits) S2: 51 (23.6 bits)