RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254781202|ref|YP_003065615.1| hypothetical protein CLIBASIA_05545 [Candidatus Liberibacter asiaticus str. psy62] (864 letters) >gnl|CDD|177329 PHA01972, PHA01972, structural protein. Length = 828 Score = 210 bits (536), Expect = 1e-54 Identities = 189/916 (20%), Positives = 332/916 (36%), Gaps = 141/916 (15%) Query: 1 MKPECIQVLNKAAGRELSKKELRRLEDGIVRAYVSL------DGKGLSKAER-YRLAGLK 53 M +C++ + +AAGR+L+ E++ +ED I A S+ K S A+R Y+ Sbjct: 1 MPQKCVEAVAQAAGRQLTADEIKGIEDRIKEAMRSVARKDPKGWKEESIAKRFYKTDADL 60 Query: 54 AEEDFQKELIRSVNDAIDEAYKRHQLRSDLDRVQAGVYGKSQALFNKLFFKAGSA----- 108 A ++ + I+ K+ + ++ + L + Sbjct: 61 ARQELVHDAIKKKKRVALAIAKQAEA---TKKINEVLTADKDPAAALLGMLSRDPNEEAK 117 Query: 109 EVPLEMKIKAAETKVLSKFNEYAEVG---------SKNLG---FTLDKQFGL-DVFDEMK 155 + +E +I A +K +++ G F +Q L D+ E+ Sbjct: 118 FLSVEQRINATRDVSKAKISDFMAALDPTTRQIFAGIATGERRFDKAQQRLLDDIVHELY 177 Query: 156 GKKTQNEQASRLVKQYFETQRELHSQAHEAGLDYKFFEN-RIPQPMSVDKLRATKKDDFV 214 G++T N A + K + + +L ++ ++AG D ++ R+PQ + K+ D +V Sbjct: 178 GRQTGNADAKKAAKGWKKVTEDLRARFNDAGGDIGELDDWRLPQKHNRMKISKAGADAWV 237 Query: 215 RSMLDWLDLSRYKDIDGTPLSRSEIASFVGEVFAERVRSTSFKDPSIPSSEVGVKR---- 270 + D LD S+Y DG P++ +E+ + V+ T + P S+ G Sbjct: 238 DDVWDLLDRSKYVKKDGKPMNDAELREALYSVYNTIA--TDGMNKIDPGSKTGSGARANR 295 Query: 271 -EFERVFHFKDSQAHMDYMEHFGVSTNVNTILTSELASLSKDIVIARELGPNADSFVKQM 329 ERV HFKD+ +H+ Y FG T+V + L +S+DI + GP+ D + Sbjct: 296 GSAERVIHFKDADSHLQYQREFG-DTSVYASMVGHLDGMSRDIGLVETFGPDPDRNFNTL 354 Query: 330 IVQTIANDQEASAGNKVLKDWLGRNKLEVRQEAMLQMWE-VMRYGETVENTGWANWMAGL 388 + S + + + + +M Y EN W +AGL Sbjct: 355 LDAVKTKKGTTS------------RQPTGAMKMVEPTFNMLMGYNMVEENPVWGRRVAGL 402 Query: 389 RSAAGASMLGQHPIGALLEDGFISRQM----LSRVGIDKEAIQRINKMPLKERMELLSDV 444 R+ AS LG + AL + G +S + + + + + K D Sbjct: 403 RNLWTASKLGAAVLSALSDSGTMSMAASYNAMPPARLLRNMLSEMMP-GSKSEASRAQDF 461 Query: 445 GLYAEGVVAHGRNMMEGSD-AFQIGHKLHSKMHKWSGAEYLDKKRISSHALIVYNQIGRM 503 GL AE + N + L + K SG ++ + +G Sbjct: 462 GLGAES-LLDRVNRWGTDNLGGHRSRNLAEAVMKVSGLTAWTDAARAAFQFEMMTALGEA 520 Query: 504 TDTYASLKDLKADPRLDPSIKAFFKQLDDTDFTVIKRAKAMSSPDGYLYARTPSTIKNLK 563 AD L ++D ++ R K ++ D + A P Sbjct: 521 -----------ADSDWS--------DLPESDRRILGR-KGITESDWAIIAAAP------- 553 Query: 564 DADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQL-ADLERKEINILKDKVSNKMH 622 R +K L+P+ + + + A+ + K + ++ + M Sbjct: 554 -----------------RTNYGGAKMLTPDAIYAVPRNMDAEAQTKLLGMVDGETM--MA 594 Query: 623 ALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMFLNILDL 682 D ++ K GT GE R F F + P + Sbjct: 595 VPTPDARTRAIMAG--------------GTKSGTFGGELHRSFFLFKSFPIATIMR---H 637 Query: 683 SNSAKMPKGASMALNHVWIQYSATM----ALAGIGVASIKALLRGEDPSLPEVIYDGTLA 738 A KG A + + Y+A M L G G K LL G+D P + D Sbjct: 638 WRRAFTGKGYPGAFDR--MAYAAIMVASTTLLGAGSIQAKDLLNGKD---PRSMTDPKFW 692 Query: 739 NGALLP------YMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLT----SSAVELATKDNE 788 ALL Y D L + + GPV S LT + + A E Sbjct: 693 IEALLKGGSFGYYGDFLRNAHTGYGSDMTSYIGGPVLSYGDQLTKWVAMNPQDAAQGKAE 752 Query: 789 NSKVNATKAIRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKK-GIELFQ 847 ++ + K + PF N+WY K + D LI++Q+ E +PGY +Q ++ +++ + Sbjct: 753 SAGADLLKFATQQTPFANLWYTKAATDRLIMDQLQELSSPGYDKKQLNRSRREFNQTYWW 812 Query: 848 NMDEGLPHRLPFPFGE 863 + G P R+P F Sbjct: 813 SPGIGTPQRIPSLFEA 828 >gnl|CDD|165202 PHA02869, PHA02869, C4L/C10L-like gene family protein; Provisional. Length = 418 Score = 34.7 bits (79), Expect = 0.11 Identities = 29/98 (29%), Positives = 47/98 (47%), Gaps = 19/98 (19%) Query: 561 NLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEI---NILKDKV 617 NL +DL D+ + + +SK PE+R EL + D + K+I N L D + Sbjct: 32 NLGISDLNDIKNICE----------DSKIFFPEKRTELLS-IKDRKSKQIVFENSLNDDL 80 Query: 618 SNKMHALVLDNVQTSVRGAMHTSLFDRQRLGLLTYKRG 655 K+HAL+ D + T V S+ + L+ Y++G Sbjct: 81 LKKLHALIYDELSTVV-----DSVTVENTVTLIMYEKG 113 >gnl|CDD|178882 PRK00124, PRK00124, hypothetical protein; Validated. Length = 151 Score = 30.6 bits (70), Expect = 1.7 Identities = 16/64 (25%), Positives = 25/64 (39%), Gaps = 8/64 (12%) Query: 539 KRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQEL 598 K A ++ P GY+Y T I L +RDL + + K + E R Sbjct: 85 KGAIVLN-PRGYIY--TNDNIDQL--LAMRDL---MATLRRSGIRTGGPKPFTQEDRSRF 136 Query: 599 QQQL 602 + +L Sbjct: 137 EAEL 140 >gnl|CDD|179891 PRK04914, PRK04914, ATP-dependent helicase HepA; Validated. Length = 956 Score = 30.6 bits (70), Expect = 1.7 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 3/52 (5%) Query: 36 LDGKGLSKAERYRLAGLKAEEDFQKELIRSVNDAIDEAYK-RHQLRSDL-DR 85 L G+ LS L L E+D + L+++ N +EA R +L S+L DR Sbjct: 359 LAGEKLSDDALNALGELLGEQDIE-PLLQAANSDSEEAQAARQELISELLDR 409 >gnl|CDD|184419 PRK13959, PRK13959, phosphoribosylaminoimidazole-succinocarboxamide synthase; Provisional. Length = 341 Score = 30.0 bits (68), Expect = 2.4 Identities = 14/39 (35%), Positives = 23/39 (58%), Gaps = 4/39 (10%) Query: 41 LSKAERYRLAGLKAEEDFQK--ELIRSVNDAI-DEAYKR 76 LS+ E +AGL + + ++ EL V++ I +EA KR Sbjct: 184 LSRKEADEIAGL-SRAEIEELEELALKVDEIITEEAEKR 221 >gnl|CDD|183727 PRK12757, PRK12757, cell division protein FtsN; Provisional. Length = 256 Score = 29.6 bits (67), Expect = 3.1 Identities = 12/23 (52%), Positives = 16/23 (69%), Gaps = 1/23 (4%) Query: 586 NSKT-LSPEQRQELQQQLADLER 607 NS T L+ EQRQ L+Q AD+ + Sbjct: 75 NSPTQLTDEQRQLLEQMQADMRQ 97 >gnl|CDD|150989 pfam10408, Ufd2P_core, Ubiquitin elongating factor core. This is the most conserved part of the core region of Ufd2P ubiquitin elongating factor or E4, running from helix alpha-11 to alpha-38. It consists of 31 helices of variable length connected by loops of variable size forming a compact unit; the helical packing pattern of the compact unit consists of five structural repeats that resemble tandem Armadillo (ARM) repeats. This domain is involved in ubiquitination as it binds Cdc48p and escorts ubiquitinated proteins from Cdc48p to the proteasome for degradation. The core is structurally similar to the nuclear transporter protein importin-alpha. The core is associated with the U-box at the C-terminus, pfam04564, which has ligase activity. Length = 624 Score = 29.2 bits (66), Expect = 4.9 Identities = 10/27 (37%), Positives = 16/27 (59%) Query: 582 KKLKNSKTLSPEQRQELQQQLADLERK 608 + ++LS E+RQE + +L ERK Sbjct: 466 QDRAEWESLSQEERQERESELHQAERK 492 >gnl|CDD|149418 pfam08349, DUF1722, Protein of unknown function (DUF1722). This domain of unknown function is found in bacteria and archaea and is homologous to the hypothetical protein ybgA from E. coli. Length = 117 Score = 29.0 bits (66), Expect = 5.4 Identities = 7/27 (25%), Positives = 14/27 (51%) Query: 587 SKTLSPEQRQELQQQLADLERKEINIL 613 K LS E++QEL + + + ++ Sbjct: 63 KKQLSSEEKQELLELIEQYREGLVPLV 89 >gnl|CDD|179046 PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha; Validated. Length = 339 Score = 28.9 bits (66), Expect = 5.6 Identities = 19/75 (25%), Positives = 28/75 (37%), Gaps = 21/75 (28%) Query: 559 IKNLKDADLRDLARMSD-------KIAYHRKK------LKNSKTLSPEQRQELQQQLADL 605 ++ L + L +A SD ++ Y KK LK L PE+R+E Sbjct: 5 LEELVEEALAAIAAASDLEALEALRVKYLGKKGELTELLKGLGKLPPEERKEAGA----- 59 Query: 606 ERKEINILKDKVSNK 620 IN LK + Sbjct: 60 ---LINELKQAIEAA 71 >gnl|CDD|181687 PRK09192, PRK09192, acyl-CoA synthetase; Validated. Length = 579 Score = 28.8 bits (65), Expect = 6.2 Identities = 24/57 (42%), Positives = 28/57 (49%), Gaps = 10/57 (17%) Query: 200 MSVDKLRATKKDDFVRSMLDWLDL-SRYKDIDGTPLSRSEIASFVGEVFAERVRSTS 255 +SVD L DF R L WLDL SR + GT +S S F E+ A RV S Sbjct: 245 LSVDYLPT---RDFARRPLQWLDLISRNR---GT-ISYSP--PFGYELCARRVNSKD 292 >gnl|CDD|178947 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed. Length = 865 Score = 28.5 bits (65), Expect = 7.4 Identities = 21/94 (22%), Positives = 42/94 (44%), Gaps = 18/94 (19%) Query: 538 IKRAKAMSSPDGYLYARTPSTIKNLKDADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQE 597 ++R +A++ Y ++A L++LA + LK + PE+ + Sbjct: 681 VRRIEAVTGEAALEYLNE-------QEALLKELAAL----------LKAKPSELPERVEA 723 Query: 598 LQQQLADLERKEINILKDKVSNKMHALVLDNVQT 631 L ++L +LE KE+ LK K++ +L + Sbjct: 724 LLEELKELE-KELEQLKAKLAAAAAGDLLAQAKE 756 >gnl|CDD|179924 PRK05074, PRK05074, inosine/xanthosine triphosphatase; Reviewed. Length = 173 Score = 28.3 bits (64), Expect = 8.0 Identities = 16/46 (34%), Positives = 18/46 (39%), Gaps = 5/46 (10%) Query: 722 RGEDPS----LPEVIYDGTLANGALLPYMDRLTKLV-SKGDRAAIG 762 RGE S LP V+ + L MDRL K AIG Sbjct: 97 RGESRSASLPLPAVVLEALRQGEELGDVMDRLFGTDNIKQKGGAIG 142 >gnl|CDD|150853 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This protein, which interacts with both microtubules and TRAF3 (tumour necrosis factor receptor-associated factor 3), is conserved from worms to humans. The N-terminal region is the microtubule binding domain and is well-conserved; the C-terminal 100 residues, also well-conserved, constitute the coiled-coil region which binds to TRAF3. The central region of the protein is rich in lysine and glutamic acid and carries KKE motifs which may also be necessary for tubulin-binding, but this region is the least well-conserved. Length = 506 Score = 28.3 bits (63), Expect = 8.8 Identities = 15/56 (26%), Positives = 28/56 (50%), Gaps = 1/56 (1%) Query: 564 DADLRDLARMSDKIAYHRKKLKNSKTLSPEQRQELQQQLADLERKEINILKDKVSN 619 D+ R+L + + L+ + + E + L+ QLA+LE + I + K+SN Sbjct: 433 DSMQRELEMWRSEYRQQAQALQQERRATEEALEPLKDQLAELE-QAIKDQRSKISN 487 >gnl|CDD|180242 PRK05761, PRK05761, DNA polymerase I; Reviewed. Length = 787 Score = 28.1 bits (63), Expect = 9.0 Identities = 17/79 (21%), Positives = 27/79 (34%), Gaps = 6/79 (7%) Query: 726 PSLPEVIYDGTLANGALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATK 785 P L V YDG L + Y K+ DR G P +T+L ++ K Sbjct: 19 PYLLSVDYDGKLGKAVVKLYDPETGKIYKWYDRT------GHKPYFLTDLDPDEIDKIPK 72 Query: 786 DNENSKVNATKAIRKTLPF 804 + + + + K Sbjct: 73 ILRHPSFDHLEIVEKYDGL 91 >gnl|CDD|180982 PRK07452, PRK07452, DNA polymerase III subunit delta; Validated. Length = 326 Score = 28.3 bits (64), Expect = 9.1 Identities = 16/51 (31%), Positives = 23/51 (45%), Gaps = 9/51 (17%) Query: 581 RKKLKNSKTLSPEQRQELQQQLADLE------RKEINILKDKVSNKMHALV 625 RK+++ LS +Q +L L DLE IN L+DK+ L Sbjct: 279 RKEVQG---LSSQQLLKLLPLLLDLEASLKQGANPINALQDKLIELCQLLQ 326 >gnl|CDD|179383 PRK02201, PRK02201, putative inner membrane protein translocase component YidC; Provisional. Length = 357 Score = 28.1 bits (63), Expect = 9.2 Identities = 10/36 (27%), Positives = 19/36 (52%) Query: 589 TLSPEQRQELQQQLADLERKEINILKDKVSNKMHAL 624 T + E+++ELQ + A ++ K + KDK + Sbjct: 156 TFNQEKQEELQGKKAKIDAKYKDYKKDKQMKQRKQQ 191 >gnl|CDD|181287 PRK08203, PRK08203, hydroxydechloroatrazine ethylaminohydrolase; Reviewed. Length = 451 Score = 28.3 bits (64), Expect = 9.2 Identities = 18/50 (36%), Positives = 23/50 (46%), Gaps = 6/50 (12%) Query: 356 LEVRQEAMLQMWEVMRYGETVENTGWANWMAGLRSAAGASMLGQHPIGAL 405 E RQ +LQ +RYG A A L GA +LG+ IG+L Sbjct: 333 GEARQALLLQR---LRYGPDAMTAREALEWATL---GGARVLGRDDIGSL 376 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.317 0.132 0.372 Gapped Lambda K H 0.267 0.0684 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 14,055,137 Number of extensions: 940213 Number of successful extensions: 2382 Number of sequences better than 10.0: 1 Number of HSP's gapped: 2369 Number of HSP's successfully gapped: 41 Length of query: 864 Length of database: 5,994,473 Length adjustment: 102 Effective length of query: 762 Effective length of database: 3,790,457 Effective search space: 2888328234 Effective search space used: 2888328234 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 62 (27.8 bits)