RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254780773|ref|YP_003065186.1| zinc metallopeptidase [Candidatus Liberibacter asiaticus str. psy62] (349 letters) >gnl|CDD|161683 TIGR00054, TIGR00054, RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain. Length = 420 Score = 121 bits (305), Expect = 3e-28 Identities = 76/228 (33%), Positives = 116/228 (50%), Gaps = 12/228 (5%) Query: 120 PVVSNVSPASPAAIAGVKKGDCIISLDGITVSAFEEVAPYVRENPLHEISLVLYREHVGV 179 PV+S+V+P SPA AG+K+GD I S++G + ++ + V+ENP + + + R Sbjct: 205 PVLSDVTPNSPAEKAGLKEGDYIQSINGEKLRSWTDFVSAVKENPGKSMDIKVERNG-ET 263 Query: 180 LHLKVMPRLQDTVDRFGIKRQVPSVGISFSYDETKLHSRTVLQSFSRGLDEISSITRGFL 239 L + + P + + +GIS S S +L +F++G I + L Sbjct: 264 LSISLTPEAKGKI----------GIGISPSL-APLEVSYGILNAFAKGASATVDIVKLIL 312 Query: 240 GVLSSAFGKDTRLNQISGPVGIARIAKNFFDHGFNAYIAFLAMFSWAIGFMNLLPIPILD 299 L +L +SGPVGI + A + + G + F A S +G MNLLPIP LD Sbjct: 313 TNLGKLITGSFKLKNLSGPVGIVKGAGSSANSGIVYLLQFGAFLSINLGIMNLLPIPALD 372 Query: 300 GGHLITFLLEMIRGKSLGVSVTRVITRMGLCIILFLFFLGIRNDIYGL 347 GG L+ +E IRGK L V + R+G+ +LFL LG+ ND+ L Sbjct: 373 GGQLLFLFIEAIRGKPLPEKVQAFVYRIGVAFLLFLMGLGLFNDLLRL 420 Score = 104 bits (261), Expect = 3e-23 Identities = 57/165 (34%), Positives = 94/165 (56%), Gaps = 19/165 (11%) Query: 9 LYTVSLII----IVVIHEFGHYMVARLCNIRVLSFSVGFGPELIGITSRSGVRWKVSLIP 64 L+ ++ I+ ++ +HE GH++ ARLC I+V FS+GFGP+++ ++G + +SLIP Sbjct: 4 LWILASILALAVLIFVHELGHFLAARLCGIKVERFSIGFGPKILKF-KKNGTEYAISLIP 62 Query: 65 LGGYVSFSEDEKDMR---------SFFCAAPWKKILTVLAGPLANCVMAILFFTFFFYNT 115 LGGYV +K+M F + ++K + + AGPLAN + AI F F Sbjct: 63 LGGYVKMKGLDKEMEVKPPETDGDLFNNKSVFQKAIIIFAGPLANFIFAI-FVYIFISLI 121 Query: 116 GV----MKPVVSNVSPASPAAIAGVKKGDCIISLDGITVSAFEEV 156 GV + PV+ + S A AG++ GD I+S++G + F++V Sbjct: 122 GVPGYEVGPVIELLDKNSIALEAGIEPGDEILSVNGNKIPGFKDV 166 >gnl|CDD|182723 PRK10779, PRK10779, zinc metallopeptidase RseP; Provisional. Length = 449 Score = 99.4 bits (248), Expect = 1e-21 Identities = 54/148 (36%), Positives = 83/148 (56%), Gaps = 10/148 (6%) Query: 10 YTVSLIIIVVIHEFGHYMVARLCNIRVLSFSVGFGPELIGITSRSGVRWKVSLIPLGGYV 69 + V+L +++ +HEFGH+ VAR C +RV FS+GFG L T R G + ++LIPLGGYV Sbjct: 10 FIVALGVLITVHEFGHFWVARRCGVRVERFSIGFGKALWRRTDRQGTEYVIALIPLGGYV 69 Query: 70 SFSED-------EKDMRSFFCAAPWKKILTVLAGPLANCVMAILFFTFFFYNTGV--MKP 120 ++ E +F ++ + AGP+AN + AI + F GV ++P Sbjct: 70 KMLDERVEPVAPELRHHAFNNKTVGQRAAIIAAGPIANFIFAIFAYWLVFI-IGVPGVRP 128 Query: 121 VVSNVSPASPAAIAGVKKGDCIISLDGI 148 VV ++P S AA A + G + ++DGI Sbjct: 129 VVGEIAPNSIAAQAQIAPGTELKAVDGI 156 Score = 95.1 bits (237), Expect = 2e-20 Identities = 75/231 (32%), Positives = 119/231 (51%), Gaps = 15/231 (6%) Query: 120 PVVSNVSPASPAAIAGVKKGDCIISLDGITVSAFEEVAPYVRENPLHEISLVLYREHVGV 179 PV++ V P S A+ AG++ GD I+ +DG ++ ++ VR+NP ++L + R Sbjct: 223 PVLAEVQPNSAASKAGLQAGDRIVKVDGQPLTQWQTFVTLVRDNPGKPLALEIER-QGSP 281 Query: 180 LHLKVMPRLQDTVDRF-GIKRQVPSVGISFSYDETKLHSRTVLQ--SFSRGLDEISSITR 236 L L + P + + G VP V I DE K TV Q FS + E + T Sbjct: 282 LSLTLTPDSKPGNGKAEGFAGVVPKV-IPLP-DEYK----TVRQYGPFS-AIYEATDKTW 334 Query: 237 GFLGVLSSAFGK----DTRLNQISGPVGIARIAKNFFDHGFNAYIAFLAMFSWAIGFMNL 292 + + S GK D +LN +SGP+ IA+ A ++G Y+ FLA+ S +G +NL Sbjct: 335 QLMKLTVSMLGKLITGDVKLNNLSGPISIAQGAGMSAEYGLVYYLMFLALISVNLGIINL 394 Query: 293 LPIPILDGGHLITFLLEMIRGKSLGVSVTRVITRMGLCIILFLFFLGIRND 343 P+P+LDGGHL+ +E ++G + V R+G +++ L L + ND Sbjct: 395 FPLPVLDGGHLLFLAIEKLKGGPVSERVQDFSYRIGSILLVLLMGLALFND 445 >gnl|CDD|128524 smart00228, PDZ, Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities. Length = 85 Score = 45.4 bits (108), Expect = 2e-05 Identities = 20/57 (35%), Positives = 33/57 (57%), Gaps = 3/57 (5%) Query: 121 VVSNVSPASPAAIAGVKKGDCIISLDGITVSAF--EEVAPYVRENPLHEISLVLYRE 175 VVS+V P SPAA AG+K GD I+ ++G +V E +++ +++L + R Sbjct: 29 VVSSVVPGSPAAKAGLKVGDVILEVNGTSVEGLTHLEAVDLLKKAG-GKVTLTVLRG 84 >gnl|CDD|162670 TIGR02037, degP_htrA_DO, periplasmic serine protease, Do/DeqQ family. This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures. Length = 428 Score = 42.6 bits (101), Expect = 2e-04 Identities = 16/39 (41%), Positives = 26/39 (66%) Query: 122 VSNVSPASPAAIAGVKKGDCIISLDGITVSAFEEVAPYV 160 V+ V P SPA AG+K GD I+S++G +S+F ++ + Sbjct: 261 VAQVLPGSPAEKAGLKAGDVILSVNGKPISSFADLRRAI 299 Score = 38.7 bits (91), Expect = 0.002 Identities = 16/36 (44%), Positives = 24/36 (66%) Query: 121 VVSNVSPASPAAIAGVKKGDCIISLDGITVSAFEEV 156 VV+ V SPAA AG++ GD I+S++ VS+ E+ Sbjct: 365 VVTKVVSGSPAARAGLQPGDVILSVNQQPVSSVAEL 400 >gnl|CDD|182855 PRK10942, PRK10942, serine endoprotease; Provisional. Length = 473 Score = 41.7 bits (98), Expect = 3e-04 Identities = 18/32 (56%), Positives = 22/32 (68%) Query: 122 VSNVSPASPAAIAGVKKGDCIISLDGITVSAF 153 VS V P S AA AG+K GD I SL+G +S+F Sbjct: 315 VSQVLPNSSAAKAGIKAGDVITSLNGKPISSF 346 Score = 33.2 bits (76), Expect = 0.091 Identities = 18/31 (58%), Positives = 20/31 (64%), Gaps = 1/31 (3%) Query: 114 NTGVMKPV-VSNVSPASPAAIAGVKKGDCII 143 N G K V V NV P +PAA G+KKGD II Sbjct: 403 NKGGDKGVVVDNVKPGTPAAQIGLKKGDVII 433 >gnl|CDD|182262 PRK10139, PRK10139, serine endoprotease; Provisional. Length = 455 Score = 38.4 bits (89), Expect = 0.002 Identities = 18/35 (51%), Positives = 24/35 (68%) Query: 122 VSNVSPASPAAIAGVKKGDCIISLDGITVSAFEEV 156 VS V P S +A AGVK GD I SL+G +++F E+ Sbjct: 294 VSEVLPNSGSAKAGVKAGDIITSLNGKPLNSFAEL 328 >gnl|CDD|182820 PRK10898, PRK10898, serine endoprotease; Provisional. Length = 353 Score = 38.1 bits (89), Expect = 0.003 Identities = 17/35 (48%), Positives = 23/35 (65%), Gaps = 1/35 (2%) Query: 121 VVSNVSPASPAAIAGVKKGDCIISLDGI-TVSAFE 154 VV+ VSP PAA AG++ D IIS++ +SA E Sbjct: 282 VVNEVSPDGPAAKAGIQVNDLIISVNNKPAISALE 316 >gnl|CDD|162671 TIGR02038, protease_degS, periplasmic serine pepetdase DegS. This family consists of the periplasmic serine protease DegS (HhoB), a shorter paralog of protease DO (HtrA, DegP) and DegQ (HhoA). It is found in E. coli and several other Proteobacteria of the gamma subdivision. It contains a trypsin domain and a single copy of PDZ domain (in contrast to DegP with two copies). A critical role of this DegS is to sense stress in the periplasm and partially degrade an inhibitor of sigma(E). Length = 351 Score = 36.0 bits (83), Expect = 0.014 Identities = 16/42 (38%), Positives = 22/42 (52%) Query: 121 VVSNVSPASPAAIAGVKKGDCIISLDGITVSAFEEVAPYVRE 162 V++ V P PAA AG+ D I+ DG V EE+ + E Sbjct: 281 VITGVDPNGPAARAGILVRDVILKYDGKDVIGAEELMDRIAE 322 >gnl|CDD|163044 TIGR02860, spore_IV_B, stage IV sporulation protein B. SpoIVB, the stage IV sporulation protein B of endospore-forming bacteria such as Bacillus subtilis, is a serine proteinase, expressed in the spore (rather than mother cell) compartment, that participates in a proteolytic activation cascade for Sigma-K. It appears to be universal among endospore-forming bacteria and occurs nowhere else. Length = 402 Score = 35.8 bits (83), Expect = 0.016 Identities = 11/46 (23%), Positives = 26/46 (56%) Query: 129 SPAAIAGVKKGDCIISLDGITVSAFEEVAPYVRENPLHEISLVLYR 174 SP AG++ GD I+ ++G + +++A + + +++L + R Sbjct: 124 SPGEEAGIQIGDRILKINGEKIKNMDDLANLINKAGGEKLTLTIER 169 >gnl|CDD|161775 TIGR00225, prc, C-terminal peptidase (prc). A C-terminal peptidase with different substrates in different species including processing of D1 protein of the photosystem II reaction center in higher plants and cleavage of a peptide of 11 residues from the precursor form of penicillin-binding protein in E.coli E.coli and H influenza have the most distal branch of the tree and their proteins have an N-terminal 200 amino acids that show no homology to other proteins in the database. Length = 334 Score = 32.7 bits (75), Expect = 0.15 Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 3/36 (8%) Query: 118 MKP-VVSNVSP--ASPAAIAGVKKGDCIISLDGITV 150 M + VSP SPA AG+K GD II ++G +V Sbjct: 59 MDDGEIVIVSPFEGSPAEKAGIKPGDKIIKINGKSV 94 >gnl|CDD|177681 PLN00049, PLN00049, carboxyl-terminal processing protease; Provisional. Length = 389 Score = 32.0 bits (73), Expect = 0.24 Identities = 13/30 (43%), Positives = 20/30 (66%) Query: 121 VVSNVSPASPAAIAGVKKGDCIISLDGITV 150 VV +P PAA AG++ GD I+++DG + Sbjct: 105 VVVAPAPGGPAARAGIRPGDVILAIDGTST 134 >gnl|CDD|183852 PRK13030, PRK13030, 2-oxoacid ferredoxin oxidoreductase; Provisional. Length = 1159 Score = 30.7 bits (70), Expect = 0.58 Identities = 21/87 (24%), Positives = 36/87 (41%), Gaps = 19/87 (21%) Query: 156 VAPYVRENPLHEISLVLYREHVGVL-----HLKVMPRLQ------DTVDRFGIKRQVPSV 204 V + R N + VG++ HL +M L+ + GI+ + V Sbjct: 256 VRAFARANSIDRWVAPSPDARVGIVTCGKAHLDLMEALRRLGLDDADLRAAGIR--IYKV 313 Query: 205 GISFSYDETKLHSRTVLQSFSRGLDEI 231 G+S+ + T L+ F+ GL+EI Sbjct: 314 GLSWPLE------PTRLREFADGLEEI 334 >gnl|CDD|151346 pfam10898, DUF2716, Protein of unknown function (DUF2716). This bacterial family of proteins has no known function. Length = 143 Score = 30.4 bits (69), Expect = 0.62 Identities = 13/41 (31%), Positives = 19/41 (46%), Gaps = 2/41 (4%) Query: 58 WKVSLIPLGGYVSFSEDEKDMRSFFCAAPWKKILTVLAGPL 98 W V + P G Y F + KD + PW+K +T+ L Sbjct: 104 WPVPVFPNGDYYFFLQ--KDFKWGLLGHPWEKSITIFGEEL 142 >gnl|CDD|173403 PTZ00111, PTZ00111, DNA replication licensing factor MCM4; Provisional. Length = 915 Score = 30.7 bits (69), Expect = 0.63 Identities = 16/60 (26%), Positives = 26/60 (43%), Gaps = 1/60 (1%) Query: 140 DCIISLDGITVSAFEEVAPYVRENPLHEISLVLYREHVGVLHLKVMPRLQDTVDRFGIKR 199 D + + D +TV +V P +R + LY V V+H+KV+ G+K Sbjct: 358 DSVKTGDRVTVVGILKVTP-IRTSTTRRTLKSLYTYFVNVIHVKVINSTNANQPEKGLKY 416 >gnl|CDD|148084 pfam06262, DUF1025, Domain of unknown function (DUF1025). Family of bacterial protein with undetermined function. Length = 96 Score = 29.0 bits (66), Expect = 1.9 Identities = 8/14 (57%), Positives = 10/14 (71%) Query: 14 LIIIVVIHEFGHYM 27 L+ VVIHE GH+ Sbjct: 71 LVRHVVIHEIGHHF 84 >gnl|CDD|131798 TIGR02751, PEPCase_arch, phosphoenolpyruvate carboxylase, archaeal type. This family is the archaeal-type phosphoenolpyruvate carboxylase, although not every host species is archaeal. These sequences bear little resemblance to the bacterial/eukaryotic type. The members from Sulfolobus solfataricus and Methanothermobacter thermautotrophicus were verified experimentally, while the activity is known to be present in a number of other archaea. Length = 506 Score = 28.9 bits (65), Expect = 2.1 Identities = 14/53 (26%), Positives = 20/53 (37%), Gaps = 5/53 (9%) Query: 35 RVLSF-----SVGFGPELIGITSRSGVRWKVSLIPLGGYVSFSEDEKDMRSFF 82 R + F S+G PELIGI S S + + Y + D + Sbjct: 374 RAIKFTGALYSIGLPPELIGIGSLSELDEDDLEVIKELYPNLRSDLSFAARYL 426 >gnl|CDD|181525 PRK08655, PRK08655, prephenate dehydrogenase; Provisional. Length = 437 Score = 27.6 bits (62), Expect = 4.7 Identities = 14/35 (40%), Positives = 19/35 (54%), Gaps = 2/35 (5%) Query: 133 IAGVKKGDC-IISLD-GITVSAFEEVAPYVRENPL 165 I K D IIS+ +T +EVAP+V+E L Sbjct: 53 IDAAKDADIVIISVPINVTEDVIKEVAPHVKEGSL 87 >gnl|CDD|132354 TIGR03311, Se_dep_Molyb_1, selenium-dependent molybdenum hydroxylase 1. Members of this protein family show full length homology to the molybdenum-containing aldehyde oxido-reductase of Desulfovibrio gigas. Members, however, are found only within species that have, and near those genes that encode, a set of predicted accessory proteins for selenium-dependent molybdenum hydroxylases. The best known examples of such enzymes are forms of xanthine dehydrogenase and purine hydroxylase; this family appears to be another such enzyme. Length = 848 Score = 27.1 bits (60), Expect = 6.3 Identities = 11/27 (40%), Positives = 16/27 (59%), Gaps = 1/27 (3%) Query: 236 RGFLGVLSSAFGKDTRLNQISGPVGIA 262 RGF GV S F + LN ++ VG++ Sbjct: 506 RGF-GVTQSNFAMECNLNLLAEKVGLS 531 >gnl|CDD|169151 PRK07912, PRK07912, salicylate synthase MbtI; Reviewed. Length = 449 Score = 27.1 bits (60), Expect = 7.3 Identities = 17/54 (31%), Positives = 23/54 (42%), Gaps = 11/54 (20%) Query: 26 YMVARLCNIRVLSF--------SVGFGPELIGITSRSGVRWKVSLIPLGGYVSF 71 Y + R N V SF ++G+ PEL+ GV V PL G +F Sbjct: 223 YRLGRRHNTPVRSFLLRLGGIRALGYSPELVTAVRADGV---VITEPLAGTRAF 273 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.330 0.145 0.443 Gapped Lambda K H 0.267 0.0624 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 5,886,143 Number of extensions: 394471 Number of successful extensions: 1020 Number of sequences better than 10.0: 1 Number of HSP's gapped: 1012 Number of HSP's successfully gapped: 38 Length of query: 349 Length of database: 5,994,473 Length adjustment: 94 Effective length of query: 255 Effective length of database: 3,963,321 Effective search space: 1010646855 Effective search space used: 1010646855 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 15 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.8 bits) S2: 58 (26.3 bits)