RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254780700|ref|YP_003065113.1| serine protease DO-like protease [Candidatus Liberibacter asiaticus str. psy62] (489 letters) >gnl|CDD|162670 TIGR02037, degP_htrA_DO, periplasmic serine protease, Do/DeqQ family. This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures. Length = 428 Score = 311 bits (800), Expect = 2e-85 Identities = 144/437 (32%), Positives = 225/437 (51%), Gaps = 30/437 (6%) Query: 39 DLPPVIARVSPSIVSVMVEPKKKVSVEQMFNAYGFGNLPEDHPLKNYFRKDFHKFFSGEE 98 P++ +V+P++V++ VE K + +F D F + Sbjct: 2 SFAPLVEKVAPAVVNISVEGTVKRRNRPPAL---------PPFFRQFFGDDMPNFPRQQ- 51 Query: 99 PILSDTVERLMFGSGFFITDDGYILTSNHIVEDGASFSVILSDDTELPAKLVGTDALFDL 158 + R + GSG I+ DGYILT+NH+V+ +V LSD E AKLVG D D+ Sbjct: 52 ---RERKVRGL-GSGVIISADGYILTNNHVVDGADEITVTLSDGREFKAKLVGKDPRTDI 107 Query: 159 AVLKVQSDRKFIPVEFEDANNIRVGEAVFTIGNPFRLRGTVSAGIVSALDRDIP--DRPG 216 AVLK+ + + ++ D++ +RVG+ V IGNPF L TV++GIVSAL R Sbjct: 108 AVLKIDAKKNLPVIKLGDSDKLRVGDWVLAIGNPFGLGQTVTSGIVSALGRSGLGIGDYE 167 Query: 217 TFTQIDAPINQGNSGGPCFNALGHVIGVNAMIVTSGQFHMGVGLIIPLSIIKKAIPSLIS 276 F Q DA IN GNSGGP N G VIG+N I + ++G+G IP ++ K + LI Sbjct: 168 NFIQTDAAINPGNSGGPLVNLRGEVIGINTAIYSPSGGNVGIGFAIPSNMAKNVVDQLIE 227 Query: 277 KGRVDHGWFGIMTQNLTQELAIPLGLRGTKGSLITAVVKESPADKAGMKVGDVICMLDGR 336 G+V GW G+ Q +T +LA LGL +G+L+ V+ SPA+KAG+K GDVI ++G+ Sbjct: 228 GGKVQRGWLGVTIQEVTSDLAKSLGLEKQRGALVAQVLPGSPAEKAGLKAGDVILSVNGK 287 Query: 337 IIKSHQDFVWQIASRSPKEQVKISLCKEGSKHSVAVVLGSSPTAKNDMHLEVGDKELLGM 396 I S D I + P ++V + + ++G + ++ V LG+SP + LG+ Sbjct: 288 PISSFADLRRAIGTLKPGKKVTLGILRKGKEKTITVTLGASPEEQAS-----SSNPFLGL 342 Query: 397 VLQDINDGNKKLVR-------IVALNPNREREVEAKGIQKGMTIVSVNTHEVSCIKDVER 449 + +++ +K +R +V G+Q G I+SVN VS + ++ + Sbjct: 343 TVANLSPEIRKELRLKGDVKGVVVTKVVSGSPAARAGLQPGDVILSVNQQPVSSVAELRK 402 Query: 450 LIGKAKEKKRDSVLLQI 466 ++ +AK+ V L I Sbjct: 403 VLDRAKKG--GRVALLI 417 >gnl|CDD|182262 PRK10139, PRK10139, serine endoprotease; Provisional. Length = 455 Score = 192 bits (488), Expect = 2e-49 Identities = 142/485 (29%), Positives = 231/485 (47%), Gaps = 46/485 (9%) Query: 3 KRQILSVKSICTVALTCVIFSSTYLVLEAKLP--PSSVDLPPVIARVSPSIVSVMVEPKK 60 + Q+LS ++ +V LT + ++ L P++ +V P++VSV VE Sbjct: 4 QTQLLSALAL-SVGLTLSASFQAVASIPGQVAGQAPLPSLAPMLEKVLPAVVSVRVEGTA 62 Query: 61 KVSVEQMFNAYGFGNLPEDHPLKNYFRKDFHKFFSGEEP-ILSDTVERLMFGSGFFI-TD 118 +PE+ F KFF + P + E L GSG I Sbjct: 63 SQGQ----------KIPEE----------FKKFFGDDLPDQPAQPFEGL--GSGVIIDAA 100 Query: 119 DGYILTSNHIVEDGASFSVILSDDTELPAKLVGTDALFDLAVLKVQSDRKFIPVEFEDAN 178 GY+LT+NH++ S+ L+D E AKL+G+D D+A+L++Q+ K + D++ Sbjct: 101 KGYVLTNNHVINQAQKISIQLNDGREFDAKLIGSDDQSDIALLQIQNPSKLTQIAIADSD 160 Query: 179 NIRVGEAVFTIGNPFRLRGTVSAGIVSALDRDIPDRPG--TFTQIDAPINQGNSGGPCFN 236 +RVG+ +GNPF L T ++GI+SAL R + G F Q DA IN+GNSGG N Sbjct: 161 KLRVGDFAVAVGNPFGLGQTATSGIISALGRSGLNLEGLENFIQTDASINRGNSGGALLN 220 Query: 237 ALGHVIGVNAMIVTSGQFHMGVGLIIPLSIIKKAIPSLISKGRVDHGWFGIMTQNLTQEL 296 G +IG+N I+ G +G+G IP ++ + LI G + G GI ++ ++ Sbjct: 221 LNGELIGINTAILAPGGGSVGIGFAIPSNMARTLAQQLIDFGEIKRGLLGIKGTEMSADI 280 Query: 297 AIPLGLRGTKGSLITAVVKESPADKAGMKVGDVICMLDGRIIKSHQDFVWQIASRSPKEQ 356 A L +G+ ++ V+ S + KAG+K GD+I L+G+ + S + +IA+ P + Sbjct: 281 AKAFNLDVQRGAFVSEVLPNSGSAKAGVKAGDIITSLNGKPLNSFAELRSRIATTEPGTK 340 Query: 357 VKISLCKEGSKHSVAVVLGSSPTAKNDMHLEVGDKELLGMVLQD--INDGNKKLVRIVAL 414 VK+ L + G V V L +S ++ E+ L G L D + DG K I Sbjct: 341 VKLGLLRNGKPLEVEVTLDTSTSSSASA--EMITPALQGATLSDGQLKDGTKG---IKID 395 Query: 415 NPNREREVEAKGIQKGMTIVSVNTHEVSCIKDVERLIGKAKEKKRDSVLLQIKYDPDMQS 474 + G+QK I+ VN V+ I ++ +++ K + LQI Sbjct: 396 EVVKGSPAAQAGLQKDDVIIGVNRDRVNSIAEMRKVL----AAKPAIIALQI------VR 445 Query: 475 GNDNM 479 GN+++ Sbjct: 446 GNESI 450 >gnl|CDD|182855 PRK10942, PRK10942, serine endoprotease; Provisional. Length = 473 Score = 181 bits (461), Expect = 4e-46 Identities = 129/431 (29%), Positives = 214/431 (49%), Gaps = 39/431 (9%) Query: 40 LPPVIARVSPSIVSVMVEPKKKVSVEQM---FNAYGFGNLP---EDHPLKNYFRKDFHKF 93 L P++ +V PS+VS+ VE V+ +M F + N P E P ++ F Sbjct: 40 LAPMLEKVMPSVVSINVEGSTTVNTPRMPRQFQQFFGDNSPFCQEGSPFQS------SPF 93 Query: 94 FSGEEPILSDTVER--LMFGSGFFI-TDDGYILTSNHIVEDGASFSVILSDDTELPAKLV 150 G + ++ + GSG I D GY++T+NH+V++ V LSD + AK+V Sbjct: 94 CQGGQGGNGGGQQQKFMALGSGVIIDADKGYVVTNNHVVDNATKIKVQLSDGRKFDAKVV 153 Query: 151 GTDALFDLAVLKVQSDRKFIPVEFEDANNIRVGEAVFTIGNPFRLRGTVSAGIVSALDR- 209 G D D+A++++Q+ + ++ D++ +RVG+ IGNP+ L TV++GIVSAL R Sbjct: 154 GKDPRSDIALIQLQNPKNLTAIKMADSDALRVGDYTVAIGNPYGLGETVTSGIVSALGRS 213 Query: 210 --DIPDRPGTFTQIDAPINQGNSGGPCFNALGHVIGVNAMIVTSGQFHMGVGLIIPLSII 267 ++ + F Q DA IN+GNSGG N G +IG+N I+ ++G+G IP +++ Sbjct: 214 GLNVENYE-NFIQTDAAINRGNSGGALVNLNGELIGINTAILAPDGGNIGIGFAIPSNMV 272 Query: 268 KKAIPSLISKGRVDHGWFGIMTQNLTQELAIPLGLRGTKGSLITAVVKESPADKAGMKVG 327 K ++ G+V G GIM L ELA + + +G+ ++ V+ S A KAG+K G Sbjct: 273 KNLTSQMVEYGQVKRGELGIMGTELNSELAKAMKVDAQRGAFVSQVLPNSSAAKAGIKAG 332 Query: 328 DVICMLDGRIIKSHQDFVWQIASRSPKEQVKISLCKEGSKHSVAVVLGSSPTAKNDMHLE 387 DVI L+G+ I S Q+ + ++ + L ++G +V V L S + D Sbjct: 333 DVITSLNGKPISSFAALRAQVGTMPVGSKLTLGLLRDGKPVNVNVELQQSSQNQVD---- 388 Query: 388 VGDKELLGMVLQDI------NDGNKKLVRIVALNPNREREVEAK-GIQKGMTIVSVNTHE 440 + I N G K V + + P A+ G++KG I+ N Sbjct: 389 ------SSNIFNGIEGAELSNKGGDKGVVVDNVKPG---TPAAQIGLKKGDVIIGANQQP 439 Query: 441 VSCIKDVERLI 451 V I ++ +++ Sbjct: 440 VKNIAELRKIL 450 >gnl|CDD|162671 TIGR02038, protease_degS, periplasmic serine pepetdase DegS. This family consists of the periplasmic serine protease DegS (HhoB), a shorter paralog of protease DO (HtrA, DegP) and DegQ (HhoA). It is found in E. coli and several other Proteobacteria of the gamma subdivision. It contains a trypsin domain and a single copy of PDZ domain (in contrast to DegP with two copies). A critical role of this DegS is to sense stress in the periplasm and partially degrade an inhibitor of sigma(E). Length = 351 Score = 160 bits (407), Expect = 6e-40 Identities = 93/273 (34%), Positives = 146/273 (53%), Gaps = 7/273 (2%) Query: 111 GSGFFITDDGYILTSNHIVEDGASFSVILSDDTELPAKLVGTDALFDLAVLKVQSDR-KF 169 GSG ++ +GYILT+ H+++ V L D + A+LVG+D L DLAVLK++ D Sbjct: 80 GSGVIMSKEGYILTNYHVIKKADQIVVALQDGRKFEAELVGSDPLTDLAVLKIEGDNLPT 139 Query: 170 IPVEFEDANNIRVGEAVFTIGNPFRLRGTVSAGIVSALDRDIPDRPG--TFTQIDAPINQ 227 IPV VG+ V IGNP+ L T++ GI+SA R+ G F Q DA IN Sbjct: 140 IPVNL--DRPPHVGDVVLAIGNPYNLGQTITQGIISATGRNGLSSVGRQNFIQTDAAINA 197 Query: 228 GNSGGPCFNALGHVIGVN--AMIVTSGQFHMGVGLIIPLSIIKKAIPSLISKGRVDHGWF 285 GNSGG N G ++G+N + + G+ IP+ + K + +I GRV G+ Sbjct: 198 GNSGGALINTNGELVGINTASFQKGGDEGGEGINFAIPIKLAHKIMGKIIRDGRVIRGYI 257 Query: 286 GIMTQNLTQELAIPLGLRGTKGSLITAVVKESPADKAGMKVGDVICMLDGRIIKSHQDFV 345 G+ +++ +A LGL +G +IT V PA +AG+ V DVI DG+ + ++ + Sbjct: 258 GVSGEDINSVVAQGLGLPDLRGIVITGVDPNGPAARAGILVRDVILKYDGKDVIGAEELM 317 Query: 346 WQIASRSPKEQVKISLCKEGSKHSVAVVLGSSP 378 +IA P +V +++ ++G + + V + P Sbjct: 318 DRIAETRPGSKVMVTVLRQGKQLELPVTIDEKP 350 >gnl|CDD|182820 PRK10898, PRK10898, serine endoprotease; Provisional. Length = 353 Score = 152 bits (387), Expect = 1e-37 Identities = 89/275 (32%), Positives = 141/275 (51%), Gaps = 8/275 (2%) Query: 111 GSGFFITDDGYILTSNHIVEDGASFSVILSDDTELPAKLVGTDALFDLAVLKVQSDRKFI 170 GSG + GYILT+ H++ D V L D A LVG+D+L DLAVLK+ + + Sbjct: 80 GSGVIMDQRGYILTNKHVINDADQIIVALQDGRVFEALLVGSDSLTDLAVLKINATN--L 137 Query: 171 PVEFEDANNI-RVGEAVFTIGNPFRLRGTVSAGIVSALDRD--IPDRPGTFTQIDAPINQ 227 PV + + +G+ V IGNP+ L T++ GI+SA R P F Q DA IN Sbjct: 138 PVIPINPKRVPHIGDVVLAIGNPYNLGQTITQGIISATGRIGLSPTGRQNFLQTDASINH 197 Query: 228 GNSGGPCFNALGHVIGVNAMIV---TSGQFHMGVGLIIPLSIIKKAIPSLISKGRVDHGW 284 GNSGG N+LG ++G+N + G+ G+G IP + K + LI GRV G+ Sbjct: 198 GNSGGALVNSLGELMGINTLSFDKSNDGETPEGIGFAIPTQLATKIMDKLIRDGRVIRGY 257 Query: 285 FGIMTQNLTQELAIPLGLRGTKGSLITAVVKESPADKAGMKVGDVICMLDGRIIKSHQDF 344 GI + + A G+ +G ++ V + PA KAG++V D+I ++ + S + Sbjct: 258 IGIGGREIAPLHAQGGGIDQLQGIVVNEVSPDGPAAKAGIQVNDLIISVNNKPAISALET 317 Query: 345 VWQIASRSPKEQVKISLCKEGSKHSVAVVLGSSPT 379 + Q+A P + + + ++ + ++ V + P Sbjct: 318 MDQVAEIRPGSVIPVVVMRDDKQLTLQVTIQEYPA 352 >gnl|CDD|128524 smart00228, PDZ, Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities. Length = 85 Score = 46.2 bits (110), Expect = 2e-05 Identities = 21/85 (24%), Positives = 36/85 (42%), Gaps = 9/85 (10%) Query: 281 DHGWFGIMTQNLTQELAIPLGLRGTKGSLITAVVKESPADKAGMKVGDVICMLDGRIIKS 340 G G E G ++++VV SPA KAG+KVGDVI ++G ++ Sbjct: 10 GGGGLGFSLVGGKDEG---------GGVVVSSVVPGSPAAKAGLKVGDVILEVNGTSVEG 60 Query: 341 HQDFVWQIASRSPKEQVKISLCKEG 365 + +V +++ + G Sbjct: 61 LTHLEAVDLLKKAGGKVTLTVLRGG 85 >gnl|CDD|161683 TIGR00054, TIGR00054, RIP metalloprotease RseP. A model that detects fragments as well matches a number of members of the PEPTIDASE FAMILY S2C. The region of match appears not to overlap the active site domain. Length = 420 Score = 40.2 bits (94), Expect = 0.001 Identities = 30/94 (31%), Positives = 49/94 (52%), Gaps = 6/94 (6%) Query: 290 QNLTQELAIPLGLRGTKGSLITA-VVKESPADKAGMKVGDVICMLDGRIIKSHQDFVWQI 348 +N T E+ L RG K + + V SPA+KAG+K GD I ++G ++S DFV + Sbjct: 186 ENWTFEVMKELIPRGPKIEPVLSDVTPNSPAEKAGLKEGDYIQSINGEKLRSWTDFVSAV 245 Query: 349 ASRSPKEQVKISLCKEGSKHSVAVVLGSSPTAKN 382 +P + + I + + G S+++ P AK Sbjct: 246 -KENPGKSMDIKVERNGETLSISLT----PEAKG 274 >gnl|CDD|182723 PRK10779, PRK10779, zinc metallopeptidase RseP; Provisional. Length = 449 Score = 39.3 bits (92), Expect = 0.003 Identities = 27/92 (29%), Positives = 47/92 (51%), Gaps = 10/92 (10%) Query: 294 QELAIPLGLRGTKGSLITAVVKE----SPADKAGMKVGDVICMLDGRIIKSHQDFVWQIA 349 Q+ LG+R +G I V+ E S A KAG++ GD I +DG+ + Q FV + Sbjct: 206 QDPVSSLGIR-PRGPQIEPVLAEVQPNSAASKAGLQAGDRIVKVDGQPLTQWQTFVTLV- 263 Query: 350 SRSPKEQVKISLCKEGSKHSVAVVLGSSPTAK 381 +P + + + + ++GS + L +P +K Sbjct: 264 RDNPGKPLALEIERQGS----PLSLTLTPDSK 291 >gnl|CDD|131023 TIGR01968, minD_bact, septum site-determining protein MinD. This model describes the bacterial and chloroplast form of MinD, a multifunctional cell division protein that guides correct placement of the septum. The homologous archaeal MinD proteins, with many archaeal genomes having two or more forms, are described by a separate model. Length = 261 Score = 33.1 bits (76), Expect = 0.16 Identities = 20/65 (30%), Positives = 32/65 (49%), Gaps = 11/65 (16%) Query: 426 GIQKG---------MTIVSVNTHEVSCIKDVERLIGKAKEKKRDSVLLQI-KYDPDMQSG 475 GI+ G IV V T EVS ++D +R+IG + K + + L + + P+M Sbjct: 122 GIESGFRNAVAPADEAIV-VTTPEVSAVRDADRVIGLLEAKGIEKIHLIVNRLRPEMVKK 180 Query: 476 NDNMS 480 D +S Sbjct: 181 GDMLS 185 >gnl|CDD|163044 TIGR02860, spore_IV_B, stage IV sporulation protein B. SpoIVB, the stage IV sporulation protein B of endospore-forming bacteria such as Bacillus subtilis, is a serine proteinase, expressed in the spore (rather than mother cell) compartment, that participates in a proteolytic activation cascade for Sigma-K. It appears to be universal among endospore-forming bacteria and occurs nowhere else. Length = 402 Score = 32.7 bits (75), Expect = 0.23 Identities = 13/53 (24%), Positives = 31/53 (58%), Gaps = 1/53 (1%) Query: 315 KESPADKAGMKVGDVICMLDGRIIKSHQDFVWQIASRSPKEQVKISLCKEGSK 367 SP ++AG+++GD I ++G IK+ D + +++ E++ +++ + G Sbjct: 122 IHSPGEEAGIQIGDRILKINGEKIKNMDDLA-NLINKAGGEKLTLTIERGGKI 173 >gnl|CDD|161775 TIGR00225, prc, C-terminal peptidase (prc). A C-terminal peptidase with different substrates in different species including processing of D1 protein of the photosystem II reaction center in higher plants and cleavage of a peptide of 11 residues from the precursor form of penicillin-binding protein in E.coli E.coli and H influenza have the most distal branch of the tree and their proteins have an N-terminal 200 amino acids that show no homology to other proteins in the database. Length = 334 Score = 31.9 bits (73), Expect = 0.35 Identities = 12/27 (44%), Positives = 19/27 (70%) Query: 309 LITAVVKESPADKAGMKVGDVICMLDG 335 +I + + SPA+KAG+K GD I ++G Sbjct: 65 VIVSPFEGSPAEKAGIKPGDKIIKING 91 >gnl|CDD|177681 PLN00049, PLN00049, carboxyl-terminal processing protease; Provisional. Length = 389 Score = 31.6 bits (72), Expect = 0.46 Identities = 16/43 (37%), Positives = 22/43 (51%), Gaps = 1/43 (2%) Query: 295 ELAIPLGLRGTKGSL-ITAVVKESPADKAGMKVGDVICMLDGR 336 E+ P G G L + A PA +AG++ GDVI +DG Sbjct: 90 EVGYPTGSDGPPAGLVVVAPAPGGPAARAGIRPGDVILAIDGT 132 >gnl|CDD|169409 PRK08374, PRK08374, homoserine dehydrogenase; Provisional. Length = 336 Score = 30.9 bits (70), Expect = 0.83 Identities = 34/130 (26%), Positives = 54/130 (41%), Gaps = 23/130 (17%) Query: 98 EPILSDTVERLMFGSGFFITDDGYILTSNHIVEDGASFSVILSDDTEL------PAK-LV 150 E +L DTV+R+ +ILT +E G +F L + L P+K + Sbjct: 162 ENLLGDTVKRI---EAVVNATTTFILTR---MEQGKTFEEALKEAQTLGIAERDPSKDID 215 Query: 151 GTDALFDLAVLKVQSDRKFIPVEFEDAN-----NIRVGEA--VFTIGNPFRLRGTVSAGI 203 G DA + +L + F P+ FE+ ++ GE G RL TV G Sbjct: 216 GIDAGYKATILHWVA---FPPITFEEVGIRGIKDVTEGEIERAKAKGRNVRLVATVEEGR 272 Query: 204 VSALDRDIPD 213 +S + +P+ Sbjct: 273 ISVKPKKLPE 282 >gnl|CDD|184804 PRK14720, PRK14720, transcript cleavage factor/unknown domain fusion protein; Provisional. Length = 906 Score = 30.1 bits (68), Expect = 1.2 Identities = 10/45 (22%), Positives = 19/45 (42%) Query: 432 TIVSVNTHEVSCIKDVERLIGKAKEKKRDSVLLQIKYDPDMQSGN 476 +V N+ + +ER + +E R LL+ Y+P + Sbjct: 194 KLVHYNSDDFDFFLRIERKVLGHREFTRLVGLLEDLYEPYKALED 238 >gnl|CDD|179776 PRK04192, PRK04192, V-type ATP synthase subunit A; Provisional. Length = 586 Score = 30.1 bits (69), Expect = 1.3 Identities = 10/33 (30%), Positives = 20/33 (60%), Gaps = 1/33 (3%) Query: 160 VLKVQSDRKFIPVEFEDANNIRVGEAVFTIGNP 192 +++++ D+ I V +E+ + I+ GE V G P Sbjct: 40 IIRIEGDKATIQV-YEETSGIKPGEPVEFTGEP 71 >gnl|CDD|151032 pfam10459, Peptidase_S46, Peptidase S46. Dipeptidyl-peptidase 7 (DPP-7) is the best characterized member of this family. It is a serine peptidase that is located on the cell surface and is predicted to have two N-terminal transmembrane domains. Length = 696 Score = 30.3 bits (69), Expect = 1.3 Identities = 8/18 (44%), Positives = 11/18 (61%) Query: 228 GNSGGPCFNALGHVIGVN 245 GNSG P N G ++G+ Sbjct: 631 GNSGSPVLNGKGELVGLA 648 >gnl|CDD|182074 PRK09784, PRK09784, hypothetical protein; Provisional. Length = 417 Score = 29.6 bits (66), Expect = 1.9 Identities = 22/89 (24%), Positives = 40/89 (44%), Gaps = 11/89 (12%) Query: 314 VKESPADKAGMKVGDVICMLDGRIIKSHQDFVWQIASRSPKEQVKISLCKEGSKHSVAVV 373 +K +P D GD C+L ++ D+ W + S +++V K K ++ + Sbjct: 200 LKYAPVD------GDGYCLLRAILVLKQHDYSWALGSHKTQKEVYEEFIKIVDKQTIEAL 253 Query: 374 LGSSPTAKNDMHLEVGDKELLGMVLQDIN 402 + TA ++ +V K L G+ LQ N Sbjct: 254 VD---TAFYNLREDV--KTLFGVDLQSDN 277 >gnl|CDD|149458 pfam08404, Baculo_p74_N, Baculoviridae P74 N-terminal. This domain is found at the N-terminus of P74 occlusion-derived virus (ODV) envelope proteins which are required for oral infectivity. The envelope proteins are found in baculoviruses which are insect pathogens. The C-terminus of P74 is anchored to the membrane whereas the N-terminus is exposed to the virion surface. Furthermore P74 is unusual for a virus envelope protein as it lacks an N-terminal localisation signal sequence. Also see pfam04583. Length = 301 Score = 28.8 bits (65), Expect = 3.2 Identities = 15/56 (26%), Positives = 23/56 (41%), Gaps = 10/56 (17%) Query: 100 ILSDTVERLMFGSGFFITDDGYILTSNHI---VEDGASFSVILSDDTELPAKLVGT 152 IL D+ ++ S Y+ T H+ V+D F+V D P + GT Sbjct: 135 ILVDSFTKMYMNSP-------YLRTDEHLIRGVDDVPGFNVEPDTDPLFPERFKGT 183 >gnl|CDD|164900 PHA02107, PHA02107, hypothetical protein. Length = 216 Score = 28.8 bits (64), Expect = 3.4 Identities = 19/69 (27%), Positives = 27/69 (39%), Gaps = 14/69 (20%) Query: 125 SNHIVEDGASFSVILSDDTELPAKLVGTDALFDLAVLKVQSDRKFIPVEFEDANNI---- 180 SNH+V +FSV L + + P K V T V + P + D NI Sbjct: 117 SNHVV----TFSVDLPNGDKAPVKNVETKT------EVVSTVDYTEPAKEPDGQNIFFAP 166 Query: 181 RVGEAVFTI 189 +AV + Sbjct: 167 ETMKAVQLV 175 >gnl|CDD|165191 PHA02854, PHA02854, putative host range protein; Provisional. Length = 178 Score = 28.3 bits (63), Expect = 5.0 Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 2/79 (2%) Query: 20 VIFSSTYLVLEAKLPPSSVDLPPVIARVSPSIVSVMVEPKKKVSVEQMFNAYGFGNLPED 79 V++S+++ + D P + P V++ V PKKK V N Y F P D Sbjct: 83 VVYSASFSINSDSYVKVFSDNPDKYKHMYP-TVTINV-PKKKFKVVDQGNTYMFIQSPID 140 Query: 80 HPLKNYFRKDFHKFFSGEE 98 K F KD +++ + Sbjct: 141 DCDKEQFLKDEFEYYDSDN 159 >gnl|CDD|180936 PRK07334, PRK07334, threonine dehydratase; Provisional. Length = 403 Score = 28.3 bits (64), Expect = 5.1 Identities = 17/38 (44%), Positives = 22/38 (57%), Gaps = 2/38 (5%) Query: 195 LRGTVSAGIVSALDRDIPDRPGTFTQIDAPINQGNSGG 232 LRG V AG ++ L DI DRPG ++ A I G +G Sbjct: 317 LRGLVRAGRLARLRVDIRDRPGALARVTALI--GEAGA 352 >gnl|CDD|130430 TIGR01363, strep_his_triad, streptococcal histidine triad protein. This model represents the N-terminal half of a family of Streptococcal proteins that contain a signal peptide and then up to five repeats of a region that includes a His-X-X-His-X-His (histidine triad) motif. Three repeats are found in the seed alignment. Members of this family from Streptococcus pneumoniae are suggested to cleave human C3, and the member PhpA has been shown in vaccine studies to be a protective antigen in mice. Length = 348 Score = 28.0 bits (62), Expect = 5.6 Identities = 10/19 (52%), Positives = 14/19 (73%) Query: 113 GFFITDDGYILTSNHIVED 131 G + TDDGYI ++ I+ED Sbjct: 181 GRYTTDDGYIFNASDIIED 199 >gnl|CDD|180494 PRK06260, PRK06260, threonine synthase; Validated. Length = 397 Score = 27.6 bits (62), Expect = 7.0 Identities = 12/26 (46%), Positives = 15/26 (57%), Gaps = 2/26 (7%) Query: 191 NPFRLRG--TVSAGIVSALDRDIPDR 214 NPFRL G T+ I L ++PDR Sbjct: 195 NPFRLEGQKTIGFEIADQLGWEVPDR 220 >gnl|CDD|151593 pfam11150, DUF2927, Protein of unknown function (DUF2927). This family is conserved in Proteobacteria. Several members are described as being putative lipoproteins, but otherwise the function is not known. Length = 212 Score = 27.6 bits (62), Expect = 8.4 Identities = 9/22 (40%), Positives = 14/22 (63%), Gaps = 3/22 (13%) Query: 460 DSVLLQIKYDPDMQSGNDNMSR 481 D +LL+ YDP ++ G M+R Sbjct: 184 DELLLRALYDPRLRPG---MTR 202 >gnl|CDD|183675 PRK12678, PRK12678, transcription termination factor Rho; Provisional. Length = 672 Score = 27.2 bits (61), Expect = 8.9 Identities = 11/23 (47%), Positives = 14/23 (60%), Gaps = 4/23 (17%) Query: 294 QELAIPLGLRGT----KGSLITA 312 + LA LG++GT KG LI A Sbjct: 32 RALAKQLGIKGTSGMRKGELIAA 54 >gnl|CDD|183712 PRK12739, PRK12739, elongation factor G; Reviewed. Length = 691 Score = 27.1 bits (61), Expect = 9.8 Identities = 15/45 (33%), Positives = 22/45 (48%), Gaps = 15/45 (33%) Query: 27 LVLEAKLPPSSVDLP-PVIARVSPSIVSVMVEPKKKVSVEQMFNA 70 ++LE S++ P PVI S+ VEPK K ++M A Sbjct: 394 IILE------SMEFPEPVI--------SLAVEPKTKADQDKMGLA 424 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.319 0.137 0.393 Gapped Lambda K H 0.267 0.0715 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 7,921,859 Number of extensions: 526620 Number of successful extensions: 946 Number of sequences better than 10.0: 1 Number of HSP's gapped: 929 Number of HSP's successfully gapped: 38 Length of query: 489 Length of database: 5,994,473 Length adjustment: 97 Effective length of query: 392 Effective length of database: 3,898,497 Effective search space: 1528210824 Effective search space used: 1528210824 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 59 (26.5 bits)