RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254780915|ref|YP_003065328.1| oligoendopeptidase F [Candidatus Liberibacter asiaticus str. psy62] (626 letters) >gnl|CDD|31358 COG1164, COG1164, Oligoendopeptidase F [Amino acid transport and metabolism]. Length = 598 Score = 441 bits (1136), Expect = e-124 Identities = 204/605 (33%), Positives = 303/605 (50%), Gaps = 15/605 (2%) Query: 27 MDSQENLGNLPRWNLEDLYPSHDSQEISNDMECIEHESLAFKTRWEGNLAHATNQKNCHS 86 + ++ + W+L DLYP E E E L L + Sbjct: 1 LKNRSEVPEKYTWDLSDLYP---------GDELWEAEFLEESKELLKALEFYELILSAED 51 Query: 87 LGAAIAEYERICELIGRIASYAMLSYNCNLSSPTIRKFYTDINAKLADFEKVLIFFALEI 146 L A+ EYE++ EL+GR+ +YA + + + + +K Y + ADF L FF E+ Sbjct: 52 LLEALEEYEKLEELLGRLYAYASMKLSTDTTDEEAQKLYQKLEEFYADFSSALSFFEPEL 111 Query: 147 NTLDEALLEQSYAQDPLTLKYSAWIKNIRKIKKHLLSNDMECLLSDTSQVGREALKRFFC 206 LDE +++ P Y +++ + + K H LS + E LL+ S VG A FF Sbjct: 112 LELDEEVIQSLLTSGPELADYRFYLEELFRRKPHQLSAEEEKLLAQLSPVGNSASNIFFD 171 Query: 207 ENIESLRFKI----NDQKIPLTKAYKSFFDSDREVRKSAAKALSHTFNKSSHIFSFITNT 262 L F +K+ L + D DREVRK+A +AL + K + + + NT Sbjct: 172 LTNSDLTFPDIGDAKGKKLTLEQLLNLLEDPDREVRKAAYEALYKAYEKHRNTLAALLNT 231 Query: 263 LAKDEEIQDRWRKYEKIADSRHLSNNVEPYVIEALMQSVKNYYPKTSHRYYELKKKWLKL 322 L K R R Y+ + DS L N V+ V++ L++SVK + HRYY+L+ K L L Sbjct: 232 LVKVLAFLARARNYDDVIDSALLRNEVDREVVDNLIESVKEAFLPLLHRYYKLRAKVLGL 291 Query: 323 DTMYFWDRLAPLP-GTSQDIIPFEVARDLVLQSYAKFSPQMSIIAEKFFTHNWIDAPQYD 381 + + +D APL +E A++LVL++ A P+ + IA + F WID Sbjct: 292 EKLRPYDLYAPLLDKDPSPEYSYEEAKELVLKALAPLGPEYAKIARRAFDERWIDVYPRK 351 Query: 382 GKGSGAFAHGTIPSVHPYILLNYLGKPQDVMTLAHELGHGIHFVLSSETQGILTNNSSLT 441 GK SGA++ G HP+IL+NY G +DV TLAHELGH +H S + Q L + S+ Sbjct: 352 GKRSGAYSIGFYKGDHPFILMNYDGTLRDVFTLAHELGHSVHSYFSRKHQPYLYADYSIF 411 Query: 442 LAETASIFGETLTFDSLLQAASSKEERKILLANKIEDMLNSIVRQISFYDFELKLHTERR 501 LAE AS F E L FD LL+ EER +L K+E ++ RQ F +FE ++H Sbjct: 412 LAEIASTFNEMLLFDYLLERFKDPEERLAILEEKLEGFFATLFRQTLFAEFEHRVHELIE 471 Query: 502 STGDIPTHRINEIWLETQKESLGPAFDLSDLEYGSFWMMVPHFIESSFYVYAYAFGNCLV 561 ++ +NE++LE QKE G A L D G W +PHF S FYVY YA G Sbjct: 472 EGEELTAEELNELYLELQKEYYGDAVKL-DELSGLEWARIPHFYHSPFYVYQYATGQLAA 530 Query: 562 NSLYDIYKSNTVDCFKEKYLNILRAGNSKHYSELLLPLNINLSDPNFWERGLQTVEKMID 621 +LY +N + F++ Y+ L++G SK ELL I+L+ P+ WE L E++ID Sbjct: 531 LALYAKILTNDAEAFEKYYIAFLKSGGSKSPLELLKIAGIDLTTPDPWEEALAEFERLID 590 Query: 622 DVEKM 626 ++E++ Sbjct: 591 ELEEL 595 >gnl|CDD|107215 cd06459, M3B_Oligoendopeptidase_F, Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid. Length = 427 Score = 386 bits (994), Expect = e-108 Identities = 150/430 (34%), Positives = 226/430 (52%), Gaps = 6/430 (1%) Query: 191 SDTSQVGREALKRFFCENIESLRFKI--NDQKIPLTKAYKSFF-DSDREVRKSAAKALSH 247 + S+ G A ++ L+F + +++ L++ + DREVRK A +AL Sbjct: 1 AKLSETGNSAWSEYYDLLNSDLKFIFEFDGEELTLSQNLSNLLESPDREVRKKAFEALYK 60 Query: 248 TFNKSSHIFSFITNTLAKDEEIQDRWRKYEKIADSRHLSNNVEPYVIEALMQSVKNYYPK 307 + K + + I NTL K + R Y+ ++ +NN+ V + L+ VK P Sbjct: 61 AYEKYENTLAAILNTLVKLRLTLAKLRGYDSYLEAALFNNNIPEDVYDFLIAVVKENVP- 119 Query: 308 TSHRYYELKKKWLKLDTMYFWDRLAPLPGTSQDIIPFEVARDLVLQSYAKFSPQMSIIAE 367 HRY +LKKK L LD + +D APL + +E A++LVL++ + P+ + A+ Sbjct: 120 LLHRYLKLKKKLLGLDKLRPYDLYAPLVSGNPPKYTYEEAKELVLEALSPLGPEYAEFAK 179 Query: 368 KFFTHNWIDAPQYDGKGSGAFAHGTIPSVHPYILLNYLGKPQDVMTLAHELGHGIHFVLS 427 + F WID GK SGA+ G P HP+IL+N+ G DV TLAHELGH H LS Sbjct: 180 RAFEERWIDVEPRKGKRSGAYCTGLPPGKHPFILMNFNGTLDDVFTLAHELGHAFHSYLS 239 Query: 428 SETQGILTNNSSLTLAETASIFGETLTFDSLLQAASSKEERKILLANKIEDMLNSIVRQI 487 + Q L ++ + LAE AS F E L FD LL+ A EE+ LL + +ED+ ++ RQ Sbjct: 240 RDNQPYLYSDYPIFLAEIASTFNELLLFDYLLKFAKDPEEKLYLLEHLLEDIRATLPRQT 299 Query: 488 SFYDFELKLHTERRSTGDIPTHRINEIWLETQKESLGPAFDLSDLEYGSFWMMVPHFIES 547 F +FE +++ + +NEI+ E +K+ G ++ D E+G W +PHF Sbjct: 300 MFAEFEHEVYENPEEGEPLTAEELNEIYRELEKKYGGDLVEI-DEEHGYEWARIPHFYYV 358 Query: 548 SFYVYAYAFGNCLVNSLYDIYKSNTVDCFKEKYLNILRAGNSKHYSELLLPLNINLSDPN 607 FYVY YAFG +LY YK + EKYL +L+AG SK ELL ++L+ P+ Sbjct: 359 PFYVYPYAFGQLAALALYAKYKEDGEG-AVEKYLELLKAGGSKSPLELLKKAGVDLTSPD 417 Query: 608 FWERGLQTVE 617 FWE + +E Sbjct: 418 FWEEAIDVIE 427 >gnl|CDD|144870 pfam01432, Peptidase_M3, Peptidase family M3. This is the Thimet oligopeptidase family, large family of mammalian and bacterial oligopeptidases that cleave medium sized peptides. The group also contains mitochondrial intermediate peptidase which is encoded by nuclear DNA but functions within the mitochondria to remove the leader sequence. Length = 448 Score = 136 bits (344), Expect = 2e-32 Identities = 97/434 (22%), Positives = 155/434 (35%), Gaps = 78/434 (17%) Query: 232 DSDREVRKSAAKALSHTFNKSSHIFSFITNTLAKDEEIQDRWRKYEKIADSRHLSNNVEP 291 DRE RK A +A + + S + L K + Y A++ L + + Sbjct: 5 SPDRETRKKAYRAFYSRAEANRNENSALLEELLKLRAELAKLLGYPSYAEAS-LEDKMAK 63 Query: 292 Y---VIEALMQSVKNYYPKTSHRYYEL----KKKWLKLDTMYFWDRLAPLPGTSQDII-- 342 V + L + V P HR EL KKK L L+ + WD +++ Sbjct: 64 IPETVYDFLEELVNKLRPLL-HRELELLKKLKKKELGLEELQPWDVAYYSEKQREELYDP 122 Query: 343 ----------PFEVARDLVLQSYAKFSPQMSIIAEK---------------------FFT 371 P E + L + ++ + E Sbjct: 123 LDQEELRPYFPLEQVLEKGLFGLFERLFGITFVLEPLGEVWHEDVRFYSVFDELSGGLIG 182 Query: 372 HNWIDAPQYDGKGSGAFAHGTIPS---VHPYILLNY---------LGKPQDVMTLAHELG 419 ++D GK GA++ G +P PY+L N+ L DV TL HE G Sbjct: 183 EFYLDLYPRKGKRGGAYSFGLVPGRLDPVPYLLCNFTKPSSGKPSLLTHDDVETLFHEFG 242 Query: 420 HGIHFVLSSETQGILTN-NSSLTLAETASIFGETLTFDSLLQAASSK--EERKILLANKI 476 H +H +LS ++ N + AE S F E ++ LL S+ E + + A + Sbjct: 243 HSMHSLLSRTEYSYVSGTNVPIDFAEIPSQFNENWLWEPLLLNLLSRHYETGEPIPAELL 302 Query: 477 EDMLNS--------IVRQISFYDFELKLHTERRSTGDIPTHRINEIWLETQKESLGPAFD 528 E ++ S + RQ+ F F+ ++H + + E + E K+ G Sbjct: 303 EKLIKSKNVNAGLFLFRQLMFAAFDQEIHEAAEEDQKLDF--LLEEYAELNKKYYGDPVT 360 Query: 529 LSDLEYGSFWMMVPHFIESSFYVYAYAFGNCLVNSLYDIYKS------NTVDCFKEKYLN 582 + SF + PH +++Y Y YA G L DI++ + YL Sbjct: 361 PDEASPLSFSHIFPHGYAANYYSYLYATGLAL-----DIFEKFFEQDPLNRETGLRYYLE 415 Query: 583 ILRAGNSKHYSELL 596 L G S ELL Sbjct: 416 FLSRGGSLDPLELL 429 >gnl|CDD|107211 cd06258, Peptidase_M3_like, The peptidase M3-like family, also called neurolysin-like family, is part of the "zincins" metallopeptidases, and includes M3, M2 and M32 families of metallopeptidases. The M3 family is subdivided into two subfamilies: the widespread M3A, which comprises a number of high-molecular mass endo- and exopeptidases from bacteria, archaea, protozoa, fungi, plants and animals, and the small M3B, whose members are enzymes primarily from bacteria. Well-known mammalian/eukaryotic M3A endopeptidases are the thimet oligopeptidase (TOP; endopeptidase 3.4.24.15), neurolysin (alias endopeptidase 3.4.24.16), and the mitochondrial intermediate peptidase. The first two are intracellular oligopeptidases, which act only on relatively short substrates of less than 20 amino acid residues, while the latter cleaves N-terminal octapeptides from proteins during their import into the mitochondria. The M3A subfamily also contains several bacterial endopeptidases, collectively called oligopeptidases A, as well as a large number of bacterial carboxypeptidases, called dipeptidyl peptidases (Dcp; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). The peptidases in the M3 family contain the HEXXH motif that forms the active site in conjunction with a C-terminally-located Glutamic acid (Glu) residue. A single zinc ion is ligated by the side-chains of the two Histidine (His) residues, and the more C-terminal Glu. Most of the peptidases are synthesized without signal peptides or propeptides, and function intracellularly. The structure of neurolysin shows similarities to those of angiotensin-converting enzyme (ACE; peptidyl-dipeptidase A) peptidase unit 2 belonging to peptidase family M2. ACE is an enzyme responsible for cleavage of dipeptides from the C-termini of proteins, notably converting angiotensin I to angiotensin II in mammals. There are similarities to the thermostable carboxypeptidases from Pyrococcus furiosus carboxypeptidase (PfuCP), and Thermus aquaticus (TaqCP), belonging to peptidase family M32. Little is known about function of this family, including carboxypeptidases Taq and Pfu. Length = 365 Score = 67.8 bits (166), Expect = 8e-12 Identities = 61/263 (23%), Positives = 90/263 (34%), Gaps = 43/263 (16%) Query: 365 IAEKFFTHNWIDAPQYDGKGSGAFAHGTIPSVH-----PYILLNY---------LGKPQD 410 A FF +D GK F G P + IL N+ L D Sbjct: 98 GALPFFY---LDLYDRKGKYPHGFCTGLDPGFNRQDKDVRILANFTSPAAPDPVLLGHDD 154 Query: 411 VMTLAHELGHGIHFVLS-SETQGILTNNSSLTLAETASIFGETLTFD-SLLQAASSKEER 468 + TL HE GH +HF+L +S AE S+F E+ D L+ + + Sbjct: 155 INTLFHEFGHAVHFLLIQQRYPFQERTPTSTDFAEAQSMFLESFATDPEWLERYARHYQG 214 Query: 469 KILLANKIEDMLNSIV--------RQISFYDFELKLHTERRSTGDIPTHRINEIWLETQK 520 ++ IE ++ + + R + FE L+ + ++W + K Sbjct: 215 GVVPDELIEKLIAARLPNTLYETRRILVVAKFEKALYEN-----PDRELELQKLWRDLVK 269 Query: 521 ESLGPAFDLSDLEYGSFWMMVPHFI-ESSFYVYAYAFGNCLVNSLYDIYKSNTVDCF--- 576 E LG D PH S Y Y Y L + L +K Sbjct: 270 EILGVRPD----PSTPDPAAFPHLAGGSPAYYYGYLLAEMLASQLRATFKKKVGYLTDNP 325 Query: 577 --KEKYLN-ILRAGNSKHYSELL 596 + ILR GNS+ + ELL Sbjct: 326 EAGPRLREHILRPGNSEPWKELL 348 >gnl|CDD|30687 COG0339, Dcp, Zn-dependent oligopeptidases [Amino acid transport and metabolism]. Length = 683 Score = 47.9 bits (114), Expect = 8e-06 Identities = 61/259 (23%), Positives = 95/259 (36%), Gaps = 58/259 (22%) Query: 381 DGKGSGAF------------AHGTIPSVHPYILLNYL----GKP-----QDVMTLAHELG 419 DGK GA+ G P + Y++ N+ GKP +V TL HE G Sbjct: 419 DGKRGGAWMDDFVSQRRLDDGGGQKPVI--YLVCNFTKPVGGKPALLSHDEVTTLFHEFG 476 Query: 420 HGIHFVLSS-ETQGILTNNSSLTLAETASIFGETLTFD--SLLQAASSKEERKILLANKI 476 HG+H +L+ + G+ N E S F E ++ L + A + + L + Sbjct: 477 HGLHHLLTRVKYPGVSGTNVPWDFVELPSQFMENWCWEPEVLAKYARHYQTGEPLPKELL 536 Query: 477 EDMLNS--------IVRQISFYDFELKLHTERRSTGDIPTHRINEIWLETQKESLGPAFD 528 + ML + +RQ+ F F+++LHTE N LE + E L Sbjct: 537 DKMLAAKNFQAGLFTLRQLEFALFDMRLHTEFDPDA-------NADILEFEAEVLKKVAV 589 Query: 529 LSDLEYGSFWMMVPHF--IESSFYV---YAYAFGNCLVNSLY------DIYKSNTVDCFK 577 L + F I + Y Y+Y + L + + T F+ Sbjct: 590 LPSIPPRRR---PHSFGHIFAGGYSAGYYSYLWAEVLSADAFAAFEEEGPFNRETGQRFR 646 Query: 578 EKYLNILRAGNSKHYSELL 596 + IL G S+ EL Sbjct: 647 D---AILSRGGSRDPMELF 662 >gnl|CDD|107214 cd06457, M3A_MIP, Peptidase M3 mitochondrial intermediate peptidase (MIP; EC 3.4.24.59) belongs to the widespread subfamily M3A, that show similarity to the Thimet oligopeptidase (TOP). It is one of three peptidases responsible for the proteolytic processing of both, nuclear and mitochondrial encoded precursor polypeptides targeted to the various subcompartments of the mitochondria. It cleaves intermediate-size proteins initially processed by mitochondrial processing peptidase (MPP) to yield a processing intermediate with a typical N-terminal octapeptide that is sequentially cleaved by MIP to mature-size protein. MIP cleaves precursor proteins of respiratory components, including subunits of the electron transport chain and tri-carboxylic acid cycle enzymes, and components of the mitochondrial genetic machinery, including ribosomal proteins, translation factors, and proteins required for mitochondrial DNA metabolism. It has been suggested that the human MIP (HMIP polypeptide; gene symbol MIPEP) may be one of the loci predicted to influence the clinical manifestations of Friedreich's ataxia (FRDA), an autosomal recessive neurodegenerative disease caused by lack of human frataxin. These proteins are enriched in cysteine residues, two of which are highly conserved, suggesting their importance to stability as well as in formation of metal binding sites, thus playing a role in MIP activity. Length = 458 Score = 39.2 bits (92), Expect = 0.004 Identities = 43/225 (19%), Positives = 73/225 (32%), Gaps = 65/225 (28%) Query: 408 PQDVMTLAHELGHGIHFVLSS-ETQGILTNNSSLTLAETASIF----------------- 449 P +V TL HE+GH +H +L E Q + + E SI Sbjct: 246 PHEVETLFHEMGHAMHSMLGRTEYQHVSGTRCATDFVEVPSILMEYFASDPRVLKLFARH 305 Query: 450 ---GETLTFDSLLQAASSKEERKILLANKIEDMLNSIVRQISFYDFELKLHTERRSTGDI 506 GE L + L A + + IV + D +LH E + Sbjct: 306 YSTGEPLPEEML---ARLLASKNSF---AALETQQQIV--YALLD--QELHGE-QPLSPT 354 Query: 507 PTHRINEIWLETQKESLGPAFDLSDLEYG--------SFWMMVPH-FIESSFYVYAYAFG 557 T + D +++ YG ++ + H + Y Y+Y F Sbjct: 355 FTSDVL--------------RDSTEIFYGLPYVPGGTAWQLRFGHLVGYGATY-YSYLFD 399 Query: 558 NCLVNSLY------DIYKSNTVDCFKEKYLNILRAGNSKHYSELL 596 + + ++ D + +E+ +L+ G K ELL Sbjct: 400 RAIASKIWQKLFAADPLSREAGERLREE---LLKHGGGKDPWELL 441 >gnl|CDD|37300 KOG2089, KOG2089, KOG2089, Metalloendopeptidase family - saccharolysin & thimet oligopeptidase [Posttranslational modification, protein turnover, chaperones]. Length = 718 Score = 34.9 bits (80), Expect = 0.067 Identities = 24/101 (23%), Positives = 42/101 (41%), Gaps = 12/101 (11%) Query: 410 DVMTLAHELGHGIHFVLSSETQGIL--TNNSSLTLAETASIFGETLTF--DSLLQAASSK 465 +V TL HE GH + +L+ N E S F E + D+L + Sbjct: 495 EVETLFHEFGHVLQHLLTQADFARFSGPRNVEWDAVEVPSQFLENWVWDPDTLRSLSKHY 554 Query: 466 EERKILLANKIEDMLNSI--------VRQISFYDFELKLHT 498 + + L ++ ++ + +RQ+ DF+L+LHT Sbjct: 555 KTGEPLPEELLKKLILTRTVNAGLFTLRQLVLADFDLELHT 595 >gnl|CDD|107213 cd06456, M3A_DCP_Oligopeptidase_A, Peptidase family M3 dipeptidyl carboxypeptidase (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This metal-binding M3A family also includes oligopeptidase A (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off the C-termini of various peptides and proteins, the smallest substrate being N-blocked tripeptides and unblocked tetrapeptides. DCP from E. coli is inhibited by the anti-hypertensive drug captopril, an inhibitor of the mammalian angiotensin converting enzyme (ACE, also called peptidyl dipeptidase A). Oligopeptidase A (OpdA) may play a specific role in the degradation of signal peptides after they are released from precursor forms of secreted proteins. It can also cleave N-acetyl-L-Ala. Length = 422 Score = 34.8 bits (81), Expect = 0.073 Identities = 20/66 (30%), Positives = 28/66 (42%), Gaps = 21/66 (31%) Query: 381 DGKGSGA----------FAHGTIPSVHPYILLNYL----GKPQ-----DVMTLAHELGHG 421 +GK GA G P Y++ N+ GKP +V TL HE GH Sbjct: 162 EGKRGGAWMNNLRSQSKNGLGQKPVA--YLVCNFTKPAGGKPALLTHDEVTTLFHEFGHA 219 Query: 422 IHFVLS 427 +H +L+ Sbjct: 220 LHHLLT 225 >gnl|CDD|107212 cd06455, M3A_TOP, Peptidase M3 Thimet oligopeptidase (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase 24.15; soluble metallo-endopeptidase; EC 3.4.24.15) family also includes neurolysin (endopeptidase 24.16, microsomal endopeptidase, mitochondrial oligopeptidase M, neurotensin endopeptidase, soluble angiotensin II-binding protein, thimet oligopeptidase II) which hydrolyzes oligopeptides such as neurotensin, bradykinin and dynorphin A. TOP and neurolysin are neuropeptidases expressed abundantly in the testis, but also found in the liver, lung and kidney. They are involved in the metabolism of neuropeptides under 20 amino acid residues long and cleave most bioactive peptides at the same sites, but recognize different positions on some naturally occurring and synthetic peptides; they cleave at distinct sites on the 13-residue bioactive peptide neurotensin, which modulates central dopaminergic and cholinergic circuits. TOP has been shown to degrade peptides released by the proteasome, limiting the extent of antigen presentation by major histocompatibility complex class I molecules, and has been associated with amyloid protein precursor processing. Length = 472 Score = 33.7 bits (78), Expect = 0.14 Identities = 49/229 (21%), Positives = 84/229 (36%), Gaps = 72/229 (31%) Query: 407 KPQDVMTLAHELGHGIHFVLSSETQGILTNNSSLTLAETASIFGETLTFDSLLQAAS--- 463 + +V T HE GH IH +L + A G + D ++A S Sbjct: 260 RHDEVETFFHEFGHVIH--------HLLGR------TKYARFSGTRVERD-FVEAPSQML 304 Query: 464 -----SKEERKILLAN-----KI-EDMLNSIV------------RQISFYDFELKLHTER 500 E K L + KI ++++ ++ RQ+ F F+L LHT Sbjct: 305 ENWCWEPEVLKRLSKHYKTGEKIPDELIERLIASRHFNRGLFYLRQLFFALFDLALHT-G 363 Query: 501 RSTGDIPTHRINEIWLETQKESLGPAFDLSDLEYGSFWMMVPHFI---ESSFYVYAYAFG 557 T N++ ++ SL P+ + + SF H ++ +Y Y + Sbjct: 364 DPADLDTTKLYNDL---REEISLIPSTEGTH-GPASF----GHLAGGYDAGYYGYLW--- 412 Query: 558 NCLVNSLYDIYKSNTVDCFKEKYLN----------ILRAGNSKHYSELL 596 V + D++ S FK+ LN +L G SK +++L Sbjct: 413 -SEVFAA-DMFSS----FFKDGLLNPEVGLRYRDTVLAPGGSKDAADML 455 >gnl|CDD|37869 KOG2658, KOG2658, KOG2658, NADH:ubiquinone oxidoreductase, NDUFV1/51kDa subunit [Energy production and conversion]. Length = 478 Score = 31.8 bits (72), Expect = 0.61 Identities = 15/42 (35%), Positives = 22/42 (52%) Query: 452 TLTFDSLLQAASSKEERKILLANKIEDMLNSIVRQISFYDFE 493 + FDSL A S +++ NK D++ +I R I FY E Sbjct: 351 LMDFDSLKAAQSGLGTGAVIVMNKSTDIVKAIARLIKFYKHE 392 >gnl|CDD|33254 COG3451, VirB4, Type IV secretory pathway, VirB4 components [Intracellular trafficking and secretion]. Length = 796 Score = 30.3 bits (68), Expect = 1.4 Identities = 20/76 (26%), Positives = 33/76 (43%), Gaps = 14/76 (18%) Query: 433 ILTNNSSLTLAETASIFGETLTFDSLLQAASSKEERKILLANKIEDMLNSIVRQISFYDF 492 ILT + SL T + G ++F++L EE N+ N+++R ++ +F Sbjct: 32 ILTKDGSLL--GTLKLEG--ISFETL-----DDEEL-----NERHAERNNLLRNLASGNF 77 Query: 493 ELKLHTERRSTGDIPT 508 H RR D P Sbjct: 78 AFYFHAVRRKVIDYPE 93 >gnl|CDD|146482 pfam03872, RseA_N, Anti sigma-E protein RseA, N-terminal domain. Sigma-E is important for the induction of proteins involved in heat shock response. RseA binds sigma-E via its N-terminal domain, sequestering sigma-E and preventing transcription from heat-shock promoters. The C-terminal domain is located in the periplasm, and may interact with other protein that signal periplasmic stress. Length = 88 Score = 30.1 bits (68), Expect = 1.8 Identities = 11/33 (33%), Positives = 15/33 (45%) Query: 252 SSHIFSFITNTLAKDEEIQDRWRKYEKIADSRH 284 S + N L +DEE+Q W +Y I D Sbjct: 15 SDDEARRLLNALDQDEELQQTWARYHLIGDVLR 47 >gnl|CDD|37035 KOG1824, KOG1824, KOG1824, TATA-binding protein-interacting protein [General function prediction only]. Length = 1233 Score = 29.9 bits (67), Expect = 2.1 Identities = 29/107 (27%), Positives = 34/107 (31%), Gaps = 32/107 (29%) Query: 181 LLSNDMECLLSDTSQVGREALKRFFCENIESL-----------------------RFKIN 217 LL EC T V E L + ESL +F I+ Sbjct: 936 LLFKHCECAEEGTRNVVAECLGKLVLIEPESLLPKLKLLLRSEASNTRSSVVSAVKFSIS 995 Query: 218 DQKIPL--------TKAYKSFFDSDREVRKSAAKAL-SHTFNKSSHI 255 DQ P+ K D D EVR+ A L S NK S I Sbjct: 996 DQPQPIDPLLKQQIGDFLKLLRDPDLEVRRVALVVLNSAAHNKPSLI 1042 >gnl|CDD|37301 KOG2090, KOG2090, KOG2090, Metalloendopeptidase family - mitochondrial intermediate peptidase [Posttranslational modification, protein turnover, chaperones]. Length = 704 Score = 29.5 bits (66), Expect = 3.0 Identities = 15/45 (33%), Positives = 21/45 (46%), Gaps = 1/45 (2%) Query: 408 PQDVMTLAHELGHGIHFVLS-SETQGILTNNSSLTLAETASIFGE 451 +V TL HE+GH +H +L + Q + AE SI E Sbjct: 477 LSEVETLFHEMGHAMHSMLGRTHYQHVTGTRCPTDFAEIPSILME 521 >gnl|CDD|36238 KOG1020, KOG1020, KOG1020, Sister chromatid cohesion protein SCC2/Nipped-B [Chromatin structure and dynamics, Cell cycle control, cell division, chromosome partitioning, Replication, recombination and repair]. Length = 1692 Score = 29.2 bits (65), Expect = 3.5 Identities = 16/50 (32%), Positives = 19/50 (38%), Gaps = 6/50 (12%) Query: 188 CLLSDTS--QVGREALKRFFCENIESLRFKINDQKIPLTKAYKSFFDSDR 235 LL+DT V A FF N SL+ K L S +D R Sbjct: 450 DLLTDTDVHAVSSIAKTPFFVNNSSSLQIS----KAILVSTIFSRYDKQR 495 >gnl|CDD|144683 pfam01180, DHO_dh, Dihydroorotate dehydrogenase. Length = 290 Score = 28.8 bits (65), Expect = 3.9 Identities = 17/78 (21%), Positives = 30/78 (38%), Gaps = 13/78 (16%) Query: 88 GAAIAEYERICELIGRIASYAMLSYNCNLSSPTI---RKFYTD------INAKLADFEKV 138 G+ + +Y + IG A Y L N+S P R TD + + + KV Sbjct: 102 GSTVEDYVEVARKIGPFADYLEL----NVSCPNTPGLRALQTDPELAAILLKVVKEVSKV 157 Query: 139 LIFFALEINTLDEALLEQ 156 + L + D +++ Sbjct: 158 PVLVKLAPDLTDIVIIDI 175 >gnl|CDD|40019 KOG4822, KOG4822, KOG4822, Predicted nuclear membrane protein involved in mRNA transport and sex determination via splicing modulation [RNA processing and modification, Signal transduction mechanisms]. Length = 1906 Score = 28.9 bits (64), Expect = 4.6 Identities = 16/53 (30%), Positives = 21/53 (39%), Gaps = 15/53 (28%) Query: 81 QKNCH---SLGAAIAEYERICELIGRIA------------SYAMLSYNCNLSS 118 QK CH SL I RI E+I R S+ ++N N+ S Sbjct: 353 QKPCHEIASLAIYILRRLRIYEVISRYEFAVLSALEGLSNSHGAATHNLNMLS 405 >gnl|CDD|32684 COG2856, COG2856, Predicted Zn peptidase [Amino acid transport and metabolism]. Length = 213 Score = 28.1 bits (62), Expect = 6.9 Identities = 15/57 (26%), Positives = 22/57 (38%), Gaps = 4/57 (7%) Query: 387 AFAHGTIPSVHPYILLN-YLGKPQDVMTLAHELGHGIHFVLSSETQGILTNNSSLTL 442 A+G P I +N + TLAHELG H +L ++ +L Sbjct: 48 IDAYGLYDEEKPVIYINANNSLERKRFTLAHELG---HALLHTDLNTRFDAEPTLQQ 101 >gnl|CDD|37683 KOG2472, KOG2472, KOG2472, Phenylalanyl-tRNA synthetase beta subunit [Translation, ribosomal structure and biogenesis]. Length = 578 Score = 28.0 bits (62), Expect = 8.5 Identities = 17/67 (25%), Positives = 29/67 (43%), Gaps = 4/67 (5%) Query: 463 SSKEERKILLANKIEDM-LNSIVRQISFYDFELKLHT---ERRSTGDIPTHRINEIWLET 518 + + R + I ++ L+ SF D + KLH R+ I TH ++ I Sbjct: 114 ETSQIRPFAVCAVIRNVSLDPPDSYKSFIDLQDKLHQNICRNRTLVAIGTHDLDTIQGPF 173 Query: 519 QKESLGP 525 + E+L P Sbjct: 174 EYEALPP 180 >gnl|CDD|32799 COG2980, RlpB, Rare lipoprotein B [Cell envelope biogenesis, outer membrane]. Length = 178 Score = 28.0 bits (62), Expect = 9.1 Identities = 14/66 (21%), Positives = 28/66 (42%), Gaps = 5/66 (7%) Query: 429 ETQGILTNNSSLTLAETASIFGETLT-----FDSLLQAASSKEERKILLANKIEDMLNSI 483 E Q LT + + ++ ++ FD+ A + EER++L D + Sbjct: 94 EYQLTLTVEAQVLDPNDDVLYPISVKVFRSFFDNPNMALAKDEEREMLWNEMRRDAAEQL 153 Query: 484 VRQISF 489 VR+++ Sbjct: 154 VRRLAS 159 >gnl|CDD|36452 KOG1238, KOG1238, KOG1238, Glucose dehydrogenase/choline dehydrogenase/mandelonitrile lyase (GMC oxidoreductase family) [General function prediction only]. Length = 623 Score = 27.6 bits (61), Expect = 9.4 Identities = 23/101 (22%), Positives = 35/101 (34%), Gaps = 8/101 (7%) Query: 322 LDTMYFWD-RLAPLPGTSQDIIPFEVARDLVLQSYAKFSPQMSIIAEKFFTH-NWIDAPQ 379 ++T+ F + + L DI VA L + I + F D+ Sbjct: 397 VETLGFINTVSSNLSLDWPDIELHFVAGSLSSDGLTALRKALGEIYQALFGELTNSDSFV 456 Query: 380 YDGKGSGAFAHGTI------PSVHPYILLNYLGKPQDVMTL 414 K + G + P +P I NY P+DV TL Sbjct: 457 IFPKLLRPKSRGRLKLRSTNPRDNPLITPNYFTHPEDVATL 497 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.319 0.134 0.396 Gapped Lambda K H 0.267 0.0679 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 7,646,437 Number of extensions: 406491 Number of successful extensions: 1068 Number of sequences better than 10.0: 1 Number of HSP's gapped: 1050 Number of HSP's successfully gapped: 35 Length of query: 626 Length of database: 6,263,737 Length adjustment: 100 Effective length of query: 526 Effective length of database: 4,102,837 Effective search space: 2158092262 Effective search space used: 2158092262 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 60 (27.0 bits)