RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254780524|ref|YP_003064937.1| flagellar hook-associated protein FlgK [Candidatus Liberibacter asiaticus str. psy62] (480 letters) >gnl|CDD|31448 COG1256, FlgK, Flagellar hook-associated protein [Cell motility and secretion]. Length = 552 Score = 170 bits (432), Expect = 7e-43 Identities = 116/555 (20%), Positives = 214/555 (38%), Gaps = 78/555 (14%) Query: 1 MSLLAVFNKAQNICNNASQQFASIVRNIENADNKNYVRRDT-MTTISPEN---------V 50 MSL ++ N A + N A NI NA+ Y R+ TT P V Sbjct: 1 MSLFSLLNTALSGLNAAQAALDVTGHNISNANTPGYSRQRVIQTTNIPYLGGGLNVGTGV 60 Query: 51 TVVT-QRSENKNMFDKVLHAHSSAIGQQRLLQSFEALKEMMSADDHYNNSPTHYISKLRD 109 VV+ QR ++ + ++ +A+S + L+ ++S +S + ++ + Sbjct: 61 NVVSIQRLRDEFLTNQYRNANSQSSYLDTRASQLSQLESLLSEPS--ESSLSTLLNDFFN 118 Query: 110 SLESYSKNLSEDILGKTVIDNAEEVVQNFNTSAREVQKIRADADKEIELEISNLRRFLSE 169 SL+ + N S+ + V+ A+ +V N + ++ +R D + EI + + L + Sbjct: 119 SLQELASNPSDTAARQAVLSKAQTLVNQINNTYEQLTDLRKDINAEIAATVDEVNSLLKQ 178 Query: 170 LTVVNDAIKFKTASKHDAHDFLDQRDVLLQKISEIIGISTIVRNNNDMVVYTSDGTTLFE 229 + +N I+ A+ +D +D LDQRD L+ ++S++IGIS R + D + T +GTTL E Sbjct: 179 IADLNKQIRKVKAAGNDPNDLLDQRDQLVDELSQLIGISVSKREDGDYNLTTGNGTTLVE 238 Query: 230 TVPRDINFEKMNSYTVTSESNPIFIDGVVVSSNQGSA-VPKGKIKALLQIR--------D 280 + F + + N I++DG + + + GK+ LL +R + Sbjct: 239 GSEAKV-FAPLPGSDDETRGNVIYVDGDDEKAGNITNTLSGGKLGGLLDLRSDELAPTLN 297 Query: 281 SVVPIFQNQLDEMARVLIGSF-------SEKDPIAGNSENVPGLFIADGIK--------- 324 ++ + + LDE + ++ I G + I Sbjct: 298 TLGQLALSFLDEFNKGHGAGLDLNGAEGADFFNIEGLPVILTNDKSNGNISSGGFTVSDA 357 Query: 325 --------DLDNKLCQGISETICVNPQYRSN-------PSFLRDGGSVSKKFLWNVKNLA 369 + + GI+ + N ++ + P+ DGG+ Sbjct: 358 NKVGATTIPVTGDVVPGIATRLSDNVTFQISPDAANGNPTLTFDGGANFAGGYVVNDAFT 417 Query: 370 GYPDLINHYCDSFNK------------NFIFDPVAGI------------NTNVTLLEYAR 405 G + + IFD + T +Y Sbjct: 418 GKAASSSLLNLAVLATPSTKVAFAAPDFTIFDNANALALLNLQNNKKLVGGKSTFSDYYA 477 Query: 406 NSIGWLEKNRSDSHDLHMKSQVVFNHISESYSNTVGVNLQEELSFLIKVEQSYNISNKLI 465 +GW+ S + + + N ++ + GVNL EE++ LI+ +Q+Y S K+I Sbjct: 478 GLVGWIGIKASTASSESDAKEALLNQLTSERQSISGVNLDEEMANLIQFQQAYQASAKVI 537 Query: 466 MFTDKMLQTLLEGVK 480 D+ML TLL V Sbjct: 538 QTVDEMLDTLLNIVG 552 >gnl|CDD|146311 pfam03607, DCX, Doublecortin. Length = 60 Score = 30.9 bits (71), Expect = 0.75 Identities = 13/43 (30%), Positives = 19/43 (44%), Gaps = 9/43 (20%) Query: 183 SKHDAHDFLDQRDVLLQKISEIIGISTIVRNNNDMVVYTSDGT 225 S+ F D LL ++E + + T VR +YT DG Sbjct: 4 SRRRFKSF----DALLDDLTEKVKLPTGVRK-----LYTPDGH 37 >gnl|CDD|31391 COG1198, PriA, Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]. Length = 730 Score = 29.9 bits (67), Expect = 1.4 Identities = 9/38 (23%), Positives = 18/38 (47%) Query: 254 IDGVVVSSNQGSAVPKGKIKALLQIRDSVVPIFQNQLD 291 + G+VV + S V K+K + ++ D+ + L Sbjct: 42 VVGIVVELSSESDVDGRKLKEIERVLDTEPVLTPELLR 79 >gnl|CDD|39610 KOG4409, KOG4409, KOG4409, Predicted hydrolase/acyltransferase (alpha/beta hydrolase superfamily) [General function prediction only]. Length = 365 Score = 29.9 bits (67), Expect = 1.4 Identities = 17/126 (13%), Positives = 37/126 (29%), Gaps = 17/126 (13%) Query: 333 GISETICVNP-QYRSNPSFLRDGGSVSKKF--LWNVKNLAGY---------PDLINHYCD 380 G E P + P + + V+ F L ++ + PD + Sbjct: 195 GFPEKPDSEPEFTKPPPEWYKALFLVATNFNPLALLRLMGPLGPKLVSRLRPDRFRKFPS 254 Query: 381 SFNKNFIFDPVAGIN----TNVTLLEYARNSIGWLEKNRSDS-HDLHMKSQVVFNHISES 435 ++F+ + + N + T + GW + +L V F + Sbjct: 255 LIEEDFLHEYIYHCNAQNPSGETAFKNLFEPGGWARRPMIQRLRELKKDVPVTFIYGDRD 314 Query: 436 YSNTVG 441 + + Sbjct: 315 WMDKNA 320 >gnl|CDD|31935 COG1749, FlgE, Flagellar hook protein FlgE [Cell motility and secretion]. Length = 423 Score = 29.9 bits (67), Expect = 1.5 Identities = 21/67 (31%), Positives = 33/67 (49%) Query: 410 WLEKNRSDSHDLHMKSQVVFNHISESYSNTVGVNLQEELSFLIKVEQSYNISNKLIMFTD 469 + N S + L F IS V+L +EL+ LI ++ Y + K+I +D Sbjct: 353 YAATNNSGAASLGAAGVGGFGKISSGALEMSNVDLAKELTNLIVAQRGYQANAKVITTSD 412 Query: 470 KMLQTLL 476 +MLQTL+ Sbjct: 413 QMLQTLV 419 >gnl|CDD|146444 pfam03805, CLAG, Cytoadherence-linked asexual protein. Clag (cytoadherence linked asexual gene) is a malaria surface protein which has been shown to be involved in the binding of Plasmodium falciparum infected erythrocytes to host endothelial cells, a process termed cytoadherence. The cytoadherence phenomenon is associated with the sequestration of infected erythrocytes in the blood vessels of the brain, cerebral malaria. Clag is a multi-gene family in Plasmodium falciparum with at least 9 members identified to date. Orthologous proteins in the rodent malaria species Plasmodium chabaudi (Lawson D Unpubl. obs.) suggest that the gene family is found in other malaria species and may play a more generic role in cytoadherence. Length = 1282 Score = 29.3 bits (66), Expect = 2.7 Identities = 11/59 (18%), Positives = 25/59 (42%), Gaps = 3/59 (5%) Query: 411 LEKNRSDSHDLHMKSQVVFNHISESYSNTVGVNLQEELSFLIKVEQSYNISNKLIMFTD 469 + + + + ++V+ I +Y Q + L K +NI+NK+++ D Sbjct: 708 VLSQKFVAPKYNKWNEVLKKIIESAYEMYFN---QRHVKNLFKYHDIFNINNKIMLMRD 763 >gnl|CDD|39054 KOG3850, KOG3850, KOG3850, Predicted membrane protein [Function unknown]. Length = 455 Score = 28.9 bits (64), Expect = 3.1 Identities = 23/86 (26%), Positives = 35/86 (40%), Gaps = 8/86 (9%) Query: 55 QRSENKNMFDKV--LHAHSSAIGQQRLLQSFEALKEMMSADDHYNNSPTHYISKLRDSLE 112 Q + K +F+K AH+ A Q++L Q LKE+ + + S K RD Sbjct: 81 QVARIKQVFEKKNQKSAHTIAQLQKKLEQYHRRLKEIENGE--SRPSKD----KSRDFPT 134 Query: 113 SYSKNLSEDILGKTVIDNAEEVVQNF 138 K I+ A+EV + F Sbjct: 135 GIRKAKGMTEAMVNPIEFAQEVKKAF 160 >gnl|CDD|34397 COG4786, FlgG, Flagellar basal body rod protein [Cell motility and secretion]. Length = 265 Score = 28.3 bits (63), Expect = 4.6 Identities = 12/44 (27%), Positives = 26/44 (59%) Query: 432 ISESYSNTVGVNLQEELSFLIKVEQSYNISNKLIMFTDKMLQTL 475 I + + VN+ EE++ +I+ +++Y ++K+I D+ML Sbjct: 217 IRQGFLEASNVNVVEEMTDMIEAQRAYEANSKVIQTADEMLGKA 260 >gnl|CDD|30404 COG0055, AtpD, F0F1-type ATP synthase, beta subunit [Energy production and conversion]. Length = 468 Score = 28.2 bits (63), Expect = 5.6 Identities = 21/69 (30%), Positives = 33/69 (47%), Gaps = 9/69 (13%) Query: 109 DSLESYSKNLSEDILGKTVIDNAEEVVQNFNTSAREVQKIRA--------DADKEIELEI 160 D L+S S+ L I+G+ + A E VQ+ +E+Q I A + DK Sbjct: 341 DPLDSTSRALDPKIVGEEHYEVARE-VQSILQRYKELQDIIAILGMDELSEEDKLTVARA 399 Query: 161 SNLRRFLSE 169 ++RFLS+ Sbjct: 400 RKIQRFLSQ 408 >gnl|CDD|39793 KOG4593, KOG4593, KOG4593, Mitotic checkpoint protein MAD1 [Cell cycle control, cell division, chromosome partitioning]. Length = 716 Score = 28.1 bits (62), Expect = 5.7 Identities = 26/160 (16%), Positives = 56/160 (35%), Gaps = 13/160 (8%) Query: 48 ENVTVVTQRSENKNMFDKVLHAHSSAIGQQRLLQSFEALKEMMSADDHYNNSPTHYISKL 107 +T +R E K L + S+ Q L Q +E + K Sbjct: 461 REITGQKKRLEKLEHELKDLQSQLSSREQSLLFQR----EESELLREKIEQ-----YLKE 511 Query: 108 RDSLESYSKNLSEDILGKTVIDNAEE----VVQNFNTSAREVQKIRADADKEIELEISNL 163 + LE + L + + + + EE V+ + ++I+ + +E++ E+ L Sbjct: 512 LELLEEENDRLRAQLERRLLQGDYEENITRVLHMSTNPTSKARQIKKNRLEELQAELERL 571 Query: 164 RRFLSELTVVNDAIKFKTASKHDAHDFLDQRDVLLQKISE 203 + L+ L + + H F + L +++ Sbjct: 572 KERLTALEGDKMQFRDGEIAVHSLLAFSKEVAQLKKEVES 611 >gnl|CDD|173725 cd06608, STKc_myosinIII_like, Catalytic domain of Class III myosin-like Protein Serine/Threonine Kinases. Serine/threonine kinases (STKs), Class III myosin-like subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The class III myosin-like subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Class III myosins are motor proteins with an N-terminal kinase catalytic domain and a C-terminal actin-binding motor domain. Class III myosins are present in the photoreceptors of invertebrates and vertebrates and in the auditory hair cells of mammals. The kinase domain of myosin III can phosphorylate several cytoskeletal proteins, conventional myosin regulatory light chains, and can autophosphorylate the C-terminal motor domain. Myosin III may play an important role in maintaining the structural integrity of photoreceptor cell microvilli. It may also function as a cargo carrier during light-dependent translocation, in photoreceptor cells, of proteins such as transducin and arrestin. The Drosophila class III myosin, called NinaC (Neither inactivation nor afterpotential protein C), is critical in normal adaptation and termination of photoresponse. Vertebrates contain two isoforms of class III myosin, IIIA and IIIB. This subfamily also includes mammalian NIK-like embryo-specific kinase (NESK), Traf2- and Nck-interacting kinase (TNIK), mitogen-activated protein kinase (MAPK) kinase kinase kinase 4 (MAPKKKK4 or MAP4K4) and MAPKKKK6 (or MAP4K6). MAP4Ks are involved in some MAPK signaling pathways by activating a MAPK kinase kinase (MAPKKK or MAP3K or MKKK). Each MAPK cascade is activated either by a small GTP-binding protein or by an adaptor protein, which transmits the signal either directly to a MAP3K to start the triple kinase core cascade or indirectly through a mediator kinase, a MAP4K. MAPK signaling cascades are important in mediating cellular responses to extracellular signals. Length = 275 Score = 27.6 bits (62), Expect = 6.8 Identities = 8/25 (32%), Positives = 16/25 (64%) Query: 142 AREVQKIRADADKEIELEISNLRRF 166 A ++ I D ++EI+ E + LR++ Sbjct: 35 AIKIMDIIEDEEEEIKEEYNILRKY 59 >gnl|CDD|147689 pfam05667, DUF812, Protein of unknown function (DUF812). This family consists of several eukaryotic proteins of unknown function. Length = 536 Score = 27.8 bits (62), Expect = 7.4 Identities = 28/161 (17%), Positives = 57/161 (35%), Gaps = 24/161 (14%) Query: 76 QQRLLQSFEALKEMMSADDHYNNSPTHYISKLRDS-----------------LESYSKNL 118 + L+ + +LKE + I KLR+ L +N Sbjct: 363 RTPLIDEYRSLKEKNRNKEDETQRQLDEIKKLRNKIEELESELQTKEQLYKQLLDEYENA 422 Query: 119 SEDILGKTVIDNAEEVVQNFNTSAREVQKIRADADKEIELEISNLRRFLSELTVVNDAIK 178 + + E+++N ++ KI +D + ++ EI+N+ L V D + Sbjct: 423 PKSVSRSAYTRRILEIIKNIKKQKEDIDKILSDT-RSLQKEINNITGKLDRTFTVTDELL 481 Query: 179 FKTASKHD-----AHDFLDQRDVLLQKISEIIGISTIVRNN 214 F+ A K D A+ L ++ E + + ++ Sbjct: 482 FRDA-KKDEHAKKAYKLLAALHENCSELIETVEETGNIKRE 521 >gnl|CDD|144701 pfam01201, Ribosomal_S8e, Ribosomal protein S8e. Length = 130 Score = 27.5 bits (62), Expect = 7.6 Identities = 8/24 (33%), Positives = 13/24 (54%) Query: 20 QFASIVRNIENADNKNYVRRDTMT 43 + I+ + N N YVRR+ +T Sbjct: 75 RKVRILGVVYNPANNEYVRRNIIT 98 >gnl|CDD|34145 COG4467, COG4467, Uncharacterized protein conserved in bacteria [Function unknown]. Length = 114 Score = 27.6 bits (61), Expect = 8.9 Identities = 13/69 (18%), Positives = 25/69 (36%), Gaps = 6/69 (8%) Query: 125 KTVIDNAEEVVQNFNTSAREVQKIRADADKEIE------LEISNLRRFLSELTVVNDAIK 178 K + D + + + E+ ++ +E LE LR L E T+ A+K Sbjct: 4 KEIFDQVDNLEEQLGVLLAELGGLKQHLGSLVEENTALRLENEKLRERLGEPTLEKTAVK 63 Query: 179 FKTASKHDA 187 + + Sbjct: 64 KEKPAVKKK 72 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.315 0.131 0.362 Gapped Lambda K H 0.267 0.0662 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 5,382,206 Number of extensions: 282966 Number of successful extensions: 727 Number of sequences better than 10.0: 1 Number of HSP's gapped: 726 Number of HSP's successfully gapped: 35 Length of query: 480 Length of database: 6,263,737 Length adjustment: 98 Effective length of query: 382 Effective length of database: 4,146,055 Effective search space: 1583793010 Effective search space used: 1583793010 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 59 (26.6 bits)