RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254780140|ref|YP_003064553.1| putative ABC transporter, substrate-binding protein [Candidatus Liberibacter asiaticus str. psy62] (452 letters) >gnl|CDD|162151 TIGR00996, Mtu_fam_mce, virulence factor Mce family protein. Members of this paralogous family are found as six tandem homologous proteins in the same orientation per cassette, in four separate cassettes in Mycobacterium tuberculosis. The six members of each cassette represent six subfamilies. One subfamily includes the protein mce (mycobacterial cell entry), a virulence protein required for invasion of non-phagocytic cells. Length = 291 Score = 48.8 bits (117), Expect = 3e-06 Identities = 52/356 (14%), Positives = 106/356 (29%), Gaps = 83/356 (23%) Query: 8 TSVGLFVVSILFFSFFSI-YWLSRSNQYDGPMAEVIIRIPGSVDGLSTDSSVRFNGIPVG 66 +V L V +L F + + Y A GL S VR G+PVG Sbjct: 6 AAVVLAAVVVLGVVFLRLPFRGFGPGTY-TVYAY-----FTDAGGLYPGSKVRVRGVPVG 59 Query: 67 RIVGLFLDQEYPNHSLAKALIRPDTPLYPSTTATIRTQGLAGITYIELSTLRKEKKTIFQ 126 ++ + L N + + + + TA IR+ L G Y+EL+ + Sbjct: 60 KVTAISLP---GNGARVTFSLDRGVTIPANATAAIRSTTLLGSRYVELTPPKGPGGPPLP 116 Query: 127 ----IATERNQRAMITATPSGINYFISNAENTSKKISDSSRHIQKIIENIEKPLTTTIAN 182 I R T+ P I+ + + + Sbjct: 117 PGGVIPLAR------TSVPPEIDDLLGSLTRLLNGLD--------------------PEK 150 Query: 183 IETISTVLANNISHIDKMMHTTQVTPHSSDSKNTFNTITDLITSLDKMIKAID--LQKVN 240 + I LA P + +L+ L ++ A++ + Sbjct: 151 LNAILNALAEA---------LAGQGPQ----------LRNLLDGLAQLTAALNARDGDIG 191 Query: 241 QILENIQVSSNNFVKSSDQVINTVHDVRETTQTFQEVGQKIDHLLSDFSSKMKSKETSAF 300 +++N+ + SDQ+ + ++ T + +D L+ S Sbjct: 192 ALIDNLNRVLDVLADRSDQLDRLLDNLATLTAQLADRDDALDDALAA---------LSGA 242 Query: 301 LENIADSTSNMRSSISAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKI 356 + D + + R + + + + + L D + +L+ + Sbjct: 243 SAQVRDL-------------LAENRPNLPQALANLAPVLTLLVDYHPELEQLLHLL 285 >gnl|CDD|128579 smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer). Thought to undergo reversible methylation in response to attractants or repellants during bacterial chemotaxis. Length = 262 Score = 41.5 bits (98), Expect = 4e-04 Identities = 23/124 (18%), Positives = 46/124 (37%), Gaps = 17/124 (13%) Query: 237 QKVNQILENIQVSSNNFVKSSDQVINTVHDVRETTQTFQEVGQKIDHLLSDFSSKMKSKE 296 + +I E + +SS +V V V ET + +E+ ++ E Sbjct: 151 SLIKEIQEETNEAVAAMEESSSEVEEGVELVEETGEALEEIVDSVE-------------E 197 Query: 297 TSAFLENIADSTSNMRSSIS----AIREITDQRQKIISTINTIENITSNLNDSSQKFAEL 352 + ++ IA +T + AI EI Q+ + I L+ +++ EL Sbjct: 198 IADLVQEIAAATDEQAAGSEEVNAAIDEIAQVTQETAAMSEEISAAAEELSGLAEELKEL 257 Query: 353 MSKI 356 + + Sbjct: 258 VEQF 261 Score = 31.9 bits (73), Expect = 0.32 Identities = 21/104 (20%), Positives = 44/104 (42%), Gaps = 9/104 (8%) Query: 266 DVRETTQTFQEVGQKIDHLLSDFSSKMKS-----KETSAFLENIADSTSNMRSSISAIRE 320 DV E + ++ L + + +M+ +E +A + IA + + + RE Sbjct: 1 DVSEAVEEIAAGAEEQAEELEELAERMEELSASIEEVAANADEIAATAQSAAEAAEEGRE 60 Query: 321 ITDQRQKIISTIN-TIENITSN---LNDSSQKFAELMSKINNIS 360 + + I +E S L +SS + E++S I++I+ Sbjct: 61 AVEDAITAMDQIREVVEEAVSAVEELEESSDEIGEIVSVIDDIA 104 Score = 29.9 bits (68), Expect = 1.3 Identities = 26/225 (11%), Positives = 62/225 (27%), Gaps = 88/225 (39%) Query: 148 ISNAENTSKKISDSSRHIQKIIENIEKPLTTTI----------ANIE------------- 184 + A + +++ +SS I +I+ I I A IE Sbjct: 76 VEEAVSAVEELEESSDEIGEIVSVI-----DDIADQTNLLALNAAIEAARAGEAGRGFAV 130 Query: 185 ---------------------TISTVLANNISHIDKMMHTTQ----VTPHSSDSKNTFNT 219 I + + M ++ ++ Sbjct: 131 VADEVRKLAERSAESAKEIESLIKEIQEETNEAVAAMEESSSEVEEGVELVEETGEALEE 190 Query: 220 ITDLITSLDKMIKAIDLQKVNQILENIQVSSNNFVKSSDQVINTVHDVRETTQTFQEVGQ 279 I D + + +++ I +++ S++V + ++ + TQ Sbjct: 191 IVDSVEE------------IADLVQEIAAATDEQAAGSEEVNAAIDEIAQVTQ------- 231 Query: 280 KIDHLLSDFSSKMKSKETSAFLENIADSTSNMRSSISAIREITDQ 324 ET+A E I+ + + ++E+ +Q Sbjct: 232 ----------------ETAAMSEEISAAAEELSGLAEELKELVEQ 260 >gnl|CDD|185619 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein; Provisional. Length = 2722 Score = 34.4 bits (79), Expect = 0.057 Identities = 32/195 (16%), Positives = 80/195 (41%), Gaps = 2/195 (1%) Query: 239 VNQILENIQVSSNNFVKSSDQVI--NTVHDVRETTQTFQEVGQKIDHLLSDFSSKMKSKE 296 + + E I F K + NT+ + +G+ ++ + FSS + KE Sbjct: 2362 IIKTKEKINNLKEEFSKLLKNIKRNNTLCNNNNIKDFISNIGKSVETIKQRFSSNLPEKE 2421 Query: 297 TSAFLENIADSTSNMRSSISAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKI 356 +E + N+ + I + KI+ I+ +N +N ++ + + + Sbjct: 2422 KLHQIEENLNEIKNIMNETKRISNVDAFTNKILQDIDNEKNKENNNMNAEKIDDLIENVT 2481 Query: 357 NNISALKENNSLFKDAQRAMHTFRDTSEKINRYIPSIGNNLQNFSQSGLNDIQNLVRKLQ 416 ++ +K + DA R + +D K+ + NN N +++ +++ ++ +L+ Sbjct: 2482 SHNEKIKSELLIINDALRRVKEKKDEMNKLFNSLTENNNNNNNSAKNIVDNSTYIINELE 2541 Query: 417 ETVNHFDDCLNNFER 431 V+ ++ L+ + Sbjct: 2542 SHVSKLNELLSYIDN 2556 Score = 31.0 bits (70), Expect = 0.68 Identities = 20/141 (14%), Positives = 55/141 (39%), Gaps = 10/141 (7%) Query: 306 DSTSNMRSSISAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKINNISA-LKE 364 DS + S+ ++ +++ IN+I+ + +N++ + + I + ++ Sbjct: 481 DSMDSKEKKESSDSNYQEKVDELLQIINSIKEKNNIVNNNFKNIEDYYITIEGLKNEIEG 540 Query: 365 NNSLFKDAQRAMHTFRDTSEKINRYIPSIGNNLQNFSQ---------SGLNDIQNLVRKL 415 L K +++ T + I N ++ + S ++I N+++++ Sbjct: 541 LIELIKYYLQSIETLIKDEKLKRSMKNDIKNKIKYIEENVDHIKDIISLNDEIDNIIQQI 600 Query: 416 QETVNHFDDCLNNFERNPQDI 436 +E +N F D+ Sbjct: 601 EELINEALFNKEKFINEKNDL 621 Score = 30.2 bits (68), Expect = 1.1 Identities = 39/255 (15%), Positives = 92/255 (36%), Gaps = 33/255 (12%) Query: 199 KMMHTTQVTPHSSDSKNTFNTITDLITSLDKMIKAIDLQKVNQILENIQVSSNNFVKSSD 258 + + T + + I ++I +L K ++ + K N I + + + S + Sbjct: 665 QTLLNTSKNEYEKLEFMKSDNIDNIIKNLKKELQNLLSLKENIIKKQLNNIEQDISNSLN 724 Query: 259 QVINTVHDVRETTQTFQEVGQKIDHLLSDFSSKMK-------------------SKETSA 299 Q +D++ + + ++E +K++ ++ +E Sbjct: 725 QYTIKYNDLKSSIEEYKEEEEKLEVYKHQIINRKNEFILHLYENDKDLPDGKNTYEEFLQ 784 Query: 300 FLENIADSTSNMRSSISAIREITDQRQKIISTINTI--------ENITSNLNDSSQKFAE 351 + + I + + + + I+ ++E Q ++++ N + E L QKF Sbjct: 785 YKDTILNKENKISNDINILKENKKNNQDLLNSYNILIQKLEAHTEKNDEELKQLLQKFPT 844 Query: 352 LMSKINNISALKENNSLFKDAQRAMHTFRDTSEKINRYIPSIG--NNLQNFSQSGLNDIQ 409 +N LKE F + + + E +N+ I I N N S S ++ Sbjct: 845 EDENLN----LKELEKEFNENNQIVDNIIKDIENMNKNINIIKTLNIAINRSNSNKQLVE 900 Query: 410 NLVRKLQETVNHFDD 424 +L+ + N + Sbjct: 901 HLLNNKIDLKNKLEQ 915 Score = 29.8 bits (67), Expect = 1.6 Identities = 49/258 (18%), Positives = 96/258 (37%), Gaps = 36/258 (13%) Query: 211 SDSKNTFNTITDLITSLDKMIKAID------------LQKVNQILENIQVSSNNFVKSSD 258 S K+ ++ I S+D K LQ +N I E + +NNF D Sbjct: 467 SIMKSFYDLIISEKDSMDSKEKKESSDSNYQEKVDELLQIINSIKEKNNIVNNNFKNIED 526 Query: 259 QVI------NTVHDVRETTQTFQEVGQKIDHLLSDFSSKMKSK-ETSAFLENIADSTSNM 311 I N + + E + + Q I+ L+ D K K + ++ I ++ ++ Sbjct: 527 YYITIEGLKNEIEGLIELIKYYL---QSIETLIKDEKLKRSMKNDIKNKIKYIEENVDHI 583 Query: 312 RSSISAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKINN------ISALKEN 365 + IS EI + Q+I IN + E + I N + L + Sbjct: 584 KDIISLNDEIDNIIQQIEELINEALFNKEKFINEKNDLQEKVKYILNKFYKGDLQELLDE 643 Query: 366 NSLFKDAQRAMHTFRDTSEKINRYIPSIGNNLQNFSQSGLNDIQNLVRKLQETVNHF--- 422 S F D + ++ + E + + + N + ++I N+++ L++ + + Sbjct: 644 LSHFLDDHKYLYHEAKSKEDLQTLLNTSKNEYEKLEFMKSDNIDNIIKNLKKELQNLLSL 703 Query: 423 -----DDCLNNFERNPQD 435 LNN E++ + Sbjct: 704 KENIIKKQLNNIEQDISN 721 Score = 27.9 bits (62), Expect = 5.6 Identities = 45/270 (16%), Positives = 100/270 (37%), Gaps = 33/270 (12%) Query: 152 ENTSKKISDSSRHIQKIIENIE-KPLTTTIANIETISTVLANNISHIDKMMHTTQVTPHS 210 + KI + H KI ++E K + I IE I ++N I+K + + Sbjct: 1329 KQVEAKIEQAKEHKNKIYGSLEDKQIDDEIKKIEQIKEEISNKRKEINKYLSNIKSNKEK 1388 Query: 211 SDSKNTFNTITDLITSLDKMIKAIDLQKVNQILENIQVSSNNFVKSSDQVINTVHDVRET 270 D + + + ID ++ +E N +K +D + E Sbjct: 1389 CDLH---------VRNASRGKDKIDFLNKHEAIEPSNSKEVNIIKITDNINKCKQYSNEA 1439 Query: 271 TQTFQEVGQKIDHLLSDFSSKMKSKETSAFLENIADSTSNM--RSSISAIREITDQRQKI 328 +T + + D ++ +N+ SSI + ++++K Sbjct: 1440 METENKADENNDSII-----------------KYEKEITNILNNSSILGKKTKLEKKKKE 1482 Query: 329 ISTINT-IENITSNLNDSSQKFAELMSKINNISALKENNSLFKDAQ--RAMHTFRDTSEK 385 + I I S + K +E ++++N +K + + + A T + + Sbjct: 1483 ATNIMDDINGEHSIIKTKLTKSSEKLNQLNEQPNIKREGDVLNNDKSTIAYETIQYNLGR 1542 Query: 386 INRYIPSIGNNLQNFSQSGLNDIQNLVRKL 415 + + +I N+++ ++ LN Q+L+R + Sbjct: 1543 VKHNLLNI-LNIKDEIETILNKAQDLMRDI 1571 >gnl|CDD|183759 PRK12804, PRK12804, flagellin; Provisional. Length = 301 Score = 33.8 bits (77), Expect = 0.10 Identities = 19/66 (28%), Positives = 35/66 (53%), Gaps = 5/66 (7%) Query: 316 SAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKINNISALKE-----NNSLFK 370 AI++++DQR K+ + N +E+ +NL S + S+I ++ KE N++ Sbjct: 220 GAIKQVSDQRSKLGAVQNRLEHTINNLGASGENLTAAESRIRDVDMAKEMSEFTKNNILS 279 Query: 371 DAQRAM 376 A +AM Sbjct: 280 QASQAM 285 >gnl|CDD|148180 pfam06419, COG6, Conserved oligomeric complex COG6. COG6 is a component of the conserved oligomeric golgi complex, which is composed of eight different subunits and is required for normal golgi morphology and localisation. Length = 615 Score = 33.4 bits (77), Expect = 0.13 Identities = 36/200 (18%), Positives = 78/200 (39%), Gaps = 29/200 (14%) Query: 235 DLQKVNQILENIQVSSNNFVKSSDQVINTVHDVRETTQTFQEVGQKIDHLLSDFSS-KMK 293 + V + L+ I + + +++ + +++ ++ T+ ++ ++ L+ ++K Sbjct: 39 EFGPVAEQLKRIGEDLSKLNNTCEEMKSQLNNAKQDTK---DLLEEASTLMEQKRQIELK 95 Query: 294 SKETSAFLENIADSTSNMRSSISAIREITDQRQKIISTINTIENITSNL--NDSSQKFAE 351 K AFL S + S I D+ +++ + I L ++ + E Sbjct: 96 KKLLKAFLSRFTLSEDELDVLTSGEEPIDDEFFQVLDKVKKIHEDCEILLGGENQRAGLE 155 Query: 352 LMSKINNI--SALKENNSLFKDAQRAMHTFRDTSEKINRYIPSIGNNLQNFSQSGLNDIQ 409 +M K+N +A N L++ QR FR NL++ ++ Sbjct: 156 IMEKMNRNLEAA---NEKLYRWTQR---EFR-------------NLNLESPQILST--LR 194 Query: 410 NLVRKLQETVNHFDDCLNNF 429 +R L E F CL+ F Sbjct: 195 KALRVLAERPVLFQSCLDEF 214 >gnl|CDD|150562 pfam09903, DUF2130, Uncharacterized protein conserved in bacteria (DUF2130). This domain, found in various hypothetical prokaryotic proteins, has no known function. Length = 267 Score = 33.4 bits (77), Expect = 0.13 Identities = 17/85 (20%), Positives = 40/85 (47%), Gaps = 5/85 (5%) Query: 274 FQEVGQKIDHLLSDFSSKMKSKETSAFLENIADSTSNMRSSISAIREITDQRQKIISTIN 333 QE + + + +F + + + A ++N + ++ I AI + + +K I + Sbjct: 178 EQENVKDKNIDIYNFEKEFRQFKE-AIVKNFELAQKDLDKEIKAIDKAWKKLEKQIEAV- 235 Query: 334 TIENITSNLNDSSQKFAEL-MSKIN 357 + + T+NL ++ K A L + K+ Sbjct: 236 -LGS-TNNLRGANNKIAGLTIKKLT 258 >gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional. Length = 895 Score = 33.3 bits (76), Expect = 0.14 Identities = 52/321 (16%), Positives = 114/321 (35%), Gaps = 48/321 (14%) Query: 132 NQRAMITATPSGINYFISNAENTSKKISDSSRHIQKIIENIEKPLTTTIANIETISTVLA 191 NQ + N ++ + E+ KKI + S++I+++ I + L + + I L Sbjct: 353 NQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIERMSAFISEILKIQEIDPDAIKKELN 412 Query: 192 NNISHIDKMMHTTQVTPHSSDSKNTFNTITDLITSLDKMIKAIDLQKVNQILENIQV--- 248 + + SS + I L +LD +L + ++L V Sbjct: 413 EINVKLQDI---------SSKVSSLNQRIRALRENLD------ELSRNMEMLNGQSVCPV 457 Query: 249 --------SSNNFVKSSDQVINTVH-DVRETTQTFQEVGQKIDHLLS--DFSSKMKSKET 297 SN+ + ++ + + +RE +++ +KI L ++ + ++ Sbjct: 458 CGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINKS 517 Query: 298 SAFLENIADSTSNMRSSISAIREITDQRQKIISTINTIENITSNLNDSSQKFAE---LMS 354 I + +++ I E+ D+ K N +++ L D K ++ Sbjct: 518 INEYNKIESARADLEDIKIKINELKDKHDKYEEIKNRYKSL--KLEDLDSKRTSWLNALA 575 Query: 355 KINNI------SALKENNSLFKDAQRAMHTFRDTSEKINRYIPSI-------GNNLQNFS 401 I+ I S E D + + YI NNL N Sbjct: 576 VISLIDIETNRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYIDKSIREIENEANNLNN-K 634 Query: 402 QSGLNDIQNLVRKLQETVNHF 422 + + + + L+ KL+ ++++ Sbjct: 635 YNEIQENKILIEKLRGKIDNY 655 Score = 32.2 bits (73), Expect = 0.28 Identities = 46/285 (16%), Positives = 106/285 (37%), Gaps = 19/285 (6%) Query: 142 SGINYFISNAENTSKKISDSSRHIQKIIENIE-KPLTTTIANIETISTVLANNISHIDKM 200 S + I E K I + ++K E +E + + +I I + A+ K+ Sbjct: 479 SRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINKSINEYNKIESARADLEDIKIKI 538 Query: 201 MHTTQVTPHSSDSKNTFNTI---------TDLITSLDKMIKAIDLQKVNQILENIQVSSN 251 + KN + ++ T + +L +I ID++ I+ N Sbjct: 539 NELKDKHDKYEEIKNRYKSLKLEDLDSKRTSWLNAL-AVISLIDIETNRSRSNEIKKQLN 597 Query: 252 NFVKSSDQVINTVHDVRETTQTFQEVGQKIDHLLSDFSSKMKS-KETSAFLENIADSTSN 310 + ++ D + ++I++ ++ ++K +E +E + N Sbjct: 598 DLESRLQEIEIGFPDDKSYIDKSI---REIENEANNLNNKYNEIQENKILIEKLRGKIDN 654 Query: 311 MRSSISAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKINNISALKE-NNSLF 369 + I+ I I ++I S IN IE+ NL S + + + + + E + Sbjct: 655 YKKQIAEIDSIIPDLKEITSRINDIED---NLKKSRKALDDAKANRARLESTIEILRTRI 711 Query: 370 KDAQRAMHTFRDTSEKINRYIPSIGNNLQNFSQSGLNDIQNLVRK 414 + ++ +T E + + +IG+ + + + ++RK Sbjct: 712 NELSDRINDINETLESMKKIKKAIGDLKRLREAFDKSGVPAMIRK 756 Score = 28.7 bits (64), Expect = 3.5 Identities = 50/297 (16%), Positives = 103/297 (34%), Gaps = 28/297 (9%) Query: 136 MITATPSGINYFISNAENTSKKISDSSRHIQKIIENIEKPLTTTIANIETISTVLANNIS 195 M+ A S I+Y E K + +I+K I + EK + T+ IE +S N + Sbjct: 177 MLRAEISNIDYL----EEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMD 232 Query: 196 HIDKMMHTTQVTPHSSDSKNTFNTITDLITSLDKMIKAIDLQKVNQILENIQVSSNNFVK 255 + + + N +++ D+ + IK + ++ +N N+ K Sbjct: 233 DYN----------NLKSALNELSSLEDMKNRYESEIKTAESDLSMELEKN------NYYK 276 Query: 256 SSDQVINTVHDVRETTQTFQEVGQKIDHLLSDFSSKMKSKETSAFLENIADSTSNMRSSI 315 ++ + V + +++ F K + L NI + + I Sbjct: 277 ELEERHMKI--------INDPVYKNRNYINDYFKYKNDIENKKQILSNIDAEINKYHAII 328 Query: 316 SAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKINNISALKENNSLFKDAQRA 375 + + I + +++ + + + + S + +I +LK+ + Sbjct: 329 KKLSVLQKDYNDYIKKKSRYDDLNNQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIER 388 Query: 376 MHTFRDTSEKINRYIPSIGNNLQNFSQSGLNDIQNLVRKLQETVNHFDDCLNNFERN 432 M F KI P N L DI + V L + + + L+ RN Sbjct: 389 MSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRN 445 >gnl|CDD|177394 PHA02562, 46, endonuclease subunit; Provisional. Length = 562 Score = 32.3 bits (74), Expect = 0.25 Identities = 29/177 (16%), Positives = 68/177 (38%), Gaps = 25/177 (14%) Query: 218 NTITDLITSLDKMIKAI--DLQ-KVNQILENIQVSSNNFVKSSDQVINTVHDVRETTQTF 274 T I K Q K ++++E + + +D+++N V D+ + + Sbjct: 198 KTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAAL 257 Query: 275 QEVGQKIDHLLSDFSSKMKS--KETSAFLEN---------IADSTSNMRSSISAIREITD 323 ++ + SK++ K + + I++ + ++E+ Sbjct: 258 NKLNTA----AAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQH 313 Query: 324 QRQKIISTINTIENITSNLNDSSQKFAELMSKINN----ISALKENNSLFKDAQRAM 376 +K+ + I+ +E I N+ S+K EL +KI+ + L + K + A+ Sbjct: 314 SLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKA---KKVKAAI 367 >gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein. These proteins are found in P. falciparum, P. vivax and P. yoelii. Length = 2757 Score = 31.6 bits (71), Expect = 0.41 Identities = 59/311 (18%), Positives = 113/311 (36%), Gaps = 54/311 (17%) Query: 152 ENTSKKISDSSRHIQKIIENIEKPLTTTIANI-ETISTVLANNISHIDKMMHTTQVTPHS 210 E + I K I NIE + T+I NI + I + NI ++ ++ + Sbjct: 1025 EKATNDIEQKIEDANKNIPNIEIAIHTSIYNIIDEIEKEIGKNIELLN-----KEILEEA 1079 Query: 211 SDSKNTFNTI--------------------TDLITSLDKMIKAIDLQKVNQI--LENIQV 248 + FN I D I + IK +D + + I LE I+ Sbjct: 1080 EINITNFNEIKEKLKHYNFDDFGKEENIKYADEINKIKDDIKNLDQKIDHHIKALEEIKK 1139 Query: 249 SSNNFVKSSDQVINTVHDVRETT----------QTFQEVGQKID----------HLLSDF 288 S N++ IN + DV + + + + KID LL++ Sbjct: 1140 KSENYIDEIKAQINDLEDVADKAISNDDPEEIEKKIENIVTKIDKKKNIYDEIKKLLNEI 1199 Query: 289 SSKMKSKETSAFLENIADSTSNMRSSISAIREITDQRQKIISTINTIENITSNLNDSSQK 348 + K K + ++ I S + + +I ++++K I +E +L++ +K Sbjct: 1200 AEIEKDKTSLEEVKGINLSYGKNLGKL-FLEKIDEEKKKSEHMIKAMEAYIEDLDEIKEK 1258 Query: 349 FAELMSKINNISALKENNSLFKDAQRAMHTFRDTSEKINRYIPSIGNNL-----QNFSQS 403 E+ +++ +K F + S+K + I I +S Sbjct: 1259 SPEIENEMGIEMDIKAEMETFNISHDDDKDHHIISKKHDENISDIREKSLKIIEDFSEES 1318 Query: 404 GLNDIQNLVRK 414 +NDI+ ++K Sbjct: 1319 DINDIKKELQK 1329 Score = 30.4 bits (68), Expect = 0.96 Identities = 37/166 (22%), Positives = 76/166 (45%), Gaps = 10/166 (6%) Query: 201 MHTTQVTPHSSDSKNTFNTITDLITSLDKMIKAIDLQKVNQILENI-----QVSS--NNF 253 M T V H S+ +N N + D+I + K I + +N+ILE+ ++S+ N++ Sbjct: 715 METATVELHLSNIENKKNELLDIIVEIKKHIHGEINKDLNKILEDFKNKEKELSNKINDY 774 Query: 254 VKSSDQVINTVHDVRETTQTFQEVGQKIDHLLSDFSSKMKSKETSAFLENIADSTSNMRS 313 K D++ + E + + ID+ + D +K ++ +++ I+ + Sbjct: 775 AKEKDELNKYKSKISEIKNHYND-QINIDN-IKDEDAKQNYDKSKEYIKTISIKEDEIFK 832 Query: 314 SISAIREITDQ-RQKIISTINTIENITSNLNDSSQKFAELMSKINN 358 I+ ++ + D K+ IN N ++ ++FAEL +KI Sbjct: 833 IINEMKFMKDDFLNKVDKFINFENNCKEKIDSEHEQFAELTNKIKA 878 Score = 29.6 bits (66), Expect = 1.6 Identities = 48/237 (20%), Positives = 102/237 (43%), Gaps = 35/237 (14%) Query: 158 ISDSSRHIQKIIENIEKPLTTTIANIETISTVLANNISHIDKMMHTTQVTPHSSDSKNTF 217 ++D+++ +IE EK T I N+ + T+ NN + Q+ +S+ + Sbjct: 2217 LADATKDHNNLIEK-EKEATKIINNLTELFTIDFNNADADILHNNKIQIIYFNSELHKSI 2275 Query: 218 NTITDLITSLDKMIKAIDLQKVNQILENIQVSSNNFVK-SSDQVINTVHDVRETTQTFQE 276 +I L + K +++ +N+ +I +N ++ ++ ++D++E Q + Sbjct: 2276 ESIKKLYKKI-NAFKLLNISHINEKYFDISKEFDNIIQLQKHKLTENLNDLKEIDQYISD 2334 Query: 277 VGQKIDHLLSDFSS-----------KMKSKETSAF-LENIADSTS-NMRSSISAIREITD 323 H L++ ++ + ++E A +ENI + + N+ I I ++T+ Sbjct: 2335 KKNIFLHALNENTNFNFNALKEIYDDIINRENKADEIENINNKENENIMQYIDTITKLTE 2394 Query: 324 QRQKIISTINTIEN-------------------ITSNLNDSSQKFAELMSKINNISA 361 + Q I+ + T EN I NL + Q F E+++KI+ I A Sbjct: 2395 KIQDILIFVTTYENDNNIIKQHIQDNDENDVSKIKDNLKKTIQSFQEILNKIDEIKA 2451 >gnl|CDD|148118 pfam06314, ADC, Acetoacetate decarboxylase (ADC). This family consists of several acetoacetate decarboxylase (ADC) proteins (EC:4.1.1.4). Length = 239 Score = 30.8 bits (70), Expect = 0.71 Identities = 11/57 (19%), Positives = 18/57 (31%), Gaps = 11/57 (19%) Query: 22 FFSIYWLSRSNQYDGPMAEVIIRIPGSVDGLSTDSSVRFNGIPVGRIVGLFLDQEYP 78 + S+ GP E I +P VR+ G +++D E Sbjct: 52 AVEFARMPDSSFGFGPYTEAGIVVP-----------VRYGGKEGAYCHFMYVDNEAA 97 >gnl|CDD|148076 pfam06248, Zw10, Centromere/kinetochore Zw10. Zw10 and rough deal proteins are both required for correct metaphase check-pointing during mitosis. These proteins bind to the centromere/kinetochore. Length = 571 Score = 30.4 bits (69), Expect = 1.0 Identities = 30/154 (19%), Positives = 60/154 (38%), Gaps = 18/154 (11%) Query: 222 DLITSLDKMIKAIDLQKVNQILENIQVSSNNFVKSSDQ--VINTVHDVRETTQTFQEVGQ 279 D T L K + ++ N+ +++ V++ N + S V+ T + +E + E + Sbjct: 357 DNDTDLLKYAEDVNTHFANKKCQDVLVTARNLMTSDIHNTVLVTPDEAKEGERLGNENEK 416 Query: 280 KIDHLLSDFSSKMKSKETSAFLENIADSTSNMRSSISAIREIT----DQRQKIISTINTI 335 + L M S+ ++ + D T + E T ++ TI I Sbjct: 417 HLSPLSFLLPRCMVSESVQQLMK-LLDKT---------LSEATFSSEGCADQLAYTIRNI 466 Query: 336 ENITSNLNDSSQKFAELMSKINNISALKENNSLF 369 + + +L+ KI +SAL NN ++ Sbjct: 467 --LQLYYDVVPTYHKKLLLKIPQLSALFHNNCMY 498 >gnl|CDD|162740 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. Length = 1164 Score = 30.0 bits (68), Expect = 1.2 Identities = 24/124 (19%), Positives = 47/124 (37%), Gaps = 18/124 (14%) Query: 316 SAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKI--------NNISALKENNS 367 +E++D +KI IE + ++ EL + N S LKE + Sbjct: 706 ELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEA 765 Query: 368 LFKDAQRAMHTFRDTSEKINRYIPSIGNNLQNFSQSGLNDIQNLVRKLQETVNHFDDCLN 427 ++ + +H + + + S S + +IQ + KL+E V+ + L Sbjct: 766 RIEELEEDLHKLEEALNDLEARL----------SHSRIPEIQAELSKLEEEVSRIEARLR 815 Query: 428 NFER 431 E+ Sbjct: 816 EIEQ 819 >gnl|CDD|173321 PTZ00021, PTZ00021, falcipain-2; Provisional. Length = 489 Score = 29.4 bits (66), Expect = 2.0 Identities = 29/135 (21%), Positives = 60/135 (44%), Gaps = 21/135 (15%) Query: 280 KIDHLLSDFSSKMKSKETSAFLENIADSTSNMRSSISAIREITDQRQKIISTINTIENIT 339 K+ L+S + + ++ET E + + +++ S + R+ + N + Sbjct: 95 KLQELISSYDKEKNNQETKNSGELKTFADFDGKNNKSV----ENFRKNFGNLKNVKKVYL 150 Query: 340 SNLNDSSQKFAELMSKINNISA----LKENNSLFKDA---QRAMHTFRDTSEKINRYIPS 392 N DS KF LM+ + N+++ +KE+ ++ Q+ +F + KIN + Sbjct: 151 INFADS--KF--LMTNLENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAH--- 203 Query: 393 IGNNLQNFS-QSGLN 406 NN +N + G+N Sbjct: 204 --NNKENVLYKKGMN 216 >gnl|CDD|162917 TIGR02550, flagell_flgL, flagellar hook-associated protein 3. This protein family consists of flagellar hook-associated proteins designated FlgL (or HAP3) encoded in bacterial flagellar operons. A N-terminal region of about 150 residues and a C-terminal region of about 85 residues are conserved. Members show considerable length heterogeneity between these two well-conserved terminal regions; members of the family vary between 287 to over 500 residues in length. This model distinguishes FlgL from the flagellin gene product FliC. Length = 306 Score = 28.9 bits (65), Expect = 3.2 Identities = 17/86 (19%), Positives = 32/86 (37%), Gaps = 3/86 (3%) Query: 182 NIETISTVLANNISHIDKMMHTTQVTPHSSDSKNTFNTITDLITSLDKMIKAIDLQKVNQ 241 + +A+N S ++ K+ FNT +LI +L + A+ Sbjct: 158 DGGQRKVQVASNRSMPVGHPGDKVFMSIPNNDKSVFNTGDNLIDALKSLEDALSSGDA-- 215 Query: 242 ILENIQVSSNNFVKSSDQVINTVHDV 267 + S N K+ D V++ +V Sbjct: 216 -QAALSASLNELDKALDNVLSARAEV 240 >gnl|CDD|184341 PRK13810, PRK13810, orotate phosphoribosyltransferase; Provisional. Length = 187 Score = 28.7 bits (64), Expect = 3.4 Identities = 16/69 (23%), Positives = 34/69 (49%) Query: 286 SDFSSKMKSKETSAFLENIADSTSNMRSSISAIREITDQRQKIISTINTIENITSNLNDS 345 S F +K ++ LE++ S ++R +I +RE + +I+ ++ E NL ++ Sbjct: 113 SRFVGDLKPEDRIVMLEDVTTSGGSVREAIEVVREAGAYIKYVITVVDREEGAEENLKEA 172 Query: 346 SQKFAELMS 354 + L+S Sbjct: 173 DVELVPLVS 181 >gnl|CDD|162462 TIGR01647, ATPase-IIIA_H, plasma-membrane proton-efflux P-type ATPase. This model describes the plasma membrane proton efflux P-type ATPase found in plants, fungi, protozoa, slime molds and archaea. The best studied representative is from yeast. Length = 755 Score = 28.4 bits (64), Expect = 3.4 Identities = 16/47 (34%), Positives = 22/47 (46%), Gaps = 2/47 (4%) Query: 135 AMITATPSGINYFISNAENTSKKISDSSRHIQKIIENIEKPLTTTIA 181 A++TAT G+N F A + S H+QKI+ I L I Sbjct: 171 AVVTAT--GMNTFFGKAAALVQSTETGSGHLQKILSKIGLFLIVLIG 215 >gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional. Length = 815 Score = 28.6 bits (64), Expect = 3.4 Identities = 25/94 (26%), Positives = 48/94 (51%), Gaps = 12/94 (12%) Query: 269 ETTQTFQEVGQKIDHLLSDFSSKMKS--KETSAF-LENIADSTSNMRSSISAIREITDQR 325 E ++ +E GQK + + DFS+K S T F + +I + + S++ +R+I Sbjct: 508 EFVKSEKETGQKAEAMWLDFSNKWFSLVHSTRPFVIRDIDELADHAASALETVRDIN--- 564 Query: 326 QKIISTINTIENIT-SNLNDS-SQKFAELMSKIN 357 + S + I ++T S L+D S ++ +L I+ Sbjct: 565 --VASRL--IGDMTGSTLDDPLSDRYKKLGCSIS 594 >gnl|CDD|168168 PRK05674, PRK05674, gamma-carboxygeranoyl-CoA hydratase; Validated. Length = 265 Score = 28.6 bits (64), Expect = 3.5 Identities = 12/38 (31%), Positives = 26/38 (68%) Query: 23 FSIYWLSRSNQYDGPMAEVIIRIPGSVDGLSTDSSVRF 60 F+ WLSR+++ + A++I + ++D + +D+S+RF Sbjct: 16 FATLWLSRADKNNAFNAQMIRELILALDQVQSDASLRF 53 >gnl|CDD|177432 PHA02607, wac, fibritin; Provisional. Length = 454 Score = 28.5 bits (64), Expect = 4.2 Identities = 17/77 (22%), Positives = 31/77 (40%), Gaps = 4/77 (5%) Query: 214 KNTFNTITDLITSLDKMIKAIDLQKVNQILENIQVSSNNFVKSSDQVINTVHDVRETTQT 273 KN + + DK+ + +D VN I EN+ V + V DQ+ V D+ + Sbjct: 52 KNVEQLDENTKKTKDKINEVVDD--VNTIQENLDVIGDISV--IDQINQNVADIEVLKKD 107 Query: 274 FQEVGQKIDHLLSDFSS 290 + K+ ++ Sbjct: 108 VSDTTDKLAGTTNEVDE 124 >gnl|CDD|149682 pfam08702, Fib_alpha, Fibrinogen alpha/beta chain family. Fibrinogen is a protein involved in platelet aggregation and is essential for the coagulation of blood. This domain forms part of the central coiled coiled region of the protein which is formed from two sets of three non-identical chains (alpha, beta and gamma). Length = 146 Score = 27.7 bits (62), Expect = 6.3 Identities = 24/95 (25%), Positives = 41/95 (43%), Gaps = 12/95 (12%) Query: 165 IQKIIENIEKPLTTTIANIETISTVLANN-------ISHIDKMMHTTQVTPHSSDSKNTF 217 IQ +++ EK + I ++E + LAN+ + HI + Q + + N + Sbjct: 23 IQDLLDKYEKDVDKRIEDLENLLDQLANSTSSAHQYVKHIKDSLRGDQ--KQAQPNDNIY 80 Query: 218 NTITDLITSLDKMIKAIDLQKVNQILENIQVSSNN 252 N + SL KMI+ I K+N I+V Sbjct: 81 NAYSK---SLRKMIEYILETKINTQESQIRVLQEV 112 >gnl|CDD|163138 TIGR03110, exosort_Gpos, exosortase family protein. Members of this protein family are found in a modest number of non-pathogenic Gram-positive bacteria, including three species of Lactococcus and three paralogs in Clostridium acetobutylicum. This protein appears related to the conserved core region of a family of proposed transpeptidases, exosortase (previously EpsH), thought to act on PEP-CTERM proteins, and members of the seed alignment include all proposed active site residues. However, in contrast to mainline exosortase (TIGR02602) and archaeal (TIGR03762), and cyanobacterial (TIGR03763) variants, this family has not yet been matched to a cognate PEP-CTERM-like sorting signal. Length = 187 Score = 27.7 bits (62), Expect = 6.8 Identities = 10/22 (45%), Positives = 15/22 (68%) Query: 9 SVGLFVVSILFFSFFSIYWLSR 30 SVGLF + FF + +++LSR Sbjct: 31 SVGLFFILTFFFRNYLVWFLSR 52 >gnl|CDD|150104 pfam09325, Vps5, Vps5 C terminal like. Vps5 is a sorting nexin that functions in membrane trafficking. This is the C terminal dimerization domain. Length = 236 Score = 27.2 bits (61), Expect = 8.1 Identities = 37/211 (17%), Positives = 73/211 (34%), Gaps = 35/211 (16%) Query: 197 IDKMMHTTQVTPHSSDSKNT-----FNTITDLITSLDKMIKAIDLQKVNQILENIQVSSN 251 + + + S K F I SL+ L+K+ + LE + Sbjct: 1 LKGLFGKFTSSVSKSTYKFNEPDPWFEDKRQYIDSLES-----QLKKLYKALELLVNQRK 55 Query: 252 NFVKSSD---QVINTVHDVRETT------QTFQEVGQKIDHLLSDFSSKMKSKETSAFLE 302 ++ + + + + +T EV ++I LL + + + E Sbjct: 56 ELASATGEFAKSLAMLASLELSTGLSRALSQLAEVEERIKELLERQALQ----DVLTLGE 111 Query: 303 NIADSTSNMRSSISAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKINN---- 358 I + +R I +++ + +QR K E +L+ + A+L + Sbjct: 112 TIDE---YLRL-IGSVKAVFNQRVKAWQQWQNAEQ---DLSKKRAQLAKLKGAGKSRQDK 164 Query: 359 -ISALKENNSLFKDAQRAMHTFRDTSEKINR 388 A KE + + Q+A F D SE I + Sbjct: 165 LQQAKKEVDEAERKVQQAKKEFEDISETIKK 195 >gnl|CDD|150441 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division. Length = 792 Score = 27.4 bits (61), Expect = 8.3 Identities = 16/93 (17%), Positives = 33/93 (35%), Gaps = 9/93 (9%) Query: 150 NAENTSKKISDSSRHIQKIIENIEKPL--------TTTIANIETISTVLANNISHIDKMM 201 ++ + S + + I IEN+ K L T I+ I A + ++++ Sbjct: 484 SSSLGIRDKSGTRKDILSAIENVYKLLLELEDLERTMPPPAIDEIMEWRAEGQALVEQLW 543 Query: 202 HTTQVTPHSSDSKNTFNTITDLITSLDKMIKAI 234 V D + + ++ S K K + Sbjct: 544 EELHVMEPIEDDSLSPHPFISML-SYPKGKKIL 575 >gnl|CDD|180134 PRK05567, PRK05567, inosine 5'-monophosphate dehydrogenase; Reviewed. Length = 486 Score = 27.1 bits (61), Expect = 8.6 Identities = 11/38 (28%), Positives = 20/38 (52%), Gaps = 2/38 (5%) Query: 343 NDSSQKFAELMSKINNISALKENNSLFKDAQRAMHTFR 380 D SQ +E+M+K ++ E +L ++A +H R Sbjct: 143 TDLSQPVSEVMTKERLVTV-PEGTTL-EEALELLHEHR 178 >gnl|CDD|182099 PRK09831, PRK09831, putative acyltransferase; Provisional. Length = 147 Score = 27.2 bits (60), Expect = 8.7 Identities = 18/54 (33%), Positives = 22/54 (40%), Gaps = 10/54 (18%) Query: 55 DSSVR---FNGIPVGRIVG-------LFLDQEYPNHSLAKALIRPDTPLYPSTT 98 S VR N PVG I LF+D EY +A AL++P T Sbjct: 52 KSQVRVAVINAQPVGFITCIEHYIDMLFVDPEYTRRGVASALLKPLIKSESELT 105 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.314 0.129 0.350 Gapped Lambda K H 0.267 0.0695 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 6,779,275 Number of extensions: 421045 Number of successful extensions: 1274 Number of sequences better than 10.0: 1 Number of HSP's gapped: 1245 Number of HSP's successfully gapped: 97 Length of query: 452 Length of database: 5,994,473 Length adjustment: 97 Effective length of query: 355 Effective length of database: 3,898,497 Effective search space: 1383966435 Effective search space used: 1383966435 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 59 (26.6 bits)