RPS-BLAST 2.2.22 [Sep-27-2009]
Database: CddB
21,608 sequences; 5,994,473 total letters
Searching..................................................done
Query= gi|254780140|ref|YP_003064553.1| putative ABC transporter,
substrate-binding protein [Candidatus Liberibacter asiaticus str.
psy62]
(452 letters)
>gnl|CDD|162151 TIGR00996, Mtu_fam_mce, virulence factor Mce family protein.
Members of this paralogous family are found as six
tandem homologous proteins in the same orientation per
cassette, in four separate cassettes in Mycobacterium
tuberculosis. The six members of each cassette represent
six subfamilies. One subfamily includes the protein mce
(mycobacterial cell entry), a virulence protein required
for invasion of non-phagocytic cells.
Length = 291
Score = 48.8 bits (117), Expect = 3e-06
Identities = 52/356 (14%), Positives = 106/356 (29%), Gaps = 83/356 (23%)
Query: 8 TSVGLFVVSILFFSFFSI-YWLSRSNQYDGPMAEVIIRIPGSVDGLSTDSSVRFNGIPVG 66
+V L V +L F + + Y A GL S VR G+PVG
Sbjct: 6 AAVVLAAVVVLGVVFLRLPFRGFGPGTY-TVYAY-----FTDAGGLYPGSKVRVRGVPVG 59
Query: 67 RIVGLFLDQEYPNHSLAKALIRPDTPLYPSTTATIRTQGLAGITYIELSTLRKEKKTIFQ 126
++ + L N + + + + TA IR+ L G Y+EL+ +
Sbjct: 60 KVTAISLP---GNGARVTFSLDRGVTIPANATAAIRSTTLLGSRYVELTPPKGPGGPPLP 116
Query: 127 ----IATERNQRAMITATPSGINYFISNAENTSKKISDSSRHIQKIIENIEKPLTTTIAN 182
I R T+ P I+ + + +
Sbjct: 117 PGGVIPLAR------TSVPPEIDDLLGSLTRLLNGLD--------------------PEK 150
Query: 183 IETISTVLANNISHIDKMMHTTQVTPHSSDSKNTFNTITDLITSLDKMIKAID--LQKVN 240
+ I LA P + +L+ L ++ A++ +
Sbjct: 151 LNAILNALAEA---------LAGQGPQ----------LRNLLDGLAQLTAALNARDGDIG 191
Query: 241 QILENIQVSSNNFVKSSDQVINTVHDVRETTQTFQEVGQKIDHLLSDFSSKMKSKETSAF 300
+++N+ + SDQ+ + ++ T + +D L+ S
Sbjct: 192 ALIDNLNRVLDVLADRSDQLDRLLDNLATLTAQLADRDDALDDALAA---------LSGA 242
Query: 301 LENIADSTSNMRSSISAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKI 356
+ D + + R + + + + + L D + +L+ +
Sbjct: 243 SAQVRDL-------------LAENRPNLPQALANLAPVLTLLVDYHPELEQLLHLL 285
>gnl|CDD|128579 smart00283, MA, Methyl-accepting chemotaxis-like domains
(chemotaxis sensory transducer). Thought to undergo
reversible methylation in response to attractants or
repellants during bacterial chemotaxis.
Length = 262
Score = 41.5 bits (98), Expect = 4e-04
Identities = 23/124 (18%), Positives = 46/124 (37%), Gaps = 17/124 (13%)
Query: 237 QKVNQILENIQVSSNNFVKSSDQVINTVHDVRETTQTFQEVGQKIDHLLSDFSSKMKSKE 296
+ +I E + +SS +V V V ET + +E+ ++ E
Sbjct: 151 SLIKEIQEETNEAVAAMEESSSEVEEGVELVEETGEALEEIVDSVE-------------E 197
Query: 297 TSAFLENIADSTSNMRSSIS----AIREITDQRQKIISTINTIENITSNLNDSSQKFAEL 352
+ ++ IA +T + AI EI Q+ + I L+ +++ EL
Sbjct: 198 IADLVQEIAAATDEQAAGSEEVNAAIDEIAQVTQETAAMSEEISAAAEELSGLAEELKEL 257
Query: 353 MSKI 356
+ +
Sbjct: 258 VEQF 261
Score = 31.9 bits (73), Expect = 0.32
Identities = 21/104 (20%), Positives = 44/104 (42%), Gaps = 9/104 (8%)
Query: 266 DVRETTQTFQEVGQKIDHLLSDFSSKMKS-----KETSAFLENIADSTSNMRSSISAIRE 320
DV E + ++ L + + +M+ +E +A + IA + + + RE
Sbjct: 1 DVSEAVEEIAAGAEEQAEELEELAERMEELSASIEEVAANADEIAATAQSAAEAAEEGRE 60
Query: 321 ITDQRQKIISTIN-TIENITSN---LNDSSQKFAELMSKINNIS 360
+ + I +E S L +SS + E++S I++I+
Sbjct: 61 AVEDAITAMDQIREVVEEAVSAVEELEESSDEIGEIVSVIDDIA 104
Score = 29.9 bits (68), Expect = 1.3
Identities = 26/225 (11%), Positives = 62/225 (27%), Gaps = 88/225 (39%)
Query: 148 ISNAENTSKKISDSSRHIQKIIENIEKPLTTTI----------ANIE------------- 184
+ A + +++ +SS I +I+ I I A IE
Sbjct: 76 VEEAVSAVEELEESSDEIGEIVSVI-----DDIADQTNLLALNAAIEAARAGEAGRGFAV 130
Query: 185 ---------------------TISTVLANNISHIDKMMHTTQ----VTPHSSDSKNTFNT 219
I + + M ++ ++
Sbjct: 131 VADEVRKLAERSAESAKEIESLIKEIQEETNEAVAAMEESSSEVEEGVELVEETGEALEE 190
Query: 220 ITDLITSLDKMIKAIDLQKVNQILENIQVSSNNFVKSSDQVINTVHDVRETTQTFQEVGQ 279
I D + + +++ I +++ S++V + ++ + TQ
Sbjct: 191 IVDSVEE------------IADLVQEIAAATDEQAAGSEEVNAAIDEIAQVTQ------- 231
Query: 280 KIDHLLSDFSSKMKSKETSAFLENIADSTSNMRSSISAIREITDQ 324
ET+A E I+ + + ++E+ +Q
Sbjct: 232 ----------------ETAAMSEEISAAAEELSGLAEELKELVEQ 260
>gnl|CDD|185619 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 34.4 bits (79), Expect = 0.057
Identities = 32/195 (16%), Positives = 80/195 (41%), Gaps = 2/195 (1%)
Query: 239 VNQILENIQVSSNNFVKSSDQVI--NTVHDVRETTQTFQEVGQKIDHLLSDFSSKMKSKE 296
+ + E I F K + NT+ + +G+ ++ + FSS + KE
Sbjct: 2362 IIKTKEKINNLKEEFSKLLKNIKRNNTLCNNNNIKDFISNIGKSVETIKQRFSSNLPEKE 2421
Query: 297 TSAFLENIADSTSNMRSSISAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKI 356
+E + N+ + I + KI+ I+ +N +N ++ + + +
Sbjct: 2422 KLHQIEENLNEIKNIMNETKRISNVDAFTNKILQDIDNEKNKENNNMNAEKIDDLIENVT 2481
Query: 357 NNISALKENNSLFKDAQRAMHTFRDTSEKINRYIPSIGNNLQNFSQSGLNDIQNLVRKLQ 416
++ +K + DA R + +D K+ + NN N +++ +++ ++ +L+
Sbjct: 2482 SHNEKIKSELLIINDALRRVKEKKDEMNKLFNSLTENNNNNNNSAKNIVDNSTYIINELE 2541
Query: 417 ETVNHFDDCLNNFER 431
V+ ++ L+ +
Sbjct: 2542 SHVSKLNELLSYIDN 2556
Score = 31.0 bits (70), Expect = 0.68
Identities = 20/141 (14%), Positives = 55/141 (39%), Gaps = 10/141 (7%)
Query: 306 DSTSNMRSSISAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKINNISA-LKE 364
DS + S+ ++ +++ IN+I+ + +N++ + + I + ++
Sbjct: 481 DSMDSKEKKESSDSNYQEKVDELLQIINSIKEKNNIVNNNFKNIEDYYITIEGLKNEIEG 540
Query: 365 NNSLFKDAQRAMHTFRDTSEKINRYIPSIGNNLQNFSQ---------SGLNDIQNLVRKL 415
L K +++ T + I N ++ + S ++I N+++++
Sbjct: 541 LIELIKYYLQSIETLIKDEKLKRSMKNDIKNKIKYIEENVDHIKDIISLNDEIDNIIQQI 600
Query: 416 QETVNHFDDCLNNFERNPQDI 436
+E +N F D+
Sbjct: 601 EELINEALFNKEKFINEKNDL 621
Score = 30.2 bits (68), Expect = 1.1
Identities = 39/255 (15%), Positives = 92/255 (36%), Gaps = 33/255 (12%)
Query: 199 KMMHTTQVTPHSSDSKNTFNTITDLITSLDKMIKAIDLQKVNQILENIQVSSNNFVKSSD 258
+ + T + + I ++I +L K ++ + K N I + + + S +
Sbjct: 665 QTLLNTSKNEYEKLEFMKSDNIDNIIKNLKKELQNLLSLKENIIKKQLNNIEQDISNSLN 724
Query: 259 QVINTVHDVRETTQTFQEVGQKIDHLLSDFSSKMK-------------------SKETSA 299
Q +D++ + + ++E +K++ ++ +E
Sbjct: 725 QYTIKYNDLKSSIEEYKEEEEKLEVYKHQIINRKNEFILHLYENDKDLPDGKNTYEEFLQ 784
Query: 300 FLENIADSTSNMRSSISAIREITDQRQKIISTINTI--------ENITSNLNDSSQKFAE 351
+ + I + + + + I+ ++E Q ++++ N + E L QKF
Sbjct: 785 YKDTILNKENKISNDINILKENKKNNQDLLNSYNILIQKLEAHTEKNDEELKQLLQKFPT 844
Query: 352 LMSKINNISALKENNSLFKDAQRAMHTFRDTSEKINRYIPSIG--NNLQNFSQSGLNDIQ 409
+N LKE F + + + E +N+ I I N N S S ++
Sbjct: 845 EDENLN----LKELEKEFNENNQIVDNIIKDIENMNKNINIIKTLNIAINRSNSNKQLVE 900
Query: 410 NLVRKLQETVNHFDD 424
+L+ + N +
Sbjct: 901 HLLNNKIDLKNKLEQ 915
Score = 29.8 bits (67), Expect = 1.6
Identities = 49/258 (18%), Positives = 96/258 (37%), Gaps = 36/258 (13%)
Query: 211 SDSKNTFNTITDLITSLDKMIKAID------------LQKVNQILENIQVSSNNFVKSSD 258
S K+ ++ I S+D K LQ +N I E + +NNF D
Sbjct: 467 SIMKSFYDLIISEKDSMDSKEKKESSDSNYQEKVDELLQIINSIKEKNNIVNNNFKNIED 526
Query: 259 QVI------NTVHDVRETTQTFQEVGQKIDHLLSDFSSKMKSK-ETSAFLENIADSTSNM 311
I N + + E + + Q I+ L+ D K K + ++ I ++ ++
Sbjct: 527 YYITIEGLKNEIEGLIELIKYYL---QSIETLIKDEKLKRSMKNDIKNKIKYIEENVDHI 583
Query: 312 RSSISAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKINN------ISALKEN 365
+ IS EI + Q+I IN + E + I N + L +
Sbjct: 584 KDIISLNDEIDNIIQQIEELINEALFNKEKFINEKNDLQEKVKYILNKFYKGDLQELLDE 643
Query: 366 NSLFKDAQRAMHTFRDTSEKINRYIPSIGNNLQNFSQSGLNDIQNLVRKLQETVNHF--- 422
S F D + ++ + E + + + N + ++I N+++ L++ + +
Sbjct: 644 LSHFLDDHKYLYHEAKSKEDLQTLLNTSKNEYEKLEFMKSDNIDNIIKNLKKELQNLLSL 703
Query: 423 -----DDCLNNFERNPQD 435
LNN E++ +
Sbjct: 704 KENIIKKQLNNIEQDISN 721
Score = 27.9 bits (62), Expect = 5.6
Identities = 45/270 (16%), Positives = 100/270 (37%), Gaps = 33/270 (12%)
Query: 152 ENTSKKISDSSRHIQKIIENIE-KPLTTTIANIETISTVLANNISHIDKMMHTTQVTPHS 210
+ KI + H KI ++E K + I IE I ++N I+K + +
Sbjct: 1329 KQVEAKIEQAKEHKNKIYGSLEDKQIDDEIKKIEQIKEEISNKRKEINKYLSNIKSNKEK 1388
Query: 211 SDSKNTFNTITDLITSLDKMIKAIDLQKVNQILENIQVSSNNFVKSSDQVINTVHDVRET 270
D + + + ID ++ +E N +K +D + E
Sbjct: 1389 CDLH---------VRNASRGKDKIDFLNKHEAIEPSNSKEVNIIKITDNINKCKQYSNEA 1439
Query: 271 TQTFQEVGQKIDHLLSDFSSKMKSKETSAFLENIADSTSNM--RSSISAIREITDQRQKI 328
+T + + D ++ +N+ SSI + ++++K
Sbjct: 1440 METENKADENNDSII-----------------KYEKEITNILNNSSILGKKTKLEKKKKE 1482
Query: 329 ISTINT-IENITSNLNDSSQKFAELMSKINNISALKENNSLFKDAQ--RAMHTFRDTSEK 385
+ I I S + K +E ++++N +K + + + A T + +
Sbjct: 1483 ATNIMDDINGEHSIIKTKLTKSSEKLNQLNEQPNIKREGDVLNNDKSTIAYETIQYNLGR 1542
Query: 386 INRYIPSIGNNLQNFSQSGLNDIQNLVRKL 415
+ + +I N+++ ++ LN Q+L+R +
Sbjct: 1543 VKHNLLNI-LNIKDEIETILNKAQDLMRDI 1571
>gnl|CDD|183759 PRK12804, PRK12804, flagellin; Provisional.
Length = 301
Score = 33.8 bits (77), Expect = 0.10
Identities = 19/66 (28%), Positives = 35/66 (53%), Gaps = 5/66 (7%)
Query: 316 SAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKINNISALKE-----NNSLFK 370
AI++++DQR K+ + N +E+ +NL S + S+I ++ KE N++
Sbjct: 220 GAIKQVSDQRSKLGAVQNRLEHTINNLGASGENLTAAESRIRDVDMAKEMSEFTKNNILS 279
Query: 371 DAQRAM 376
A +AM
Sbjct: 280 QASQAM 285
>gnl|CDD|148180 pfam06419, COG6, Conserved oligomeric complex COG6. COG6 is a
component of the conserved oligomeric golgi complex,
which is composed of eight different subunits and is
required for normal golgi morphology and localisation.
Length = 615
Score = 33.4 bits (77), Expect = 0.13
Identities = 36/200 (18%), Positives = 78/200 (39%), Gaps = 29/200 (14%)
Query: 235 DLQKVNQILENIQVSSNNFVKSSDQVINTVHDVRETTQTFQEVGQKIDHLLSDFSS-KMK 293
+ V + L+ I + + +++ + +++ ++ T+ ++ ++ L+ ++K
Sbjct: 39 EFGPVAEQLKRIGEDLSKLNNTCEEMKSQLNNAKQDTK---DLLEEASTLMEQKRQIELK 95
Query: 294 SKETSAFLENIADSTSNMRSSISAIREITDQRQKIISTINTIENITSNL--NDSSQKFAE 351
K AFL S + S I D+ +++ + I L ++ + E
Sbjct: 96 KKLLKAFLSRFTLSEDELDVLTSGEEPIDDEFFQVLDKVKKIHEDCEILLGGENQRAGLE 155
Query: 352 LMSKINNI--SALKENNSLFKDAQRAMHTFRDTSEKINRYIPSIGNNLQNFSQSGLNDIQ 409
+M K+N +A N L++ QR FR NL++ ++
Sbjct: 156 IMEKMNRNLEAA---NEKLYRWTQR---EFR-------------NLNLESPQILST--LR 194
Query: 410 NLVRKLQETVNHFDDCLNNF 429
+R L E F CL+ F
Sbjct: 195 KALRVLAERPVLFQSCLDEF 214
>gnl|CDD|150562 pfam09903, DUF2130, Uncharacterized protein conserved in bacteria
(DUF2130). This domain, found in various hypothetical
prokaryotic proteins, has no known function.
Length = 267
Score = 33.4 bits (77), Expect = 0.13
Identities = 17/85 (20%), Positives = 40/85 (47%), Gaps = 5/85 (5%)
Query: 274 FQEVGQKIDHLLSDFSSKMKSKETSAFLENIADSTSNMRSSISAIREITDQRQKIISTIN 333
QE + + + +F + + + A ++N + ++ I AI + + +K I +
Sbjct: 178 EQENVKDKNIDIYNFEKEFRQFKE-AIVKNFELAQKDLDKEIKAIDKAWKKLEKQIEAV- 235
Query: 334 TIENITSNLNDSSQKFAEL-MSKIN 357
+ + T+NL ++ K A L + K+
Sbjct: 236 -LGS-TNNLRGANNKIAGLTIKKLT 258
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 33.3 bits (76), Expect = 0.14
Identities = 52/321 (16%), Positives = 114/321 (35%), Gaps = 48/321 (14%)
Query: 132 NQRAMITATPSGINYFISNAENTSKKISDSSRHIQKIIENIEKPLTTTIANIETISTVLA 191
NQ + N ++ + E+ KKI + S++I+++ I + L + + I L
Sbjct: 353 NQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIERMSAFISEILKIQEIDPDAIKKELN 412
Query: 192 NNISHIDKMMHTTQVTPHSSDSKNTFNTITDLITSLDKMIKAIDLQKVNQILENIQV--- 248
+ + SS + I L +LD +L + ++L V
Sbjct: 413 EINVKLQDI---------SSKVSSLNQRIRALRENLD------ELSRNMEMLNGQSVCPV 457
Query: 249 --------SSNNFVKSSDQVINTVH-DVRETTQTFQEVGQKIDHLLS--DFSSKMKSKET 297
SN+ + ++ + + +RE +++ +KI L ++ + ++
Sbjct: 458 CGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINKS 517
Query: 298 SAFLENIADSTSNMRSSISAIREITDQRQKIISTINTIENITSNLNDSSQKFAE---LMS 354
I + +++ I E+ D+ K N +++ L D K ++
Sbjct: 518 INEYNKIESARADLEDIKIKINELKDKHDKYEEIKNRYKSL--KLEDLDSKRTSWLNALA 575
Query: 355 KINNI------SALKENNSLFKDAQRAMHTFRDTSEKINRYIPSI-------GNNLQNFS 401
I+ I S E D + + YI NNL N
Sbjct: 576 VISLIDIETNRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYIDKSIREIENEANNLNN-K 634
Query: 402 QSGLNDIQNLVRKLQETVNHF 422
+ + + + L+ KL+ ++++
Sbjct: 635 YNEIQENKILIEKLRGKIDNY 655
Score = 32.2 bits (73), Expect = 0.28
Identities = 46/285 (16%), Positives = 106/285 (37%), Gaps = 19/285 (6%)
Query: 142 SGINYFISNAENTSKKISDSSRHIQKIIENIE-KPLTTTIANIETISTVLANNISHIDKM 200
S + I E K I + ++K E +E + + +I I + A+ K+
Sbjct: 479 SRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINKSINEYNKIESARADLEDIKIKI 538
Query: 201 MHTTQVTPHSSDSKNTFNTI---------TDLITSLDKMIKAIDLQKVNQILENIQVSSN 251
+ KN + ++ T + +L +I ID++ I+ N
Sbjct: 539 NELKDKHDKYEEIKNRYKSLKLEDLDSKRTSWLNAL-AVISLIDIETNRSRSNEIKKQLN 597
Query: 252 NFVKSSDQVINTVHDVRETTQTFQEVGQKIDHLLSDFSSKMKS-KETSAFLENIADSTSN 310
+ ++ D + ++I++ ++ ++K +E +E + N
Sbjct: 598 DLESRLQEIEIGFPDDKSYIDKSI---REIENEANNLNNKYNEIQENKILIEKLRGKIDN 654
Query: 311 MRSSISAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKINNISALKE-NNSLF 369
+ I+ I I ++I S IN IE+ NL S + + + + + E +
Sbjct: 655 YKKQIAEIDSIIPDLKEITSRINDIED---NLKKSRKALDDAKANRARLESTIEILRTRI 711
Query: 370 KDAQRAMHTFRDTSEKINRYIPSIGNNLQNFSQSGLNDIQNLVRK 414
+ ++ +T E + + +IG+ + + + ++RK
Sbjct: 712 NELSDRINDINETLESMKKIKKAIGDLKRLREAFDKSGVPAMIRK 756
Score = 28.7 bits (64), Expect = 3.5
Identities = 50/297 (16%), Positives = 103/297 (34%), Gaps = 28/297 (9%)
Query: 136 MITATPSGINYFISNAENTSKKISDSSRHIQKIIENIEKPLTTTIANIETISTVLANNIS 195
M+ A S I+Y E K + +I+K I + EK + T+ IE +S N +
Sbjct: 177 MLRAEISNIDYL----EEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMD 232
Query: 196 HIDKMMHTTQVTPHSSDSKNTFNTITDLITSLDKMIKAIDLQKVNQILENIQVSSNNFVK 255
+ + + N +++ D+ + IK + ++ +N N+ K
Sbjct: 233 DYN----------NLKSALNELSSLEDMKNRYESEIKTAESDLSMELEKN------NYYK 276
Query: 256 SSDQVINTVHDVRETTQTFQEVGQKIDHLLSDFSSKMKSKETSAFLENIADSTSNMRSSI 315
++ + V + +++ F K + L NI + + I
Sbjct: 277 ELEERHMKI--------INDPVYKNRNYINDYFKYKNDIENKKQILSNIDAEINKYHAII 328
Query: 316 SAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKINNISALKENNSLFKDAQRA 375
+ + I + +++ + + + + S + +I +LK+ +
Sbjct: 329 KKLSVLQKDYNDYIKKKSRYDDLNNQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIER 388
Query: 376 MHTFRDTSEKINRYIPSIGNNLQNFSQSGLNDIQNLVRKLQETVNHFDDCLNNFERN 432
M F KI P N L DI + V L + + + L+ RN
Sbjct: 389 MSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRN 445
>gnl|CDD|177394 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 32.3 bits (74), Expect = 0.25
Identities = 29/177 (16%), Positives = 68/177 (38%), Gaps = 25/177 (14%)
Query: 218 NTITDLITSLDKMIKAI--DLQ-KVNQILENIQVSSNNFVKSSDQVINTVHDVRETTQTF 274
T I K Q K ++++E + + +D+++N V D+ + +
Sbjct: 198 KTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAAL 257
Query: 275 QEVGQKIDHLLSDFSSKMKS--KETSAFLEN---------IADSTSNMRSSISAIREITD 323
++ + SK++ K + + I++ + ++E+
Sbjct: 258 NKLNTA----AAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQH 313
Query: 324 QRQKIISTINTIENITSNLNDSSQKFAELMSKINN----ISALKENNSLFKDAQRAM 376
+K+ + I+ +E I N+ S+K EL +KI+ + L + K + A+
Sbjct: 314 SLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKA---KKVKAAI 367
>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein. These
proteins are found in P. falciparum, P. vivax and P.
yoelii.
Length = 2757
Score = 31.6 bits (71), Expect = 0.41
Identities = 59/311 (18%), Positives = 113/311 (36%), Gaps = 54/311 (17%)
Query: 152 ENTSKKISDSSRHIQKIIENIEKPLTTTIANI-ETISTVLANNISHIDKMMHTTQVTPHS 210
E + I K I NIE + T+I NI + I + NI ++ ++ +
Sbjct: 1025 EKATNDIEQKIEDANKNIPNIEIAIHTSIYNIIDEIEKEIGKNIELLN-----KEILEEA 1079
Query: 211 SDSKNTFNTI--------------------TDLITSLDKMIKAIDLQKVNQI--LENIQV 248
+ FN I D I + IK +D + + I LE I+
Sbjct: 1080 EINITNFNEIKEKLKHYNFDDFGKEENIKYADEINKIKDDIKNLDQKIDHHIKALEEIKK 1139
Query: 249 SSNNFVKSSDQVINTVHDVRETT----------QTFQEVGQKID----------HLLSDF 288
S N++ IN + DV + + + + KID LL++
Sbjct: 1140 KSENYIDEIKAQINDLEDVADKAISNDDPEEIEKKIENIVTKIDKKKNIYDEIKKLLNEI 1199
Query: 289 SSKMKSKETSAFLENIADSTSNMRSSISAIREITDQRQKIISTINTIENITSNLNDSSQK 348
+ K K + ++ I S + + +I ++++K I +E +L++ +K
Sbjct: 1200 AEIEKDKTSLEEVKGINLSYGKNLGKL-FLEKIDEEKKKSEHMIKAMEAYIEDLDEIKEK 1258
Query: 349 FAELMSKINNISALKENNSLFKDAQRAMHTFRDTSEKINRYIPSIGNNL-----QNFSQS 403
E+ +++ +K F + S+K + I I +S
Sbjct: 1259 SPEIENEMGIEMDIKAEMETFNISHDDDKDHHIISKKHDENISDIREKSLKIIEDFSEES 1318
Query: 404 GLNDIQNLVRK 414
+NDI+ ++K
Sbjct: 1319 DINDIKKELQK 1329
Score = 30.4 bits (68), Expect = 0.96
Identities = 37/166 (22%), Positives = 76/166 (45%), Gaps = 10/166 (6%)
Query: 201 MHTTQVTPHSSDSKNTFNTITDLITSLDKMIKAIDLQKVNQILENI-----QVSS--NNF 253
M T V H S+ +N N + D+I + K I + +N+ILE+ ++S+ N++
Sbjct: 715 METATVELHLSNIENKKNELLDIIVEIKKHIHGEINKDLNKILEDFKNKEKELSNKINDY 774
Query: 254 VKSSDQVINTVHDVRETTQTFQEVGQKIDHLLSDFSSKMKSKETSAFLENIADSTSNMRS 313
K D++ + E + + ID+ + D +K ++ +++ I+ +
Sbjct: 775 AKEKDELNKYKSKISEIKNHYND-QINIDN-IKDEDAKQNYDKSKEYIKTISIKEDEIFK 832
Query: 314 SISAIREITDQ-RQKIISTINTIENITSNLNDSSQKFAELMSKINN 358
I+ ++ + D K+ IN N ++ ++FAEL +KI
Sbjct: 833 IINEMKFMKDDFLNKVDKFINFENNCKEKIDSEHEQFAELTNKIKA 878
Score = 29.6 bits (66), Expect = 1.6
Identities = 48/237 (20%), Positives = 102/237 (43%), Gaps = 35/237 (14%)
Query: 158 ISDSSRHIQKIIENIEKPLTTTIANIETISTVLANNISHIDKMMHTTQVTPHSSDSKNTF 217
++D+++ +IE EK T I N+ + T+ NN + Q+ +S+ +
Sbjct: 2217 LADATKDHNNLIEK-EKEATKIINNLTELFTIDFNNADADILHNNKIQIIYFNSELHKSI 2275
Query: 218 NTITDLITSLDKMIKAIDLQKVNQILENIQVSSNNFVK-SSDQVINTVHDVRETTQTFQE 276
+I L + K +++ +N+ +I +N ++ ++ ++D++E Q +
Sbjct: 2276 ESIKKLYKKI-NAFKLLNISHINEKYFDISKEFDNIIQLQKHKLTENLNDLKEIDQYISD 2334
Query: 277 VGQKIDHLLSDFSS-----------KMKSKETSAF-LENIADSTS-NMRSSISAIREITD 323
H L++ ++ + ++E A +ENI + + N+ I I ++T+
Sbjct: 2335 KKNIFLHALNENTNFNFNALKEIYDDIINRENKADEIENINNKENENIMQYIDTITKLTE 2394
Query: 324 QRQKIISTINTIEN-------------------ITSNLNDSSQKFAELMSKINNISA 361
+ Q I+ + T EN I NL + Q F E+++KI+ I A
Sbjct: 2395 KIQDILIFVTTYENDNNIIKQHIQDNDENDVSKIKDNLKKTIQSFQEILNKIDEIKA 2451
>gnl|CDD|148118 pfam06314, ADC, Acetoacetate decarboxylase (ADC). This family
consists of several acetoacetate decarboxylase (ADC)
proteins (EC:4.1.1.4).
Length = 239
Score = 30.8 bits (70), Expect = 0.71
Identities = 11/57 (19%), Positives = 18/57 (31%), Gaps = 11/57 (19%)
Query: 22 FFSIYWLSRSNQYDGPMAEVIIRIPGSVDGLSTDSSVRFNGIPVGRIVGLFLDQEYP 78
+ S+ GP E I +P VR+ G +++D E
Sbjct: 52 AVEFARMPDSSFGFGPYTEAGIVVP-----------VRYGGKEGAYCHFMYVDNEAA 97
>gnl|CDD|148076 pfam06248, Zw10, Centromere/kinetochore Zw10. Zw10 and rough deal
proteins are both required for correct metaphase
check-pointing during mitosis. These proteins bind to
the centromere/kinetochore.
Length = 571
Score = 30.4 bits (69), Expect = 1.0
Identities = 30/154 (19%), Positives = 60/154 (38%), Gaps = 18/154 (11%)
Query: 222 DLITSLDKMIKAIDLQKVNQILENIQVSSNNFVKSSDQ--VINTVHDVRETTQTFQEVGQ 279
D T L K + ++ N+ +++ V++ N + S V+ T + +E + E +
Sbjct: 357 DNDTDLLKYAEDVNTHFANKKCQDVLVTARNLMTSDIHNTVLVTPDEAKEGERLGNENEK 416
Query: 280 KIDHLLSDFSSKMKSKETSAFLENIADSTSNMRSSISAIREIT----DQRQKIISTINTI 335
+ L M S+ ++ + D T + E T ++ TI I
Sbjct: 417 HLSPLSFLLPRCMVSESVQQLMK-LLDKT---------LSEATFSSEGCADQLAYTIRNI 466
Query: 336 ENITSNLNDSSQKFAELMSKINNISALKENNSLF 369
+ + +L+ KI +SAL NN ++
Sbjct: 467 --LQLYYDVVPTYHKKLLLKIPQLSALFHNNCMY 498
>gnl|CDD|162740 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent.
Length = 1164
Score = 30.0 bits (68), Expect = 1.2
Identities = 24/124 (19%), Positives = 47/124 (37%), Gaps = 18/124 (14%)
Query: 316 SAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKI--------NNISALKENNS 367
+E++D +KI IE + ++ EL + N S LKE +
Sbjct: 706 ELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEA 765
Query: 368 LFKDAQRAMHTFRDTSEKINRYIPSIGNNLQNFSQSGLNDIQNLVRKLQETVNHFDDCLN 427
++ + +H + + + S S + +IQ + KL+E V+ + L
Sbjct: 766 RIEELEEDLHKLEEALNDLEARL----------SHSRIPEIQAELSKLEEEVSRIEARLR 815
Query: 428 NFER 431
E+
Sbjct: 816 EIEQ 819
>gnl|CDD|173321 PTZ00021, PTZ00021, falcipain-2; Provisional.
Length = 489
Score = 29.4 bits (66), Expect = 2.0
Identities = 29/135 (21%), Positives = 60/135 (44%), Gaps = 21/135 (15%)
Query: 280 KIDHLLSDFSSKMKSKETSAFLENIADSTSNMRSSISAIREITDQRQKIISTINTIENIT 339
K+ L+S + + ++ET E + + +++ S + R+ + N +
Sbjct: 95 KLQELISSYDKEKNNQETKNSGELKTFADFDGKNNKSV----ENFRKNFGNLKNVKKVYL 150
Query: 340 SNLNDSSQKFAELMSKINNISA----LKENNSLFKDA---QRAMHTFRDTSEKINRYIPS 392
N DS KF LM+ + N+++ +KE+ ++ Q+ +F + KIN +
Sbjct: 151 INFADS--KF--LMTNLENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAH--- 203
Query: 393 IGNNLQNFS-QSGLN 406
NN +N + G+N
Sbjct: 204 --NNKENVLYKKGMN 216
>gnl|CDD|162917 TIGR02550, flagell_flgL, flagellar hook-associated protein 3. This
protein family consists of flagellar hook-associated
proteins designated FlgL (or HAP3) encoded in bacterial
flagellar operons. A N-terminal region of about 150
residues and a C-terminal region of about 85 residues
are conserved. Members show considerable length
heterogeneity between these two well-conserved terminal
regions; members of the family vary between 287 to over
500 residues in length. This model distinguishes FlgL
from the flagellin gene product FliC.
Length = 306
Score = 28.9 bits (65), Expect = 3.2
Identities = 17/86 (19%), Positives = 32/86 (37%), Gaps = 3/86 (3%)
Query: 182 NIETISTVLANNISHIDKMMHTTQVTPHSSDSKNTFNTITDLITSLDKMIKAIDLQKVNQ 241
+ +A+N S ++ K+ FNT +LI +L + A+
Sbjct: 158 DGGQRKVQVASNRSMPVGHPGDKVFMSIPNNDKSVFNTGDNLIDALKSLEDALSSGDA-- 215
Query: 242 ILENIQVSSNNFVKSSDQVINTVHDV 267
+ S N K+ D V++ +V
Sbjct: 216 -QAALSASLNELDKALDNVLSARAEV 240
>gnl|CDD|184341 PRK13810, PRK13810, orotate phosphoribosyltransferase; Provisional.
Length = 187
Score = 28.7 bits (64), Expect = 3.4
Identities = 16/69 (23%), Positives = 34/69 (49%)
Query: 286 SDFSSKMKSKETSAFLENIADSTSNMRSSISAIREITDQRQKIISTINTIENITSNLNDS 345
S F +K ++ LE++ S ++R +I +RE + +I+ ++ E NL ++
Sbjct: 113 SRFVGDLKPEDRIVMLEDVTTSGGSVREAIEVVREAGAYIKYVITVVDREEGAEENLKEA 172
Query: 346 SQKFAELMS 354
+ L+S
Sbjct: 173 DVELVPLVS 181
>gnl|CDD|162462 TIGR01647, ATPase-IIIA_H, plasma-membrane proton-efflux P-type
ATPase. This model describes the plasma membrane proton
efflux P-type ATPase found in plants, fungi, protozoa,
slime molds and archaea. The best studied representative
is from yeast.
Length = 755
Score = 28.4 bits (64), Expect = 3.4
Identities = 16/47 (34%), Positives = 22/47 (46%), Gaps = 2/47 (4%)
Query: 135 AMITATPSGINYFISNAENTSKKISDSSRHIQKIIENIEKPLTTTIA 181
A++TAT G+N F A + S H+QKI+ I L I
Sbjct: 171 AVVTAT--GMNTFFGKAAALVQSTETGSGHLQKILSKIGLFLIVLIG 215
>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
Length = 815
Score = 28.6 bits (64), Expect = 3.4
Identities = 25/94 (26%), Positives = 48/94 (51%), Gaps = 12/94 (12%)
Query: 269 ETTQTFQEVGQKIDHLLSDFSSKMKS--KETSAF-LENIADSTSNMRSSISAIREITDQR 325
E ++ +E GQK + + DFS+K S T F + +I + + S++ +R+I
Sbjct: 508 EFVKSEKETGQKAEAMWLDFSNKWFSLVHSTRPFVIRDIDELADHAASALETVRDIN--- 564
Query: 326 QKIISTINTIENIT-SNLNDS-SQKFAELMSKIN 357
+ S + I ++T S L+D S ++ +L I+
Sbjct: 565 --VASRL--IGDMTGSTLDDPLSDRYKKLGCSIS 594
>gnl|CDD|168168 PRK05674, PRK05674, gamma-carboxygeranoyl-CoA hydratase;
Validated.
Length = 265
Score = 28.6 bits (64), Expect = 3.5
Identities = 12/38 (31%), Positives = 26/38 (68%)
Query: 23 FSIYWLSRSNQYDGPMAEVIIRIPGSVDGLSTDSSVRF 60
F+ WLSR+++ + A++I + ++D + +D+S+RF
Sbjct: 16 FATLWLSRADKNNAFNAQMIRELILALDQVQSDASLRF 53
>gnl|CDD|177432 PHA02607, wac, fibritin; Provisional.
Length = 454
Score = 28.5 bits (64), Expect = 4.2
Identities = 17/77 (22%), Positives = 31/77 (40%), Gaps = 4/77 (5%)
Query: 214 KNTFNTITDLITSLDKMIKAIDLQKVNQILENIQVSSNNFVKSSDQVINTVHDVRETTQT 273
KN + + DK+ + +D VN I EN+ V + V DQ+ V D+ +
Sbjct: 52 KNVEQLDENTKKTKDKINEVVDD--VNTIQENLDVIGDISV--IDQINQNVADIEVLKKD 107
Query: 274 FQEVGQKIDHLLSDFSS 290
+ K+ ++
Sbjct: 108 VSDTTDKLAGTTNEVDE 124
>gnl|CDD|149682 pfam08702, Fib_alpha, Fibrinogen alpha/beta chain family.
Fibrinogen is a protein involved in platelet aggregation
and is essential for the coagulation of blood. This
domain forms part of the central coiled coiled region of
the protein which is formed from two sets of three
non-identical chains (alpha, beta and gamma).
Length = 146
Score = 27.7 bits (62), Expect = 6.3
Identities = 24/95 (25%), Positives = 41/95 (43%), Gaps = 12/95 (12%)
Query: 165 IQKIIENIEKPLTTTIANIETISTVLANN-------ISHIDKMMHTTQVTPHSSDSKNTF 217
IQ +++ EK + I ++E + LAN+ + HI + Q + + N +
Sbjct: 23 IQDLLDKYEKDVDKRIEDLENLLDQLANSTSSAHQYVKHIKDSLRGDQ--KQAQPNDNIY 80
Query: 218 NTITDLITSLDKMIKAIDLQKVNQILENIQVSSNN 252
N + SL KMI+ I K+N I+V
Sbjct: 81 NAYSK---SLRKMIEYILETKINTQESQIRVLQEV 112
>gnl|CDD|163138 TIGR03110, exosort_Gpos, exosortase family protein. Members of
this protein family are found in a modest number of
non-pathogenic Gram-positive bacteria, including three
species of Lactococcus and three paralogs in
Clostridium acetobutylicum. This protein appears
related to the conserved core region of a family of
proposed transpeptidases, exosortase (previously EpsH),
thought to act on PEP-CTERM proteins, and members of
the seed alignment include all proposed active site
residues. However, in contrast to mainline exosortase
(TIGR02602) and archaeal (TIGR03762), and
cyanobacterial (TIGR03763) variants, this family has
not yet been matched to a cognate PEP-CTERM-like
sorting signal.
Length = 187
Score = 27.7 bits (62), Expect = 6.8
Identities = 10/22 (45%), Positives = 15/22 (68%)
Query: 9 SVGLFVVSILFFSFFSIYWLSR 30
SVGLF + FF + +++LSR
Sbjct: 31 SVGLFFILTFFFRNYLVWFLSR 52
>gnl|CDD|150104 pfam09325, Vps5, Vps5 C terminal like. Vps5 is a sorting nexin
that functions in membrane trafficking. This is the C
terminal dimerization domain.
Length = 236
Score = 27.2 bits (61), Expect = 8.1
Identities = 37/211 (17%), Positives = 73/211 (34%), Gaps = 35/211 (16%)
Query: 197 IDKMMHTTQVTPHSSDSKNT-----FNTITDLITSLDKMIKAIDLQKVNQILENIQVSSN 251
+ + + S K F I SL+ L+K+ + LE +
Sbjct: 1 LKGLFGKFTSSVSKSTYKFNEPDPWFEDKRQYIDSLES-----QLKKLYKALELLVNQRK 55
Query: 252 NFVKSSD---QVINTVHDVRETT------QTFQEVGQKIDHLLSDFSSKMKSKETSAFLE 302
++ + + + + +T EV ++I LL + + + E
Sbjct: 56 ELASATGEFAKSLAMLASLELSTGLSRALSQLAEVEERIKELLERQALQ----DVLTLGE 111
Query: 303 NIADSTSNMRSSISAIREITDQRQKIISTINTIENITSNLNDSSQKFAELMSKINN---- 358
I + +R I +++ + +QR K E +L+ + A+L +
Sbjct: 112 TIDE---YLRL-IGSVKAVFNQRVKAWQQWQNAEQ---DLSKKRAQLAKLKGAGKSRQDK 164
Query: 359 -ISALKENNSLFKDAQRAMHTFRDTSEKINR 388
A KE + + Q+A F D SE I +
Sbjct: 165 LQQAKKEVDEAERKVQQAKKEFEDISETIKK 195
>gnl|CDD|150441 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 792
Score = 27.4 bits (61), Expect = 8.3
Identities = 16/93 (17%), Positives = 33/93 (35%), Gaps = 9/93 (9%)
Query: 150 NAENTSKKISDSSRHIQKIIENIEKPL--------TTTIANIETISTVLANNISHIDKMM 201
++ + S + + I IEN+ K L T I+ I A + ++++
Sbjct: 484 SSSLGIRDKSGTRKDILSAIENVYKLLLELEDLERTMPPPAIDEIMEWRAEGQALVEQLW 543
Query: 202 HTTQVTPHSSDSKNTFNTITDLITSLDKMIKAI 234
V D + + ++ S K K +
Sbjct: 544 EELHVMEPIEDDSLSPHPFISML-SYPKGKKIL 575
>gnl|CDD|180134 PRK05567, PRK05567, inosine 5'-monophosphate dehydrogenase;
Reviewed.
Length = 486
Score = 27.1 bits (61), Expect = 8.6
Identities = 11/38 (28%), Positives = 20/38 (52%), Gaps = 2/38 (5%)
Query: 343 NDSSQKFAELMSKINNISALKENNSLFKDAQRAMHTFR 380
D SQ +E+M+K ++ E +L ++A +H R
Sbjct: 143 TDLSQPVSEVMTKERLVTV-PEGTTL-EEALELLHEHR 178
>gnl|CDD|182099 PRK09831, PRK09831, putative acyltransferase; Provisional.
Length = 147
Score = 27.2 bits (60), Expect = 8.7
Identities = 18/54 (33%), Positives = 22/54 (40%), Gaps = 10/54 (18%)
Query: 55 DSSVR---FNGIPVGRIVG-------LFLDQEYPNHSLAKALIRPDTPLYPSTT 98
S VR N PVG I LF+D EY +A AL++P T
Sbjct: 52 KSQVRVAVINAQPVGFITCIEHYIDMLFVDPEYTRRGVASALLKPLIKSESELT 105
Database: CddB
Posted date: Feb 4, 2011 9:54 PM
Number of letters in database: 5,994,473
Number of sequences in database: 21,608
Lambda K H
0.314 0.129 0.350
Gapped
Lambda K H
0.267 0.0695 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 21608
Number of Hits to DB: 6,779,275
Number of extensions: 421045
Number of successful extensions: 1274
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1245
Number of HSP's successfully gapped: 97
Length of query: 452
Length of database: 5,994,473
Length adjustment: 97
Effective length of query: 355
Effective length of database: 3,898,497
Effective search space: 1383966435
Effective search space used: 1383966435
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (26.6 bits)