RPS-BLAST 2.2.22 [Sep-27-2009]
Database: CddB
21,608 sequences; 5,994,473 total letters
Searching..................................................done
Query= gi|254780512|ref|YP_003064925.1| flagellar biosynthesis protein
FlhA [Candidatus Liberibacter asiaticus str. psy62]
(692 letters)
>gnl|CDD|183749 PRK12792, flhA, flagellar biosynthesis protein FlhA; Reviewed.
Length = 694
Score = 978 bits (2531), Expect = 0.0
Identities = 409/678 (60%), Positives = 537/678 (79%), Gaps = 4/678 (0%)
Query: 19 DFAFSFCIVLIICILFLPIPTVLLDVGLASSIALSILILMVALWIEKPLEFSSFPTVLLI 78
DF F+ IV I+ +LFLP+P VL+D+GLA SIALS+LILMVALWI++PLEFS+FPTVLLI
Sbjct: 17 DFGFAVGIVAILAVLFLPVPAVLIDIGLAFSIALSVLILMVALWIQRPLEFSAFPTVLLI 76
Query: 79 VTIIRLSLNIATTRAILSFGHEGYGAAGGIIAGFSSLVMSGDFVIGLVVFMILITINFIV 138
T++RL+LNIATTR ILS G EG AAG +IAGFS VMSGDFVIGLVVF ILIT+NF+V
Sbjct: 77 ATLLRLALNIATTRLILSNGQEGVDAAGHVIAGFSQFVMSGDFVIGLVVFAILITVNFLV 136
Query: 139 ITKGATRIAEVGARFTLDAIPGKQMAIDADLSSGLIEEEEAKRRRKELEEESAFFGAMDG 198
ITKGATRIAEVGARFTLDAIPGKQMAIDADLS+GLI+++EA+RRR+ELEEESAFFG+MDG
Sbjct: 137 ITKGATRIAEVGARFTLDAIPGKQMAIDADLSAGLIDDKEAQRRRRELEEESAFFGSMDG 196
Query: 199 ASKFVRGDAIASIIITAINIVGGIVIGCFRYDMSIHHAADVFVRLSVGDGLVSQVPALII 258
ASKFVRGDAIAS+II A+NI GGI+IG R+ M + AADVF +LSVGDGLVSQ+PALI+
Sbjct: 197 ASKFVRGDAIASLIIIAVNIFGGIIIGVTRHGMPLGQAADVFTKLSVGDGLVSQIPALIV 256
Query: 259 SLSAAFLVSRTTSKGSTNTAIVEQLSHYPRALLISAFFMIVLSVMPNLPAFPFIMLGGFF 318
SL+A LVS+ ++GS A++ QL YPRAL ++A M VL+++P LP PF +LGG
Sbjct: 257 SLAAGLLVSKGGTRGSAEQAVLGQLGAYPRALSVAALLMFVLAIVPGLPFLPFALLGGVM 316
Query: 319 ACAGFYVPYKNELQRLAKVAQI----QEASKQNQHSAQLNFITSGVELVLGSLVSNRLLS 374
A + +P + + A+ A++ + A + + S + T+ +EL LG ++ +L
Sbjct: 317 AFVAYTIPRRRAARAAAEAAKVKREEESAQAEAKDSVKEQLRTAEIELCLGKQLAAQLQG 376
Query: 375 SQEDLFLRVSKIRRKFAVQYGFIVPEIKVTTDISLPEKGYTIRVYGTTVAISELRVGEVL 434
+ +L RV+K+RRKFA QYGF+VPEIK+T +SLP K Y I+++GT VA ELR GE+L
Sbjct: 377 AHAELAHRVAKMRRKFAKQYGFVVPEIKLTDSLSLPPKTYQIKIHGTVVATQELRPGELL 436
Query: 435 VILGSGQKPTFPGDEVKEPAFGMPAFAIMESFSDDLRRQGFHPIDNLAVVLTHLSEVIRN 494
V++G G +P PG+EV+EPAFGM A + ++F++++RR GF P+DN +V+LTHLSEVIRN
Sbjct: 437 VVVGDGPRPDVPGEEVREPAFGMKALWVPDAFANEVRRDGFEPVDNASVLLTHLSEVIRN 496
Query: 495 NLSQLLSYKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAVLKLLLAEHVSIRNLPLIL 554
NL QLLSYKD++ L+ RLDPEY++L ++ C S ISYSG+QAVLKLLLAE VSIRNL LIL
Sbjct: 497 NLPQLLSYKDMRALLDRLDPEYKRLIDDICPSQISYSGLQAVLKLLLAERVSIRNLHLIL 556
Query: 555 ESIAEVAPHSRKTSHIVEQVRIRMAQQICGDLAPTGILNILKLGNHWDMIFYQAIQRDSK 614
E++AE+APH+R+ I E VR+R+AQQICGDL+ G+L +L+LGN WD+ F+Q+++RD+K
Sbjct: 557 EAVAEIAPHARRAEQIAEHVRMRIAQQICGDLSDNGVLKVLRLGNRWDLAFHQSLKRDAK 616
Query: 615 GECVDFNVEPRAVEMFSENATNSIRQYIDKGIPLTIVTLPEIRSYIRMILERNFPSLAVL 674
GE V+F+++PR VE F A+ +IR+ +D+G +VT PE R Y+RMI+ER FP+L VL
Sbjct: 617 GEVVEFDIDPRLVEQFGTEASEAIRERMDQGHQFVLVTAPEARPYVRMIIERLFPTLPVL 676
Query: 675 SHMEIAKGLKVNILGTIS 692
SH+EIA+G+++ LGTIS
Sbjct: 677 SHLEIARGVEIKSLGTIS 694
>gnl|CDD|180350 PRK06012, flhA, flagellar biosynthesis protein FlhA; Validated.
Length = 697
Score = 880 bits (2277), Expect = 0.0
Identities = 285/696 (40%), Positives = 433/696 (62%), Gaps = 6/696 (0%)
Query: 1 MIQSLIGISDSNDRNHLHDFAFSFCIVLIICILFLPIPTVLLDVGLASSIALSILILMVA 60
+ +++ + + D A + ++ I+ ++ LP+P LLD+ L +IALS+LIL+VA
Sbjct: 2 NLAAMLRLPGNLKLLKWRDLAVAILVLAILAMMILPLPPFLLDLLLTFNIALSVLILLVA 61
Query: 61 LWIEKPLEFSSFPTVLLIVTIIRLSLNIATTRAILSFGHEGYGAAGGIIAGFSSLVMSGD 120
L+I++PL+FS+FPT+LLI T++RL+LN+A+TR IL GHEG AAG +I F V+ G+
Sbjct: 62 LFIQRPLDFSAFPTLLLITTLLRLALNVASTRLILLEGHEGTDAAGKVIEAFGHFVVGGN 121
Query: 121 FVIGLVVFMILITINFIVITKGATRIAEVGARFTLDAIPGKQMAIDADLSSGLIEEEEAK 180
FV+G+VVF+IL+ INF+VITKGA RIAEV ARFTLDA+PGKQMAIDADL++GLI+EEEAK
Sbjct: 122 FVVGIVVFIILVIINFVVITKGAGRIAEVAARFTLDAMPGKQMAIDADLNAGLIDEEEAK 181
Query: 181 RRRKELEEESAFFGAMDGASKFVRGDAIASIIITAINIVGGIVIGCFRYDMSIHHAADVF 240
+RRKEL++E+ F+GAMDGASKFV+GDAIA I+IT INI+GG++IG ++ MS AA+ +
Sbjct: 182 KRRKELQQEADFYGAMDGASKFVKGDAIAGILITVINIIGGLIIGVVQHGMSFGEAAETY 241
Query: 241 VRLSVGDGLVSQVPALIISLSAAFLVSRTTSKGSTNTAIVEQLSHYPRALLISAFFMIVL 300
L++GDGLVSQ+PAL+IS +A +V+R +S G IV QL P+AL I+A + +L
Sbjct: 242 TLLTIGDGLVSQIPALLISTAAGIIVTRVSSDGDVGEQIVGQLFANPKALYIAAGVLFLL 301
Query: 301 SVMPNLPAFPFIMLGGFFACAGFYVPYKNELQRLAKVAQIQEASKQNQHSAQLNFITSG- 359
++P +P PF++L G + + + + + +E + +
Sbjct: 302 GLVPGMPHLPFLLLAGLLGFLAYRLRKREKKAAELAAEEAEEEEAAEPEEESWDDVLPVD 361
Query: 360 -VELVLGSLVSNRLLSSQ-EDLFLRVSKIRRKFAVQYGFIVPEIKVTTDISLPEKGYTIR 417
+EL +G + + +Q +L R+ IR+K A + GF+VP +++ ++ LP Y I+
Sbjct: 362 PLELEVGYGLIPLVDENQGGELLDRIRSIRKKIAQELGFLVPPVRIRDNLQLPPNEYRIK 421
Query: 418 VYGTTVAISELRVGEVLVILGSGQKPTFPGDEVKEPAFGMPAFAIMESFSDDLRRQGFHP 477
+ G V ELR G +L + G PG KEPAFG+PA I E+ + + G+
Sbjct: 422 IKGVEVGSGELRPGRLLAMNPGGVDGELPGIPTKEPAFGLPAVWIDEALREQAQLLGYTV 481
Query: 478 IDNLAVVLTHLSEVIRNNLSQLLSYKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAVL 537
+D VV THL+EVI+N+ ++LL ++V+ L+ RL EY KL EE +S +Q VL
Sbjct: 482 VDPSTVVATHLTEVIKNHAAELLGRQEVQQLLDRLAKEYPKLVEELVPKVLSLGTLQKVL 541
Query: 538 KLLLAEHVSIRNLPLILESIAEVAPHSRKTSHIVEQVRIRMAQQICGDLAP-TGILNILK 596
+ LL E VSIR+L ILE++A+ AP ++ + E VR R+ +QI G L ++
Sbjct: 542 QNLLKERVSIRDLRTILETLADYAPITKDPDELTEHVRQRLGRQIVQQYKGEDGELPVIT 601
Query: 597 LGNHWDMIFYQAIQRDSKGECVDFNVEPRAVEMFSENATNSIRQYIDKGIPLTIVTLPEI 656
L + + Q++Q G +EP E ++ ++ + KG P ++ P +
Sbjct: 602 LDPELEQLLLQSLQGTGGGS--YLALEPGLAERLLQSLQEALERQEMKGEPPVLLVSPAL 659
Query: 657 RSYIRMILERNFPSLAVLSHMEIAKGLKVNILGTIS 692
R Y+R +LER P L VLS+ EI +++ I+GT+
Sbjct: 660 RPYLRRLLERFLPQLPVLSYNEIPDNIEIRIVGTVG 695
>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA. This model
describes flagellar biosynthesis protein FlhA, one of a
large number of genes associated with the biosynthesis
of functional bacterial flagella. Homologs of many such
proteins, including FlhA, function in type III protein
secretion systems. A separate model describes InvA
(Salmonella enterica), LcrD (Yersinia enterocolitica),
HrcV (Xanthomonas), etc., all of which score below the
noise cutoff for this model.
Length = 678
Score = 796 bits (2059), Expect = 0.0
Identities = 318/679 (46%), Positives = 448/679 (65%), Gaps = 9/679 (1%)
Query: 19 DFAFSFCIVLIICILFLPIPTVLLDVGLASSIALSILILMVALWIEKPLEFSSFPTVLLI 78
D + +V I+ ++ LP+P LLD+ LA +IALS+LIL+V L+I+KPL+FSSFPT+LLI
Sbjct: 3 DLLLAIGVVAILAVMILPLPAFLLDILLALNIALSLLILLVTLFIQKPLDFSSFPTLLLI 62
Query: 79 VTIIRLSLNIATTRAILSFGHEGYGAAGGIIAGFSSLVMSGDFVIGLVVFMILITINFIV 138
T+ RLSLN+A+TR IL+ GHEG AAG +I F V+ G++VIGL+VF+ILI +NFIV
Sbjct: 63 ATLFRLSLNVASTRLILTHGHEGPNAAGKVIEAFGQFVVGGNYVIGLIVFIILIIVNFIV 122
Query: 139 ITKGATRIAEVGARFTLDAIPGKQMAIDADLSSGLIEEEEAKRRRKELEEESAFFGAMDG 198
ITKGATRIAEV ARFTLDA+PGKQMAIDADL++GLI EEEAK+RR+ELE+E+ F+GAMDG
Sbjct: 123 ITKGATRIAEVAARFTLDAMPGKQMAIDADLNAGLITEEEAKKRREELEQEADFYGAMDG 182
Query: 199 ASKFVRGDAIASIIITAINIVGGIVIGCFRYDMSIHHAADVFVRLSVGDGLVSQVPALII 258
ASKFV+GDAIA IIIT INI+GG++IG ++ MS+ AA + L++GDGLV+Q+PALII
Sbjct: 183 ASKFVKGDAIAGIIITLINIIGGLIIGVVQHGMSLSDAASTYTILTIGDGLVAQIPALII 242
Query: 259 SLSAAFLVSRTTSKGSTNTAIVEQLSHYPRALLISAFFMIVLSVMPNLPAFPFIMLGGFF 318
S + +V+R +S+GS AIV QL PRALLI A + +L+++P LP FPF+ L G
Sbjct: 243 STATGLIVTRASSEGSFGKAIVTQLGANPRALLIVAAVLGLLALVPGLPTFPFLFLAGAL 302
Query: 319 ACAGFYVPY----KNELQRLAKVAQIQEASKQNQHSAQLNFITSGVELVLGSLVSNRLLS 374
A +Y+ + E A AQ +EA+++ + S +EL LG + +
Sbjct: 303 AFLAWYLRRRSKQEEEAAAEAAKAQ-EEAAEEEEESINDILALDDLELELGYGLIPLVDD 361
Query: 375 SQ-EDLFLRVSKIRRKFAVQYGFIVPEIKVTTDISLPEKGYTIRVYGTTVAISELRVGEV 433
SQ DL R+ IR++ A +YGF++P I++ ++ LP Y I++ G VA ELR G+
Sbjct: 362 SQGGDLLDRIRSIRKQLAQEYGFVMPVIRIRDNLRLPPNEYRIKIKGVEVARGELRPGKY 421
Query: 434 LVILGSGQKPTFPGDEVKEPAFGMPAFAIMESFSDDLRRQGFHPIDNLAVVLTHLSEVIR 493
L + PG+E +EPAFG+PA+ I E ++ R G+ +D V+ THLSEVI+
Sbjct: 422 LAMNPGNADGEIPGEETREPAFGLPAYWISEKNKEEAERLGYTVVDPATVLATHLSEVIK 481
Query: 494 NNLSQLLSYKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAVLKLLLAEHVSIRNLPLI 553
NN ++LL+ ++V+NL+ RL EY KL EE + IQ VL+LLL E VSIRNLP I
Sbjct: 482 NNAAELLTRQEVQNLLDRLKEEYPKLVEELIPDKVPLGTIQKVLQLLLRERVSIRNLPTI 541
Query: 554 LESIAEVAPHSRKTSHIVEQVRIRMAQQICGDLAPT-GILNILKLGNHWDMIFYQAIQRD 612
LE++A+ AP ++ +VE VR R+ +QI G+L ++ L + +A+QRD
Sbjct: 542 LETLADYAPITKDPDLLVEHVRQRLGRQITQQYLDEDGVLPVITLDPDLEAALAEALQRD 601
Query: 613 SKGECVDFNVEPRAVEMFSENATNSIRQYIDKGIPLTIVTLPEIRSYIRMILERNFPSLA 672
+GE +D EP +E ++ + + G ++T P +R Y+R ILER FP L
Sbjct: 602 GEGELLDL--EPALLEELVRAVRKAVEKLANNGERPVLLTSPRVRPYVRRILERFFPELP 659
Query: 673 VLSHMEIAKGLKVNILGTI 691
VLS+ EI ++V +G +
Sbjct: 660 VLSYNEIPDNVRVETVGVV 678
>gnl|CDD|162339 TIGR01399, hrcV, type III secretion protein, HrcV family. Members
of this family are closely homologous to the flagellar
biosynthesis protein FlhA (TIGR01398) and should all
participate in type III secretion systems. Examples
include InvA (Salmonella enterica), LcrD (Yersinia
enterocolitica), HrcV (Xanthomonas), etc. Type III
secretion systems resemble flagellar biogenesis systems,
and may share the property of translocating special
classes of peptides through the membrane.
Length = 677
Score = 454 bits (1169), Expect = e-128
Identities = 231/683 (33%), Positives = 370/683 (54%), Gaps = 17/683 (2%)
Query: 18 HDFAFSFCIVLIICILFLPIPTVLLDVGLASSIALSILILMVALWIEKPLEFSSFPTVLL 77
D + ++ II ++ LP+PT+L+D+ +A +I +S+L+LM+A++I +PL S+FP+VLL
Sbjct: 3 SDLVLALLLLAIISMMILPLPTLLVDILIAINITISVLLLMIAIYIPRPLALSTFPSVLL 62
Query: 78 IVTIIRLSLNIATTRAILSFGHEGYGAAGGIIAGFSSLVMSGDFVIGLVVFMILITINFI 137
I T+ RL+L+I+TTR IL + AG II F V+ G+ +GLV+F+I+ + FI
Sbjct: 63 ITTLFRLALSISTTRLILL-----HADAGNIIEAFGQFVVGGNLAVGLVIFLIITIVQFI 117
Query: 138 VITKGATRIAEVGARFTLDAIPGKQMAIDADLSSGLIEEEEAKRRRKELEEESAFFGAMD 197
VITKG+ R+AEV ARF+LDA+PGKQM+IDADL +G+I+ +EA+RRR LE+ES +GAMD
Sbjct: 118 VITKGSERVAEVSARFSLDAMPGKQMSIDADLRAGVIDADEARRRRSTLEKESQLYGAMD 177
Query: 198 GASKFVRGDAIASIIITAINIVGGIVIGCFRYDMSIHHAADVFVRLSVGDGLVSQVPALI 257
GA KFV+GDAIA III INI+GGI IG ++ MS A ++ L++GDGLVSQ+PAL+
Sbjct: 178 GAMKFVKGDAIAGIIIVLINIIGGISIGVTQHGMSASEALHLYTVLTIGDGLVSQIPALL 237
Query: 258 ISLSAAFLVSRTTSKGSTNTA--IVEQLSHYPRALLISAFFMIVLSVMPNLPAFPFIMLG 315
IS++A +V+R + N I QL+ PRALL++A ++ +++P P F +L
Sbjct: 238 ISVTAGIIVTRVPGEAERNLGREIGHQLTSQPRALLLAAVLLLGFALIPGFPLLVFALLA 297
Query: 316 GFFACAGFY--VPYKNELQRLAKVAQIQEASKQNQHSAQLNFITSGVELVLGSLVSNRLL 373
A AG+ ++ + A AS + N L +S L
Sbjct: 298 VLLAAAGYLLSRRKRSRAKANKAQASGAVASAPGAAAPIKNLDPFAEACPLILRLSPDLQ 357
Query: 374 SS--QEDLFLRVSKIRRKFAVQYGFIVPEIKVTTDISLPEKGYTIRVYGTTVAISELRVG 431
SS ++ L + ++R G +P I + SLP+ + I +Y V + G
Sbjct: 358 SSADKDTLDQEIERMRWALFEDLGIPLPGIIIRVGDSLPDNEFRILLYEVPVLRDTIPPG 417
Query: 432 EVLVILGSGQKPTF-PGDEVKEPAFGM-PAFAIMESFSDDLRRQGFHPIDNLAVVLTHLS 489
V + G + G + E ++ L+ G + V+ L
Sbjct: 418 HVALNDGVDNIEVAGIPAISGKRWPGESQRVWVTEEGAEKLQGAGLGYFSDSQVITHRLK 477
Query: 490 EVIRNNLSQLLSYKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAVLKLLLAEHVSIRN 549
+ N + + ++ + L+ +++ EY +L +E + I VL+ L++E VSIRN
Sbjct: 478 ATLLRNAQEFIGIQETRYLLDQMEREYPELVKEV-QRVLPLQRIAEVLQRLVSEQVSIRN 536
Query: 550 LPLILESIAEVAPHSRKTSHIVEQVRIRMAQQICGDLAPTG-ILNILKLGNHWDMIFYQA 608
L LILE++ E A + + E VRI + + IC A G L+ + + + + A
Sbjct: 537 LRLILETLIEWAQREKDVVMLTEYVRIALKRYICHRYANGGRQLSAVLIDPEIEELIRGA 596
Query: 609 IQRDSKGECVDFNVEPRAVEMFSENATNSIRQYIDKGIPLTIVTLPEIRSYIRMILERNF 668
I++ S G ++P E + ++ ++T +IR Y+R ++E F
Sbjct: 597 IRQTSTGTY--LALDPDDSEQLLDQIRQAVGDLPRAPSQPVLLTSMDIRRYVRKMIESEF 654
Query: 669 PSLAVLSHMEIAKGLKVNILGTI 691
P L VLS+ E+ + ++V +LG I
Sbjct: 655 PDLPVLSYQELGEEIEVQVLGRI 677
>gnl|CDD|185237 PRK15337, PRK15337, type III secretion system protein InvA;
Provisional.
Length = 686
Score = 406 bits (1046), Expect = e-114
Identities = 219/690 (31%), Positives = 367/690 (53%), Gaps = 48/690 (6%)
Query: 26 IVLIICILFLPIPTVLLDVGLASSIALSILILMVALWIEKPLEFSSFPTVLLIVTIIRLS 85
+V+II +L +P+PT L+D + +I L+IL+ M + +I++ L FS+FP++LLI T+ RL+
Sbjct: 21 MVMIIAMLIIPLPTYLVDFLIGLNIVLAILVFMGSFYIDRILSFSTFPSILLITTLFRLA 80
Query: 86 LNIATTRAILSFGHEGYGAAGGIIAGFSSLVMSGDFVIGLVVFMILITINFIVITKGATR 145
L+I+T+R IL AG II F V+ V+G V+F I+ + FIVITKG+ R
Sbjct: 81 LSISTSRLILLDAD-----AGEIITTFGQFVIGDSLVVGFVIFSIVTVVQFIVITKGSER 135
Query: 146 IAEVGARFTLDAIPGKQMAIDADLSSGLIEEEEAKRRRKELEEESAFFGAMDGASKFVRG 205
+AEV ARF+LD +PGKQM+IDADL +G+I+ + K RR LE ES +G+ DGA KF++G
Sbjct: 136 VAEVAARFSLDGMPGKQMSIDADLKAGIIDADGVKERRSVLERESQLYGSFDGAMKFIKG 195
Query: 206 DAIASIIITAINIVGGIVIGCFRYDMSIHHAADVFVRLSVGDGLVSQVPALIISLSAAFL 265
DAIA III +N++GGI +G ++ M + A + L++GDGLV+Q+PAL+IS+SA F+
Sbjct: 196 DAIAGIIIIFVNLIGGISVGMTQHGMDLSSALSTYTILTIGDGLVAQIPALLISISAGFI 255
Query: 266 VSRTTSKGSTNT---AIVEQLSHYPRALLISAFFMIVLSVMPNLPAFPF----IMLGGFF 318
V+R G ++ I+ QL P L+++A + + ++P P F ++LG F
Sbjct: 256 VTRVN--GDSDNLGRNIMSQLLSNPFVLVVTAILALSIGLLPGFPLPVFLLLAVVLGVLF 313
Query: 319 ACAGFYVPYKNELQRLAKVAQIQ-EASKQNQHSAQL-----NFITSGVELVLGSLVSNRL 372
F K+ + + + ++ S L I V L+L + R
Sbjct: 314 YFKKFRKKKKSAAEPDTSGGEAPLDIDEKAGSSLGLIGDLDKVIPETVPLILLVPEARRP 373
Query: 373 LSSQEDLFLRVSKIRRKFAVQYGFIVPEIKVTTDISLPEKGYTIRVYGTTVAISELRVGE 432
+E+L ++R +F + YG +P+I + L + + V I+E+R +
Sbjct: 374 DLEKENL---AERLRSQFFIDYGVRLPDILLRYSEGLDDN----SI---VVLINEIRAAQ 423
Query: 433 VLVILGSGQKPTFPGDEVKEPAF--------GMPAFAIMESFSDDLRRQGFHPIDNLAVV 484
I + DE+ G + + ++ L + G+ + +
Sbjct: 424 -FTIYFDLHRVVNYSDELVSLGINPTIIDSGGEQYYWVPHEDTEKLAKLGYVLRSAIDEL 482
Query: 485 LTHLSEVIRNNLSQLLSYKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAVLKLLLAEH 544
LS ++ +N+++ ++ K+L+ +L+ +Y L +E H + I VL+ LL+E
Sbjct: 483 YHCLSVLLLHNINEFFGIQETKHLLDQLEKKYPDLLKE-VYRHATVQRISEVLQRLLSER 541
Query: 545 VSIRNLPLILESIAEVAPHSRKTSHIVEQVRIRMAQQICGDLAPTGILNILKLGNHWDMI 604
+SIRN+ LI+E++A AP + +VE VR +A+ IC A G L + L +
Sbjct: 542 ISIRNMKLIMEALALWAPREKDVIMLVEHVRGALARYICHKFAAGGELRAVVLSAEVEDA 601
Query: 605 FYQAIQRDSKGECVDFNVEPRAVEMFSEN---ATNSIRQYIDKGIPLTIVTLPEIRSYIR 661
+ I++ S G + N++P E + A + + + L V +IR +++
Sbjct: 602 IRKGIRQTSGGTFL--NLDPAESENLMDLLTLALDDLGIAHRDIVLLVSV---DIRRFVK 656
Query: 662 MILERNFPSLAVLSHMEIAKGLKVNILGTI 691
++E FP L VLS EIA ++VN++ TI
Sbjct: 657 KLIEGRFPELEVLSFGEIADSVEVNVIKTI 686
>gnl|CDD|183699 PRK12720, PRK12720, secretion system apparatus protein SsaV;
Provisional.
Length = 675
Score = 369 bits (950), Expect = e-102
Identities = 202/678 (29%), Positives = 354/678 (52%), Gaps = 27/678 (3%)
Query: 18 HDFAFSFCIVLIICILFLPIPTVLLDVGLASSIALSILILMVALWIEKPLEFSSFPTVLL 77
D + +++ + ++ LP+PT ++D+ +A ++ S+++LM+A+++ PLEFS FP++LL
Sbjct: 15 QDIVLAVMLLVAVFMMILPLPTWMVDILIAINLMFSVILLMIAIYLRDPLEFSVFPSLLL 74
Query: 78 IVTIIRLSLNIATTRAILSFGHEGYGAAGGIIAGFSSLVMSGDFVIGLVVFMILITINFI 137
I T+ RL+L I+T+R +L H+ AG I+ F V+ G+ +GL+VF I+ + FI
Sbjct: 75 ITTLYRLALTISTSRLVL-LQHD----AGEIVDAFGKFVVGGNLAVGLIVFTIITIVQFI 129
Query: 138 VITKGATRIAEVGARFTLDAIPGKQMAIDADLSSGLIEEEEAKRRRKELEEESAFFGAMD 197
VITKG+ R+AEV ARF+LD +PGKQM+ID D+ +G+I+ +EA+R R+ +++ES GAMD
Sbjct: 130 VITKGSERVAEVSARFSLDGMPGKQMSIDGDMRAGVIDADEARRLRQHVQKESRLLGAMD 189
Query: 198 GASKFVRGDAIASIIITAINIVGGIVIGCFRYDMSIHHAADVFVRLSVGDGLVSQVPALI 257
GA KFV+GDAIA II+ +NI+GGI+IG ++DMS A + + LS+GDGL Q+P+L+
Sbjct: 190 GAMKFVKGDAIAGIIVILVNIIGGIIIGVMQHDMSASEAVNTYAVLSIGDGLCGQIPSLL 249
Query: 258 ISLSAAFLVSRTTSKGSTNTA--IVEQLSHYPRALLISAFFMIVLSVMPNLPAFPFIMLG 315
IS++A +V+R + N A + Q+ P+AL ++A +++ +++P P F L
Sbjct: 250 ISITAGIIVTRVPGEKRQNLANELSSQIGRQPQALWLAAVVLMLFALIPGFPFITFAFLA 309
Query: 316 GFFACAGFYVPYKNELQRLAKVAQIQEASKQNQHSAQLNFITSGVELVLGSLVSNRLLSS 375
A + K + A EA + + + L+L +
Sbjct: 310 ALVAAPAILLRRKKSVVS----ANGVEAGGSEEGPEGDSMVPGACPLMLRLAPTLHSADL 365
Query: 376 QEDLFLRVSKIRRKFAVQYGFIVPEIKVTTDISLPEKGYTIRVYGTTVAISELRVGEVLV 435
D+ +R G +PE+ + D L EK T+ +Y V L + +
Sbjct: 366 IRDI----DALRWFLFEDLGVPLPEVNIEVDPELTEKTLTVLLYQEPVL--SLSLPPQAL 419
Query: 436 ILGSGQKPTFPGDEVKEPAFGMPAFA-IMESFSDDLRRQGFHPIDNLAVVLTHLSEVIRN 494
+L G + GD P GM + + ++ + G + L V+
Sbjct: 420 LLLIGPDASLVGDSQTLP-NGMGQICWLTKDQAEQAQGFGLDVFAGSQRISALLKCVLLR 478
Query: 495 NLSQLLSYKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAVLKLLLAEHVSIRNLPLIL 554
+ + + ++ + L+ ++ Y +L +E + I +L+ L++E VSIR+L I
Sbjct: 479 YMGEFIGVQETRYLMDAMEKRYGELVKEL-QRQLPVGKIAEILQRLVSERVSIRDLRTIF 537
Query: 555 ESIAEVAPHSRKTSHIVEQVRIRMAQQICGDLAPTG-ILNILKLGNHWDMIFYQAIQRDS 613
++ E AP + + E VRI + + I G L +L++G + + ++I++ S
Sbjct: 538 GTLVEWAPREKDVVMLTEYVRIALRRHILRRFNHEGKWLPVLRIGEGIENLIRESIRQTS 597
Query: 614 KGECVDFNVEPRAVEMFSENATNSIRQYIDKGIPLTIVTLPEIRSYIRMILERNFPSLAV 673
G + S I Q + + L +VT ++R ++R I+ER L V
Sbjct: 598 AGTYSALSSRH------STQILQLIEQALKQSQKLVLVTSVDVRRFLRKIIERTLFDLPV 651
Query: 674 LSHMEIAKGLKVNILGTI 691
LS E+ ++ ++G+I
Sbjct: 652 LSWQELGDEAEIKVVGSI 669
>gnl|CDD|168293 PRK05910, PRK05910, type III secretion system protein; Validated.
Length = 584
Score = 207 bits (529), Expect = 6e-54
Identities = 178/674 (26%), Positives = 288/674 (42%), Gaps = 114/674 (16%)
Query: 26 IVLI-ICILF---LPIPTVLLDVGLASSIALSILILMVALWIEKPLEFSSFPTVLLIVTI 81
+V I +CILF P+P LLD GL + ALS+ I+ + FP++ L + +
Sbjct: 12 MVAIPLCILFTLIFPLPQWLLDFGLCINFALSLSIVFWVFSLRSSASARLFPSLFLYLCL 71
Query: 82 IRLSLNIATTRAILSFGHEGYGAAGGIIAGFSSLVMSGDFVIGLVVFMILITINFIVITK 141
+RL LN+A+TR ILS G A +I + G L ++L +NF+VITK
Sbjct: 72 LRLGLNLASTRWILS-----SGWASPLIFSLGNFFSLGSLPAALTACLLLFLVNFLVITK 126
Query: 142 GATRIAEVGARFTLDAIPGKQMAIDADLSSGLIEEEEAKRRRKELEEESAFFGAMDGASK 201
G+ RIAEV ARF+L+A+PGKQM++DADL SG +++ L EES FF AM+G +
Sbjct: 127 GSERIAEVRARFSLEALPGKQMSLDADLVSGRASYSRVSKQKNSLLEESDFFSAMEGVFR 186
Query: 202 FVRGDAIASIIITAINIVGGIVIGCFRYDMSIHHAADVFVRLSVGDGLVSQVPALIISLS 261
FV+GDAI S I+ +NI+ +G + +A +GD LVSQVPAL+ S +
Sbjct: 187 FVKGDAIMSCILLGVNILAATFLGR-----ATGYAVGDLWLTVLGDALVSQVPALLTSCA 241
Query: 262 AAFLVSRTTSKGSTNTAIVEQLSHYPRALLISAFFMIVLSVMPNLPAFPFIMLGGFFACA 321
AA L+S+ K S +++ + A + L+ +P P P + F+
Sbjct: 242 AATLISKVGEKESLLQHLLDYYEQSRQHFRFIALLLCSLACIPGAPKAPILG----FSVL 297
Query: 322 GFYVPYKNELQRLAKVAQIQEASKQNQHSAQLNFITSGVELVLGSLVSNRLLSSQEDLFL 381
++ YKN + + L F E V +L + + + +L+
Sbjct: 298 -LFLAYKNP-------SSGET----------LLFQKERFEFVELAL-PDEGVGNPANLYR 338
Query: 382 RVSKIRRKFAVQYGFIVPEIKVTTDISLPEKGYTIRVYGTTVAISELRVGEVLVILGSGQ 441
R + + G + PE + E + G V + EL +L L +
Sbjct: 339 AA---REEIFQELGVVFPE---EIVVRHVESSPRLIFSGQEVYLRELSCPAILPSLRNLA 392
Query: 442 KPTFPGDEVKEPAFGMPAFAIMESFSDDLRRQGFHPIDNLAVVLTHLSEVIRNNLSQLLS 501
VK ++ + I+ ++ +SE N+L LL
Sbjct: 393 PEAISERFVK-------------RLVEEFQEVAGISIEE--IIPKKISE---NSLVFLL- 433
Query: 502 YKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAVLKLLLAEHVSIRNLPLILESIAEVA 561
+ L E S H+ P ILE+IA
Sbjct: 434 ---------------RALVRERVSLHL---------------------FPKILEAIAVYG 457
Query: 562 PHSRKTSHIVEQVRIRMAQQICGDLAPTGILNILKLGNHWDMIFYQAIQRDSKGECVDFN 621
+ + +VE+VR + +QI L N D + + I DS E +
Sbjct: 458 SQGKSSEELVEKVRKYLGKQIG-----------RSLWNRQDTL--EVITIDSHVEQFIRD 504
Query: 622 VEPRAVEMFSENATNSIRQYIDK---GIPLTIVTLPEIRSYIRMILERNFPSLAVLSHME 678
++ +E ++ +++ G IVT E R ++ +++ FP L VLSH E
Sbjct: 505 SYSKSNPDMNEKVVAQVKSLLERSGEGNFRAIVTGCETRFELKKMVDPYFPDLLVLSHSE 564
Query: 679 IAKGLKVNILGTIS 692
+ + + +++LG +S
Sbjct: 565 LPEEIPISLLGAVS 578
>gnl|CDD|177434 PHA02611, 51, baseplate hub assembly protein; Provisional.
Length = 249
Score = 32.4 bits (74), Expect = 0.36
Identities = 13/57 (22%), Positives = 30/57 (52%), Gaps = 1/57 (1%)
Query: 486 THLSEVIRNNLSQLLSYKDVKNLISRLDPEYQKLAEETCSSH-ISYSGIQAVLKLLL 541
T LSE + L +S ++++ ++ ++P +L C+ + Y+ + + KLL+
Sbjct: 147 TELSEETQEALIDAISIEELEKIVKSMNPIKLELKISCCNRRDLKYTDLLELFKLLV 203
>gnl|CDD|147902 pfam05994, FragX_IP, Cytoplasmic Fragile-X interacting family.
CYFIP1/2 (Cytoplasmic fragile X mental retardation
interacting protein) like proteins for a highly
conserved protein family. The function of CYFIPs is
unclear, but CYFIP interaction with fragile X mental
retardation interacting protein (FMRP) involves the
domain of FMRP which also mediating homo- and
heteromerization.
Length = 855
Score = 32.0 bits (73), Expect = 0.58
Identities = 36/225 (16%), Positives = 78/225 (34%), Gaps = 45/225 (20%)
Query: 477 PIDNLAVVLTHLSEVIRNNLSQLLSYKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAV 536
ID ++ L+E+ R NL +S + +L S E ++L + +
Sbjct: 379 SIDLNRLIAQRLNELFRENLDAAISRFESNDLTSI--VELERLLD--------------I 422
Query: 537 LKL---LLAEHVSIRN-----LPLILESIAEVAPHSRKTSHIVEQVRIRMAQQICGDLAP 588
LKL LL+E + + + + +P SR H+ ++ D P
Sbjct: 423 LKLTHQLLSEFLIPLDPFDDMFSEANHNDSPNSPSSRILLHV--------FSELNNDFLP 474
Query: 589 TGILNILKL-GNHWDMIFYQAIQRDSKGECVDFNVEPRAVEMFSENATNSIRQYIDKGIP 647
LN + ++F + + R+ + + ++ Y
Sbjct: 475 NYFLNSTTMRFVRAKIVFSEELPREKPPIVQPI--YLLGTKDLNAAFATVLKLYT----- 527
Query: 648 LTIVTLPEIRSYIRMILERNFPSLAVLSHMEIAKGLKVNILGTIS 692
+ + R+ R++ + +AVL E+ K ++ + +
Sbjct: 528 -GFIGIEHFRALCRLL---GYGGIAVL-IQELLKIIESKLEEALL 567
>gnl|CDD|163153 TIGR03142, cytochro_ccmI, cytochrome c-type biogenesis protein
CcmI. This TPR repeat-containing protein is the CcmI
protein (also called CycH) of c-type cytochrome
biogenesis. CcmI is thought to act as an apo-cytochrome
c chaperone. This model describes the N-terminal region
of the protein, Members of this protein family.
Length = 117
Score = 31.8 bits (73), Expect = 0.61
Identities = 16/66 (24%), Positives = 34/66 (51%), Gaps = 3/66 (4%)
Query: 125 LVVFMILITINFIVIT--KGATRIAEVGARFTLDAIPGKQMA-IDADLSSGLIEEEEAKR 181
+ + L+ + F+++ + A V A+ ++A ++ DL+ GL++E EA+
Sbjct: 5 VAALLTLVALLFLLLPLLRRRRAAATVDRDELNLAVYRDRLAELERDLAEGLLDEAEAEA 64
Query: 182 RRKELE 187
R EL+
Sbjct: 65 ARAELQ 70
>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
Provisional.
Length = 569
Score = 31.3 bits (72), Expect = 0.74
Identities = 15/78 (19%), Positives = 32/78 (41%), Gaps = 7/78 (8%)
Query: 479 DNLAVVLTHLSEVIRNNLSQLLSYKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAVLK 538
L + + + N S+L S + ++ + L+ +Y ++ E I+YS +Q L+
Sbjct: 327 KELKEEIDRVKQSYTLNESELESVRQLEKQLESLEKQYDEITERIAEQEIAYSELQEELE 386
Query: 539 LLL-------AEHVSIRN 549
+L E +
Sbjct: 387 EILKQLEEIEKEQEKLSE 404
>gnl|CDD|149595 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j has been shown to be required
for processing of 20S pre-rRNA and binds to 18S rRNA and
eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 30.8 bits (70), Expect = 1.3
Identities = 13/53 (24%), Positives = 22/53 (41%), Gaps = 2/53 (3%)
Query: 161 KQMAIDADLSSGLIEEEEAKRRRKELEEESAFFGAMD--GASKFVRGDAIASI 211
++ A+ +E K R ++LEEES A D G + + S+
Sbjct: 77 EEKALRELEEITPEDELAEKLRLQKLEEESDLKNAADLFGGPAPTKDTDLESL 129
>gnl|CDD|184040 PRK13412, fkp, bifunctional
fucokinase/L-fucose-1-P-guanylyltransferase;
Provisional.
Length = 974
Score = 29.8 bits (67), Expect = 2.5
Identities = 45/186 (24%), Positives = 65/186 (34%), Gaps = 48/186 (25%)
Query: 513 DPEYQKLAEETCSSHI-SYSGIQAVLKLLLAEHVSIRNLPLILESIAEVAPHSRKTSHIV 571
PEY+ H+ Y+GI K +LAE V + L S A + +H +
Sbjct: 821 QPEYRD-------CHLLYYTGITRTAKGILAEIVR----SMFLNSTAHLQLLHEMKAHAL 869
Query: 572 EQVRIRMAQQI-CGDLAPTGILNILKLGNHWDMIFYQAIQRDSKGECVDFNVEPRAVEMF 630
+ M + I G+ G L +G W+ + +D P AVE
Sbjct: 870 D-----MYEAIQRGEFEEFGRL----VGKTWEQ-----------NKALDSGTNPAAVEAI 909
Query: 631 SENATNSIRQYI---------DKGIPLTIVTLPEIRSYIRMILERNFP-SLAVLSHMEIA 680
E I+ Y G + P IR IL N P A M ++
Sbjct: 910 IE----LIKDYTLGYKLPGAGGGGYLYMVAKDPGAAERIRKILTENAPNPRARFVDMSLS 965
Query: 681 -KGLKV 685
KGL+V
Sbjct: 966 DKGLQV 971
>gnl|CDD|150431 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 28.9 bits (65), Expect = 4.4
Identities = 15/71 (21%), Positives = 27/71 (38%), Gaps = 20/71 (28%)
Query: 134 INFIVITKGATRIAEVGARFTLDAIPGKQMAIDADLSSGLIEEEEAKRRRKELEEESAFF 193
IN+I + K + E+ A F L E+ R ++LE + +
Sbjct: 106 INYIKLKK-VVNLEELSAEFGLKT-------------------EDVIERIQDLEAQGSIT 145
Query: 194 GAMDGASKFVR 204
G +D K++
Sbjct: 146 GVIDDRGKYIY 156
>gnl|CDD|150924 pfam10328, 7TM_GPCR_Srx, Serpentine type 7TM GPCR chemoreceptor
Srx. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srx is part of the Srg
superfamily of chemoreceptors. Chemoperception is one of
the central senses of soil nematodes like C. elegans
which are otherwise 'blind' and 'deaf'.
Length = 275
Score = 28.3 bits (64), Expect = 6.1
Identities = 34/151 (22%), Positives = 53/151 (35%), Gaps = 33/151 (21%)
Query: 21 AFSFCIVLIICILFLPIPTVLLD------------VGLASSIALSI---LILMVAL---- 61
A S I+ I LF +P LLD GL I L+++L
Sbjct: 36 AISNAIICTIF-LFYVVPMTLLDLSFLPSLLNSHIGGLILLGLYEISPLTHLLISLNRFC 94
Query: 62 ------WIEKPLEFSSFPTVLLIVTIIRLSLNIATTRAI-----LSFGHEGYGAAGGIIA 110
+K FS T ++I I +++ I+T + E +
Sbjct: 95 AVFFPFKYKK--IFSIKNTKIIITFIWIIAIIISTLFYFPLGCHFYYSPETLTWSFDEDP 152
Query: 111 GFSSLVMSGDFVIGLVVFMILITINFIVITK 141
+ + GDF+ LV+ +I IN I K
Sbjct: 153 PCAEINWYGDFLKNLVLVIITNIINVITFIK 183
>gnl|CDD|162132 TIGR00958, 3a01208, Conjugate Transporter-2 (CT2) Family protein.
Length = 711
Score = 28.5 bits (64), Expect = 6.2
Identities = 34/156 (21%), Positives = 55/156 (35%), Gaps = 15/156 (9%)
Query: 198 GASKFVRGDAIASIIITAINIVGGIVIGCFRYDMSIHHAADVFVRLSVGDGLVSQVPALI 257
G ++ A ++ G+ G F Y M+ + +R + L+ Q
Sbjct: 196 GPPALASAIFFMCLLSIASSVSAGLRGGSFNYTMARINLR---IREDLFRSLLRQDLGFF 252
Query: 258 ISLSAAFLVSRTTSKGST-NTAIVEQLSHYPRAL---LISAFFMIVLSVMPNLPAFPFIM 313
L SR +S T + ++ ++ R L L FM+ LS P L I
Sbjct: 253 DENKTGELTSRLSSDTQTMSRSLSLNVNVLLRNLVMLLGLLGFMLWLS--PRLTMVTLIN 310
Query: 314 LGGFFACAGFYVPYKNELQRLAKVAQIQEA-SKQNQ 348
L F + L ++QEA +K NQ
Sbjct: 311 LPLVFLAEKVFGKRYQLLSE-----ELQEAVAKANQ 341
>gnl|CDD|152423 pfam11988, Dsl1_N, Retrograde transport protein Dsl1 N terminal.
Dsl1 is a peripheral membrane protein required for
transport between the Golgi and the endoplasmic
reticulum. It is localized to the ER membrane, and in
vitro it specifically binds to coatomer, the major
component of the protein coat of COPI vesicles. It is
comprised primarily of alpha helical bundles. It
complexes with another subunit of the Dsl1p complex
called Tip20 which forms heterodimers by pairing the N
termini of each protein. A central disorganized region
between the N and C termini of Dsl1 contains binding
sites for coatomer. The C terminus of Dsl1 contains a
binding site to the Sec39 subunit of the Dsl1p complex.
Length = 316
Score = 28.6 bits (64), Expect = 6.3
Identities = 18/60 (30%), Positives = 37/60 (61%), Gaps = 5/60 (8%)
Query: 483 VVLTHLSEVIRNNLSQLLSYKD--VKNLISRLDPEYQKLAEETCSSHISYSG--IQAVLK 538
++T +++++ N S++LS + +KNLIS ++ + KL+E + SS +Y G + +L
Sbjct: 257 AIVTEFTKLVKQNASEILSKEANQIKNLISHINDKLTKLSERSGSSW-TYDGNELDKLLN 315
>gnl|CDD|181140 PRK07843, PRK07843, 3-ketosteroid-delta-1-dehydrogenase; Reviewed.
Length = 557
Score = 28.5 bits (64), Expect = 6.5
Identities = 13/38 (34%), Positives = 17/38 (44%), Gaps = 5/38 (13%)
Query: 141 KGATRIAEVGARFTLDAIPGKQM-----AIDADLSSGL 173
+G R +VGAR GK + A+ A L GL
Sbjct: 181 RGVLRALKVGARTLWAKATGKNLLGMGQALAAGLRIGL 218
>gnl|CDD|116866 pfam08282, Hydrolase_3, haloacid dehalogenase-like hydrolase. This
family contains haloacid dehalogenase-like hydrolase
enzymes.
Length = 254
Score = 28.0 bits (63), Expect = 8.3
Identities = 13/83 (15%), Positives = 26/83 (31%), Gaps = 25/83 (30%)
Query: 631 SENATNSIRQYIDKGIPLTIVT---LPEIRSYI----------------------RMILE 665
SE +I++ +KGI + I T + +++ +
Sbjct: 17 SERTKEAIKKLQEKGIKVVIATGRPYRGALPVLEELGLDLPVICFNGAYIYDENGKILYK 76
Query: 666 RNFPSLAVLSHMEIAKGLKVNIL 688
V +E K ++IL
Sbjct: 77 NPISKEDVKEIIEYLKENNLSIL 99
>gnl|CDD|162122 TIGR00939, 2a57, Equilibrative Nucleoside Transporter (ENT).
Length = 437
Score = 27.8 bits (62), Expect = 9.1
Identities = 26/112 (23%), Positives = 50/112 (44%), Gaps = 12/112 (10%)
Query: 26 IVLIICILFLPIPTVLLDVGLASSIALSILILMVALWIEKPL--EFSSFPTVLLIVTIIR 83
+L + I + + V L + + ++++++ + + K E F T + V II
Sbjct: 58 SLLFNSLNLFLIFRIPVTVRLLGGLVILLVVVILVMVLVKVQTSETGFFVTTMASVVIIN 117
Query: 84 LSLNIATTRAILSFGHEGYGAAGGIIAGFSSLVMSGDFVIGLV--VFMILIT 133
A+L +G AG + +SS VMSG + G++ + MIL+
Sbjct: 118 SG------MALLQGSL--FGLAGVFPSTYSSAVMSGQGLAGVLTSLAMILVK 161
>gnl|CDD|183259 PRK11652, emrD, multidrug resistance protein D; Provisional.
Length = 394
Score = 27.9 bits (63), Expect = 9.3
Identities = 28/146 (19%), Positives = 53/146 (36%), Gaps = 51/146 (34%)
Query: 28 LIICILF-LPIPTVLLDVGLASSIA-----------LSILILMVALWIEKPLEFSSFPTV 75
+ + ILF LPIP A + L+ + +WI P
Sbjct: 243 MTVSILFILPIPAAFFGAWFAGRPNKRFSTLMWQSVICCLLAGLLMWI---------PGW 293
Query: 76 LLIVTIIRLSLNI---------------ATTRAILSFGHEGYGAAG-----------GII 109
++ + +L + AT+ A+ F + G AG G+
Sbjct: 294 FGVMNV--WTLLVPAALFFFGAGMLFPLATSGAMEPFPYLA-GTAGALLGGLQNIGSGLA 350
Query: 110 AGFSSLV-MSGDFVIGLVVFMILITI 134
A S+++ +G F +GL++ ++ + I
Sbjct: 351 ALLSAMLPQTGQFSLGLLMTLMGLLI 376
>gnl|CDD|185539 PTZ00286, PTZ00286, 6-phospho-1-fructokinase; Provisional.
Length = 459
Score = 27.7 bits (62), Expect = 9.3
Identities = 16/73 (21%), Positives = 26/73 (35%), Gaps = 9/73 (12%)
Query: 193 FGAMDGASKFVRGDAI--ASIIITAINIVGGIVIGCFRYDMSIHHAADVFVRLSV----- 245
+GA G + D I + I+ +GG ++G R D +R +
Sbjct: 123 YGAKYGYKGLYKEDWIKLDPKDVKTIHRLGGTILGSSRGGFDPKVMVDTLIRHGINILFT 182
Query: 246 --GDGLVSQVPAL 256
GDG A+
Sbjct: 183 LGGDGTHRGALAI 195
>gnl|CDD|162277 TIGR01271, CFTR_protein, cystic fibrosis transmembrane conductor
regulator (CFTR). The model describes the cystis
fibrosis transmembrane conductor regulator (CFTR) in
eukaryotes. The principal role of this protein is
chloride ion conductance. The protein is predicted to
consist of 12 transmembrane domains. Mutations or
lesions in the genetic loci have been linked to the
aetiology of asthma, bronchiectasis, chronic obstructive
pulmonary disease etc. Disease-causing mutations have
been studied by 36Cl efflux assays in vitro cell
cultures and electrophysiology, all of which point to
the impairment of chloride channel stability and not the
biosynthetic processing per se.
Length = 1490
Score = 28.0 bits (62), Expect = 9.9
Identities = 23/92 (25%), Positives = 39/92 (42%), Gaps = 16/92 (17%)
Query: 286 YPRALLISAFFMIVLSVMPNLPAF-PFIMLGGFFACAGFYVPYKNELQR----------- 333
Y A S FF++ LSV+P A I+L F + + + + R
Sbjct: 307 YSSAFFFSGFFVVFLSVVPY--ALIKGIILRRIFTTISYCIVLRMTVTRQFPGAIQTWYD 364
Query: 334 -LAKVAQIQE-ASKQNQHSAQLNFITSGVELV 363
L + +IQ+ K+ + + N T+ VE+V
Sbjct: 365 SLGAITKIQDFLCKEEYKTLEYNLTTTEVEMV 396
Database: CddB
Posted date: Feb 4, 2011 9:54 PM
Number of letters in database: 5,994,473
Number of sequences in database: 21,608
Lambda K H
0.324 0.139 0.392
Gapped
Lambda K H
0.267 0.0684 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 21608
Number of Hits to DB: 11,279,725
Number of extensions: 771392
Number of successful extensions: 2724
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2675
Number of HSP's successfully gapped: 86
Length of query: 692
Length of database: 5,994,473
Length adjustment: 100
Effective length of query: 592
Effective length of database: 3,833,673
Effective search space: 2269534416
Effective search space used: 2269534416
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.5 bits)
S2: 61 (27.4 bits)