RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254780512|ref|YP_003064925.1| flagellar biosynthesis protein FlhA [Candidatus Liberibacter asiaticus str. psy62] (692 letters) >gnl|CDD|183749 PRK12792, flhA, flagellar biosynthesis protein FlhA; Reviewed. Length = 694 Score = 978 bits (2531), Expect = 0.0 Identities = 409/678 (60%), Positives = 537/678 (79%), Gaps = 4/678 (0%) Query: 19 DFAFSFCIVLIICILFLPIPTVLLDVGLASSIALSILILMVALWIEKPLEFSSFPTVLLI 78 DF F+ IV I+ +LFLP+P VL+D+GLA SIALS+LILMVALWI++PLEFS+FPTVLLI Sbjct: 17 DFGFAVGIVAILAVLFLPVPAVLIDIGLAFSIALSVLILMVALWIQRPLEFSAFPTVLLI 76 Query: 79 VTIIRLSLNIATTRAILSFGHEGYGAAGGIIAGFSSLVMSGDFVIGLVVFMILITINFIV 138 T++RL+LNIATTR ILS G EG AAG +IAGFS VMSGDFVIGLVVF ILIT+NF+V Sbjct: 77 ATLLRLALNIATTRLILSNGQEGVDAAGHVIAGFSQFVMSGDFVIGLVVFAILITVNFLV 136 Query: 139 ITKGATRIAEVGARFTLDAIPGKQMAIDADLSSGLIEEEEAKRRRKELEEESAFFGAMDG 198 ITKGATRIAEVGARFTLDAIPGKQMAIDADLS+GLI+++EA+RRR+ELEEESAFFG+MDG Sbjct: 137 ITKGATRIAEVGARFTLDAIPGKQMAIDADLSAGLIDDKEAQRRRRELEEESAFFGSMDG 196 Query: 199 ASKFVRGDAIASIIITAINIVGGIVIGCFRYDMSIHHAADVFVRLSVGDGLVSQVPALII 258 ASKFVRGDAIAS+II A+NI GGI+IG R+ M + AADVF +LSVGDGLVSQ+PALI+ Sbjct: 197 ASKFVRGDAIASLIIIAVNIFGGIIIGVTRHGMPLGQAADVFTKLSVGDGLVSQIPALIV 256 Query: 259 SLSAAFLVSRTTSKGSTNTAIVEQLSHYPRALLISAFFMIVLSVMPNLPAFPFIMLGGFF 318 SL+A LVS+ ++GS A++ QL YPRAL ++A M VL+++P LP PF +LGG Sbjct: 257 SLAAGLLVSKGGTRGSAEQAVLGQLGAYPRALSVAALLMFVLAIVPGLPFLPFALLGGVM 316 Query: 319 ACAGFYVPYKNELQRLAKVAQI----QEASKQNQHSAQLNFITSGVELVLGSLVSNRLLS 374 A + +P + + A+ A++ + A + + S + T+ +EL LG ++ +L Sbjct: 317 AFVAYTIPRRRAARAAAEAAKVKREEESAQAEAKDSVKEQLRTAEIELCLGKQLAAQLQG 376 Query: 375 SQEDLFLRVSKIRRKFAVQYGFIVPEIKVTTDISLPEKGYTIRVYGTTVAISELRVGEVL 434 + +L RV+K+RRKFA QYGF+VPEIK+T +SLP K Y I+++GT VA ELR GE+L Sbjct: 377 AHAELAHRVAKMRRKFAKQYGFVVPEIKLTDSLSLPPKTYQIKIHGTVVATQELRPGELL 436 Query: 435 VILGSGQKPTFPGDEVKEPAFGMPAFAIMESFSDDLRRQGFHPIDNLAVVLTHLSEVIRN 494 V++G G +P PG+EV+EPAFGM A + ++F++++RR GF P+DN +V+LTHLSEVIRN Sbjct: 437 VVVGDGPRPDVPGEEVREPAFGMKALWVPDAFANEVRRDGFEPVDNASVLLTHLSEVIRN 496 Query: 495 NLSQLLSYKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAVLKLLLAEHVSIRNLPLIL 554 NL QLLSYKD++ L+ RLDPEY++L ++ C S ISYSG+QAVLKLLLAE VSIRNL LIL Sbjct: 497 NLPQLLSYKDMRALLDRLDPEYKRLIDDICPSQISYSGLQAVLKLLLAERVSIRNLHLIL 556 Query: 555 ESIAEVAPHSRKTSHIVEQVRIRMAQQICGDLAPTGILNILKLGNHWDMIFYQAIQRDSK 614 E++AE+APH+R+ I E VR+R+AQQICGDL+ G+L +L+LGN WD+ F+Q+++RD+K Sbjct: 557 EAVAEIAPHARRAEQIAEHVRMRIAQQICGDLSDNGVLKVLRLGNRWDLAFHQSLKRDAK 616 Query: 615 GECVDFNVEPRAVEMFSENATNSIRQYIDKGIPLTIVTLPEIRSYIRMILERNFPSLAVL 674 GE V+F+++PR VE F A+ +IR+ +D+G +VT PE R Y+RMI+ER FP+L VL Sbjct: 617 GEVVEFDIDPRLVEQFGTEASEAIRERMDQGHQFVLVTAPEARPYVRMIIERLFPTLPVL 676 Query: 675 SHMEIAKGLKVNILGTIS 692 SH+EIA+G+++ LGTIS Sbjct: 677 SHLEIARGVEIKSLGTIS 694 >gnl|CDD|180350 PRK06012, flhA, flagellar biosynthesis protein FlhA; Validated. Length = 697 Score = 880 bits (2277), Expect = 0.0 Identities = 285/696 (40%), Positives = 433/696 (62%), Gaps = 6/696 (0%) Query: 1 MIQSLIGISDSNDRNHLHDFAFSFCIVLIICILFLPIPTVLLDVGLASSIALSILILMVA 60 + +++ + + D A + ++ I+ ++ LP+P LLD+ L +IALS+LIL+VA Sbjct: 2 NLAAMLRLPGNLKLLKWRDLAVAILVLAILAMMILPLPPFLLDLLLTFNIALSVLILLVA 61 Query: 61 LWIEKPLEFSSFPTVLLIVTIIRLSLNIATTRAILSFGHEGYGAAGGIIAGFSSLVMSGD 120 L+I++PL+FS+FPT+LLI T++RL+LN+A+TR IL GHEG AAG +I F V+ G+ Sbjct: 62 LFIQRPLDFSAFPTLLLITTLLRLALNVASTRLILLEGHEGTDAAGKVIEAFGHFVVGGN 121 Query: 121 FVIGLVVFMILITINFIVITKGATRIAEVGARFTLDAIPGKQMAIDADLSSGLIEEEEAK 180 FV+G+VVF+IL+ INF+VITKGA RIAEV ARFTLDA+PGKQMAIDADL++GLI+EEEAK Sbjct: 122 FVVGIVVFIILVIINFVVITKGAGRIAEVAARFTLDAMPGKQMAIDADLNAGLIDEEEAK 181 Query: 181 RRRKELEEESAFFGAMDGASKFVRGDAIASIIITAINIVGGIVIGCFRYDMSIHHAADVF 240 +RRKEL++E+ F+GAMDGASKFV+GDAIA I+IT INI+GG++IG ++ MS AA+ + Sbjct: 182 KRRKELQQEADFYGAMDGASKFVKGDAIAGILITVINIIGGLIIGVVQHGMSFGEAAETY 241 Query: 241 VRLSVGDGLVSQVPALIISLSAAFLVSRTTSKGSTNTAIVEQLSHYPRALLISAFFMIVL 300 L++GDGLVSQ+PAL+IS +A +V+R +S G IV QL P+AL I+A + +L Sbjct: 242 TLLTIGDGLVSQIPALLISTAAGIIVTRVSSDGDVGEQIVGQLFANPKALYIAAGVLFLL 301 Query: 301 SVMPNLPAFPFIMLGGFFACAGFYVPYKNELQRLAKVAQIQEASKQNQHSAQLNFITSG- 359 ++P +P PF++L G + + + + + +E + + Sbjct: 302 GLVPGMPHLPFLLLAGLLGFLAYRLRKREKKAAELAAEEAEEEEAAEPEEESWDDVLPVD 361 Query: 360 -VELVLGSLVSNRLLSSQ-EDLFLRVSKIRRKFAVQYGFIVPEIKVTTDISLPEKGYTIR 417 +EL +G + + +Q +L R+ IR+K A + GF+VP +++ ++ LP Y I+ Sbjct: 362 PLELEVGYGLIPLVDENQGGELLDRIRSIRKKIAQELGFLVPPVRIRDNLQLPPNEYRIK 421 Query: 418 VYGTTVAISELRVGEVLVILGSGQKPTFPGDEVKEPAFGMPAFAIMESFSDDLRRQGFHP 477 + G V ELR G +L + G PG KEPAFG+PA I E+ + + G+ Sbjct: 422 IKGVEVGSGELRPGRLLAMNPGGVDGELPGIPTKEPAFGLPAVWIDEALREQAQLLGYTV 481 Query: 478 IDNLAVVLTHLSEVIRNNLSQLLSYKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAVL 537 +D VV THL+EVI+N+ ++LL ++V+ L+ RL EY KL EE +S +Q VL Sbjct: 482 VDPSTVVATHLTEVIKNHAAELLGRQEVQQLLDRLAKEYPKLVEELVPKVLSLGTLQKVL 541 Query: 538 KLLLAEHVSIRNLPLILESIAEVAPHSRKTSHIVEQVRIRMAQQICGDLAP-TGILNILK 596 + LL E VSIR+L ILE++A+ AP ++ + E VR R+ +QI G L ++ Sbjct: 542 QNLLKERVSIRDLRTILETLADYAPITKDPDELTEHVRQRLGRQIVQQYKGEDGELPVIT 601 Query: 597 LGNHWDMIFYQAIQRDSKGECVDFNVEPRAVEMFSENATNSIRQYIDKGIPLTIVTLPEI 656 L + + Q++Q G +EP E ++ ++ + KG P ++ P + Sbjct: 602 LDPELEQLLLQSLQGTGGGS--YLALEPGLAERLLQSLQEALERQEMKGEPPVLLVSPAL 659 Query: 657 RSYIRMILERNFPSLAVLSHMEIAKGLKVNILGTIS 692 R Y+R +LER P L VLS+ EI +++ I+GT+ Sbjct: 660 RPYLRRLLERFLPQLPVLSYNEIPDNIEIRIVGTVG 695 >gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA. This model describes flagellar biosynthesis protein FlhA, one of a large number of genes associated with the biosynthesis of functional bacterial flagella. Homologs of many such proteins, including FlhA, function in type III protein secretion systems. A separate model describes InvA (Salmonella enterica), LcrD (Yersinia enterocolitica), HrcV (Xanthomonas), etc., all of which score below the noise cutoff for this model. Length = 678 Score = 796 bits (2059), Expect = 0.0 Identities = 318/679 (46%), Positives = 448/679 (65%), Gaps = 9/679 (1%) Query: 19 DFAFSFCIVLIICILFLPIPTVLLDVGLASSIALSILILMVALWIEKPLEFSSFPTVLLI 78 D + +V I+ ++ LP+P LLD+ LA +IALS+LIL+V L+I+KPL+FSSFPT+LLI Sbjct: 3 DLLLAIGVVAILAVMILPLPAFLLDILLALNIALSLLILLVTLFIQKPLDFSSFPTLLLI 62 Query: 79 VTIIRLSLNIATTRAILSFGHEGYGAAGGIIAGFSSLVMSGDFVIGLVVFMILITINFIV 138 T+ RLSLN+A+TR IL+ GHEG AAG +I F V+ G++VIGL+VF+ILI +NFIV Sbjct: 63 ATLFRLSLNVASTRLILTHGHEGPNAAGKVIEAFGQFVVGGNYVIGLIVFIILIIVNFIV 122 Query: 139 ITKGATRIAEVGARFTLDAIPGKQMAIDADLSSGLIEEEEAKRRRKELEEESAFFGAMDG 198 ITKGATRIAEV ARFTLDA+PGKQMAIDADL++GLI EEEAK+RR+ELE+E+ F+GAMDG Sbjct: 123 ITKGATRIAEVAARFTLDAMPGKQMAIDADLNAGLITEEEAKKRREELEQEADFYGAMDG 182 Query: 199 ASKFVRGDAIASIIITAINIVGGIVIGCFRYDMSIHHAADVFVRLSVGDGLVSQVPALII 258 ASKFV+GDAIA IIIT INI+GG++IG ++ MS+ AA + L++GDGLV+Q+PALII Sbjct: 183 ASKFVKGDAIAGIIITLINIIGGLIIGVVQHGMSLSDAASTYTILTIGDGLVAQIPALII 242 Query: 259 SLSAAFLVSRTTSKGSTNTAIVEQLSHYPRALLISAFFMIVLSVMPNLPAFPFIMLGGFF 318 S + +V+R +S+GS AIV QL PRALLI A + +L+++P LP FPF+ L G Sbjct: 243 STATGLIVTRASSEGSFGKAIVTQLGANPRALLIVAAVLGLLALVPGLPTFPFLFLAGAL 302 Query: 319 ACAGFYVPY----KNELQRLAKVAQIQEASKQNQHSAQLNFITSGVELVLGSLVSNRLLS 374 A +Y+ + E A AQ +EA+++ + S +EL LG + + Sbjct: 303 AFLAWYLRRRSKQEEEAAAEAAKAQ-EEAAEEEEESINDILALDDLELELGYGLIPLVDD 361 Query: 375 SQ-EDLFLRVSKIRRKFAVQYGFIVPEIKVTTDISLPEKGYTIRVYGTTVAISELRVGEV 433 SQ DL R+ IR++ A +YGF++P I++ ++ LP Y I++ G VA ELR G+ Sbjct: 362 SQGGDLLDRIRSIRKQLAQEYGFVMPVIRIRDNLRLPPNEYRIKIKGVEVARGELRPGKY 421 Query: 434 LVILGSGQKPTFPGDEVKEPAFGMPAFAIMESFSDDLRRQGFHPIDNLAVVLTHLSEVIR 493 L + PG+E +EPAFG+PA+ I E ++ R G+ +D V+ THLSEVI+ Sbjct: 422 LAMNPGNADGEIPGEETREPAFGLPAYWISEKNKEEAERLGYTVVDPATVLATHLSEVIK 481 Query: 494 NNLSQLLSYKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAVLKLLLAEHVSIRNLPLI 553 NN ++LL+ ++V+NL+ RL EY KL EE + IQ VL+LLL E VSIRNLP I Sbjct: 482 NNAAELLTRQEVQNLLDRLKEEYPKLVEELIPDKVPLGTIQKVLQLLLRERVSIRNLPTI 541 Query: 554 LESIAEVAPHSRKTSHIVEQVRIRMAQQICGDLAPT-GILNILKLGNHWDMIFYQAIQRD 612 LE++A+ AP ++ +VE VR R+ +QI G+L ++ L + +A+QRD Sbjct: 542 LETLADYAPITKDPDLLVEHVRQRLGRQITQQYLDEDGVLPVITLDPDLEAALAEALQRD 601 Query: 613 SKGECVDFNVEPRAVEMFSENATNSIRQYIDKGIPLTIVTLPEIRSYIRMILERNFPSLA 672 +GE +D EP +E ++ + + G ++T P +R Y+R ILER FP L Sbjct: 602 GEGELLDL--EPALLEELVRAVRKAVEKLANNGERPVLLTSPRVRPYVRRILERFFPELP 659 Query: 673 VLSHMEIAKGLKVNILGTI 691 VLS+ EI ++V +G + Sbjct: 660 VLSYNEIPDNVRVETVGVV 678 >gnl|CDD|162339 TIGR01399, hrcV, type III secretion protein, HrcV family. Members of this family are closely homologous to the flagellar biosynthesis protein FlhA (TIGR01398) and should all participate in type III secretion systems. Examples include InvA (Salmonella enterica), LcrD (Yersinia enterocolitica), HrcV (Xanthomonas), etc. Type III secretion systems resemble flagellar biogenesis systems, and may share the property of translocating special classes of peptides through the membrane. Length = 677 Score = 454 bits (1169), Expect = e-128 Identities = 231/683 (33%), Positives = 370/683 (54%), Gaps = 17/683 (2%) Query: 18 HDFAFSFCIVLIICILFLPIPTVLLDVGLASSIALSILILMVALWIEKPLEFSSFPTVLL 77 D + ++ II ++ LP+PT+L+D+ +A +I +S+L+LM+A++I +PL S+FP+VLL Sbjct: 3 SDLVLALLLLAIISMMILPLPTLLVDILIAINITISVLLLMIAIYIPRPLALSTFPSVLL 62 Query: 78 IVTIIRLSLNIATTRAILSFGHEGYGAAGGIIAGFSSLVMSGDFVIGLVVFMILITINFI 137 I T+ RL+L+I+TTR IL + AG II F V+ G+ +GLV+F+I+ + FI Sbjct: 63 ITTLFRLALSISTTRLILL-----HADAGNIIEAFGQFVVGGNLAVGLVIFLIITIVQFI 117 Query: 138 VITKGATRIAEVGARFTLDAIPGKQMAIDADLSSGLIEEEEAKRRRKELEEESAFFGAMD 197 VITKG+ R+AEV ARF+LDA+PGKQM+IDADL +G+I+ +EA+RRR LE+ES +GAMD Sbjct: 118 VITKGSERVAEVSARFSLDAMPGKQMSIDADLRAGVIDADEARRRRSTLEKESQLYGAMD 177 Query: 198 GASKFVRGDAIASIIITAINIVGGIVIGCFRYDMSIHHAADVFVRLSVGDGLVSQVPALI 257 GA KFV+GDAIA III INI+GGI IG ++ MS A ++ L++GDGLVSQ+PAL+ Sbjct: 178 GAMKFVKGDAIAGIIIVLINIIGGISIGVTQHGMSASEALHLYTVLTIGDGLVSQIPALL 237 Query: 258 ISLSAAFLVSRTTSKGSTNTA--IVEQLSHYPRALLISAFFMIVLSVMPNLPAFPFIMLG 315 IS++A +V+R + N I QL+ PRALL++A ++ +++P P F +L Sbjct: 238 ISVTAGIIVTRVPGEAERNLGREIGHQLTSQPRALLLAAVLLLGFALIPGFPLLVFALLA 297 Query: 316 GFFACAGFY--VPYKNELQRLAKVAQIQEASKQNQHSAQLNFITSGVELVLGSLVSNRLL 373 A AG+ ++ + A AS + N L +S L Sbjct: 298 VLLAAAGYLLSRRKRSRAKANKAQASGAVASAPGAAAPIKNLDPFAEACPLILRLSPDLQ 357 Query: 374 SS--QEDLFLRVSKIRRKFAVQYGFIVPEIKVTTDISLPEKGYTIRVYGTTVAISELRVG 431 SS ++ L + ++R G +P I + SLP+ + I +Y V + G Sbjct: 358 SSADKDTLDQEIERMRWALFEDLGIPLPGIIIRVGDSLPDNEFRILLYEVPVLRDTIPPG 417 Query: 432 EVLVILGSGQKPTF-PGDEVKEPAFGM-PAFAIMESFSDDLRRQGFHPIDNLAVVLTHLS 489 V + G + G + E ++ L+ G + V+ L Sbjct: 418 HVALNDGVDNIEVAGIPAISGKRWPGESQRVWVTEEGAEKLQGAGLGYFSDSQVITHRLK 477 Query: 490 EVIRNNLSQLLSYKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAVLKLLLAEHVSIRN 549 + N + + ++ + L+ +++ EY +L +E + I VL+ L++E VSIRN Sbjct: 478 ATLLRNAQEFIGIQETRYLLDQMEREYPELVKEV-QRVLPLQRIAEVLQRLVSEQVSIRN 536 Query: 550 LPLILESIAEVAPHSRKTSHIVEQVRIRMAQQICGDLAPTG-ILNILKLGNHWDMIFYQA 608 L LILE++ E A + + E VRI + + IC A G L+ + + + + A Sbjct: 537 LRLILETLIEWAQREKDVVMLTEYVRIALKRYICHRYANGGRQLSAVLIDPEIEELIRGA 596 Query: 609 IQRDSKGECVDFNVEPRAVEMFSENATNSIRQYIDKGIPLTIVTLPEIRSYIRMILERNF 668 I++ S G ++P E + ++ ++T +IR Y+R ++E F Sbjct: 597 IRQTSTGTY--LALDPDDSEQLLDQIRQAVGDLPRAPSQPVLLTSMDIRRYVRKMIESEF 654 Query: 669 PSLAVLSHMEIAKGLKVNILGTI 691 P L VLS+ E+ + ++V +LG I Sbjct: 655 PDLPVLSYQELGEEIEVQVLGRI 677 >gnl|CDD|185237 PRK15337, PRK15337, type III secretion system protein InvA; Provisional. Length = 686 Score = 406 bits (1046), Expect = e-114 Identities = 219/690 (31%), Positives = 367/690 (53%), Gaps = 48/690 (6%) Query: 26 IVLIICILFLPIPTVLLDVGLASSIALSILILMVALWIEKPLEFSSFPTVLLIVTIIRLS 85 +V+II +L +P+PT L+D + +I L+IL+ M + +I++ L FS+FP++LLI T+ RL+ Sbjct: 21 MVMIIAMLIIPLPTYLVDFLIGLNIVLAILVFMGSFYIDRILSFSTFPSILLITTLFRLA 80 Query: 86 LNIATTRAILSFGHEGYGAAGGIIAGFSSLVMSGDFVIGLVVFMILITINFIVITKGATR 145 L+I+T+R IL AG II F V+ V+G V+F I+ + FIVITKG+ R Sbjct: 81 LSISTSRLILLDAD-----AGEIITTFGQFVIGDSLVVGFVIFSIVTVVQFIVITKGSER 135 Query: 146 IAEVGARFTLDAIPGKQMAIDADLSSGLIEEEEAKRRRKELEEESAFFGAMDGASKFVRG 205 +AEV ARF+LD +PGKQM+IDADL +G+I+ + K RR LE ES +G+ DGA KF++G Sbjct: 136 VAEVAARFSLDGMPGKQMSIDADLKAGIIDADGVKERRSVLERESQLYGSFDGAMKFIKG 195 Query: 206 DAIASIIITAINIVGGIVIGCFRYDMSIHHAADVFVRLSVGDGLVSQVPALIISLSAAFL 265 DAIA III +N++GGI +G ++ M + A + L++GDGLV+Q+PAL+IS+SA F+ Sbjct: 196 DAIAGIIIIFVNLIGGISVGMTQHGMDLSSALSTYTILTIGDGLVAQIPALLISISAGFI 255 Query: 266 VSRTTSKGSTNT---AIVEQLSHYPRALLISAFFMIVLSVMPNLPAFPF----IMLGGFF 318 V+R G ++ I+ QL P L+++A + + ++P P F ++LG F Sbjct: 256 VTRVN--GDSDNLGRNIMSQLLSNPFVLVVTAILALSIGLLPGFPLPVFLLLAVVLGVLF 313 Query: 319 ACAGFYVPYKNELQRLAKVAQIQ-EASKQNQHSAQL-----NFITSGVELVLGSLVSNRL 372 F K+ + + + ++ S L I V L+L + R Sbjct: 314 YFKKFRKKKKSAAEPDTSGGEAPLDIDEKAGSSLGLIGDLDKVIPETVPLILLVPEARRP 373 Query: 373 LSSQEDLFLRVSKIRRKFAVQYGFIVPEIKVTTDISLPEKGYTIRVYGTTVAISELRVGE 432 +E+L ++R +F + YG +P+I + L + + V I+E+R + Sbjct: 374 DLEKENL---AERLRSQFFIDYGVRLPDILLRYSEGLDDN----SI---VVLINEIRAAQ 423 Query: 433 VLVILGSGQKPTFPGDEVKEPAF--------GMPAFAIMESFSDDLRRQGFHPIDNLAVV 484 I + DE+ G + + ++ L + G+ + + Sbjct: 424 -FTIYFDLHRVVNYSDELVSLGINPTIIDSGGEQYYWVPHEDTEKLAKLGYVLRSAIDEL 482 Query: 485 LTHLSEVIRNNLSQLLSYKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAVLKLLLAEH 544 LS ++ +N+++ ++ K+L+ +L+ +Y L +E H + I VL+ LL+E Sbjct: 483 YHCLSVLLLHNINEFFGIQETKHLLDQLEKKYPDLLKE-VYRHATVQRISEVLQRLLSER 541 Query: 545 VSIRNLPLILESIAEVAPHSRKTSHIVEQVRIRMAQQICGDLAPTGILNILKLGNHWDMI 604 +SIRN+ LI+E++A AP + +VE VR +A+ IC A G L + L + Sbjct: 542 ISIRNMKLIMEALALWAPREKDVIMLVEHVRGALARYICHKFAAGGELRAVVLSAEVEDA 601 Query: 605 FYQAIQRDSKGECVDFNVEPRAVEMFSEN---ATNSIRQYIDKGIPLTIVTLPEIRSYIR 661 + I++ S G + N++P E + A + + + L V +IR +++ Sbjct: 602 IRKGIRQTSGGTFL--NLDPAESENLMDLLTLALDDLGIAHRDIVLLVSV---DIRRFVK 656 Query: 662 MILERNFPSLAVLSHMEIAKGLKVNILGTI 691 ++E FP L VLS EIA ++VN++ TI Sbjct: 657 KLIEGRFPELEVLSFGEIADSVEVNVIKTI 686 >gnl|CDD|183699 PRK12720, PRK12720, secretion system apparatus protein SsaV; Provisional. Length = 675 Score = 369 bits (950), Expect = e-102 Identities = 202/678 (29%), Positives = 354/678 (52%), Gaps = 27/678 (3%) Query: 18 HDFAFSFCIVLIICILFLPIPTVLLDVGLASSIALSILILMVALWIEKPLEFSSFPTVLL 77 D + +++ + ++ LP+PT ++D+ +A ++ S+++LM+A+++ PLEFS FP++LL Sbjct: 15 QDIVLAVMLLVAVFMMILPLPTWMVDILIAINLMFSVILLMIAIYLRDPLEFSVFPSLLL 74 Query: 78 IVTIIRLSLNIATTRAILSFGHEGYGAAGGIIAGFSSLVMSGDFVIGLVVFMILITINFI 137 I T+ RL+L I+T+R +L H+ AG I+ F V+ G+ +GL+VF I+ + FI Sbjct: 75 ITTLYRLALTISTSRLVL-LQHD----AGEIVDAFGKFVVGGNLAVGLIVFTIITIVQFI 129 Query: 138 VITKGATRIAEVGARFTLDAIPGKQMAIDADLSSGLIEEEEAKRRRKELEEESAFFGAMD 197 VITKG+ R+AEV ARF+LD +PGKQM+ID D+ +G+I+ +EA+R R+ +++ES GAMD Sbjct: 130 VITKGSERVAEVSARFSLDGMPGKQMSIDGDMRAGVIDADEARRLRQHVQKESRLLGAMD 189 Query: 198 GASKFVRGDAIASIIITAINIVGGIVIGCFRYDMSIHHAADVFVRLSVGDGLVSQVPALI 257 GA KFV+GDAIA II+ +NI+GGI+IG ++DMS A + + LS+GDGL Q+P+L+ Sbjct: 190 GAMKFVKGDAIAGIIVILVNIIGGIIIGVMQHDMSASEAVNTYAVLSIGDGLCGQIPSLL 249 Query: 258 ISLSAAFLVSRTTSKGSTNTA--IVEQLSHYPRALLISAFFMIVLSVMPNLPAFPFIMLG 315 IS++A +V+R + N A + Q+ P+AL ++A +++ +++P P F L Sbjct: 250 ISITAGIIVTRVPGEKRQNLANELSSQIGRQPQALWLAAVVLMLFALIPGFPFITFAFLA 309 Query: 316 GFFACAGFYVPYKNELQRLAKVAQIQEASKQNQHSAQLNFITSGVELVLGSLVSNRLLSS 375 A + K + A EA + + + L+L + Sbjct: 310 ALVAAPAILLRRKKSVVS----ANGVEAGGSEEGPEGDSMVPGACPLMLRLAPTLHSADL 365 Query: 376 QEDLFLRVSKIRRKFAVQYGFIVPEIKVTTDISLPEKGYTIRVYGTTVAISELRVGEVLV 435 D+ +R G +PE+ + D L EK T+ +Y V L + + Sbjct: 366 IRDI----DALRWFLFEDLGVPLPEVNIEVDPELTEKTLTVLLYQEPVL--SLSLPPQAL 419 Query: 436 ILGSGQKPTFPGDEVKEPAFGMPAFA-IMESFSDDLRRQGFHPIDNLAVVLTHLSEVIRN 494 +L G + GD P GM + + ++ + G + L V+ Sbjct: 420 LLLIGPDASLVGDSQTLP-NGMGQICWLTKDQAEQAQGFGLDVFAGSQRISALLKCVLLR 478 Query: 495 NLSQLLSYKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAVLKLLLAEHVSIRNLPLIL 554 + + + ++ + L+ ++ Y +L +E + I +L+ L++E VSIR+L I Sbjct: 479 YMGEFIGVQETRYLMDAMEKRYGELVKEL-QRQLPVGKIAEILQRLVSERVSIRDLRTIF 537 Query: 555 ESIAEVAPHSRKTSHIVEQVRIRMAQQICGDLAPTG-ILNILKLGNHWDMIFYQAIQRDS 613 ++ E AP + + E VRI + + I G L +L++G + + ++I++ S Sbjct: 538 GTLVEWAPREKDVVMLTEYVRIALRRHILRRFNHEGKWLPVLRIGEGIENLIRESIRQTS 597 Query: 614 KGECVDFNVEPRAVEMFSENATNSIRQYIDKGIPLTIVTLPEIRSYIRMILERNFPSLAV 673 G + S I Q + + L +VT ++R ++R I+ER L V Sbjct: 598 AGTYSALSSRH------STQILQLIEQALKQSQKLVLVTSVDVRRFLRKIIERTLFDLPV 651 Query: 674 LSHMEIAKGLKVNILGTI 691 LS E+ ++ ++G+I Sbjct: 652 LSWQELGDEAEIKVVGSI 669 >gnl|CDD|168293 PRK05910, PRK05910, type III secretion system protein; Validated. Length = 584 Score = 207 bits (529), Expect = 6e-54 Identities = 178/674 (26%), Positives = 288/674 (42%), Gaps = 114/674 (16%) Query: 26 IVLI-ICILF---LPIPTVLLDVGLASSIALSILILMVALWIEKPLEFSSFPTVLLIVTI 81 +V I +CILF P+P LLD GL + ALS+ I+ + FP++ L + + Sbjct: 12 MVAIPLCILFTLIFPLPQWLLDFGLCINFALSLSIVFWVFSLRSSASARLFPSLFLYLCL 71 Query: 82 IRLSLNIATTRAILSFGHEGYGAAGGIIAGFSSLVMSGDFVIGLVVFMILITINFIVITK 141 +RL LN+A+TR ILS G A +I + G L ++L +NF+VITK Sbjct: 72 LRLGLNLASTRWILS-----SGWASPLIFSLGNFFSLGSLPAALTACLLLFLVNFLVITK 126 Query: 142 GATRIAEVGARFTLDAIPGKQMAIDADLSSGLIEEEEAKRRRKELEEESAFFGAMDGASK 201 G+ RIAEV ARF+L+A+PGKQM++DADL SG +++ L EES FF AM+G + Sbjct: 127 GSERIAEVRARFSLEALPGKQMSLDADLVSGRASYSRVSKQKNSLLEESDFFSAMEGVFR 186 Query: 202 FVRGDAIASIIITAINIVGGIVIGCFRYDMSIHHAADVFVRLSVGDGLVSQVPALIISLS 261 FV+GDAI S I+ +NI+ +G + +A +GD LVSQVPAL+ S + Sbjct: 187 FVKGDAIMSCILLGVNILAATFLGR-----ATGYAVGDLWLTVLGDALVSQVPALLTSCA 241 Query: 262 AAFLVSRTTSKGSTNTAIVEQLSHYPRALLISAFFMIVLSVMPNLPAFPFIMLGGFFACA 321 AA L+S+ K S +++ + A + L+ +P P P + F+ Sbjct: 242 AATLISKVGEKESLLQHLLDYYEQSRQHFRFIALLLCSLACIPGAPKAPILG----FSVL 297 Query: 322 GFYVPYKNELQRLAKVAQIQEASKQNQHSAQLNFITSGVELVLGSLVSNRLLSSQEDLFL 381 ++ YKN + + L F E V +L + + + +L+ Sbjct: 298 -LFLAYKNP-------SSGET----------LLFQKERFEFVELAL-PDEGVGNPANLYR 338 Query: 382 RVSKIRRKFAVQYGFIVPEIKVTTDISLPEKGYTIRVYGTTVAISELRVGEVLVILGSGQ 441 R + + G + PE + E + G V + EL +L L + Sbjct: 339 AA---REEIFQELGVVFPE---EIVVRHVESSPRLIFSGQEVYLRELSCPAILPSLRNLA 392 Query: 442 KPTFPGDEVKEPAFGMPAFAIMESFSDDLRRQGFHPIDNLAVVLTHLSEVIRNNLSQLLS 501 VK ++ + I+ ++ +SE N+L LL Sbjct: 393 PEAISERFVK-------------RLVEEFQEVAGISIEE--IIPKKISE---NSLVFLL- 433 Query: 502 YKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAVLKLLLAEHVSIRNLPLILESIAEVA 561 + L E S H+ P ILE+IA Sbjct: 434 ---------------RALVRERVSLHL---------------------FPKILEAIAVYG 457 Query: 562 PHSRKTSHIVEQVRIRMAQQICGDLAPTGILNILKLGNHWDMIFYQAIQRDSKGECVDFN 621 + + +VE+VR + +QI L N D + + I DS E + Sbjct: 458 SQGKSSEELVEKVRKYLGKQIG-----------RSLWNRQDTL--EVITIDSHVEQFIRD 504 Query: 622 VEPRAVEMFSENATNSIRQYIDK---GIPLTIVTLPEIRSYIRMILERNFPSLAVLSHME 678 ++ +E ++ +++ G IVT E R ++ +++ FP L VLSH E Sbjct: 505 SYSKSNPDMNEKVVAQVKSLLERSGEGNFRAIVTGCETRFELKKMVDPYFPDLLVLSHSE 564 Query: 679 IAKGLKVNILGTIS 692 + + + +++LG +S Sbjct: 565 LPEEIPISLLGAVS 578 >gnl|CDD|177434 PHA02611, 51, baseplate hub assembly protein; Provisional. Length = 249 Score = 32.4 bits (74), Expect = 0.36 Identities = 13/57 (22%), Positives = 30/57 (52%), Gaps = 1/57 (1%) Query: 486 THLSEVIRNNLSQLLSYKDVKNLISRLDPEYQKLAEETCSSH-ISYSGIQAVLKLLL 541 T LSE + L +S ++++ ++ ++P +L C+ + Y+ + + KLL+ Sbjct: 147 TELSEETQEALIDAISIEELEKIVKSMNPIKLELKISCCNRRDLKYTDLLELFKLLV 203 >gnl|CDD|147902 pfam05994, FragX_IP, Cytoplasmic Fragile-X interacting family. CYFIP1/2 (Cytoplasmic fragile X mental retardation interacting protein) like proteins for a highly conserved protein family. The function of CYFIPs is unclear, but CYFIP interaction with fragile X mental retardation interacting protein (FMRP) involves the domain of FMRP which also mediating homo- and heteromerization. Length = 855 Score = 32.0 bits (73), Expect = 0.58 Identities = 36/225 (16%), Positives = 78/225 (34%), Gaps = 45/225 (20%) Query: 477 PIDNLAVVLTHLSEVIRNNLSQLLSYKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAV 536 ID ++ L+E+ R NL +S + +L S E ++L + + Sbjct: 379 SIDLNRLIAQRLNELFRENLDAAISRFESNDLTSI--VELERLLD--------------I 422 Query: 537 LKL---LLAEHVSIRN-----LPLILESIAEVAPHSRKTSHIVEQVRIRMAQQICGDLAP 588 LKL LL+E + + + + +P SR H+ ++ D P Sbjct: 423 LKLTHQLLSEFLIPLDPFDDMFSEANHNDSPNSPSSRILLHV--------FSELNNDFLP 474 Query: 589 TGILNILKL-GNHWDMIFYQAIQRDSKGECVDFNVEPRAVEMFSENATNSIRQYIDKGIP 647 LN + ++F + + R+ + + ++ Y Sbjct: 475 NYFLNSTTMRFVRAKIVFSEELPREKPPIVQPI--YLLGTKDLNAAFATVLKLYT----- 527 Query: 648 LTIVTLPEIRSYIRMILERNFPSLAVLSHMEIAKGLKVNILGTIS 692 + + R+ R++ + +AVL E+ K ++ + + Sbjct: 528 -GFIGIEHFRALCRLL---GYGGIAVL-IQELLKIIESKLEEALL 567 >gnl|CDD|163153 TIGR03142, cytochro_ccmI, cytochrome c-type biogenesis protein CcmI. This TPR repeat-containing protein is the CcmI protein (also called CycH) of c-type cytochrome biogenesis. CcmI is thought to act as an apo-cytochrome c chaperone. This model describes the N-terminal region of the protein, Members of this protein family. Length = 117 Score = 31.8 bits (73), Expect = 0.61 Identities = 16/66 (24%), Positives = 34/66 (51%), Gaps = 3/66 (4%) Query: 125 LVVFMILITINFIVIT--KGATRIAEVGARFTLDAIPGKQMA-IDADLSSGLIEEEEAKR 181 + + L+ + F+++ + A V A+ ++A ++ DL+ GL++E EA+ Sbjct: 5 VAALLTLVALLFLLLPLLRRRRAAATVDRDELNLAVYRDRLAELERDLAEGLLDEAEAEA 64 Query: 182 RRKELE 187 R EL+ Sbjct: 65 ARAELQ 70 >gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA; Provisional. Length = 569 Score = 31.3 bits (72), Expect = 0.74 Identities = 15/78 (19%), Positives = 32/78 (41%), Gaps = 7/78 (8%) Query: 479 DNLAVVLTHLSEVIRNNLSQLLSYKDVKNLISRLDPEYQKLAEETCSSHISYSGIQAVLK 538 L + + + N S+L S + ++ + L+ +Y ++ E I+YS +Q L+ Sbjct: 327 KELKEEIDRVKQSYTLNESELESVRQLEKQLESLEKQYDEITERIAEQEIAYSELQEELE 386 Query: 539 LLL-------AEHVSIRN 549 +L E + Sbjct: 387 EILKQLEEIEKEQEKLSE 404 >gnl|CDD|149595 pfam08597, eIF3_subunit, Translation initiation factor eIF3 subunit. This is a family of proteins which are subunits of the eukaryotic translation initiation factor 3 (eIF3). In yeast it is called Hcr1. The Saccharomyces cerevisiae protein eIF3j has been shown to be required for processing of 20S pre-rRNA and binds to 18S rRNA and eIF3 subunits Rpg1p and Prt1p. Length = 242 Score = 30.8 bits (70), Expect = 1.3 Identities = 13/53 (24%), Positives = 22/53 (41%), Gaps = 2/53 (3%) Query: 161 KQMAIDADLSSGLIEEEEAKRRRKELEEESAFFGAMD--GASKFVRGDAIASI 211 ++ A+ +E K R ++LEEES A D G + + S+ Sbjct: 77 EEKALRELEEITPEDELAEKLRLQKLEEESDLKNAADLFGGPAPTKDTDLESL 129 >gnl|CDD|184040 PRK13412, fkp, bifunctional fucokinase/L-fucose-1-P-guanylyltransferase; Provisional. Length = 974 Score = 29.8 bits (67), Expect = 2.5 Identities = 45/186 (24%), Positives = 65/186 (34%), Gaps = 48/186 (25%) Query: 513 DPEYQKLAEETCSSHI-SYSGIQAVLKLLLAEHVSIRNLPLILESIAEVAPHSRKTSHIV 571 PEY+ H+ Y+GI K +LAE V + L S A + +H + Sbjct: 821 QPEYRD-------CHLLYYTGITRTAKGILAEIVR----SMFLNSTAHLQLLHEMKAHAL 869 Query: 572 EQVRIRMAQQI-CGDLAPTGILNILKLGNHWDMIFYQAIQRDSKGECVDFNVEPRAVEMF 630 + M + I G+ G L +G W+ + +D P AVE Sbjct: 870 D-----MYEAIQRGEFEEFGRL----VGKTWEQ-----------NKALDSGTNPAAVEAI 909 Query: 631 SENATNSIRQYI---------DKGIPLTIVTLPEIRSYIRMILERNFP-SLAVLSHMEIA 680 E I+ Y G + P IR IL N P A M ++ Sbjct: 910 IE----LIKDYTLGYKLPGAGGGGYLYMVAKDPGAAERIRKILTENAPNPRARFVDMSLS 965 Query: 681 -KGLKV 685 KGL+V Sbjct: 966 DKGLQV 971 >gnl|CDD|150431 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of approximately 300 residues, found in plants and vertebrates. They contain a highly conserved DDRGK motif. Length = 189 Score = 28.9 bits (65), Expect = 4.4 Identities = 15/71 (21%), Positives = 27/71 (38%), Gaps = 20/71 (28%) Query: 134 INFIVITKGATRIAEVGARFTLDAIPGKQMAIDADLSSGLIEEEEAKRRRKELEEESAFF 193 IN+I + K + E+ A F L E+ R ++LE + + Sbjct: 106 INYIKLKK-VVNLEELSAEFGLKT-------------------EDVIERIQDLEAQGSIT 145 Query: 194 GAMDGASKFVR 204 G +D K++ Sbjct: 146 GVIDDRGKYIY 156 >gnl|CDD|150924 pfam10328, 7TM_GPCR_Srx, Serpentine type 7TM GPCR chemoreceptor Srx. Chemoreception is mediated in Caenorhabditis elegans by members of the seven-transmembrane G-protein-coupled receptor class (7TM GPCRs) of proteins which are of the serpentine type. Srx is part of the Srg superfamily of chemoreceptors. Chemoperception is one of the central senses of soil nematodes like C. elegans which are otherwise 'blind' and 'deaf'. Length = 275 Score = 28.3 bits (64), Expect = 6.1 Identities = 34/151 (22%), Positives = 53/151 (35%), Gaps = 33/151 (21%) Query: 21 AFSFCIVLIICILFLPIPTVLLD------------VGLASSIALSI---LILMVAL---- 61 A S I+ I LF +P LLD GL I L+++L Sbjct: 36 AISNAIICTIF-LFYVVPMTLLDLSFLPSLLNSHIGGLILLGLYEISPLTHLLISLNRFC 94 Query: 62 ------WIEKPLEFSSFPTVLLIVTIIRLSLNIATTRAI-----LSFGHEGYGAAGGIIA 110 +K FS T ++I I +++ I+T + E + Sbjct: 95 AVFFPFKYKK--IFSIKNTKIIITFIWIIAIIISTLFYFPLGCHFYYSPETLTWSFDEDP 152 Query: 111 GFSSLVMSGDFVIGLVVFMILITINFIVITK 141 + + GDF+ LV+ +I IN I K Sbjct: 153 PCAEINWYGDFLKNLVLVIITNIINVITFIK 183 >gnl|CDD|162132 TIGR00958, 3a01208, Conjugate Transporter-2 (CT2) Family protein. Length = 711 Score = 28.5 bits (64), Expect = 6.2 Identities = 34/156 (21%), Positives = 55/156 (35%), Gaps = 15/156 (9%) Query: 198 GASKFVRGDAIASIIITAINIVGGIVIGCFRYDMSIHHAADVFVRLSVGDGLVSQVPALI 257 G ++ A ++ G+ G F Y M+ + +R + L+ Q Sbjct: 196 GPPALASAIFFMCLLSIASSVSAGLRGGSFNYTMARINLR---IREDLFRSLLRQDLGFF 252 Query: 258 ISLSAAFLVSRTTSKGST-NTAIVEQLSHYPRAL---LISAFFMIVLSVMPNLPAFPFIM 313 L SR +S T + ++ ++ R L L FM+ LS P L I Sbjct: 253 DENKTGELTSRLSSDTQTMSRSLSLNVNVLLRNLVMLLGLLGFMLWLS--PRLTMVTLIN 310 Query: 314 LGGFFACAGFYVPYKNELQRLAKVAQIQEA-SKQNQ 348 L F + L ++QEA +K NQ Sbjct: 311 LPLVFLAEKVFGKRYQLLSE-----ELQEAVAKANQ 341 >gnl|CDD|152423 pfam11988, Dsl1_N, Retrograde transport protein Dsl1 N terminal. Dsl1 is a peripheral membrane protein required for transport between the Golgi and the endoplasmic reticulum. It is localized to the ER membrane, and in vitro it specifically binds to coatomer, the major component of the protein coat of COPI vesicles. It is comprised primarily of alpha helical bundles. It complexes with another subunit of the Dsl1p complex called Tip20 which forms heterodimers by pairing the N termini of each protein. A central disorganized region between the N and C termini of Dsl1 contains binding sites for coatomer. The C terminus of Dsl1 contains a binding site to the Sec39 subunit of the Dsl1p complex. Length = 316 Score = 28.6 bits (64), Expect = 6.3 Identities = 18/60 (30%), Positives = 37/60 (61%), Gaps = 5/60 (8%) Query: 483 VVLTHLSEVIRNNLSQLLSYKD--VKNLISRLDPEYQKLAEETCSSHISYSG--IQAVLK 538 ++T +++++ N S++LS + +KNLIS ++ + KL+E + SS +Y G + +L Sbjct: 257 AIVTEFTKLVKQNASEILSKEANQIKNLISHINDKLTKLSERSGSSW-TYDGNELDKLLN 315 >gnl|CDD|181140 PRK07843, PRK07843, 3-ketosteroid-delta-1-dehydrogenase; Reviewed. Length = 557 Score = 28.5 bits (64), Expect = 6.5 Identities = 13/38 (34%), Positives = 17/38 (44%), Gaps = 5/38 (13%) Query: 141 KGATRIAEVGARFTLDAIPGKQM-----AIDADLSSGL 173 +G R +VGAR GK + A+ A L GL Sbjct: 181 RGVLRALKVGARTLWAKATGKNLLGMGQALAAGLRIGL 218 >gnl|CDD|116866 pfam08282, Hydrolase_3, haloacid dehalogenase-like hydrolase. This family contains haloacid dehalogenase-like hydrolase enzymes. Length = 254 Score = 28.0 bits (63), Expect = 8.3 Identities = 13/83 (15%), Positives = 26/83 (31%), Gaps = 25/83 (30%) Query: 631 SENATNSIRQYIDKGIPLTIVT---LPEIRSYI----------------------RMILE 665 SE +I++ +KGI + I T + +++ + Sbjct: 17 SERTKEAIKKLQEKGIKVVIATGRPYRGALPVLEELGLDLPVICFNGAYIYDENGKILYK 76 Query: 666 RNFPSLAVLSHMEIAKGLKVNIL 688 V +E K ++IL Sbjct: 77 NPISKEDVKEIIEYLKENNLSIL 99 >gnl|CDD|162122 TIGR00939, 2a57, Equilibrative Nucleoside Transporter (ENT). Length = 437 Score = 27.8 bits (62), Expect = 9.1 Identities = 26/112 (23%), Positives = 50/112 (44%), Gaps = 12/112 (10%) Query: 26 IVLIICILFLPIPTVLLDVGLASSIALSILILMVALWIEKPL--EFSSFPTVLLIVTIIR 83 +L + I + + V L + + ++++++ + + K E F T + V II Sbjct: 58 SLLFNSLNLFLIFRIPVTVRLLGGLVILLVVVILVMVLVKVQTSETGFFVTTMASVVIIN 117 Query: 84 LSLNIATTRAILSFGHEGYGAAGGIIAGFSSLVMSGDFVIGLV--VFMILIT 133 A+L +G AG + +SS VMSG + G++ + MIL+ Sbjct: 118 SG------MALLQGSL--FGLAGVFPSTYSSAVMSGQGLAGVLTSLAMILVK 161 >gnl|CDD|183259 PRK11652, emrD, multidrug resistance protein D; Provisional. Length = 394 Score = 27.9 bits (63), Expect = 9.3 Identities = 28/146 (19%), Positives = 53/146 (36%), Gaps = 51/146 (34%) Query: 28 LIICILF-LPIPTVLLDVGLASSIA-----------LSILILMVALWIEKPLEFSSFPTV 75 + + ILF LPIP A + L+ + +WI P Sbjct: 243 MTVSILFILPIPAAFFGAWFAGRPNKRFSTLMWQSVICCLLAGLLMWI---------PGW 293 Query: 76 LLIVTIIRLSLNI---------------ATTRAILSFGHEGYGAAG-----------GII 109 ++ + +L + AT+ A+ F + G AG G+ Sbjct: 294 FGVMNV--WTLLVPAALFFFGAGMLFPLATSGAMEPFPYLA-GTAGALLGGLQNIGSGLA 350 Query: 110 AGFSSLV-MSGDFVIGLVVFMILITI 134 A S+++ +G F +GL++ ++ + I Sbjct: 351 ALLSAMLPQTGQFSLGLLMTLMGLLI 376 >gnl|CDD|185539 PTZ00286, PTZ00286, 6-phospho-1-fructokinase; Provisional. Length = 459 Score = 27.7 bits (62), Expect = 9.3 Identities = 16/73 (21%), Positives = 26/73 (35%), Gaps = 9/73 (12%) Query: 193 FGAMDGASKFVRGDAI--ASIIITAINIVGGIVIGCFRYDMSIHHAADVFVRLSV----- 245 +GA G + D I + I+ +GG ++G R D +R + Sbjct: 123 YGAKYGYKGLYKEDWIKLDPKDVKTIHRLGGTILGSSRGGFDPKVMVDTLIRHGINILFT 182 Query: 246 --GDGLVSQVPAL 256 GDG A+ Sbjct: 183 LGGDGTHRGALAI 195 >gnl|CDD|162277 TIGR01271, CFTR_protein, cystic fibrosis transmembrane conductor regulator (CFTR). The model describes the cystis fibrosis transmembrane conductor regulator (CFTR) in eukaryotes. The principal role of this protein is chloride ion conductance. The protein is predicted to consist of 12 transmembrane domains. Mutations or lesions in the genetic loci have been linked to the aetiology of asthma, bronchiectasis, chronic obstructive pulmonary disease etc. Disease-causing mutations have been studied by 36Cl efflux assays in vitro cell cultures and electrophysiology, all of which point to the impairment of chloride channel stability and not the biosynthetic processing per se. Length = 1490 Score = 28.0 bits (62), Expect = 9.9 Identities = 23/92 (25%), Positives = 39/92 (42%), Gaps = 16/92 (17%) Query: 286 YPRALLISAFFMIVLSVMPNLPAF-PFIMLGGFFACAGFYVPYKNELQR----------- 333 Y A S FF++ LSV+P A I+L F + + + + R Sbjct: 307 YSSAFFFSGFFVVFLSVVPY--ALIKGIILRRIFTTISYCIVLRMTVTRQFPGAIQTWYD 364 Query: 334 -LAKVAQIQE-ASKQNQHSAQLNFITSGVELV 363 L + +IQ+ K+ + + N T+ VE+V Sbjct: 365 SLGAITKIQDFLCKEEYKTLEYNLTTTEVEMV 396 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.324 0.139 0.392 Gapped Lambda K H 0.267 0.0684 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 11,279,725 Number of extensions: 771392 Number of successful extensions: 2724 Number of sequences better than 10.0: 1 Number of HSP's gapped: 2675 Number of HSP's successfully gapped: 86 Length of query: 692 Length of database: 5,994,473 Length adjustment: 100 Effective length of query: 592 Effective length of database: 3,833,673 Effective search space: 2269534416 Effective search space used: 2269534416 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 15 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.5 bits) S2: 61 (27.4 bits)