RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy2380
(733 letters)
>gnl|CDD|234597 PRK00050, PRK00050, 16S rRNA m(4)C1402 methyltranserfase;
Provisional.
Length = 296
Score = 352 bits (907), Expect = e-117
Identities = 118/253 (46%), Positives = 169/253 (66%), Gaps = 6/253 (2%)
Query: 229 LNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTESVSLGNK-- 286
L+E ++ L I + +GIY+D TFG GGHS ILERLG KGRLIAID+D ++++
Sbjct: 8 LDEVVDALAI---KPDGIYVDGTFGGGGHSRAILERLGPKGRLIAIDRDPDAIAAAKDRL 64
Query: 287 ITDSRFSIIHNCFTELDIILKKYNIKKIDGILFDLGISSNQINNELRGFSFLLDGPLDMR 346
RF+++H F+ L +L + + K+DGIL DLG+SS Q+++ RGFSF DGPLDMR
Sbjct: 65 KPFGRFTLVHGNFSNLKEVLAE-GLGKVDGILLDLGVSSPQLDDAERGFSFRKDGPLDMR 123
Query: 347 MDITRGISASKWLANATEFNIKKVIQDYGEERFAKKIAKEIVHYRSITPITRTKQLVEII 406
MD ++G++A++ + +E + ++ ++YGEERFA++IA+ IV R PIT T +L EII
Sbjct: 124 MDQSQGLTAAEVVNTYSEEELARIFKEYGEERFARRIARAIVEARPKKPITTTGELAEII 183
Query: 407 LKSIRGNKRYKNPATRTFQAIRIYINQELKNLSIALKIGFKKLNIKGRIVVISFHSLEDR 466
++ ++ +PATRTFQA+RI +N EL+ L AL+ L GR+ VISFHSLEDR
Sbjct: 184 KSAVPPRRKGIHPATRTFQALRIEVNDELEELERALEAALDLLKPGGRLAVISFHSLEDR 243
Query: 467 IVKNFINFNTKIP 479
IVK F +K
Sbjct: 244 IVKRFFRELSKGC 256
>gnl|CDD|223352 COG0275, COG0275, Predicted S-adenosylmethionine-dependent
methyltransferase involved in cell envelope biogenesis
[Cell envelope biogenesis, outer membrane].
Length = 314
Score = 344 bits (885), Expect = e-113
Identities = 130/281 (46%), Positives = 178/281 (63%), Gaps = 10/281 (3%)
Query: 229 LNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTESVSLGNKIT 288
LNE + L + +GIYID T G GGHS ILE+L GRLI ID+D +++++ +
Sbjct: 12 LNEVVELLAP---KPDGIYIDGTLGAGGHSRAILEKLPDLGRLIGIDRDPQAIAIAKERL 68
Query: 289 ---DSRFSIIHNCFTELDIILKKYNIKKIDGILFDLGISSNQINNELRGFSFLLDGPLDM 345
D R +++H F L LK+ I K+DGIL DLG+SS Q+++ RGFSF DGPLDM
Sbjct: 69 KEFDGRVTLVHGNFANLAEALKELGIGKVDGILLDLGVSSPQLDDAERGFSFRKDGPLDM 128
Query: 346 RMDITRGISASKWLANATEFNIKKVIQDYGEERFAKKIAKEIVHYRSITPITRTKQLVEI 405
RMD T+G+SA++ + +E ++ ++ ++YGEERFAK+IA+ IV R PI TK+L EI
Sbjct: 129 RMDQTQGLSAAEVVNTYSEEDLARIFKEYGEERFAKRIARAIVERRKKKPIETTKELAEI 188
Query: 406 ILKSIRGNKRYK--NPATRTFQAIRIYINQELKNLSIALKIGFKKLNIKGRIVVISFHSL 463
I +I ++ K +PATRTFQAIRIY+N EL+ L AL+ L GR+ VISFHSL
Sbjct: 189 IKSAIPAKEKRKKIHPATRTFQAIRIYVNDELEELEEALEAALDLLKPGGRLAVISFHSL 248
Query: 464 EDRIVKNFINFNTKIPHIDRRLPIYNYLFEPKMKFLARCKP 504
EDRIVKNF +K + LP+ K K + + KP
Sbjct: 249 EDRIVKNFFKELSKPGV-PKGLPVTEEGPALKFKLITK-KP 287
>gnl|CDD|216706 pfam01795, Methyltransf_5, MraW methylase family. Members of this
family are probably SAM dependent methyltransferases
based on Escherichia coli rsmH. This family appears to
be related to pfam01596.
Length = 310
Score = 315 bits (809), Expect = e-102
Identities = 129/281 (45%), Positives = 181/281 (64%), Gaps = 9/281 (3%)
Query: 229 LNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTESVSL-GNKI 287
L E + LNI+ +GIYID T G GGHS ILE+L ++GRLI ID+D ++++ ++
Sbjct: 9 LQEVVELLNIKP---DGIYIDCTLGGGGHSEAILEQLPEEGRLIGIDRDPQAIANAKERL 65
Query: 288 T--DSRFSIIHNCFTELDIILKKYNIKKIDGILFDLGISSNQINNELRGFSFLLDGPLDM 345
R ++IH+ F L LK+ + K+DGILFDLG+SS Q++ RGFSF D PLDM
Sbjct: 66 KPFKGRVTLIHSNFANLFAYLKELGVGKVDGILFDLGVSSPQLDGAERGFSFKHDAPLDM 125
Query: 346 RMDITRGISASKWLANATEFNIKKVIQDYGEERFAKKIAKEIVHYRSITPITRTKQLVEI 405
RMD + G++A++ L +E ++ ++ + YGEERF+K+IA+ IV R PI TK+L EI
Sbjct: 126 RMDQSAGLTAAEVLNTYSEEDLARIFKKYGEERFSKRIARAIVERRKKKPIETTKELAEI 185
Query: 406 ILKSIRGNKRYK--NPATRTFQAIRIYINQELKNLSIALKIGFKKLNIKGRIVVISFHSL 463
I K++ + K +PATRTFQAIRIY+N EL++L AL+ L GR+ VISFHSL
Sbjct: 186 IKKAVPAKAKRKKIHPATRTFQAIRIYVNDELESLKEALEQAPDLLAPGGRLSVISFHSL 245
Query: 464 EDRIVKNFINFNTKIPHIDRRLPIYNYLFEPKMKFLARCKP 504
EDRIVKNF +K + LP+ EP +K + + KP
Sbjct: 246 EDRIVKNFFREYSKGCQVPAGLPVILEGLEPLLKIITK-KP 285
>gnl|CDD|232778 TIGR00006, TIGR00006, 16S rRNA
(cytosine(1402)-N(4))-methyltransferase. This model
describes RsmH, a 16S rRNA methyltransferase.
Previously, this gene was designated MraW, known to be
essential in E. coli and widely conserved in bacteria
[Cell envelope, Biosynthesis and degradation of murein
sacculus and peptidoglycan].
Length = 307
Score = 283 bits (725), Expect = 1e-89
Identities = 131/279 (46%), Positives = 184/279 (65%), Gaps = 8/279 (2%)
Query: 229 LNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTESVSLGNKI- 287
L+E + LNI + +GIYID T G+GGHS IL +LG+ GRLI ID+D ++++ +I
Sbjct: 9 LDEVVEGLNI---KPDGIYIDGTLGRGGHSKAILSQLGE-GRLIGIDRDPQAIAFAKEIL 64
Query: 288 TDSRFSIIHNCFTELDIILKKYNIKKIDGILFDLGISSNQINNELRGFSFLLDGPLDMRM 347
+D RF IIH+ F LK+ + KIDGIL DLG+SS Q+++ RGFSF+ DGPLDMRM
Sbjct: 65 SDLRFVIIHDNFANFFEYLKELLVTKIDGILVDLGVSSPQLDDPERGFSFMHDGPLDMRM 124
Query: 348 DITRGISASKWLANATEFNIKKVIQDYGEERFAKKIAKEIVHYRSITPITRTKQLVEIIL 407
D ++ +SA++ L +E +++ +++ YGEERF+K+IA+ IV R PI TK+L E+I
Sbjct: 125 DQSQKLSAAEILNTYSEEDLEWILKKYGEERFSKRIARAIVERRKKKPIQTTKELAEVIS 184
Query: 408 KSIRGNKRYK--NPATRTFQAIRIYINQELKNLSIALKIGFKKLNIKGRIVVISFHSLED 465
K++ G +YK +PATR FQAIRIY+N EL+ L AL+ L GR+ +ISFHSLED
Sbjct: 185 KAVPGFSKYKSIHPATRVFQAIRIYVNDELEELEEALQFAPNLLAPGGRLSIISFHSLED 244
Query: 466 RIVKNFINFNTKIPHIDRRLPIYNYLFEPKMKFLARCKP 504
RIVKNF +K P + LP+ K + + KP
Sbjct: 245 RIVKNFFRELSKGPQVPPGLPVKEEQLYALGKRITK-KP 282
>gnl|CDD|182741 PRK10799, PRK10799, metal-binding protein; Provisional.
Length = 247
Score = 237 bits (607), Expect = 2e-73
Identities = 103/220 (46%), Positives = 139/220 (63%), Gaps = 13/220 (5%)
Query: 507 KLNIKKYEDYCPNGLQVEGRSNINVIVTGVTASLDLIKTAVDMNADAILVHHGYFWKGEN 566
KLN DY PNGLQVEGR + IVTGVTAS L+ AV + ADA++VHHGYFWKGE+
Sbjct: 13 KLNSAAISDYAPNGLQVEGRETVQKIVTGVTASQALLDEAVRLQADAVIVHHGYFWKGES 72
Query: 567 SNIVGIKKKRLEQLIINKINLYAYHLPLDMHPKLGNNAQLAKILNFSCTRRFSKNNIGWI 626
I G+K+ RL+ L+ N INLY +HLPLD HP+LGNNAQLA +L I +
Sbjct: 73 PVIRGMKRNRLKTLLANDINLYGWHLPLDAHPELGNNAQLAALL-----------GITVM 121
Query: 627 GKIINLKRYN-FKKIITIKDLFHHITRKIGKKPIVIGDLN-KKIYEIGWCTGAAQNLLTD 684
G+I L + + +L I ++G+KP+ GD + + + WCTG Q+ +
Sbjct: 122 GEIEPLVPWGELTMPVPGLELASWIEARLGRKPLWCGDTGPEVVQRVAWCTGGGQSFIDS 181
Query: 685 AINEGVTAYISGEISESTVYISRESGVAYFAAGHHATERY 724
A GV A+I+GE+SE T++ +RE G+ ++AAGHHATER
Sbjct: 182 AARFGVDAFITGEVSEQTIHSAREQGLHFYAAGHHATERG 221
>gnl|CDD|223404 COG0327, COG0327, Uncharacterized conserved protein [Function
unknown].
Length = 250
Score = 193 bits (493), Expect = 9e-57
Identities = 79/222 (35%), Positives = 126/222 (56%), Gaps = 12/222 (5%)
Query: 506 KKLNIKKYEDYCPNGLQVEG-RSNINVIVTGVTASLDLIKTAVDMNADAILVHHG-YFWK 563
+ L + ED+ NGLQV G + + + A+L ++ A+++ AD ++VHH FW
Sbjct: 12 EFLPPELAEDWDNNGLQVGGPLEEVKKVAVALDATLAVLDEAIELGADLLIVHHPLIFWP 71
Query: 564 GENSNIVGIKKKRLEQLIINKINLYAYHLPLDMHPKLGNNAQLAKILNFSCTRRFSKNNI 623
++ I G K KR++ LI N INLYA H LD HP+ GN+A A +L F +
Sbjct: 72 VKSLAIDGPKGKRIKALIQNDINLYAAHTNLDAHPEGGNDALAAALLGAEELPPF-GEGL 130
Query: 624 GWIGKIINLKRYNFKKIITIKDLFHHITRKIGKKPI-VIGDLNKKIYEIGWCTGAAQNLL 682
G +G K+ T+++L I K+G+ P+ V+ D +++I + C+G+ Q L
Sbjct: 131 GRVG--------ELKEPTTLEELAERIKAKLGRPPLRVVKDGSEEIKRVAVCSGSGQGFL 182
Query: 683 TDAINEGVTAYISGEISESTVYISRESGVAYFAAGHHATERY 724
++A EGV AYI+G++S T + +RE G++ AGH+ATER
Sbjct: 183 SEAAAEGVDAYITGDLSHHTAHDARELGLSVIDAGHYATERP 224
>gnl|CDD|213534 TIGR00486, YbgI_SA1388, dinuclear metal center protein, YbgI/SA1388
family. The characterization of this family of
uncharacterized proteins as orthologous is tentative.
Members are found in all three domains of life. Several
members (from Bacillus subtilis, Listeria monocytogenes,
and Mycobacterium tuberculosis - all classified as
Firmicutes within the Eubacteria) share a long insert
relative to other members [Unknown function, General].
Length = 249
Score = 181 bits (462), Expect = 2e-52
Identities = 73/221 (33%), Positives = 111/221 (50%), Gaps = 11/221 (4%)
Query: 506 KKLNIKKYEDYCPNGLQVE-GRSNINVIVTGVTASLDLIKTAVDMNADAILVHHGYFWKG 564
+ L + ED NGLQV G + +V V AS + AV + AD I+ HH WK
Sbjct: 12 RFLPKELAEDGDNNGLQVGSGNEEVKKVVVAVDASESVADEAVRLGADLIITHHPLIWKP 71
Query: 565 ENSNIVGIKKKRLEQLIINKINLYAYHLPLDMHPKLGNNAQLAKILNFSCTRRFSKNNIG 624
I GIK RL+ L+ N I+LY+ H LD H GNN LA+ L + F +G
Sbjct: 72 LKRLIRGIKPGRLKILLQNDISLYSAHTNLDAHD-GGNNDALARALGLENPKEFEDYGLG 130
Query: 625 WIGKIINLKRYNFKKIITIKDLFHHITRKIGKKPIVIGDLNK-KIYEIGWCTGAAQNLLT 683
+G+ FK I + I + + KP+++ + ++ +G+ + +
Sbjct: 131 RVGE--------FKAPIESLEEVLEIKKVLNVKPLLVVKNGPEYVKKVAVVSGSGLSFIM 182
Query: 684 DAINEGVTAYISGEISESTVYISRESGVAYFAAGHHATERY 724
A+ EGV AYI+G++S T +++RE G+ AGH+ATER
Sbjct: 183 KALREGVDAYITGDLSHHTAHLARELGLNVIDAGHYATERG 223
>gnl|CDD|216697 pfam01784, NIF3, NIF3 (NGG1p interacting factor 3). This family
contains several NIF3 (NGG1p interacting factor 3)
protein homologues. NIF3 interacts with the yeast
transcriptional coactivator NGG1p which is part of the
ADA complex, the exact function of this interaction is
unknown.
Length = 237
Score = 180 bits (460), Expect = 3e-52
Identities = 63/227 (27%), Positives = 113/227 (49%), Gaps = 17/227 (7%)
Query: 506 KKLNIKKYEDYCPNGLQVEGRSN-INVIVTGVTASLDLIKTAVDMNADAILVHHGYFWKG 564
+ ED+ GLQV + I+ + A+ ++I+ A++ AD I+ HH +K
Sbjct: 8 ELAPPSLAEDWDNVGLQVGDPEGEVKKILVALDATEEVIEEAIENGADLIITHHPLIFKP 67
Query: 565 ENSNI-VGIKKKRLEQLIINKINLYAYHLPLDMHPKLGNNAQLAKILNFSCTRRFSKN-- 621
K +R+ +LI N I+LY+ H LD P G N LA++L +
Sbjct: 68 LKRITGDDPKGRRILKLIKNNISLYSAHTNLDAAPG-GVNDWLAELLGLESIEPLEPSVS 126
Query: 622 ---NIGWIGKIINLKRYNFKKIITIKDLFHHITRKIGKKPI-VIGDLNKKIYEIGWCTGA 677
+G IG++ + I++++L + ++G + + V+GDL+K I ++ C G+
Sbjct: 127 EGEGLGRIGEL--------PEPISLEELAKRVKERLGLEGVRVVGDLDKPIKKVAICGGS 178
Query: 678 AQNLLTDAINEGVTAYISGEISESTVYISRESGVAYFAAGHHATERY 724
+L+ +A +G I+GE+ ++E G+A AGH+ATER+
Sbjct: 179 GGSLIEEAKAKGADLLITGELKHHDALDAKEKGIALIDAGHYATERF 225
>gnl|CDD|240528 cd13123, MATE_MurJ_like, MurJ/MviN, a subfamily of the multidrug
and toxic compound extrusion (MATE)-like proteins.
Escherichia coli MurJ (MviN) has been identified as
essential for murein biosynthesis. It has been suggested
that MurJ functions as the peptidoglycan lipid II
flippase which is involved in translocation of
lipid-anchored peptidoglycan precursors across the
cytoplasmic membrane, though results obtained in
Bacillus subtilis seem to indicate that its MurJ
homologs are not essential for growth. Some MviN family
members (e.g. in Mycobacterium tuberculosis) possess an
extended C-terminal region that contains an
intracellular pseudo-kinase domain and an extracellular
domain resembling carbohydrate-binding proteins.
Proteins from the MATE family are involved in exporting
metabolites across the cell membrane and are often
responsible for multidrug resistance (MDR).
Length = 420
Score = 122 bits (308), Expect = 1e-29
Identities = 48/125 (38%), Positives = 79/125 (63%), Gaps = 4/125 (3%)
Query: 1 MAFAVVLGGALQIIIQIPSLIKIGMFPHIKLNPSHGFKNIAVRRILKKMGPSVFSVFAAQ 60
+A+ V+LGG LQ+++Q+P+L ++G +L+ F++ VRR+LK M P++ V +Q
Sbjct: 177 LAWGVLLGGVLQLLVQLPALRRLGFRFRPRLD----FRDPGVRRVLKLMLPALLGVSVSQ 232
Query: 61 ISLMLNTNIASQMREGSLSCLSYADRLMEFPTTLLGVTFNTILLPNLSKARIENNTEEYS 120
I+L+++T +AS + EGS+S L YADRL + P + GV T LLP LS+ + +E+
Sbjct: 233 INLLVDTILASFLPEGSISALYYADRLYQLPLGIFGVAIATALLPRLSRLAAAGDLDEFR 292
Query: 121 AILDW 125
L
Sbjct: 293 RTLSR 297
Score = 34.4 bits (80), Expect = 0.21
Identities = 12/31 (38%), Positives = 16/31 (51%)
Query: 126 IPIFAHSGLALSIGLGACLHASFLYWYLRHK 156
I H GLAL+ L A ++A L LR +
Sbjct: 390 IKPLGHVGLALATSLSAWVNALLLLVLLRRR 420
>gnl|CDD|223800 COG0728, MviN, Uncharacterized membrane protein, putative virulence
factor [General function prediction only].
Length = 518
Score = 121 bits (306), Expect = 4e-29
Identities = 49/124 (39%), Positives = 80/124 (64%), Gaps = 4/124 (3%)
Query: 2 AFAVVLGGALQIIIQIPSLIKIGMFPHIKLNPSHGFKNIAVRRILKKMGPSVFSVFAAQI 61
A+ V++GG LQ+++Q+P+L K G+ + FK+ ++R LK M P++ V +QI
Sbjct: 195 AWGVLIGGLLQLLVQLPALRKAGLLIKPRFG----FKDPGLKRFLKLMLPALLGVSISQI 250
Query: 62 SLMLNTNIASQMREGSLSCLSYADRLMEFPTTLLGVTFNTILLPNLSKARIENNTEEYSA 121
+L+++T IAS + EGS+S L YADRL + P + GV +T+LLP+LS+ + E+
Sbjct: 251 NLLIDTAIASFLAEGSVSWLYYADRLYQLPLGIFGVALSTVLLPSLSRHAANGDWPEFLK 310
Query: 122 ILDW 125
+LDW
Sbjct: 311 LLDW 314
Score = 66.2 bits (162), Expect = 3e-11
Identities = 34/110 (30%), Positives = 54/110 (49%), Gaps = 2/110 (1%)
Query: 126 IPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFI-RLVIALLLLVIVAL 184
IP H GLAL+ L A ++A LY+ LR + +Y+P GWG+F I +L++A ++ L
Sbjct: 407 IPPLGHVGLALATSLAAWVNALLLYYLLRKRLVYLPGRGWGLFLILKLLLASAIMAAALL 466
Query: 185 WGNSYFNWLGMQAHPIFRIVALLLIL-LFCGITYFLALRIMGFLFLNEAI 233
+ + + +LL+L L YF L ++GF A
Sbjct: 467 ALLHLAQREWLLGGLLLIRLGVLLLLVLLGAGVYFAMLLLLGFRVRGFAR 516
>gnl|CDD|217323 pfam03023, MVIN, MviN-like protein. Deletion of the mviN virulence
gene in Salmonella enterica serovar. Typhimurium greatly
reduces virulence in a mouse model of typhoid-like
disease. Open reading frames encoding homologues of MviN
have since been identified in a variety of bacteria,
including pathogens and non-pathogens and
plant-symbionts. In the nitrogen-fixing symbiont
Rhizobium tropici, mviN is required for motility. The
MviM protein is predicted to be membrane-associated.
Length = 452
Score = 98.5 bits (246), Expect = 9e-22
Identities = 50/152 (32%), Positives = 87/152 (57%), Gaps = 13/152 (8%)
Query: 2 AFAVVLGGALQIIIQIPSLIKIGMFPHIKLNPSHGFKNIAVRRILKKMGPSVFSVFAAQI 61
A V++GG LQ+++Q+P L K G+ + N F++ V+R L P++ V +Q+
Sbjct: 161 AIGVLIGGVLQLLVQLPFLRKAGLLLKPRFN----FRDKGVKRFLALALPTLLGVSVSQL 216
Query: 62 SLMLNTNIASQMREGSLSCLSYADRLMEFPTTLLGVTFNTILLPNLSKARIENNTEEYSA 121
+L+++T +AS + EGS+S L YADR+ + P + GV+ +T+LLP LS+ + + +E+
Sbjct: 217 NLLIDTALASFLAEGSISYLYYADRIYQLPLGIFGVSVSTVLLPRLSRHAKDGDWDEFLD 276
Query: 122 ILDWIPIFAHSGLALSIGLGACLHASFLYWYL 153
+LD A+ + L + ASF L
Sbjct: 277 LLDQ---------AIRLTLLLMIPASFGLLVL 299
Score = 51.9 bits (125), Expect = 8e-07
Identities = 26/87 (29%), Positives = 40/87 (45%), Gaps = 7/87 (8%)
Query: 126 IPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIVALW 185
+P GLAL+ L A ++ FLY+ LR + GW F LV+ L+ V L
Sbjct: 373 LPPLGVVGLALATSLSAWINLVFLYYILRKRLGGHSLRGWKTFLASLVVPTALMGGVILL 432
Query: 186 GNSYFNWLGMQAHPIFRIVALLLILLF 212
+S ++ + +LLLIL+
Sbjct: 433 LSSLT-------QGLWVVGSLLLILVG 452
>gnl|CDD|233536 TIGR01695, mviN, integral membrane protein MviN. This model
represents MviN, a family of integral membrane proteins
predicted to have ten or more transmembrane regions.
Although frequently listed as a virulence protein, it is
not restricted to pathogens and it is an essential
protein in Sinorhizobium meliloti. In a number of
species its gene is adjacent to that of the
uridylyltransferase GlnD , the signal-transducing enzyme
that performs the key modification to the nitrogen
regulatory protein PII [Unknown function, General].
Length = 502
Score = 96.6 bits (241), Expect = 6e-21
Identities = 47/142 (33%), Positives = 76/142 (53%), Gaps = 5/142 (3%)
Query: 2 AFAVVLGGALQIIIQIPSLIKIGMFPHIKLNPSHGFKNIAVRRILKKMGPSVFSVFAAQI 61
A V++GG Q++IQ+P L K G + N ++R LK P+ A+QI
Sbjct: 186 AIGVLIGGVAQLLIQLPFLRKAGFLLKPRFNFRDP----GLKRFLKLFLPTTLGSSASQI 241
Query: 62 SLMLNTNIASQMREGSLSCLSYADRLMEFPTTLLGVTFNTILLPNLSKARIENNTEEYSA 121
+L++NT +AS + GS+S L YA+R+ + P + G++ +T+LLP LS+ E N E
Sbjct: 242 TLLINTALASFLEIGSVSALYYANRIYQLPLGIFGISLSTVLLPKLSRHASEGNWNELRD 301
Query: 122 ILDW-IPIFAHSGLALSIGLGA 142
+L+ I + + S GL
Sbjct: 302 LLNQGIRLSLLLTIPSSFGLLI 323
Score = 65.0 bits (159), Expect = 6e-11
Identities = 24/108 (22%), Positives = 41/108 (37%), Gaps = 4/108 (3%)
Query: 126 IPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIVALW 185
I G+AL+ + + + LY L + I G +LVIA ++ V
Sbjct: 398 IFPLGLVGIALATSAASMVSSVLLYLMLNRRLKGILPFGVLKVLAKLVIASAIIGGVLYL 457
Query: 186 GNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLALRIMGFLFLNEAI 233
S + + + + L+ LL YFL L ++ LN +
Sbjct: 458 IVSVSLGGVLVKNLLGLLAIGLIGLLV----YFLGLALLKIEELNLLL 501
>gnl|CDD|218341 pfam04947, Pox_VLTF3, Poxvirus Late Transcription Factor VLTF3
like. Members of this family are approximately 26 KDa,
and are involved in trans-activator of late
transcription.
Length = 171
Score = 36.1 bits (84), Expect = 0.027
Identities = 27/137 (19%), Positives = 51/137 (37%), Gaps = 17/137 (12%)
Query: 381 KKIAKEIVHYRSITPITRTKQLVEIILKSIRGNKRYKNPATRTFQAIRIYINQELKNLSI 440
KK+ K + +T V LK + +K Y N F + + + NLS
Sbjct: 42 KKLKKYNIDTDDVT-----TTKVLDFLKKLGYSKYYDN----VFLILNMLTGKPPPNLSS 92
Query: 441 ALKIGFKKLNIKGRIVVISFHSLEDRIVKNFINFNTKIPHIDRRLPIYNYLFEPKMKFLA 500
L+ +++ + + F L R KNFI+++ + + L + Y L
Sbjct: 93 ELE---ERIKEIFKELQKFFDKLNTR--KNFISYSY---VLYKLLELLGYDENLPDFPLL 144
Query: 501 RCKPKKKLNIKKYEDYC 517
+ K + ++ C
Sbjct: 145 KSLTKLNSQDEIWKKLC 161
>gnl|CDD|225795 COG3256, NorB, Nitric oxide reductase large subunit [Inorganic ion
transport and metabolism].
Length = 717
Score = 36.3 bits (84), Expect = 0.064
Identities = 33/142 (23%), Positives = 44/142 (30%), Gaps = 14/142 (9%)
Query: 97 VTFNTILLPNLSKARIENNTEEYSAILDWIPIFAHSGLALSIGLGACLHASFLYWYLRHK 156
+ LLP L + E S L F G LG +W+
Sbjct: 315 LAAGLYLLPELGGPK---FQELGSPKLLIALFFVVVGSLAGAWLGYLQLLPAPFWFWFGH 371
Query: 157 RIYIPCAGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGIT 216
+ Y G G +L++ LLV VAL N R L+ ILL
Sbjct: 372 QGY-EYLGRG-RLWQLLLIKGLLVWVALMFR--ANV-TATKLKG-RKTTLIAILLLSLTG 425
Query: 217 YFLALRIMGFLFLNEAINWLNI 238
+ L F N W N+
Sbjct: 426 --IGL-FFLFGLYNP--EWTNL 442
>gnl|CDD|225546 COG3000, ERG3, Sterol desaturase [Lipid metabolism].
Length = 271
Score = 35.1 bits (81), Expect = 0.093
Identities = 27/114 (23%), Positives = 43/114 (37%), Gaps = 13/114 (11%)
Query: 123 LDWIPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGV----------FFIRL 172
L P+ LA+++ L A FL + W
Sbjct: 5 LWPAPLDFVLALAIALFLLAFGLHLFLLRLYLFPQE-PLRKAWPTRDWLLDLVLLTLSIS 63
Query: 173 VIALLLLVIVALW--GNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLALRIM 224
V+ L+LL+ + LW Y LG+ P+ + LLL LF + Y+ A R++
Sbjct: 64 VVGLVLLLALILWALLGGYLPGLGVAPGPLPFALQLLLAFLFLDLGYYWAHRLL 117
>gnl|CDD|220569 pfam10101, DUF2339, Predicted membrane protein (DUF2339). This
domain, found in various hypothetical bacterial
proteins, has no known function.
Length = 738
Score = 35.8 bits (83), Expect = 0.098
Identities = 23/109 (21%), Positives = 33/109 (30%), Gaps = 27/109 (24%)
Query: 119 YSAILDWIPIFAHSGLALSIGLGACLHASFLYWYLRHKRI-YIPCA----GWGVFFIRLV 173
Y+ + L GL L A LR +R Y P A G ++ +
Sbjct: 24 YAIEAGLLGPELRVALGALAGLA--LLA--AGERLRRRRYAYFPAALQGGGLATLYLTVF 79
Query: 174 IALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLALR 222
A L ++ VA L++L T LAL
Sbjct: 80 AAFHLYGLLPPT------------------VAFALLVLVTAGTVVLALL 110
Score = 33.5 bits (77), Expect = 0.48
Identities = 22/109 (20%), Positives = 37/109 (33%), Gaps = 7/109 (6%)
Query: 128 IFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIVALWGN 187
+ LAL++ A + Y + G+ + ++ALLLL L G
Sbjct: 526 LLTLLALALALVYLYRRRARWPLVYRAAAYLL-----LGLAPLAALLALLLLENPLLSGI 580
Query: 188 SYFNWLGMQAHPIFRIVALLLILLFCGITYFLALRIMGFLFLNEAINWL 236
LG PI + L +LL + R+ + L +
Sbjct: 581 GGAAPLG--FLPILNPLLLAYLLLAPALLALWLRRLATPIRLKALRPAV 627
>gnl|CDD|223885 COG0815, Lnt, Apolipoprotein N-acyltransferase [Cell envelope
biogenesis, outer membrane].
Length = 518
Score = 33.5 bits (77), Expect = 0.38
Identities = 27/125 (21%), Positives = 41/125 (32%), Gaps = 28/125 (22%)
Query: 121 AILDWIPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLV 180
W+ L L++ GA +F P W + LL LV
Sbjct: 5 LARQWVLGRPARRLLLALVFGALFALAF------------PPPDWWWLAWFSLAPLLWLV 52
Query: 181 IVA------------LWGNSYF----NWLGMQAHPIFRIVALLLILLFCGITYFLALRIM 224
A L+G +F WLG ++A+ L LL + +LAL ++
Sbjct: 53 RGAPTSWEGLAKSGFLFGFGFFLAGFYWLGTSLGVGLGLLAVALPLLVLLLAAWLALFLL 112
Query: 225 GFLFL 229
L
Sbjct: 113 LVAVL 117
>gnl|CDD|218473 pfam05159, Capsule_synth, Capsule polysaccharide biosynthesis
protein. This family includes export proteins involved
in capsule polysaccharide biosynthesis, such as KpsS and
LipB.
Length = 268
Score = 33.1 bits (76), Expect = 0.45
Identities = 28/124 (22%), Positives = 45/124 (36%), Gaps = 15/124 (12%)
Query: 531 VIVTGVTASLDLIKTAVDMNADA--ILVHHGYFWKGENSNIVGIKKKRLEQLIINKINLY 588
V+V G I A I + G+ S +G L+++KI +Y
Sbjct: 3 VVVWGDKRPYHRIAAAAAKKKGIPVIRLEDGFL----RSLGLGSNGYPPYSLVLDKIGIY 58
Query: 589 AY-HLPLDM-----HPKLGNNAQLA---KILNFSCTRRFSKNNIGWIGKIINLKRYNFKK 639
LP D+ + + A LA ++N T + SK N+ L KK
Sbjct: 59 YDSSLPSDLEQLLSEKSVLDQAGLARAEYLMNRIVTHKLSKYNLRRAAPAFRLLLAKDKK 118
Query: 640 IITI 643
I+ +
Sbjct: 119 IVLV 122
>gnl|CDD|148050 pfam06212, GRIM-19, GRIM-19 protein. This family consists of
several eukaryotic gene associated with
retinoic-interferon-induced mortality 19 (GRIM-19)
proteins. GRIM-19, was reported to encode a small
protein primarily distributed in the nucleus and was
able to promote cell death induced by IFN-# and RA. A
bovine homologue of GRIM-19 was co-purified with
mitochondrial NADH:ubiquinone oxidoreductase (complex
I) in bovine heart. Therefore, its exact cellular
localisation and function are unclear. It has now been
discovered that GRIM-19 is a specific interacting
protein which negatively regulates Stat3 activity.
Length = 130
Score = 31.2 bits (71), Expect = 0.66
Identities = 10/33 (30%), Positives = 15/33 (45%)
Query: 33 PSHGFKNIAVRRILKKMGPSVFSVFAAQISLML 65
P G+ I +R L K G S +++FA
Sbjct: 11 PPGGYGPIPFKRNLPKTGFSGYAMFATGAGATA 43
>gnl|CDD|185281 PRK15383, PRK15383, type III secretion system protein; Provisional.
Length = 335
Score = 31.9 bits (72), Expect = 1.2
Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 5/69 (7%)
Query: 228 FLNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTESVSLGNKI 287
+LN IN IE ERI GI++D F +G K L +L + R I + +++ L +
Sbjct: 75 YLNNIINAAIIEKERIIGIFVDGDFSKGQR--KALGKLEQNYRNIKVIYNSD---LNYSM 129
Query: 288 TDSRFSIIH 296
D + + I+
Sbjct: 130 YDKKLTTIY 138
>gnl|CDD|220473 pfam09925, DUF2157, Predicted membrane protein (DUF2157). This
domain, found in various hypothetical prokaryotic
proteins, has no known function.
Length = 144
Score = 30.7 bits (70), Expect = 1.2
Identities = 27/107 (25%), Positives = 35/107 (32%), Gaps = 22/107 (20%)
Query: 110 ARIENNTEEYSAILDWIPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVF- 168
I E A + I G AL +G G F+ W
Sbjct: 16 EAILARYGEDPARSSLVRILLWLG-ALLLGAGV---IFFVAAN------------WEEIP 59
Query: 169 -FIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCG 214
F++ + L LLV +AL G WL + P LLL L G
Sbjct: 60 RFVKFGLLLALLVGLALGGF----WLWRRGKPRLGEALLLLGALLFG 102
>gnl|CDD|225127 COG2217, ZntA, Cation transport ATPase [Inorganic ion transport and
metabolism].
Length = 713
Score = 32.2 bits (74), Expect = 1.3
Identities = 13/53 (24%), Positives = 21/53 (39%), Gaps = 2/53 (3%)
Query: 168 FFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLA 220
RL+IA LL + + L S LG P + +L + G ++
Sbjct: 87 LLRRLIIAGLLTLPLLLL--SLGLLLGAFLLPWVSFLLATPVLFYGGWPFYRG 137
>gnl|CDD|234462 TIGR04086, TIGR04086_membr, putative membrane protein, TIGR04086
family. Members of this family of strongly hydrophobic
putative transmembrane protein average about 125 amino
acids in length and occur mostly, but not exclusively,
in the Firmicutes. Members are quite diverse in
sequence. The function is unknown.
Length = 115
Score = 29.8 bits (68), Expect = 1.6
Identities = 19/94 (20%), Positives = 32/94 (34%), Gaps = 22/94 (23%)
Query: 164 GWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGIT------- 216
G+ ++ L+LL+I+AL YF L + ++ +L + G
Sbjct: 4 LKGLLVA-YILTLILLLILALL--LYFTDLSEGSLSWLILIIYILSVFIGGFIAGKKAGS 60
Query: 217 ------------YFLALRIMGFLFLNEAINWLNI 238
YFL L ++ LF L
Sbjct: 61 KGWLWGLLVGLLYFLILLLISLLFGPGPGFTLGQ 94
>gnl|CDD|237913 PRK15119, PRK15119, undecaprenyl-phosphate
alpha-N-acetylglucosaminyl 1-phosphate transferase;
Provisional.
Length = 365
Score = 31.6 bits (72), Expect = 1.7
Identities = 20/81 (24%), Positives = 33/81 (40%), Gaps = 17/81 (20%)
Query: 131 HSGL-----ALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFF------------IRLV 173
H GL +S+ G C +Y+ H +Y+ CAG VF IR
Sbjct: 43 HQGLIPLVGGISVYAGICFTFGITDYYIPHASLYLACAGVLVFVGALDDRFDISVKIRAT 102
Query: 174 IALLLLVIVALWGNSYFNWLG 194
I + +++ ++ Y + LG
Sbjct: 103 IQAAVGIVMMVFAGLYLSSLG 123
>gnl|CDD|240539 cd13134, MATE_like_8, Uncharacterized subfamily of the multidrug
and toxic compound extrusion (MATE) proteins. The
integral membrane proteins from the MATE family are
involved in exporting metabolites across the cell
membrane and are responsible for multidrug resistance
(MDR) in many bacteria and animals. A number of family
members are involved in the synthesis of peptidoglycan
components in bacteria.
Length = 438
Score = 31.3 bits (72), Expect = 1.7
Identities = 17/80 (21%), Positives = 33/80 (41%), Gaps = 10/80 (12%)
Query: 169 FIRLVIALLLLVIVALWGNSYFNWLGMQAHPI------FRIVALLLILLFCGITYFLALR 222
L++ L+ +++ +G +G+ + RIV L L +T LR
Sbjct: 87 LFNLLLGLVFSLLLLFFGAELLRLMGLPPELLALGSTYLRIVGGSLFLQALSLTLSAILR 146
Query: 223 IMGF----LFLNEAINWLNI 238
G ++++ +N LNI
Sbjct: 147 SHGRTKIPMYVSLIMNILNI 166
>gnl|CDD|212167 cd11653, rap1_RCT, C-terminal domain of RAP1 recruits proteins to
telomeres. The RAP1 (repressor activator protein 1)
C-terminal domain (RCT) mediates interactions with other
proteins such as TRF2 (human), Rif1, Rif2, Sir3, Sir4
(Saccharomyces cerevisiae), and Taz1
(Schizosaccharomyces pombe) at telomeres and other loci.
RAP1, identified in budding yeast as repressor/activator
protein 1, is a well-conserved telomere binding protein,
also found in fission yeast and mammals. In
Saccharomyces cerevisiae, RAP1 directly binds DNA and is
involved in transcriptional activation, gene silencing,
as well as binding at numerous sites at each telemore,
where it functions in telomere length regulation,
telomeric position effect gene silencing and telomere
end protection. Human RAP1 apparently does not bind
telomeric DNA directly, but binds telomere repeat
binding factor 2 (TRF2) via the RCT. RAP1 might act by
suppressing nonhomologous end-joining. Yeast RAP1 has
two myb-type DNA binding modules, and an RCT domain that
recruits Sir proteins 3 and 4 (Sir3, Sir4) for gene
silencing, and Rif1 and Rif2 for telomere length
maintenance. Schizosaccharomyces pombe RAP1 (spRap1),
like human RAP1, lacks direct DNA-binding activity and
is localized to telomeres via Taz1, an ortholog of TRF1
and TRF2. The S. pompe RCT resembles the first 3-helix
bundle of the yeast and human RCT forms, but is not
included in this larger model.
Length = 100
Score = 29.2 bits (66), Expect = 1.9
Identities = 14/60 (23%), Positives = 24/60 (40%), Gaps = 16/60 (26%)
Query: 332 LRGFSFLLDGPLDMRMDITRGISASKW-------LANATEFNIKKVIQDYGEERFAKKIA 384
L +L+ P DM GI W L + I+++I+ +G E A++
Sbjct: 46 LNVLRYLVGIPPDMP-----GI----WTREDDECLQSDDPRTIERLIKKHGAENVARRWE 96
>gnl|CDD|224880 COG1969, HyaC, Ni,Fe-hydrogenase I cytochrome b subunit [Energy
production and conversion].
Length = 227
Score = 30.9 bits (70), Expect = 2.0
Identities = 23/91 (25%), Positives = 30/91 (32%), Gaps = 23/91 (25%)
Query: 151 WYL------RHKRIYIPCAGWGVFFIRLVIALLLLVIVALWGNSY--------------F 190
WYL K + P A FF L+I ++L AL+G Y F
Sbjct: 113 WYLFLGKKPHTKGGHNPIAQVAYFFYFLMIVFMILTGFALYGEHYHYGLGAPLRYVFEWF 172
Query: 191 NWLG---MQAHPIFRIVALLLILLFCGITYF 218
W G H R+ L+I Y
Sbjct: 173 EWTGGGSADVHFWHRLGMWLIIAFVIVHVYM 203
>gnl|CDD|172166 PRK13603, PRK13603, fumarate reductase subunit C; Provisional.
Length = 126
Score = 29.6 bits (66), Expect = 2.2
Identities = 26/80 (32%), Positives = 37/80 (46%), Gaps = 17/80 (21%)
Query: 150 YW------YLRHKRIYIPC--AGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHP-- 199
YW YLR I C W V ++ +L+L V GNSY +L A+P
Sbjct: 11 YWWARRRSYLRFMLREISCIFVAWFVLYL-----VLVLRAVGAGGNSYQRFLDFSANPVV 65
Query: 200 -IFRIVALLLILLFCGITYF 218
+ +VAL +LL +T+F
Sbjct: 66 VVLNVVALSFLLLH-AVTWF 84
>gnl|CDD|237526 PRK13837, PRK13837, two-component VirA-like sensor kinase;
Provisional.
Length = 828
Score = 31.2 bits (71), Expect = 2.3
Identities = 11/55 (20%), Positives = 18/55 (32%)
Query: 225 GFLFLNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTE 279
GF L AI W++ ER + + +D + L +I
Sbjct: 726 GFSTLAAAIAWISKGPERFDLVLVDDRLLDEEQAAAALHAAAPTLPIILGGNSKT 780
>gnl|CDD|179400 PRK02261, PRK02261, methylaspartate mutase subunit S; Provisional.
Length = 137
Score = 29.9 bits (68), Expect = 2.4
Identities = 13/28 (46%), Positives = 17/28 (60%), Gaps = 1/28 (3%)
Query: 530 NVIVTGVTASL-DLIKTAVDMNADAILV 556
VI GV S + I A++ +ADAILV
Sbjct: 33 EVINLGVMTSQEEFIDAAIETDADAILV 60
>gnl|CDD|217153 pfam02628, COX15-CtaA, Cytochrome oxidase assembly protein. This
is a family of integral membrane proteins. CtaA is
required for cytochrome aa3 oxidase assembly in Bacillus
subtilis. COX15 is required for cytochrome c oxidase
assembly in yeast.
Length = 301
Score = 30.7 bits (70), Expect = 2.6
Identities = 21/121 (17%), Positives = 29/121 (23%), Gaps = 29/121 (23%)
Query: 130 AHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIV------- 182
AH LA + L L + L + LLLL I
Sbjct: 133 AHLLLAFLL-LALLLWLALRLREPDPAPAPAAGPRRLRGLALLALVLLLLQIALGALVAG 191
Query: 183 --------------------ALWGNSYFNWLGMQ-AHPIFRIVALLLILLFCGITYFLAL 221
W N + N +Q H + + LL+L
Sbjct: 192 LDAGLACNTFPLMGPDFPAEPDWRNFFENPATVQFLHRLGAYLTFLLLLALAVAAARRRR 251
Query: 222 R 222
R
Sbjct: 252 R 252
Score = 29.9 bits (68), Expect = 5.1
Identities = 27/118 (22%), Positives = 36/118 (30%), Gaps = 24/118 (20%)
Query: 125 WIPIFAHSGLALSI------GLGACLHASFLY-WYLRHKRIYIPCAGWGVFFI------- 170
+ A L L + L A L A + + W FF
Sbjct: 167 RLRGLALLALVLLLLQIALGALVAGLDAGLACNTFPLMGPDFPAEPDWRNFFENPATVQF 226
Query: 171 --RLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFC----GI-TYFLAL 221
RL L L+++AL + + R LLL LL GI T L L
Sbjct: 227 LHRLGAYLTFLLLLALA---VAAARRRRRPALRRAALLLLALLLLQVALGILTVLLGL 281
>gnl|CDD|222102 pfam13398, Peptidase_M50B, Peptidase M50B-like. This is a family
of bacterial and plant peptidases in the same family as
MEROPS:M50B.
Length = 201
Score = 30.2 bits (69), Expect = 2.6
Identities = 22/112 (19%), Positives = 36/112 (32%), Gaps = 35/112 (31%)
Query: 131 HSGLALSIG---------------LGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIA 175
SGL +S G LG L L AG L +
Sbjct: 52 GSGLTVSRGRPGGLGMFLTLAAGYLGPALWGLLLILLAF--------AGRARAL--LWLG 101
Query: 176 LLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLALRIMGFL 227
+ LL+++ LW N G ++ LL L + + +++ F+
Sbjct: 102 VALLLVLLLWVR---NLFG-------LLILLLTGALLLALWWLAPPQVLAFV 143
>gnl|CDD|205007 pfam12670, DUF3792, Protein of unknown function (DUF3792). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 130 amino acids in length.
These proteins are integral membrane proteins.
Length = 117
Score = 29.4 bits (67), Expect = 2.8
Identities = 18/86 (20%), Positives = 30/86 (34%), Gaps = 21/86 (24%)
Query: 172 LVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFC------------------ 213
++ L+LL+I+AL YF L ++ LL +
Sbjct: 16 YIVTLILLLILALL--LYFTDLSESVLSWLIVIITLLSVFLGGFVAGKKAGSKGWLWGLL 73
Query: 214 -GITYFLALRIMGFLFLNEAINWLNI 238
G+ YFL L ++ L + L
Sbjct: 74 VGLLYFLILLLISLLGGGGFFSGLEQ 99
>gnl|CDD|206279 pfam14110, DUF4282, Domain of unknown function (DUF4282). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 93 and 155
amino acids in length. There is a single completely
conserved residue E that may be functionally important.
Length = 89
Score = 28.7 bits (65), Expect = 2.9
Identities = 11/56 (19%), Positives = 24/56 (42%)
Query: 169 FIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLALRIM 224
I+++ L L++I + F L A+ + LL+ + L +R++
Sbjct: 13 IIKVLYWLGLILIGLFGLSGLFGGLATMAYSPGGGLLALLLGPLGFLLGLLLVRVV 68
>gnl|CDD|219117 pfam06638, Strabismus, Strabismus protein. This family consists of
several strabismus (STB) or Van Gogh-like (VANGL)
proteins 1 and 2. The exact function of this family is
unknown. It is thought, however that STB1 gene and STB2
may be potent tumour suppressor gene candidates.
Length = 507
Score = 30.8 bits (70), Expect = 3.0
Identities = 30/109 (27%), Positives = 46/109 (42%), Gaps = 18/109 (16%)
Query: 128 IFAHSGLALSIGLGACLHA-SF-----LYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVI 181
+ L +GL A + +F + W R + C G I L LL+L+I
Sbjct: 92 RYLGLALGGLLGLLAFVTPLAFIVLPQVGWRDRLEPCGTACEG---LLISLAFKLLILLI 148
Query: 182 VALWGNSYFNWLGMQAHP---IFRIVALLLILLFCGITY--FLALRIMG 225
A W + F P +FR + L+L+ LF +Y F +RI+G
Sbjct: 149 -ASW--ALFLRPPRADMPRIFVFRALLLVLVFLFL-FSYWLFYGVRILG 193
>gnl|CDD|219354 pfam07264, EI24, Etoposide-induced protein 2.4 (EI24). This family
contains a number of eukaryotic etoposide-induced 2.4
(EI24) proteins approximately 350 residues long as well
as bacterial CysZ proteins (formerly known as DUF540).
In cells treated with the cytotoxic drug etoposide, EI24
is induced by p53. It has been suggested to play an
important role in negative cell growth control.
Length = 205
Score = 30.2 bits (69), Expect = 3.2
Identities = 16/92 (17%), Positives = 30/92 (32%), Gaps = 17/92 (18%)
Query: 152 YLRHKRIYIPCAGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHP---------IFR 202
L R+ + + L++ LLL + + S L +
Sbjct: 7 DLLSPRL------RRLVLLPLLLNLLLFAGLLYFLFSLLGPLLDALLSLLWLGWLSWLLW 60
Query: 203 IVALLLILLFCGIT-YFLALRIMGFLFLNEAI 233
I+A+LL+L+ +A I F +
Sbjct: 61 ILAVLLLLVLLWFLFVVVASLIAAP-FWTPDL 91
>gnl|CDD|150723 pfam10081, DUF2319, Predicted membrane protein (DUF2319). Members
of this family of hypothetical bacterial proteins have
no known function.
Length = 534
Score = 30.7 bits (70), Expect = 3.3
Identities = 27/115 (23%), Positives = 41/115 (35%), Gaps = 23/115 (20%)
Query: 132 SGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIVAL------- 184
SG++ +IG G + +L+ YL + RLV L + L
Sbjct: 38 SGVSAAIGYGIGVFLRWLWRYLELPEL----RPRPRRGARLVAVALGAAVTVLMLVQAAR 93
Query: 185 WGNSYFNWLGMQ----AHPIFRI-----VALLLILLFCGITY---FLALRIMGFL 227
W NS +GM A + + V L+ L + FLA R+ L
Sbjct: 94 WQNSVRALMGMPPLETAAYLRGVLVALLVFGALVALGRLLRRLIRFLARRLQRVL 148
>gnl|CDD|163613 cd00144, MPP_PPP_family, phosphoprotein phosphatases of the
metallophosphatase superfamily, metallophosphatase
domain. The PPP (phosphoprotein phosphatase) family is
one of two known protein phosphatase families specific
for serine and threonine. This family includes: PP1,
PP2A, PP2B (calcineurin), PP4, PP5, PP6, PP7, Bsu1,
RdgC, PrpE, PrpA/PrpB, and ApA4 hydrolase. The PPP
catalytic domain is defined by three conserved motifs
(-GDXHG-, -GDXVDRG- and -GNHE-). The PPP enzyme family
is ancient with members found in all eukaryotes, and in
most bacterial and archeal genomes. Dephosphorylation
of phosphoserines and phosphothreonines on target
proteins plays a central role in the regulation of many
cellular processes. PPPs belong to the
metallophosphatase (MPP) superfamily. MPPs are
functionally diverse, but all share a conserved domain
with an active site consisting of two metal ions
(usually manganese, iron, or zinc) coordinated with
octahedral geometry by a cage of histidine, aspartate,
and asparagine residues. The MPP superfamily includes:
Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
debranching enzymes, YfcE-like phosphodiesterases,
purple acid phosphatases (PAPs), YbbF-like
UDP-2,3-diacylglucosamine hydrolases, and acid
sphingomyelinases (ASMases). The conserved domain is a
double beta-sheet sandwich with a di-metal active site
made up of residues located at the C-terminal side of
the sheets. This domain is thought to allow for
productive metal coordination.
Length = 225
Score = 30.1 bits (68), Expect = 3.8
Identities = 10/28 (35%), Positives = 15/28 (53%), Gaps = 1/28 (3%)
Query: 295 IHNCFTELDIILKKYNIKKIDGILFDLG 322
IH C +L +L+K D ++F LG
Sbjct: 6 IHGCLDDLLRLLEKIGFPPNDKLIF-LG 32
>gnl|CDD|221252 pfam11833, DUF3353, Protein of unknown function (DUF3353). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are typically between 205 to 258
amino acids in length.
Length = 193
Score = 29.5 bits (67), Expect = 4.7
Identities = 25/85 (29%), Positives = 37/85 (43%), Gaps = 14/85 (16%)
Query: 133 GLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLL--LVIVALWGNSYF 190
L L++G GAC +Y+ R R + W + LV+ LLL L+ V L
Sbjct: 117 QLQLALGTGAC-----IYFLNRKGRRFGRALLWSLGG--LVVGLLLGSLLAVLLPPFILP 169
Query: 191 NWLG---MQAHPIFRIVALLLILLF 212
L +Q+ P + + L L LF
Sbjct: 170 GSLSPEQIQSLPAY--LLLWLGSLF 192
>gnl|CDD|133453 cd05311, NAD_bind_2_malic_enz, NAD(P) binding domain of malic
enzyme (ME), subgroup 2. Malic enzyme (ME), a member of
the amino acid dehydrogenase (DH)-like domain family,
catalyzes the oxidative decarboxylation of L-malate to
pyruvate in the presence of cations (typically Mg++ or
Mn++) with the concomitant reduction of cofactor NAD+ or
NADP+. ME has been found in all organisms, and plays
important roles in diverse metabolic pathways such as
photosynthesis and lipogenesis. This enzyme generally
forms homotetramers. The conversion of malate to
pyruvate by ME typically involves oxidation of malate to
produce oxaloacetate, followed by decarboxylation of
oxaloacetate to produce pyruvate and CO2. This
subfamily consists primarily of archaeal and bacterial
ME. Amino acid DH-like NAD(P)-binding domains are
members of the Rossmann fold superfamily and include
glutamate, leucine, and phenylalanine DHs, methylene
tetrahydrofolate DH, methylene-tetrahydromethanopterin
DH, methylene-tetrahydropholate DH/cyclohydrolase,
Shikimate DH-like proteins, malate oxidoreductases, and
glutamyl tRNA reductase. Amino acid DHs catalyze the
deamination of amino acids to keto acids with NAD(P)+ as
a cofactor. The NAD(P)-binding Rossmann fold superfamily
includes a wide variety of protein families including
NAD(P)- binding domains of alcohol DHs,
tyrosine-dependent oxidoreductases,
glyceraldehyde-3-phosphate DH, lactate/malate DHs,
formate/glycerate DHs, siroheme synthases,
6-phosphogluconate DH, amino acid DHs, repressor rex,
NAD-binding potassium channel domain, CoA-binding, and
ornithine cyclodeaminase-like domains. These domains
have an alpha-beta-alpha configuration. NAD binding
involves numerous hydrogen and van der Waals contacts.
Length = 226
Score = 29.5 bits (67), Expect = 5.1
Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 10/69 (14%)
Query: 326 NQINNELRGFSFLLDGPLDMRM-DITRG--ISASKWLANATEFNIKKVIQDYGEERFAKK 382
NQ+NN L GF + G LD+R IT ++A++ +A+ E + GEE
Sbjct: 156 NQVNNVL-GFPGIFRGALDVRATKITEEMKLAAAEAIADLAE------EEVLGEEYIIPT 208
Query: 383 IAKEIVHYR 391
V R
Sbjct: 209 PFDPRVVPR 217
>gnl|CDD|224596 COG1682, TagG, ABC-type polysaccharide/polyol phosphate export
systems, permease component [Carbohydrate transport and
metabolism / Cell envelope biogenesis, outer membrane].
Length = 263
Score = 29.5 bits (67), Expect = 5.2
Identities = 23/95 (24%), Positives = 41/95 (43%), Gaps = 8/95 (8%)
Query: 135 ALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIVALWGNSYFNWLG 194
A+S G G+ + + L + + +P A +I L++++I + +W
Sbjct: 85 AISEGAGSVVANAALIKKINFPPLILPVARTLSRLFNFLIHLIIILIFLIILGVEPSW-- 142
Query: 195 MQAHPIFRIVAL-LLILLFCGITYFLALRIMGFLF 228
H + + AL LLIL G+ LA +G F
Sbjct: 143 ---HWLLLLPALLLLILFSVGLGLILA--SLGVRF 172
>gnl|CDD|148632 pfam07136, DUF1385, Protein of unknown function (DUF1385). This
family contains a number of hypothetical bacterial
proteins of unknown function approximately 300 residues
in length. Some family members are predicted to be
metal-dependent.
Length = 235
Score = 29.5 bits (67), Expect = 5.7
Identities = 11/43 (25%), Positives = 23/43 (53%)
Query: 164 GWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVAL 206
GW ++R++ +LLL +VA + G +P+ +I++
Sbjct: 160 GWDPLWLRILSRILLLPVVAGLSYEIIKFAGKSQNPLVKILSA 202
>gnl|CDD|215222 PLN02397, PLN02397, aspartate transaminase.
Length = 423
Score = 29.9 bits (68), Expect = 5.8
Identities = 14/33 (42%), Positives = 16/33 (48%), Gaps = 2/33 (6%)
Query: 135 ALSIGLGAC-LHASFLYWYLRHKRIYIPCAGWG 166
LS G G+ L A FL + IYIP WG
Sbjct: 122 CLS-GTGSLRLGAEFLARFYPGSTIYIPNPTWG 153
>gnl|CDD|220626 pfam10192, GpcrRhopsn4, Rhodopsin-like GPCR transmembrane domain.
This region of 270 amino acids is the seven
transmembrane alpha-helical domains included within five
GPCRRHODOPSN4 motifs of a G-protein-coupled-receptor
(GPCR) protein, conserved from nematodes to humans.
GPCRs are integral membrane receptors whose
intracellular actions are mediated by signalling
pathways involving G proteins and downstream secondary
messengers.
Length = 256
Score = 29.5 bits (67), Expect = 6.2
Identities = 17/97 (17%), Positives = 37/97 (38%), Gaps = 16/97 (16%)
Query: 138 IGLGACLHASFLYWYLRHK-------RIYIPCAGWGVFFIRLVIALLLLVIVALWGNSYF 190
I L + WY+++ +Y G+ + +R+V+A+ L+G +
Sbjct: 141 IVLIVVFQVILVIWYIKYFFDPASKFHLYESWPGYLLLALRIVLAIYF-----LYGL--Y 193
Query: 191 NWLGMQAHPIFRIVALLLILLFCGITYFLALRIMGFL 227
N + P + L G +FL+ ++ +
Sbjct: 194 NTYKKEKSPAKK--QFYLKFGIVGTLWFLSFPVIVII 228
>gnl|CDD|200943 pfam00032, Cytochrom_B_C, Cytochrome b(C-terminal)/b6/petD.
Length = 102
Score = 27.8 bits (63), Expect = 6.6
Identities = 19/72 (26%), Positives = 37/72 (51%), Gaps = 10/72 (13%)
Query: 148 FLYWY--LRHKRIYIPCAGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVA 205
FL+ Y LR IP GV I +V+++L+L ++ S ++ P+F+I+
Sbjct: 17 FLFAYAILRS----IPNKLGGV--IAMVLSILILFLLPFLDRSKVR--SLRFRPLFKILF 68
Query: 206 LLLILLFCGITY 217
+ ++ F +T+
Sbjct: 69 WIFVVNFLLLTW 80
>gnl|CDD|222372 pfam13777, DUF4173, Domain of unknown function (DUF4173). This
domain of unknown function contains multiple predicted
transmembrane domains.
Length = 188
Score = 29.0 bits (66), Expect = 7.0
Identities = 12/66 (18%), Positives = 24/66 (36%), Gaps = 13/66 (19%)
Query: 153 LRHKRIYIPCAGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLF 212
L + RI F+ L+ LLLV+ ++ WL R +++
Sbjct: 97 LTYLRIL---VLA---FMLLLAIGLLLVLAKIFRGKSNRWL-------LRRNLAAGVVVL 143
Query: 213 CGITYF 218
+++
Sbjct: 144 YILSFV 149
>gnl|CDD|239023 cd02072, Glm_B12_BD, B12 binding domain of glutamate mutase (Glm).
Glutamate mutase catalysis the conversion of
(S)-glutamate with (2S,3S)-3-methylaspartate. The
rearrangement reaction is initiated by the extraction of
a hydrogen from the protein-bound substrate by a
5'-desoxyadenosyl radical, which is generated by the
homolytic cleavage of the organometallic bond of the
cofactor B12. Glm is a heterotetrameric molecule
consisting of two alpha and two epsilon polypeptide
chains.
Length = 128
Score = 28.2 bits (63), Expect = 7.6
Identities = 13/35 (37%), Positives = 20/35 (57%), Gaps = 1/35 (2%)
Query: 527 SNINVIVTGV-TASLDLIKTAVDMNADAILVHHGY 560
+ NV+ GV + + I A++ +ADAILV Y
Sbjct: 26 AGFNVVNLGVLSPQEEFIDAAIETDADAILVSSLY 60
>gnl|CDD|216972 pfam02322, Cyto_ox_2, Cytochrome oxidase subunit II. This Family
consists of cytochrome bd type terminal oxidases that
catalyzes Quinol dependent, Na+ independent oxygen
uptake. Members of this family are integral membrane
proteins andi contain a protohaem IX centre B558. One
member of the family,cydB from Klebsiella pneumoniae, is
implicated in having an important role in micro-aerobic
nitrogen fixation in the enteric bacterium Klebsiella
pneumoniae.
Length = 327
Score = 29.4 bits (67), Expect = 7.9
Identities = 27/144 (18%), Positives = 46/144 (31%), Gaps = 16/144 (11%)
Query: 91 PTTLLGVTFNTILLPNLSKARIENNTEEYSA-ILDWIPIFA-HSGLALSIGLGACLHASF 148
L GV ++ + + Y+ D + FA GLA+ L +
Sbjct: 125 AAFLQGVALGNLVQG------VPFDAGNYAGGFFDLLNPFALLGGLAVV--ALYALLGAT 176
Query: 149 LYWYLRHK---RIYIPCAGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVA 205
+L K + +++ L LV V + + + + P I+
Sbjct: 177 ---WLILKTEGELQERARRLAKKLGLVLLVLFALVSVWVPLAAPALFANWFSSPWLLILP 233
Query: 206 LLLILLFCGITYFLALRIMGFLFL 229
L +L L LR G FL
Sbjct: 234 ALPVLAALLAVLLLRLRREGLAFL 257
>gnl|CDD|226960 COG4597, BatB, ABC-type amino acid transport system, permease
component [Amino acid transport and metabolism].
Length = 397
Score = 29.4 bits (66), Expect = 8.1
Identities = 18/76 (23%), Positives = 32/76 (42%), Gaps = 6/76 (7%)
Query: 148 FLYWYLRHKRIYIPCAGWG----VFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRI 203
+L ++ +Y P WG F + LV+A++ V +A W + G Q
Sbjct: 166 PDLIFLSNRGLYFPSPQWGEGFIAFILALVMAIVASVFLARWAKTRQIATGQQFRTW--P 223
Query: 204 VALLLILLFCGITYFL 219
A +LI+ + +L
Sbjct: 224 TAAVLIIGLPLLAQWL 239
>gnl|CDD|224748 COG1835, COG1835, Predicted acyltransferases [Lipid metabolism].
Length = 386
Score = 29.4 bits (66), Expect = 8.2
Identities = 20/112 (17%), Positives = 32/112 (28%), Gaps = 4/112 (3%)
Query: 119 YSAILDWIPIFAHSGLAL-SIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALL 177
Y + + AL + L L + FL +G + L
Sbjct: 162 YLLFPLLLLLLRRVRFALALLLLLLALASFFLL-LALGPFATGAALYYGTA--TRLWEFL 218
Query: 178 LLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLALRIMGFLFL 229
L + ALW W + A + + LL+L I L +
Sbjct: 219 LGALAALWRPKRRPWPALLAAALALLGLALLLLAVALIPGSSPFFPGDLLLV 270
>gnl|CDD|238227 cd00386, Heme_Cu_Oxidase_III_like, Heme-copper oxidase subunit III.
Heme-copper oxidases are transmembrane protein
complexes in the respiratory chains of prokaryotes and
mitochondria which couple the reduction of molecular
oxygen to water to, proton pumping across the membrane.
The heme-copper oxidase superfamily is diverse in terms
of electron donors, subunit composition, and heme types.
This superfamily includes cytochrome c and ubiquinol
oxidases. Bacterial oxidases typically contain 3 or 4
subunits in contrast to the 13 subunit bovine cytochrome
c oxidase (CcO). Subunits I, II, and III of mammalian
CcO are encoded within the mitochondrial genome and the
remaining 10 subunits are encoded within the nuclear
genome. Subunits I, II and III of ubiquinol oxidase are
homologous to the corresponding subunits in CcO. This
group additionally contains proteins which are fusions
between subunits I and III, such as Sulfolobus
acidocaldarius SoxM, a subunit of the SoxM terminal
oxidase complex. It also includes NorE which has been
speculated to be a subunit of nitric oxide reductase.
Some archaebacterial cytochrome oxidases lack subunit
III. Although not required for catalytic activity,
subunit III is believed to play a role in assembly of
the multimer complex. Rhodobacter CcO subunit III
stabilizes the integrity of the binuclear center in
subunit I. It has been proposed that archaea acquired
heme-copper oxidases through gene transfer from
gram-positive bacteria.
Length = 183
Score = 28.7 bits (65), Expect = 8.2
Identities = 19/85 (22%), Positives = 28/85 (32%), Gaps = 8/85 (9%)
Query: 146 ASFL--YWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRI 203
SF Y++ R P +G L + LL + L G+S A
Sbjct: 25 GSFFWAYFHSRL----SPPVEFGAGLDPLDLPLLNTNTLLLSGSSVTWAHASLAARRGNR 80
Query: 204 VALLLILLFCGI--TYFLALRIMGF 226
L LL + FL L+ +
Sbjct: 81 KKARLWLLLTILLGLAFLGLQAYEY 105
>gnl|CDD|223674 COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport
systems, permease components [Amino acid transport and
metabolism / Inorganic ion transport and metabolism].
Length = 317
Score = 29.0 bits (66), Expect = 8.6
Identities = 14/51 (27%), Positives = 25/51 (49%), Gaps = 7/51 (13%)
Query: 168 FFIRLVIALLLLVIVALWGNSYFNWLGM------QAHPIFRIVALLLILLF 212
+ +++ + L ++ AL S+ + L + P F + LLLILLF
Sbjct: 106 LILAVLLGIPLGILAALKRGSWIDRLLRVLSLLGISIPSF-WLGLLLILLF 155
>gnl|CDD|224055 COG1132, MdlB, ABC-type multidrug transport system, ATPase and
permease components [Defense mechanisms].
Length = 567
Score = 29.3 bits (66), Expect = 8.6
Identities = 13/85 (15%), Positives = 31/85 (36%), Gaps = 5/85 (5%)
Query: 148 FLYWYLRHKRIYI-PCAGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVAL 206
L YL++K + + + L++ LL+ I+ ++ LG + ++ L
Sbjct: 6 RLLKYLKYKLLLLAILLLLLSALLSLLLPLLIGRII----DALLADLGELLELLLLLLLL 61
Query: 207 LLILLFCGITYFLALRIMGFLFLNE 231
L+ +G + +
Sbjct: 62 ALLGGVLRALQSYLGSRLGQKIVAD 86
>gnl|CDD|223905 COG0835, CheW, Chemotaxis signal transduction protein [Cell
motility and secretion / Signal transduction
mechanisms].
Length = 165
Score = 28.4 bits (64), Expect = 9.2
Identities = 14/41 (34%), Positives = 21/41 (51%), Gaps = 3/41 (7%)
Query: 375 GEERFAKKIAK--EIVHYRSITPITRTKQLVEIILKSIRGN 413
G+E + I K EIV Y ITP+ V ++ ++RG
Sbjct: 26 GDEEYGIDILKVREIVGYPPITPVPNAPPYVLGVI-NLRGE 65
>gnl|CDD|224287 COG1368, MdoB, Phosphoglycerol transferase and related proteins,
alkaline phosphatase superfamily [Cell envelope
biogenesis, outer membrane].
Length = 650
Score = 29.3 bits (66), Expect = 9.9
Identities = 19/70 (27%), Positives = 25/70 (35%), Gaps = 7/70 (10%)
Query: 167 VFFIRLVIALLLL------VIVALWGNSYFNWLGMQAHPIFR-IVALLLILLFCGITYFL 219
F I LV L LL + L F L + IV+LLL+LL L
Sbjct: 20 GFGILLVFLLWLLTLLIYFLGFVLPILLLFQGLRLLFSLPILFIVSLLLLLLLFKGVDAL 79
Query: 220 ALRIMGFLFL 229
+ + L
Sbjct: 80 NIFRLILALL 89
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.325 0.141 0.422
Gapped
Lambda K H
0.267 0.0765 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 38,698,822
Number of extensions: 4044010
Number of successful extensions: 6597
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6486
Number of HSP's successfully gapped: 278
Length of query: 733
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 629
Effective length of database: 6,324,786
Effective search space: 3978290394
Effective search space used: 3978290394
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 63 (28.0 bits)