RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy1769
(1662 letters)
>gnl|CDD|100107 cd02440, AdoMet_MTases, S-adenosylmethionine-dependent
methyltransferases (SAM or AdoMet-MTase), class I;
AdoMet-MTases are enzymes that use
S-adenosyl-L-methionine (SAM or AdoMet) as a substrate
for methyltransfer, creating the product
S-adenosyl-L-homocysteine (AdoHcy). There are at least
five structurally distinct families of AdoMet-MTases,
class I being the largest and most diverse. Within this
class enzymes can be classified by different substrate
specificities (small molecules, lipids, nucleic acids,
etc.) and different target atoms for methylation
(nitrogen, oxygen, carbon, sulfur, etc.).
Length = 107
Score = 62.1 bits (151), Expect = 2e-11
Identities = 29/107 (27%), Positives = 49/107 (45%), Gaps = 10/107 (9%)
Query: 1430 VLEVGCGMGLLSLFCAEAGAKHVISVDCS--VITQLTQEVVEENDCSDVITVICRRMEDI 1487
VL++GCG G L+L A V VD S + + +D + V+ E++
Sbjct: 2 VLDLGCGTGALALALASGPGARVTGVDISPVALELARK--AAAALLADNVEVLKGDAEEL 59
Query: 1488 DRLPHGIENVDIIVSNWMGHVLYLDSLINAVV-YARDRFLKPHGLIL 1533
P E+ D+I+S+ H +L + + AR LKP G+++
Sbjct: 60 --PPEADESFDVIISDPPLH--HLVEDLARFLEEARRL-LKPGGVLV 101
>gnl|CDD|222295 pfam13659, Methyltransf_26, Methyltransferase domain. This family
contains methyltransferase domains.
Length = 117
Score = 62.1 bits (151), Expect = 2e-11
Identities = 28/116 (24%), Positives = 45/116 (38%), Gaps = 12/116 (10%)
Query: 1430 VLEVGCGMGLLSLFCAEAGAK-HVISVDCS-VITQLTQEVVEENDCSDVITVICRRMEDI 1487
VL+ G G G L A AG V+ V+ L + + + + V+ ++
Sbjct: 4 VLDPGAGSGAFLLAAARAGPDARVVGVELDPEAAALARRRLALAGLAPRVRVVVGDAREL 63
Query: 1488 DRLPHGIENVDIIVSNW-----MGHVLYLDSLINAVVYARDRFLKPHG---LILPD 1535
LP G + D+++ N G L + + A R LKP G +I P
Sbjct: 64 LELPDG--SFDLVLGNPPYGPRAGDPKDNRDLYDRFLAAALRLLKPGGVLVVITPA 117
>gnl|CDD|234786 PRK00517, prmA, ribosomal protein L11 methyltransferase; Reviewed.
Length = 250
Score = 55.2 bits (134), Expect = 6e-08
Identities = 38/117 (32%), Positives = 54/117 (46%), Gaps = 25/117 (21%)
Query: 1422 SHLLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVI----TQLTQEVVEENDCSDVI 1477
+L K VL+VGCG G+L++ A+ GAK V++VD I + +E E N +
Sbjct: 115 KLVLPGKTVLDVGCGSGILAIAAAKLGAKKVLAVD---IDPQAVEAARENAELNGVELNV 171
Query: 1478 TVICRRMEDIDRLPHGIENVDIIVSNWMGHVLYLDSLINAVVYARDRFLKPHG-LIL 1533
LP G D+IV+N + + L L + R LKP G LIL
Sbjct: 172 Y-----------LPQGDLKADVIVANILANP--LLELAPDLA----RLLKPGGRLIL 211
>gnl|CDD|219759 pfam08241, Methyltransf_11, Methyltransferase domain. Members of
this family are SAM dependent methyltransferases.
Length = 92
Score = 50.0 bits (120), Expect = 2e-07
Identities = 28/104 (26%), Positives = 46/104 (44%), Gaps = 14/104 (13%)
Query: 1431 LEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITVICRRMEDIDRL 1490
L+VGCG GLL+ A G V VD L+ E++ + D + L
Sbjct: 1 LDVGCGTGLLAEALARRGGARVTGVD------LSPEMLALARKRAPRKFVV---GDAEDL 51
Query: 1491 PHGIENVDIIVSNWM-GHVLYLDSLINAVVYARDRFLKPHGLIL 1533
P E+ D++VS+ + H+ + + + R LKP G ++
Sbjct: 52 PFPDESFDVVVSSLVLHHLPDPERALREIA----RVLKPGGKLV 91
>gnl|CDD|218990 pfam06325, PrmA, Ribosomal protein L11 methyltransferase (PrmA).
This family consists of several Ribosomal protein L11
methyltransferase (EC:2.1.1.-) sequences.
Length = 294
Score = 54.2 bits (131), Expect = 2e-07
Identities = 37/157 (23%), Positives = 62/157 (39%), Gaps = 37/157 (23%)
Query: 1422 SHLLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCS-VITQLTQEVVEENDCSDVITVI 1480
S + + VL+VGCG G+L++ + GAK V+ VD V + +E E N + V
Sbjct: 156 SLVKPGETVLDVGCGSGILAIAALKLGAKKVVGVDIDPVAVRAAKENAELNGVEAQLEVY 215
Query: 1481 CRRMEDIDRLPHGI--ENVDIIVSNWMGHVLYLDSLINAVVYARDRFLKPHGL-----IL 1533
LP + D++V+N + L L + +KP G IL
Sbjct: 216 ---------LPGDLPEGKADVVVANILADP--LIELAPDIY----ALVKPGGYLILSGIL 260
Query: 1534 PDRAELYCVAANDTMAATKYSFWHDVYGFDMEPIQRD 1570
++A+ A YS F++ ++
Sbjct: 261 EEQAD---------DVAEAYSQG-----FELITVEER 283
>gnl|CDD|223574 COG0500, SmtA, SAM-dependent methyltransferases [Secondary
metabolites biosynthesis, transport, and catabolism /
General function prediction only].
Length = 257
Score = 53.4 bits (125), Expect = 3e-07
Identities = 38/284 (13%), Positives = 83/284 (29%), Gaps = 41/284 (14%)
Query: 1370 MDIDDMTSREIFEDVYADIKTHQTTYKDKQLIGCFHDSIINNSHLFKVRKPNSHLLKDKI 1429
+ ++Y + + D ++ L L
Sbjct: 1 DSLLSAELLSRILELYDRLAELLDAFLLLAEE--LLDLLLVLRLL-------RLLPGGLG 51
Query: 1430 VLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITVICRRMEDIDR 1489
VL++GCG G L+L A G + V + ++ + + + V + +
Sbjct: 52 VLDIGCGTGRLALL-ARLGGRGAYVVGVDLSPEMLALARARAEGAGLGLVDFVVADALGG 110
Query: 1490 LPH--GIENVDIIVSNWMGHVLYLDSLINAVVYARDRFLKPHGLILPDRAELYCVAANDT 1547
+ + D+++S + H+L + ++ R LKP G ++
Sbjct: 111 VLPFEDSASFDLVISLLVLHLLPPAKALRELL----RVLKPGGRLV-------------- 152
Query: 1548 MAATKYSFWHDVYGFDMEPIQRDLPNIAKFHPVPGDKVMTDSILIHSIDLNTCSVDDTSF 1607
G + L + +++L + L+ +
Sbjct: 153 ------LSDLLRDGLLEGRLAALLGFGDPVLERGDILLELEALLRLEL-LDLEELLGLLG 205
Query: 1608 NLEFAMVAKEGGFVNAFVLYFKVILKLILRVEPHKFQVLAQSRK 1651
LE GG ++ L L +E F++L +
Sbjct: 206 ELELE----RGGLARLLEELRLLLELLALGLEARGFRLLLLLKG 245
>gnl|CDD|225173 COG2264, PrmA, Ribosomal protein L11 methylase [Translation,
ribosomal structure and biogenesis].
Length = 300
Score = 53.5 bits (129), Expect = 4e-07
Identities = 38/116 (32%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 1422 SHLLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDC-SVITQLTQEVVEENDCSDVITVI 1480
L K K VL+VGCG G+L++ A+ GAK V+ VD + +E N ++
Sbjct: 158 KLLKKGKTVLDVGCGSGILAIAAAKLGAKKVVGVDIDPQAVEAARENARLNGVELLVQAK 217
Query: 1481 CRRMEDIDRLPHGIEN--VDIIVSNWMGHVLYLDSLINAVVYARDRFLKPHG-LIL 1533
L EN D+IV+N + V L L + R LKP G LIL
Sbjct: 218 G------FLLLEVPENGPFDVIVANILAEV--LVELAPDIK----RLLKPGGRLIL 261
>gnl|CDD|222415 pfam13847, Methyltransf_31, Methyltransferase domain. This family
appears to be have methyltransferase activity.
Length = 151
Score = 49.7 bits (119), Expect = 1e-06
Identities = 28/112 (25%), Positives = 52/112 (46%), Gaps = 9/112 (8%)
Query: 1426 KDKIVLEVGCGMGLLSLFCAE--AGAKHVISVDCS--VITQLTQEVVEENDCSDVITVIC 1481
VL++GCG G L+ AE V+ +D S I + +E ++ + + I
Sbjct: 3 SGIKVLDLGCGTGYLTFILAEKLGPGAEVVGIDISEEAI-EKAKENAKKLG-YENVEFIQ 60
Query: 1482 RRMEDIDRLPHGIENVDIIVSNWMGHVLYLDSLINAVVYARDRFLKPHGLIL 1533
+E++ +L + D+++SN VL + V+ R LKP G+++
Sbjct: 61 GDIEELPQLQLEDNSFDVVISN---EVLNHLPDPDKVLEEIIRVLKPGGVLI 109
>gnl|CDD|221804 pfam12847, Methyltransf_18, Methyltransferase domain. Protein in
this family function as methyltransferases.
Length = 104
Score = 48.1 bits (115), Expect = 1e-06
Identities = 25/116 (21%), Positives = 40/116 (34%), Gaps = 24/116 (20%)
Query: 1426 KDKIVLEVGCGMGLLSLFCAEA-GAKHVISVD-----CSVITQLTQEVVEENDCSDVITV 1479
VL++GCG G L++ A V VD + + + + IT
Sbjct: 1 PGARVLDIGCGTGSLAIELARLFPGARVTGVDLSPEMLELARENAKLALGPR-----ITF 55
Query: 1480 ICRRMEDIDRLPHGIENVDIIVSNWMGHVL--YLDSLINAVVYARDRFLKPHGLIL 1533
+ D +E D + G L LD+L + LKP G ++
Sbjct: 56 VQ---GDAPDALDLLEGFDAVFIGGGGGDLLELLDALASL--------LKPGGRLV 100
>gnl|CDD|225139 COG2230, Cfa, Cyclopropane fatty acid synthase and related
methyltransferases [Cell envelope biogenesis, outer
membrane].
Length = 283
Score = 51.1 bits (123), Expect = 2e-06
Identities = 30/116 (25%), Positives = 45/116 (38%), Gaps = 16/116 (13%)
Query: 1423 HLLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQL--TQEVVEENDCSDVITVI 1480
L +L++GCG G L+++ AE V+ V S QL ++ + D + V
Sbjct: 69 GLKPGMTLLDIGCGWGGLAIYAAEEYGVTVVGVTLS-EEQLAYAEKRIAARGLEDNVEVR 127
Query: 1481 CRRMEDIDRLPHGIENVDIIVSNWMGHVL---YLDSLINAVVYARDRFLKPHGLIL 1533
+ D + E D IVS M + D V LKP G +L
Sbjct: 128 LQDYRDFE------EPFDRIVSVGMFEHVGKENYDDFFKKV----YALLKPGGRML 173
>gnl|CDD|225443 COG2890, HemK, Methylase of polypeptide chain release factors
[Translation, ribosomal structure and biogenesis].
Length = 280
Score = 48.9 bits (117), Expect = 1e-05
Identities = 26/84 (30%), Positives = 37/84 (44%), Gaps = 8/84 (9%)
Query: 1422 SHLLKDKIVLEVGCGMGLLSLFCAEAGAKH-VISVDCSV-ITQLTQEVVEENDCSDVITV 1479
L DK +L++G G G +++ A+ G VI+VD S L +E E N V+ V
Sbjct: 106 LLLQLDKRILDLGTGSGAIAIALAKEGPDAEVIAVDISPDALALARENAERNGLVRVLVV 165
Query: 1480 ICRRMEDIDRLPHGIENVDIIVSN 1503
D D+IVSN
Sbjct: 166 QS------DLFEPLRGKFDLIVSN 183
>gnl|CDD|232960 TIGR00406, prmA, ribosomal protein L11 methyltransferase. Ribosomal
protein L11 methyltransferase is an
S-adenosyl-L-methionine-dependent methyltransferase
required for the modification of ribosomal protein L11.
This protein is found in bacteria and (with a probable
transit peptide) in Arabidopsis [Protein synthesis,
Ribosomal proteins: synthesis and modification].
Length = 288
Score = 47.5 bits (113), Expect = 3e-05
Identities = 37/132 (28%), Positives = 58/132 (43%), Gaps = 19/132 (14%)
Query: 1425 LKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDC-SVITQLTQEVVEENDCSDVITVICRR 1483
LKDK V++VGCG G+LS+ + GA V+ +D + + ++ E N SD + V
Sbjct: 158 LKDKNVIDVGCGSGILSIAALKLGAAKVVGIDIDPLAVESARKNAELNQVSDRLQVKLIY 217
Query: 1484 MEDIDRLPHGIENVDIIVSNWMGHVLY-LDSLINAVVYARDRFLKPHGL-----ILPDRA 1537
+E D+IV+N + V+ L + R +KP G IL +A
Sbjct: 218 LEQPIEGKA-----DVIVANILAEVIKELYPQFS-------RLVKPGGWLILSGILETQA 265
Query: 1538 ELYCVAANDTMA 1549
+ C A
Sbjct: 266 QSVCDAYEQGFT 277
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 47.4 bits (113), Expect = 6e-05
Identities = 36/183 (19%), Positives = 67/183 (36%), Gaps = 21/183 (11%)
Query: 938 NNMIKTIDFHQFQNNVMKNHPGNKNFKNFNNKSYGDKNKNEFKVTKIMQRPVDENKDEIA 997
N +++ I + ++ N K S G N + I RP E D+I
Sbjct: 761 NILLQEIAY-DVIVSLFNNPNAEKILIITEILSDGINNSD------INDRP-QELIDQII 812
Query: 998 KLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDK 1057
+ ++ K + ++ L +Q+N++ + +E +
Sbjct: 813 E-----------SEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAE 861
Query: 1058 IEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEGSKENKPQQQ 1117
E + N N E E D +K AKD KL S+D IK + ++ +K+ K +
Sbjct: 862 EEIQNINENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIK--QDEKTTKKKKKDLE 919
Query: 1118 RRK 1120
+
Sbjct: 920 KTD 922
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part of
a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 47.0 bits (112), Expect = 8e-05
Identities = 24/154 (15%), Positives = 60/154 (38%), Gaps = 9/154 (5%)
Query: 983 KIMQRPV----DENKDEIAKLPIPLGGDKKDDKKVDDKNVSKDQK--ITKSTNNNLNKKT 1036
K MQR +EN EI +L L G+++ D++ +++ K+ N ++
Sbjct: 377 KFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAES 436
Query: 1037 NKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATK---L 1093
K + +K +E++ ++E ++ K + ++ ++ +
Sbjct: 437 KKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPW 496
Query: 1094 SNSADSIKLNKKDEGSKENKPQQQRRKFNNFHNS 1127
+ S+ + K + SK+ + + N +
Sbjct: 497 LKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKA 530
Score = 39.3 bits (92), Expect = 0.019
Identities = 20/122 (16%), Positives = 42/122 (34%), Gaps = 8/122 (6%)
Query: 1005 GDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENN 1064
+ + K + +K L + + K+ K + + E+EE
Sbjct: 432 KEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEE-- 489
Query: 1065 TNAKENDEKSTDDEK---NKLDDKAKDTATKLSNSADSIKLNKKDEGSKENKPQQQRRKF 1121
+EN T K D K +++KL +A+ I K+ K +++
Sbjct: 490 -LDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVK--VKKKKKKEKSIDL 546
Query: 1122 NN 1123
++
Sbjct: 547 DD 548
Score = 37.3 bits (87), Expect = 0.088
Identities = 29/157 (18%), Positives = 53/157 (33%), Gaps = 19/157 (12%)
Query: 950 QNNVMKNHPGNKNFKNFNNKSYGDKNKNEFKVTKIMQRPVDENKDEIAKLPIPLGGDKKD 1009
N K K K N+ K +E + ++ + + KL +K+
Sbjct: 427 PENGEKEAESKKLKKENKNEFKEKKESDEEE--ELEDEEEAKVEKVANKLLKRSEKAQKE 484
Query: 1010 -------------DKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEED 1056
+K Q K +++ L+K NK + KK ++
Sbjct: 485 EEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSI 544
Query: 1057 KIEK----EENNTNAKENDEKSTDDEKNKLDDKAKDT 1089
++ EE++ +DE+ DDE+ K KD
Sbjct: 545 DLDDDLIDEEDSIKLDVDDEEDEDDEELPFLFKQKDL 581
Score = 31.2 bits (71), Expect = 6.7
Identities = 26/140 (18%), Positives = 46/140 (32%), Gaps = 20/140 (14%)
Query: 1207 NSNNYNNNKSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKKIKFKG 1266
N N K K++ + D+++ K ++ Q EEE ++
Sbjct: 440 KKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEE---------- 489
Query: 1267 PEILNETTEPTASSATNSNVTTVSNTADNSNNPSSILKKSKATEDCKVKTVQGAPKTEKS 1326
+ P T+S S +S SS K+ K +K
Sbjct: 490 ----LDEENP-WLKTTSSV--GKSAKKQDSKKKSS---SKLDKAANKISKAAVKVKKKKK 539
Query: 1327 QTKSGEYDNLDEREPSTIQL 1346
+ KS + D+ E +I+L
Sbjct: 540 KEKSIDLDDDLIDEEDSIKL 559
>gnl|CDD|222287 pfam13649, Methyltransf_25, Methyltransferase domain. This family
appears to be a methyltransferase domain.
Length = 97
Score = 42.3 bits (100), Expect = 1e-04
Identities = 27/105 (25%), Positives = 43/105 (40%), Gaps = 12/105 (11%)
Query: 1430 VLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITV--ICRRMEDI 1487
+L++GCG G + A AG V VD I++ E+ +E V + D
Sbjct: 1 ILDLGCGTGRVLRALARAGPSSVTGVD---ISKEALELAKERLRDKGPKVRFVV---ADA 54
Query: 1488 DRLPHGIENVDIIVSNWMGHVLYL--DSLINAVVYARDRFLKPHG 1530
LP + D+++ + YL L + A R L+P G
Sbjct: 55 RDLPFEEGSFDLVIC-AGLSLDYLSPKQLRALLREAA-RLLRPGG 97
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 43.4 bits (102), Expect = 0.001
Identities = 55/308 (17%), Positives = 107/308 (34%), Gaps = 25/308 (8%)
Query: 828 LPSSEKSRKPISHVSPPVVTSIITSTATSNVTTTTSSSETKPAESDNKLPSNNNPSEEGE 887
+ +E+S+ PI + N+ +++SSS + + + S +
Sbjct: 92 IDLNERSKTPIKNNDNVTTPIKANKKEKHNLDSSSSSSISSSLTNISFFSSPTSIYSCLS 151
Query: 888 DVSGRKNAKVVPGSPAKSNNMSQHYNEKGHKSGGERQRDRGDTGRHHQFQNNMIK----- 942
+ K++ V ++ N +K +Q + H + M +
Sbjct: 152 NSLSSKHSPKVIKENQSTHVNISSDNSPRNKEISNKQLKKQTNVTHTTCYDKMRRSPRNT 211
Query: 943 -TIDFHQFQNNVMKNHPGNKNFKNFNNKSYGDKNKNEFKVTKIMQRPVDENKDEIAKLPI 1001
TI + N KN +KN K + K +E + V + ++ L
Sbjct: 212 STIKNNTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSE-------KSKVQNSHFDVRILRS 264
Query: 1002 PLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKE 1061
+KKD+K V + + + K + L K + N Q K ++ I +
Sbjct: 265 YTKENKKDEKNV--VSGIRSSVLLKRKSQCLRKDSYVYSNHQ---KKAKTGDPKNIIHRN 319
Query: 1062 ENNTNAKENDEKSTDDEKNKL-------DDKAKDTATKLSNSADSIKLNKKDEGSKENKP 1114
++N+ +D S++ + K T TK +N+ + K NK K N P
Sbjct: 320 NGSSNSNNDDTSSSNHLGSNRISNRNPSSPYKKQTTTKHTNNTKNNKYNKTKTTQKFNHP 379
Query: 1115 QQQRRKFN 1122
+ N
Sbjct: 380 LRHHATIN 387
Score = 35.7 bits (82), Expect = 0.27
Identities = 56/316 (17%), Positives = 102/316 (32%), Gaps = 42/316 (13%)
Query: 1006 DKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNT 1065
K KV +N S I+ + + +NKQ Q DK+ + NT
Sbjct: 155 SSKHSPKVIKENQSTHVNISSDNSPRNKEISNKQLKKQ---TNVTHTTCYDKMRRSPRNT 211
Query: 1066 NAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEGSKENKPQQQRRKFNNFH 1125
+ +N+ + EKNK DK K +G K+ K ++ K N H
Sbjct: 212 STIKNNTNDKNKEKNKEKDKNIK---------------KDRDGDKQTKRNSEKSKVQNSH 256
Query: 1126 NSSKYQRHPYHNNGGDFNYVPVYDGNINKFNKFGGGGNNRYPNHRYYDFDFIDPRSCFFP 1185
+ R N D V I + Y S
Sbjct: 257 FDVRILRSYTKENKKDEKNVV---SGIRSSVLLKRKSQCLRKDSYVY--------SNHQK 305
Query: 1186 GSGTPYPQYGGGGPPMFNNLYNSNNYNNNKSKQKKTDSTNNVDKKQDSTSTDE-RKKKAT 1244
+ T P+ N+ + NN ++N + + S + + + + KK+ T
Sbjct: 306 KAKTGDPK----------NIIHRNNGSSNSNNDDTSSSNHLGSNRISNRNPSSPYKKQTT 355
Query: 1245 QKQQQSEEESKKTSNKKIKFKGPEILNETTEPTASS--ATNSNVTTVSNTADNSNNPSSI 1302
K + + +K K + + + T SS + ++ ++
Sbjct: 356 TKHTNNTKNNKYNKTKTTQKFNHPLRHHATINKRSSMLPMSEQKGRGASEKSEYIKEFTM 415
Query: 1303 LKKSKATEDCKVKTVQ 1318
+ +K T+D +K V+
Sbjct: 416 EEVAKLTKDTTIKLVE 431
Score = 32.7 bits (74), Expect = 2.3
Identities = 35/232 (15%), Positives = 81/232 (34%), Gaps = 9/232 (3%)
Query: 83 SISKKKKLKFVKVDPKELNIVIDDGRGRDTDFRNNKPNHVKSQESRKSSSHHQDKGLGSS 142
S+S K K +K + + D R+ + N +++ + + DK S
Sbjct: 153 SLSSKHSPKVIKENQSTHVNISSDNSPRNKE----ISNKQLKKQTNVTHTTCYDKMRRSP 208
Query: 143 TRHSNTAGGASSRNMSRGSGKHPPSNNNYKADFEVNEDSTQQHNKKHKRESSKVISVTNN 202
S + +N + K + N K D + ++ + + K + S + + +
Sbjct: 209 RNTSTIKNNTNDKNKEKNKEK----DKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRS 264
Query: 203 QQVNNDKDFKNNLMWDGDSAGAKKSRNQWKNLDVNNITNKTRERLMNKPASSNDSLLKDN 262
N KD + N++ S+ K ++Q D +N ++ P + +
Sbjct: 265 YTKENKKD-EKNVVSGIRSSVLLKRKSQCLRKDSYVYSNHQKKAKTGDPKNIIHRNNGSS 323
Query: 263 GDVDSRTTTASILDSSLYQNTYDSKTRNFVDSGSQIFNAASTLNETPEHSTR 314
+ T++++ L S+ N S + N + + + +
Sbjct: 324 NSNNDDTSSSNHLGSNRISNRNPSSPYKKQTTTKHTNNTKNNKYNKTKTTQK 375
>gnl|CDD|202698 pfam03602, Cons_hypoth95, Conserved hypothetical protein 95.
Length = 183
Score = 40.7 bits (96), Expect = 0.002
Identities = 26/126 (20%), Positives = 43/126 (34%), Gaps = 25/126 (19%)
Query: 1425 LKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLT-QEVVEENDCSDVITVICRR 1483
+ VL++ G G L L GA V+ V+ T +E +E V+ R
Sbjct: 42 IGGARVLDLFAGSGALGLEALSRGAASVVFVEKDKKAVATLKENLEALGLEGETAVL--R 99
Query: 1484 M---EDIDRLPHGIENVDIIVSNWMGHVLYLD-----SLINAVVYARDRFLKPHGLILPD 1535
+ RL D++ +LD LI + L +G + P+
Sbjct: 100 NDAARALLRLAGKGPPFDLV---------FLDPPYAKGLIEEAL----ELLAENGWLNPN 146
Query: 1536 RAELYC 1541
A +
Sbjct: 147 -ALIVV 151
>gnl|CDD|237872 PRK14968, PRK14968, putative methyltransferase; Provisional.
Length = 188
Score = 40.7 bits (96), Expect = 0.003
Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 8/42 (19%)
Query: 1422 SHLL-------KDKIVLEVGCGMGLLSLFCAEAGAKHVISVD 1456
S LL K VLEVG G G++++ A+ G K V+ VD
Sbjct: 12 SFLLAENAVDKKGDRVLEVGTGSGIVAIVAAKNGKK-VVGVD 52
>gnl|CDD|226608 COG4123, COG4123, Predicted O-methyltransferase [General function
prediction only].
Length = 248
Score = 40.7 bits (96), Expect = 0.004
Identities = 21/86 (24%), Positives = 36/86 (41%), Gaps = 7/86 (8%)
Query: 1422 SHLLKDKIVLEVGCGMGLLSLFCAEAGAK-HVISVDCS-VITQLTQEVVEENDCSDVITV 1479
+ + K +L++G G G L L A+ K ++ V+ ++ Q V N + I V
Sbjct: 40 APVPKKGRILDLGAGNGALGLLLAQRTEKAKIVGVEIQEEAAEMAQRNVALNPLEERIQV 99
Query: 1480 ICRRMEDIDRL--PHGIENVDIIVSN 1503
I DI + D+I+ N
Sbjct: 100 IE---ADIKEFLKALVFASFDLIICN 122
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this family
are the chaperone DnaK, of the DnaK-DnaJ-GrpE chaperone
system. All members of the seed alignment were taken from
completely sequenced bacterial or archaeal genomes and
(except for Mycoplasma sequence) found clustered with
other genes of this systems. This model excludes DnaK
homologs that are not DnaK itself, such as the heat shock
cognate protein HscA (TIGR01991). However, it is not
designed to distinguish among DnaK paralogs in
eukaryotes. Note that a number of dnaK genes have shadow
ORFs in the same reverse (relative to dnaK) reading
frame, a few of which have been assigned glutamate
dehydrogenase activity. The significance of this
observation is unclear; lengths of such shadow ORFs are
highly variable as if the presumptive protein product is
not conserved [Protein fate, Protein folding and
stabilization].
Length = 595
Score = 41.5 bits (98), Expect = 0.004
Identities = 27/101 (26%), Positives = 45/101 (44%), Gaps = 19/101 (18%)
Query: 1015 DKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEE----NNTNA--- 1067
DK K+Q IT + ++ L++ + I+ + + N EEDK KEE NN ++
Sbjct: 481 DKGTGKEQSITITASSGLSE-----EEIERMVKEAEANAEEDKKRKEEIEARNNADSLAY 535
Query: 1068 ------KENDEKSTDDEKNKLDDKAKDTATKLSNSA-DSIK 1101
KE +K +EK K++ + L + IK
Sbjct: 536 QAEKTLKEAGDKLPAEEKEKIEKAVAELKEALKGEDVEEIK 576
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 41.4 bits (97), Expect = 0.004
Identities = 24/165 (14%), Positives = 55/165 (33%), Gaps = 20/165 (12%)
Query: 267 SRTTTASILDSSLYQNTYDSK------TRNFVDSGSQIFNAASTLNETPEHSTRDGKLFS 320
T+T ++ + T S + S ++ +
Sbjct: 223 GLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIGINNHHS-- 280
Query: 321 NEALYAPSSTTNY--PYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFMAAT 378
+ + T +S+ S P+ ++ + +N +S GS +
Sbjct: 281 -KHADSKLQTIEVIENHSNKSRPSSSSTNGSKETTSNSSSAAAGSIGSKSSKSAK----- 334
Query: 379 THGYSSISNTITNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLN 423
H + SN ++ S +N SV SS+++ S +++S + +
Sbjct: 335 -HSNRNKSN---SSPKSHSSANGSVPSSSVSDNESKQKRASKSSS 375
Score = 36.1 bits (83), Expect = 0.21
Identities = 29/154 (18%), Positives = 47/154 (30%), Gaps = 8/154 (5%)
Query: 327 PSSTTNYPYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFMAATTHGYSSIS 386
+ T + S + +T+ S G S + S + + G
Sbjct: 205 ENHTLSVTDKEKSEASSKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHHH 264
Query: 387 NTITNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLNTYGHQASSLSNSNTTGYSSQNLY 446
+ + S +N+ S Q V N S S+S+T G
Sbjct: 265 SH-NHQHHSIGINNHH-----SKHADSKLQTIEVIENHSN--KSRPSSSSTNGSKETTSN 316
Query: 447 QQQPSSVSYSSNSYVYPQTSIATSSNVSSTSHTS 480
++ S S S + S SN S SH+S
Sbjct: 317 SSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSS 350
>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
Length = 627
Score = 41.2 bits (98), Expect = 0.004
Identities = 31/103 (30%), Positives = 43/103 (41%), Gaps = 32/103 (31%)
Query: 1034 KKTNKQQNIQI-----LSP---------AKKPNKEEDKIEKEE----NNTNA-------- 1067
K T K+Q+I I LS A+ N EEDK KE N ++
Sbjct: 484 KGTGKEQSITITASSGLSDEEIERMVKDAEA-NAEEDKKRKELVEARNQADSLIYQTEKT 542
Query: 1068 -KENDEKSTDDEKNKLDDKAKD--TATKLSNSADSIKLNKKDE 1107
KE +K DEK K++ K+ A K ++IK K +E
Sbjct: 543 LKELGDKVPADEKEKIEAAIKELKEALK-GEDKEAIK-AKTEE 583
>gnl|CDD|236059 PRK07580, PRK07580, Mg-protoporphyrin IX methyl transferase;
Validated.
Length = 230
Score = 39.8 bits (94), Expect = 0.007
Identities = 30/104 (28%), Positives = 42/104 (40%), Gaps = 30/104 (28%)
Query: 1420 PNSHLLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITV 1479
P L +L+ GCG+G LS+ A GAK V++ D S ++VEE
Sbjct: 57 PADGDLTGLRILDAGCGVGSLSIPLARRGAK-VVASDIS------PQMVEE--------- 100
Query: 1480 ICRRMEDIDRLP--HGIENVDIIVSN---WMG---HVLYLDSLI 1515
R R P N+ V + +G V+ LD LI
Sbjct: 101 -ARE-----RAPEAGLAGNITFEVGDLESLLGRFDTVVCLDVLI 138
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 40.1 bits (94), Expect = 0.010
Identities = 21/104 (20%), Positives = 37/104 (35%)
Query: 1206 YNSNNYNNNKSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKKIKFK 1265
Y +N + T S ST D+++ K + +++SE++ KK KK
Sbjct: 26 YIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKS 85
Query: 1266 GPEILNETTEPTASSATNSNVTTVSNTADNSNNPSSILKKSKAT 1309
E P S T D N + I + ++ +
Sbjct: 86 EGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKS 129
Score = 33.5 bits (77), Expect = 0.86
Identities = 14/82 (17%), Positives = 31/82 (37%), Gaps = 5/82 (6%)
Query: 1202 FNNLYNSNNYNNNKSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKK 1261
L ++ + K ++ DKK+D + ++K + +K+++ ++E K K
Sbjct: 31 VLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETK 90
Query: 1262 IKFKGPEILNETTEPTASSATN 1283
+ FK P
Sbjct: 91 LGFKTP-----KKSKKTKKKPP 107
Score = 30.8 bits (70), Expect = 7.5
Identities = 21/95 (22%), Positives = 33/95 (34%), Gaps = 8/95 (8%)
Query: 1019 SKDQKITKSTNNNLN-KKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDD 1077
SK Q I+ +N L K + K K +KE+ N +K+ EK
Sbjct: 19 SKLQPISYIYSNVLVLSKEILSTFSEEE--NKVATTSTKKDKKEDKNNESKKKSEKKKKK 76
Query: 1078 EKNKLDDKAK-----DTATKLSNSADSIKLNKKDE 1107
+K K + K++ T + K K
Sbjct: 77 KKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKP 111
>gnl|CDD|217086 pfam02527, GidB, rRNA small subunit methyltransferase G. This is a
family of bacterial glucose inhibited division proteins
these are probably involved in the regulation of cell
devision. GidB has been shown to be a methyltransferase G
specific to the rRNA small subunit. Previously identified
as a glucose-inhibited division protein B that appears to
be present and in a single copy in all complete
eubacterial genomes so far sequenced. GidB specifically
methylates the N7 position of a guanosine in 16S rRNA.
Length = 184
Score = 38.8 bits (91), Expect = 0.010
Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 43/145 (29%)
Query: 1413 HLFKVRKPNS----HLL-----------KDKIVLEVGCGMGL----LSLFCAEAGAKHVI 1453
+L + +PN HLL V +VG G G L++ + K +
Sbjct: 20 NLTSITEPNELLERHLLDSLVVLEYLDNIRIKVADVGSGAGFPGIPLAIAFPD---KKLT 76
Query: 1454 SVD-----CSVITQLTQEVVEENDCSDVITVICRRMEDIDRLPHGIENVDIIVSNWMGHV 1508
++ + + +L +E+ EN +T++ R E+ D+I S +
Sbjct: 77 LLESLLKKINFLEELKKELNLEN-----VTIVHARAEEYQHEEQ----YDVITSRAVA-- 125
Query: 1509 LYLDSLINAVVYARDRFLKPHGLIL 1533
SL +A LKP G L
Sbjct: 126 ----SLNELTEWAL-PLLKPGGYFL 145
>gnl|CDD|225172 COG2263, COG2263, Predicted RNA methylase [Translation, ribosomal
structure and biogenesis].
Length = 198
Score = 38.8 bits (91), Expect = 0.011
Identities = 19/81 (23%), Positives = 35/81 (43%), Gaps = 9/81 (11%)
Query: 1424 LLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVD-CSVITQLTQEVVEENDCSDVITVICR 1482
L+ K VL++G G G+L++ A GA V++VD ++ + EE + +
Sbjct: 43 DLEGKTVLDLGAGTGILAIGAALLGASRVLAVDIDPEALEIARANAEELL--GDVEFVVA 100
Query: 1483 RMEDIDRLPHGIENVDIIVSN 1503
+ D D ++ N
Sbjct: 101 DVSDFRGK------FDTVIMN 115
>gnl|CDD|216988 pfam02353, CMAS, Mycolic acid cyclopropane synthetase. This family
consist of Cyclopropane-fatty-acyl-phospholipid synthase
or CFA synthase EC:2.1.1.79 this enzyme catalyze the
reaction: S-adenosyl-L-methionine + phospholipid olefinic
fatty acid <=> S-adenosyl-L-homocysteine + phospholipid
cyclopropane fatty acid.
Length = 273
Score = 39.2 bits (92), Expect = 0.012
Identities = 32/120 (26%), Positives = 44/120 (36%), Gaps = 22/120 (18%)
Query: 1419 KPNSHLLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQ-LTQEVVEENDCSDVI 1477
KP LL ++GCG G L AE +V+ + S ++ V +
Sbjct: 61 KPGMTLL------DIGCGWGGLMRRAAERYDVNVVGLTLSKNQYKHARQRVAAEGLQRKV 114
Query: 1478 TVICRRMEDIDRLPHGIENVDIIVSNWM----GHVLYLDSLINAVVYARDRFLKPHGLIL 1533
V+ + D D E D IVS M GH Y +Y L P GL+L
Sbjct: 115 EVLLQDYRDFD------EPFDRIVSVGMFEHVGHENYDTFF--KKLY---NLLPPGGLML 163
>gnl|CDD|233687 TIGR02021, BchM-ChlM, magnesium protoporphyrin O-methyltransferase.
This model represents the S-adenosylmethionine-dependent
O-methyltransferase responsible for methylation of
magnesium protoporphyrin IX. This step is essentiasl for
the biosynthesis of both chlorophyll and
bacteriochlorophyll. This model encompasses two closely
related clades, from cyanobacteria (and plants) where it
is called ChlM and other photosynthetic bacteria where it
is known as BchM [Biosynthesis of cofactors, prosthetic
groups, and carriers, Chlorophyll and
bacteriochlorphyll].
Length = 219
Score = 38.2 bits (89), Expect = 0.017
Identities = 29/105 (27%), Positives = 44/105 (41%), Gaps = 19/105 (18%)
Query: 1424 LLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCS-VITQLTQEVVEENDCSDVITVICR 1482
LK K VL+ GCG GLLS+ A+ GA V +VD S + Q+ + + D + +
Sbjct: 53 PLKGKRVLDAGCGTGLLSIELAKRGA-IVKAVDISEQMVQMARNRAQGRDVAGNVEFEVN 111
Query: 1483 RMEDIDRLPHGIENVDIIVSNWMGHVLYLDSLINAVVYARDRFLK 1527
+ + DI+V +D LI+ Y K
Sbjct: 112 DLLSLC------GEFDIVVC--------MDVLIH---YPASDMAK 139
>gnl|CDD|233708 TIGR02072, BioC, biotin biosynthesis protein BioC. This enzyme,
which is found in biotin biosynthetic gene clusters in
proteobacteria, firmicutes, green-sulfur bacteria,
fusobacterium and bacteroides, is believed to carry out
an enzymatic step prior to the formation of pimeloyl-CoA
(although attribution of this annotation is not
traceable). The enzyme appears related to
methyltransferases by homology [Biosynthesis of
cofactors, prosthetic groups, and carriers, Biotin].
Length = 240
Score = 38.4 bits (90), Expect = 0.019
Identities = 21/85 (24%), Positives = 35/85 (41%), Gaps = 8/85 (9%)
Query: 1420 PNSHLLKDKIVLEVGCGMGLLSLFCAEAGAK-HVISVDCSVITQLTQEVVEENDCSDVIT 1478
+ VL++GCG G L+ + + I+ D S + S+ +
Sbjct: 28 KEKGIFIPASVLDIGCGTGYLTRALLKRFPQAEFIANDIS----AGMLAQAKTKLSENVQ 83
Query: 1479 VICRRMEDIDRLPHGIENVDIIVSN 1503
IC D ++LP + D+IVSN
Sbjct: 84 FIC---GDAEKLPLEDSSFDLIVSN 105
>gnl|CDD|237852 PRK14896, ksgA, 16S ribosomal RNA methyltransferase KsgA/Dim1 family
protein; Provisional.
Length = 258
Score = 38.0 bits (89), Expect = 0.027
Identities = 13/58 (22%), Positives = 24/58 (41%), Gaps = 5/58 (8%)
Query: 1424 LLKDKIVLEVGCGMGLLSLFCAEAGAK-HVISVDCSVITQLTQEVVEENDCSDVITVI 1480
VLE+G G G L+ A+ K + I +D + L + + + + +I
Sbjct: 27 DTDGDPVLEIGPGKGALTDELAKRAKKVYAIELDPRLAEFLRDDEIAAGN----VEII 80
>gnl|CDD|237898 PRK15068, PRK15068, tRNA mo(5)U34 methyltransferase; Provisional.
Length = 322
Score = 38.3 bits (90), Expect = 0.029
Identities = 36/104 (34%), Positives = 46/104 (44%), Gaps = 36/104 (34%)
Query: 1423 HL--LKDKIVLEVGCGMGLLSLFC---AEAGAKHVISVDCSVITQLTQ-EVVEENDCSDV 1476
HL LK + VL+VGCG G AGAK V+ +D S + L Q E V
Sbjct: 117 HLSPLKGRTVLDVGCGNG---YHMWRMLGAGAKLVVGIDPSQLF-LCQFEAV-------- 164
Query: 1477 ITVICRRMEDIDR----LPHGIENV------DIIVSNWMGHVLY 1510
R++ D+ LP GIE + D + S MG VLY
Sbjct: 165 -----RKLLGNDQRAHLLPLGIEQLPALKAFDTVFS--MG-VLY 200
>gnl|CDD|222171 pfam13489, Methyltransf_23, Methyltransferase domain. This family
appears to be a methyltransferase domain.
Length = 154
Score = 35.9 bits (83), Expect = 0.053
Identities = 28/111 (25%), Positives = 41/111 (36%), Gaps = 23/111 (20%)
Query: 1424 LLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITVICRR 1483
L VL++GCG G+L E G V VD S L + + D
Sbjct: 20 LKPGGRVLDIGCGTGILLRLLRERG-FDVTGVDPSPAAVLIFSLFDAPD----------- 67
Query: 1484 MEDIDRLPHGIENVDIIVSNWMGHVL-YLDSLINAVVYARDRFLKPHGLIL 1533
+ G D+I + VL +L + R+ LKP G++L
Sbjct: 68 ----PAVLAG--KYDLITAF---EVLEHLPDPPALLQQLREL-LKPGGVLL 108
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 36.7 bits (85), Expect = 0.064
Identities = 23/96 (23%), Positives = 43/96 (44%), Gaps = 15/96 (15%)
Query: 1006 DKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNT 1065
+K+ K +N+ +D N NK+ +K + + +E +K E E++
Sbjct: 1 MEKECKDAKHENMEED-----CCKENENKEEDKGKEEDL------EFEEIEKEEIIEDSE 49
Query: 1066 NAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIK 1101
+ E + DE NKL ++ K KL N +++K
Sbjct: 50 ESNEVKIEELKDENNKLKEENK----KLENELEALK 81
>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
Rho/Rac/Cdc42-like GTPases [Signal transduction
mechanisms].
Length = 1175
Score = 37.6 bits (87), Expect = 0.065
Identities = 51/256 (19%), Positives = 83/256 (32%), Gaps = 51/256 (19%)
Query: 232 KNLDVNNITNKTRERLMNKPASSNDSLLKDNGDVDS--------RTTTASILDSSLYQNT 283
+N N+I N +K + +L S R + I S ++
Sbjct: 47 RNGADNDIINSE-----SKESFGKYALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATSS 101
Query: 284 YDSKTRNFVDSGSQIFNAASTLNETPEHSTRDGKLFSNEALYAPSSTTNYPYSSSSTPTY 343
S N G Q A+ +L+ P S+ + S +P P SS+
Sbjct: 102 TSSLNSN---DGDQFSPASDSLSFNP--SSTQSRKDSGPGDGSPVQKRKNPLLPSSS--- 153
Query: 344 AAMTTASGNANNVASNGGGSNNGGVYSTNSFMAATTHGYSSISNTITNTQPSQQQSNNSV 403
G + ++N G ++ + S S ++ P +Q +S
Sbjct: 154 -----THGTHPPIVF----TDNNGSHAGAPNARSRKEIPSLGSQSMQLPSPHFRQKFSSS 204
Query: 404 YSSTI--------------NSYSSFPQQSSVNLNTY--GHQASSLSNSNTTGYSSQNLYQ 447
+S NS SFP S+ L G +SL +SN T SS
Sbjct: 205 DTSNGFSYPSIRKNSRHSSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSSS----- 259
Query: 448 QQPSSVSYSSNSYVYP 463
+ + S Y+YP
Sbjct: 260 NSEAMSTSSKRPYIYP 275
Score = 36.8 bits (85), Expect = 0.14
Identities = 46/229 (20%), Positives = 84/229 (36%), Gaps = 30/229 (13%)
Query: 295 GSQIFNAASTLNETPEHSTRDGKLFSNEALYAPSSTTNYPYSSSSTPTYAAMTTASGNAN 354
G QIF++ S+ + + + ++PS+T++ +S+ + + S + N
Sbjct: 69 GHQIFSSFSSSPKLFQRRNS-----AGPITHSPSATSSTSSLNSNDGDQFSPASDSLSFN 123
Query: 355 NVASNGGGSNNGG----VYSTNSFM--AATTHGYSSISNTITNTQPSQQQSNNSVYSSTI 408
++ + G V + + +++THG T+ S + N+ I
Sbjct: 124 PSSTQSRKDSGPGDGSPVQKRKNPLLPSSSTHGTHPPIVF-TDNNGSHAGAPNARSRKEI 182
Query: 409 NSYSSFPQQSSVNLNTYGHQASSLSNSNTTGYSSQNLYQQQPSSVSYSSNSYVYPQTSIA 468
S S Q H S+S+T+ S ++ S S S+ + T++
Sbjct: 183 PSLGSQSMQLPSP-----HFRQKFSSSDTSNGFSYPSIRKNSRHSSNSMPSFPHSSTAVL 237
Query: 469 TS--SNVSSTSHTSHKYPPYNVDMKNHLDSKSTLETKSSSEHPPYVDYP 515
S S S S P S S E S+S PY+ YP
Sbjct: 238 LKRHSGSSGASLISSNITP----------SSSNSEAMSTSSKRPYI-YP 275
>gnl|CDD|224110 COG1189, COG1189, Predicted rRNA methylase [Translation, ribosomal
structure and biogenesis].
Length = 245
Score = 36.8 bits (86), Expect = 0.066
Identities = 30/112 (26%), Positives = 48/112 (42%), Gaps = 15/112 (13%)
Query: 1425 LKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITVICRRM 1484
+K K+VL++G G + + GAKHV +VD QL ++ ND VI
Sbjct: 78 VKGKVVLDIGSSTGGFTDVLLQRGAKHVYAVDVG-YGQLHWKL--RNDPR----VIVLER 130
Query: 1485 EDIDRL--PHGIENVDIIVSNWMGHVLYLDSLINAVVYARDRFLKPHGLILP 1534
++ L E D+IV + + SL ++ A LK G ++
Sbjct: 131 TNVRYLTPEDFTEKPDLIVID-----VSFISLKL-ILPALLLLLKDGGDLVL 176
>gnl|CDD|219760 pfam08242, Methyltransf_12, Methyltransferase domain. Members of
this family are SAM dependent methyltransferases.
Length = 98
Score = 34.3 bits (79), Expect = 0.079
Identities = 23/107 (21%), Positives = 43/107 (40%), Gaps = 14/107 (13%)
Query: 1431 LEVGCGMGLLSLFCAEAGA-KHVISVDCS--VITQLTQEVVE-ENDCSDVITVICRRMED 1486
L++GCG G L EA VD S + + + + + + D
Sbjct: 1 LDIGCGTGTLLRALLEALPGLEYTGVDISPAALEAAAERLAALGLLDAVRVRLDVLDAID 60
Query: 1487 IDRLPHGIENVDIIV-SNWMGHVLYLDSLINAVVYARDRFLKPHGLI 1532
+D + D++V SN + H+ +++ + R LKP G++
Sbjct: 61 LD-----PGSFDVVVASNVLHHLADPRAVLRNLR----RLLKPGGVL 98
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 36.9 bits (86), Expect = 0.086
Identities = 20/147 (13%), Positives = 54/147 (36%), Gaps = 6/147 (4%)
Query: 976 KNEFKVTKIMQRPVDENKDEIAKLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNN-NLNK 1034
+ K + + + K K K+ K + N + +
Sbjct: 57 IFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDD 116
Query: 1035 KTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDE----KSTDDEKNKLDDKAKDTA 1090
N ++I +L+ A + ++D + ++++ + ++DE DD+ + D++ K+
Sbjct: 117 DLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAK 176
Query: 1091 TKLSNSADSIKLNKKDEGSKENKPQQQ 1117
S D ++ S+ + ++
Sbjct: 177 ELEKLSDDD-DFVWDEDDSEALRQARK 202
Score = 33.8 bits (78), Expect = 0.78
Identities = 18/117 (15%), Positives = 48/117 (41%), Gaps = 4/117 (3%)
Query: 988 PVDENKDEIAKLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSP 1047
+D +K K + D K +D N + D ++ + + + +
Sbjct: 99 ELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDE 158
Query: 1048 AKKPNK---EEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIK 1101
+ E+++ ++ + ++D+ D++ ++ +A+ A KL+ +AD +K
Sbjct: 159 DDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDA-KLTATADPVK 214
Score = 31.1 bits (71), Expect = 5.5
Identities = 19/95 (20%), Positives = 37/95 (38%), Gaps = 2/95 (2%)
Query: 1022 QKITKSTNNNLNKKTNKQQNIQ-ILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKN 1080
+K+ K + +K ++ I+ L KK ++ D++ + +D +D K
Sbjct: 18 KKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKK 77
Query: 1081 KLDDKAKDTATKLSNSAD-SIKLNKKDEGSKENKP 1114
K AK A K +L+ + K+N
Sbjct: 78 KTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNAL 112
>gnl|CDD|223813 COG0742, COG0742, N6-adenine-specific methylase [DNA replication,
recombination, and repair].
Length = 187
Score = 35.7 bits (83), Expect = 0.10
Identities = 22/126 (17%), Positives = 40/126 (31%), Gaps = 24/126 (19%)
Query: 1425 LKDKIVLEVGCGMGLLSLFCAEA---GAKHVISVDCSVITQLT-QEVVEENDCSDVITVI 1480
++ VL++ G G L L EA GA V+ V+ +E ++ V+
Sbjct: 42 IEGARVLDLFAGSGALGL---EALSRGAARVVFVEKDRKAVKILKENLKALGLEGEARVL 98
Query: 1481 CRRMEDIDRLPHGIENVDIIVSNWMGHVLYLD-----SLINAVVYARDRFLKPHGLILPD 1535
+ E D++ +LD L++ + L L
Sbjct: 99 RNDALRALKQLGTREPFDLV---------FLDPPYAKGLLDKELALL---LLEENGWLKP 146
Query: 1536 RAELYC 1541
A +
Sbjct: 147 GALIVV 152
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains separated
by a hinge in the middle. The eukaryotic SMC proteins
form two kind of heterodimers: the SMC1/SMC3 and the
SMC2/SMC4 types. These heterodimers constitute an
essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 36.9 bits (85), Expect = 0.11
Identities = 19/181 (10%), Positives = 53/181 (29%), Gaps = 18/181 (9%)
Query: 960 NKNFKNFNNKSYGDKNKNEFKVTKIMQRPVDENKDEIAKLPIPLGGDKKDDKKVDDKNVS 1019
N+ + + D+ + + +++ + + + K+++K+ +
Sbjct: 236 NEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKEN-------KEEEKEKKLQEEE 288
Query: 1020 KDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNT--------NAKEND 1071
+ +++ + K+ KE K+EKE +
Sbjct: 289 LKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKEL 348
Query: 1072 EKSTDDEKNKLDDKAKDT---ATKLSNSADSIKLNKKDEGSKENKPQQQRRKFNNFHNSS 1128
E + E+ + + K KL + S +++ N +
Sbjct: 349 EIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEA 408
Query: 1129 K 1129
K
Sbjct: 409 K 409
>gnl|CDD|226413 COG3897, COG3897, Predicted methyltransferase [General function
prediction only].
Length = 218
Score = 35.9 bits (83), Expect = 0.11
Identities = 13/33 (39%), Positives = 22/33 (66%)
Query: 1424 LLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVD 1456
++ K VL++G G GL+++ A AGA V++ D
Sbjct: 77 TVRGKRVLDLGAGSGLVAIAAARAGAAEVVAAD 109
>gnl|CDD|236275 PRK08486, PRK08486, single-stranded DNA-binding protein;
Provisional.
Length = 182
Score = 35.6 bits (82), Expect = 0.11
Identities = 17/94 (18%), Positives = 30/94 (31%), Gaps = 4/94 (4%)
Query: 396 QQQSNNSVYSSTINSYSSFPQQSSVNLNTYGHQASSLSNSNTTGYSSQNLYQQQPSSVSY 455
Q S ++ T S +S N Q +S + Y PS Y
Sbjct: 87 QNGQKRSKHTITAESMQMLDSKSD-NPQANAMQDNSFHENFNNNYPGN---YNNPSQDPY 142
Query: 456 SSNSYVYPQTSIATSSNVSSTSHTSHKYPPYNVD 489
+ + Y Q + A + + + P ++D
Sbjct: 143 MNQAQSYNQNAYAKENQQAQPPKYENSVPEIDID 176
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 36.7 bits (85), Expect = 0.12
Identities = 37/188 (19%), Positives = 75/188 (39%), Gaps = 27/188 (14%)
Query: 1200 PMFNNLYNSNNYN---NNKSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESK- 1255
+FN + ++Y N++ ++ ++ K D+ + ++ KA ++ S +K
Sbjct: 125 NLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKP 184
Query: 1256 KTSNKKIKFKGPEILNETTEPTASSATNSNVTTVSNTADNSNNPS-SILKKSKATEDCKV 1314
TSNK+ P N++ AS T + ++ + S++ SIL
Sbjct: 185 STSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKDNQSMSDSALDSILD---------- 234
Query: 1315 KTVQGAPKTEKSQTKSGEYDNLDEREPSTIQLFMGLGLDKPPSQKPLVY---PIRTFNMD 1371
+ + A KT+K + D + QL P+Q L + P ++F D
Sbjct: 235 QYSEDAKKTQKDYASQSKKDKTETSNTKNPQL---------PTQDELKHKSKPAQSFEND 285
Query: 1372 IDDMTSRE 1379
++ +R
Sbjct: 286 VNQSNTRS 293
Score = 32.1 bits (73), Expect = 3.1
Identities = 34/178 (19%), Positives = 70/178 (39%), Gaps = 5/178 (2%)
Query: 180 DSTQQHNKKHKRESSKVISVTNNQQVNNDKDFKNNLMWDGDSAGAKKSRNQWKNLDVNNI 239
D Q N + S S ++ + + + K + D A + + + N
Sbjct: 135 DYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKA-DNQKAPSSNNTKPSTSNKQPNS 193
Query: 240 TNKTRERLMNKPASSNDSLLKDNGDVDSRTTTASILDSSLYQNTYDSKTRNFVDSGSQIF 299
T+ N +S+D+ + + D+++ + S LDS L Q + D+K +
Sbjct: 194 PKPTQPNQSNSQPASDDTANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQKDYASQSKK 253
Query: 300 NAASTLNET-PEHSTRDGKLFSNEALYAPSSTTNYPYSSS-STPTYAAMTTASGNANN 355
+ T N P+ T+D +++ A S + S++ ST + + S N ++
Sbjct: 254 DKTETSNTKNPQLPTQDE--LKHKSKPAQSFENDVNQSNTRSTSLFETGPSLSNNDDS 309
>gnl|CDD|233668 TIGR01983, UbiG, ubiquinone biosynthesis O-methyltransferase. This
model represents an O-methyltransferase believed to act
at two points in the ubiquinone biosynthetic pathway in
bacteria (UbiG) and fungi (COQ3). A separate methylase
(MenG/UbiE) catalyzes the single C-methylation step. The
most commonly used names for genes in this family do not
indicate whether this gene is an O-methyl, or C-methyl
transferase [Biosynthesis of cofactors, prosthetic
groups, and carriers, Menaquinone and ubiquinone].
Length = 224
Score = 35.7 bits (83), Expect = 0.14
Identities = 33/115 (28%), Positives = 50/115 (43%), Gaps = 20/115 (17%)
Query: 1425 LKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVE------ENDCSDVIT 1478
L VL+VGCG GLLS A GA V +D ++E +E + D I
Sbjct: 44 LFGLRVLDVGCGGGLLSEPLARLGAN-VTGID------ASEENIEVAKLHAKKDPLLKIE 96
Query: 1479 VICRRMEDIDRLPHGIENVDIIV-SNWMGHVLYLDSLINAVVYARDRFLKPHGLI 1532
C +ED+ G ++ D++ + HV + I A + LKP G++
Sbjct: 97 YRCTSVEDLA--EKGAKSFDVVTCMEVLEHVPDPQAFIRACA----QLLKPGGIL 145
>gnl|CDD|181382 PRK08317, PRK08317, hypothetical protein; Provisional.
Length = 241
Score = 35.7 bits (83), Expect = 0.14
Identities = 23/109 (21%), Positives = 38/109 (34%), Gaps = 21/109 (19%)
Query: 1430 VLEVGCGMGLLSL-----FCAEAGAKHVISVDCS--VITQLTQEVVEENDCSDVITVICR 1482
VL+VGCG G + E V+ +D S ++ + +V
Sbjct: 23 VLDVGCGPGNDARELARRVGPEG---RVVGIDRSEAMLALAKERA--AGLGPNV----EF 73
Query: 1483 RMEDIDRLPHGIENVDIIVS-NWMGHVLYLDSLINAVVYARDRFLKPHG 1530
D D LP + D + S + H+ + + R L+P G
Sbjct: 74 VRGDADGLPFPDGSFDAVRSDRVLQHLEDPARALAEIA----RVLRPGG 118
>gnl|CDD|225137 COG2227, UbiG,
2-polyprenyl-3-methyl-5-hydroxy-6-metoxy-1,4-benzoquinol
methylase [Coenzyme metabolism].
Length = 243
Score = 35.8 bits (83), Expect = 0.15
Identities = 25/91 (27%), Positives = 35/91 (38%), Gaps = 12/91 (13%)
Query: 1373 DDMTSREIFE-DVYADIKTH--QTTYKDKQLIGCFHDSIIN--NSHLFKVRKPNSHLLKD 1427
DD + + E D + + + + K L H IN + L
Sbjct: 7 DDTQNVDYKELDKFEALASRWWDPEGEFKPL----H--KINPLRLDYIREVARLRFDLPG 60
Query: 1428 KIVLEVGCGMGLLSLFCAEAGAKHVISVDCS 1458
VL+VGCG G+LS A GA V +D S
Sbjct: 61 LRVLDVGCGGGILSEPLARLGA-SVTGIDAS 90
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 36.1 bits (83), Expect = 0.20
Identities = 22/127 (17%), Positives = 49/127 (38%), Gaps = 15/127 (11%)
Query: 1011 KKVDDKNVSK-----DQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENN- 1064
+K + + + N LN+ + ++ A + + +E + + E++
Sbjct: 3805 EKSNSGELESGTGLGSGVGAEDITNTLNEDDDLEELANEEDTANQSDLDESEARELESDM 3864
Query: 1065 ------TNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKL--NKKDEGSKENKPQQ 1116
+ EN+ +++E LD++ D LSNS + KL +E E + +
Sbjct: 3865 NGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNE-KLWDEPNEEDLLETEQKS 3923
Query: 1117 QRRKFNN 1123
+ N
Sbjct: 3924 NEQSAAN 3930
Score = 35.4 bits (81), Expect = 0.36
Identities = 27/125 (21%), Positives = 49/125 (39%), Gaps = 27/125 (21%)
Query: 990 DENKDEIAKLPIPLGGDKKD------------------DKKVDDKNV----SKDQKITKS 1027
D ++ E +L + G KD D++V+D S ++K+
Sbjct: 3851 DLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDE 3910
Query: 1028 TNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAK 1087
N +T ++ N Q N E D + KE++N ++ D + +DE+ DD
Sbjct: 3911 PNEEDLLETEQKSNEQ-----SAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGI 3965
Query: 1088 DTATK 1092
D +
Sbjct: 3966 DDEIQ 3970
Score = 33.0 bits (75), Expect = 2.1
Identities = 25/149 (16%), Positives = 60/149 (40%), Gaps = 20/149 (13%)
Query: 970 SYGDKNKNEFKVTKIMQRPVDENKDEIAK-LPIPLGGDKKDDKKVDDKNVSKDQKITKST 1028
S +N+N ++ + +DE ++I + L L D+ +D ++ + +S
Sbjct: 3871 SVVSENENS--DSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSA 3928
Query: 1029 NNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKD 1088
NN + +K+ + EDK +E+ + +D D+ + D ++
Sbjct: 3929 ANNESDLVSKEDD---------NKALEDKDRQEKEDEEEMSDDVGIDDEIQ---PDIQEN 3976
Query: 1089 TATKLSNSAD-----SIKLNKKDEGSKEN 1112
+ + D +KL++K+ ++
Sbjct: 3977 NSQPPPENEDLDLPEDLKLDEKEGDVSKD 4005
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold many
proteins. Hsp70 assisted folding involves repeated cycles
of substrate binding and release. Hsp70 activity is ATP
dependent. Hsp70 proteins are made up of two regions: the
amino terminus is the ATPase domain and the carboxyl
terminus is the substrate binding region.
Length = 598
Score = 35.7 bits (83), Expect = 0.21
Identities = 19/91 (20%), Positives = 38/91 (41%), Gaps = 11/91 (12%)
Query: 1034 KKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATKL 1093
K T K+Q I I + + + E +++ K+ A++ K + KN+ ++ L
Sbjct: 485 KGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSL 544
Query: 1094 SNSADSI------KLNK-----KDEGSKENK 1113
D + K+ + K+E E+K
Sbjct: 545 KEEGDKLPEADKKKVEEAIEWLKEELEGEDK 575
>gnl|CDD|203189 pfam05175, MTS, Methyltransferase small domain. This domain is found
in ribosomal RNA small subunit methyltransferase C as
well as other methyltransferases .
Length = 170
Score = 34.1 bits (79), Expect = 0.25
Identities = 30/131 (22%), Positives = 45/131 (34%), Gaps = 24/131 (18%)
Query: 1422 SHLLKDK--IVLEVGCGMGLLSLFCAEAGAKHVISVDCSVI----TQLTQEVVEENDCSD 1475
SHL K VL++GCG G+L A+ + V I + + + N +
Sbjct: 25 SHLPKPLGGKVLDLGCGYGVLGAALAKRSPD--LEVTMVDINARALESARANLAANGLEN 82
Query: 1476 VITVICRRMEDIDRLPHGIENVDIIVSN---WMGHVLYLDSLINAVVYARDRFLKPHGLI 1532
++ D+I+SN G D + A LKP G
Sbjct: 83 GEVFWSDLYSAVEP-----GKFDLIISNPPFHAGKATDYDVAQRFIAGAARH-LKPGG-- 134
Query: 1533 LPDRAELYCVA 1543
EL+ VA
Sbjct: 135 -----ELWIVA 140
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 34.3 bits (79), Expect = 0.25
Identities = 9/50 (18%), Positives = 23/50 (46%)
Query: 1212 NNNKSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKK 1261
+K K KK D ++ +K+D +++ + T+ ++ + +K
Sbjct: 94 KKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKPRK 143
Score = 33.5 bits (77), Expect = 0.47
Identities = 13/61 (21%), Positives = 27/61 (44%), Gaps = 4/61 (6%)
Query: 1048 AKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNK----LDDKAKDTATKLSNSADSIKLN 1103
K+ K + K K++ + + + D+K D + K +DK +D S + ++
Sbjct: 80 EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSEL 139
Query: 1104 K 1104
K
Sbjct: 140 K 140
Score = 30.0 bits (68), Expect = 6.7
Identities = 12/59 (20%), Positives = 27/59 (45%)
Query: 1037 NKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSN 1095
KQ+ +KK ++ + ++ + +++ DEK +D+ L +T + LS
Sbjct: 80 EKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSE 138
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and 214
amino acids in length.
Length = 125
Score = 33.4 bits (77), Expect = 0.27
Identities = 18/88 (20%), Positives = 37/88 (42%), Gaps = 9/88 (10%)
Query: 1006 DKKDDKKVDDKNVSKD-QKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENN 1064
+ +++K DK V K K +K ++ + + +S +K E +K+EKE
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKE--- 57
Query: 1065 TNAKENDEKSTDDEKNKLDDKAKDTATK 1092
+ E + +L +AK ++
Sbjct: 58 -LEELEAEL----ARRELKAEAKKMLSE 80
>gnl|CDD|234708 PRK00274, ksgA, 16S ribosomal RNA methyltransferase KsgA/Dim1 family
protein; Reviewed.
Length = 272
Score = 34.7 bits (81), Expect = 0.31
Identities = 16/61 (26%), Positives = 27/61 (44%), Gaps = 6/61 (9%)
Query: 1422 SHLLKDKIVLEVGCGMGLLSLFCAEAGAK-HVISVDCSVITQLTQEVVEENDCSDVITVI 1480
+ VLE+G G+G L+ E AK + +D + L + E+N +T+I
Sbjct: 38 AGPQPGDNVLEIGPGLGALTEPLLERAAKVTAVEIDRDLAPILAETFAEDN-----LTII 92
Query: 1481 C 1481
Sbjct: 93 E 93
>gnl|CDD|184931 PRK14967, PRK14967, putative methyltransferase; Provisional.
Length = 223
Score = 34.3 bits (79), Expect = 0.33
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 1430 VLEVGCGMGLLSLFCAEAGAKHVISVDCS 1458
VL++ G G L++ A AGA V +VD S
Sbjct: 40 VLDLCTGSGALAVAAAAAGAGSVTAVDIS 68
>gnl|CDD|224017 COG1092, COG1092, Predicted SAM-dependent methyltransferases [General
function prediction only].
Length = 393
Score = 35.0 bits (81), Expect = 0.36
Identities = 14/35 (40%), Positives = 16/35 (45%)
Query: 1424 LLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCS 1458
L K VL + G S+ A GA V SVD S
Sbjct: 215 LAAGKRVLNLFSYTGGFSVHAALGGASEVTSVDLS 249
>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
Validated.
Length = 725
Score = 35.2 bits (81), Expect = 0.39
Identities = 24/201 (11%), Positives = 71/201 (35%), Gaps = 7/201 (3%)
Query: 937 QNNMIKTIDFHQFQNNVMKNHPGNKNFKNFNNKSYGDKNKNEFKVTKIMQRPVDENKDEI 996
N + ++ +N + + ++ + +N EF+ +I + ++
Sbjct: 409 DNKNKNSSFINKTENILTNSPLKDELLEKTTE-IINIENPQEFEFGQIGNDIISTEIAQL 467
Query: 997 A--KLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKT-NKQQNIQILSPAKKPNK 1053
+ I G ++ + N KI ++ N + T + + + A+ N
Sbjct: 468 DENQNLIDTGEFDLENNFSNSFNPENGNKIDENINETFDTSTISANLSENKTNFAQSFNN 527
Query: 1054 EEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATK---LSNSADSIKLNKKDEGSK 1110
++ + E + ++ + +KN + KD + ++ +IK + +++
Sbjct: 528 KDTNLINSEIPIDLIKDTITINNSQKNVKKNGNKDYLSVEEVINLIMLAIKFHSQNQVEY 587
Query: 1111 ENKPQQQRRKFNNFHNSSKYQ 1131
+ Q + F ++
Sbjct: 588 KKLVQNWNKNLPLFEYDVEFM 608
Score = 34.0 bits (78), Expect = 0.84
Identities = 24/153 (15%), Positives = 47/153 (30%), Gaps = 7/153 (4%)
Query: 965 NFNNKSYGDKNKNEFKVTKIMQRPVDENKDEIAKLPIPLGGDKKDDKKVDDKNVSKDQKI 1024
+ D+N+ +FK + + K++ + + G + ++
Sbjct: 354 ALSELEEEDENEIKFKKIEENSIDNLDIKEKKIENENDIEGKSDTKNLEEGFETKDNKNK 413
Query: 1025 TKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDD 1084
S N + + +K + + +E ND ST+ + +
Sbjct: 414 NSSFINKT--ENILTNSPLKDELLEKTTEIINIENPQEFEFGQIGNDIISTEIAQLDENQ 471
Query: 1085 KAKDTAT-----KLSNSADSIKLNKKDEGSKEN 1112
DT SNS + NK DE E
Sbjct: 472 NLIDTGEFDLENNFSNSFNPENGNKIDENINET 504
>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
Length = 234
Score = 34.5 bits (78), Expect = 0.39
Identities = 31/150 (20%), Positives = 64/150 (42%), Gaps = 10/150 (6%)
Query: 328 SSTTNYPYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFMAATTHGYSSISN 387
+S+ + S+ + A+TT S +A+ G +N +T S TT S+ +
Sbjct: 25 TSSGSSTASAGNVTGTTAVTTPSPSAS-----GPSTNQSTTLTTTSAPITTTAILSTNTT 79
Query: 388 TITNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLNTYGHQASSLSNSNTTGYSSQNLYQ 447
T+T+T + + +STIN + Q+ ++ ++++ TT SS
Sbjct: 80 TVTSTGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSAT 139
Query: 448 QQPSSVSYSSNSYVYPQTSIATSSNVSSTS 477
+ ++ + + P S +SN + T+
Sbjct: 140 TRITNATTLA-----PTLSSKGTSNATKTT 164
>gnl|CDD|129627 TIGR00536, hemK_fam, HemK family putative methylases. The gene hemK
from E. coli was found to contribute to heme biosynthesis
and originally suggested to be protoporphyrinogen oxidase
(Medline 95189105). Functional analysis of the nearest
homolog in Saccharomyces cerevisiae, YNL063w, finds it is
not protoporphyrinogen oxidase and sequence analysis
suggests that HemK homologs have
S-adenosyl-methionine-dependent methyltransferase
activity (Medline 99237242). Homologs are found, usually
in a single copy, in nearly all completed genomes, but
varying somewhat in apparent domain architecture. Both E.
coli and H. influenzae have two members rather than one.
The members from the Mycoplasmas have an additional
C-terminal domain [Protein fate, Protein modification and
repair].
Length = 284
Score = 34.3 bits (79), Expect = 0.42
Identities = 22/76 (28%), Positives = 36/76 (47%), Gaps = 6/76 (7%)
Query: 1430 VLEVGCGMGLLSLFCAEAGA-KHVISVDCSV-ITQLTQEVVEENDCSDVITVICRRMEDI 1487
+L++G G G ++L A VI+VD S + +E E+N + I +
Sbjct: 118 ILDLGTGSGCIALALAYEFPNAEVIAVDISPDALAVAEENAEKNQLEHRVEFIQSNL--F 175
Query: 1488 DRLPHGIENVDIIVSN 1503
+ L + +DIIVSN
Sbjct: 176 EPLAG--QKIDIIVSN 189
>gnl|CDD|226947 COG4581, COG4581, Superfamily II RNA helicase [DNA replication,
recombination, and repair].
Length = 1041
Score = 35.1 bits (81), Expect = 0.45
Identities = 13/68 (19%), Positives = 22/68 (32%), Gaps = 2/68 (2%)
Query: 1050 KPNKEEDKIEK--EENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDE 1107
K + E K EE+ N +D + NK +A D + ++ LN
Sbjct: 19 KEKRVERKAATAEEESLKNIDISDLFERSVDMNKDPKEATDQVKAVDKHSEIESLNVGMR 78
Query: 1108 GSKENKPQ 1115
+
Sbjct: 79 LDVRLSSK 86
>gnl|CDD|176193 cd08231, MDR_TM0436_like, Hypothetical enzyme TM0436 resembles the
zinc-dependent alcohol dehydrogenases (ADH). This group
contains the hypothetical TM0436 alcohol dehydrogenase
from Thermotoga maritima, proteins annotated as
5-exo-alcohol dehydrogenase, and other members of the
medium chain dehydrogenases/reductase
(MDR)/zinc-dependent alcohol dehydrogenase-like family.
MDR, which contains the zinc-dependent alcohol
dehydrogenase (ADH-Zn) and related proteins, is a diverse
group of proteins related to the first identified member,
class I mammalian ADH. MDRs display a broad range of
activities and are distinguished from the smaller short
chain dehydrogenases (~ 250 amino acids vs. the ~ 350
amino acids of the MDR). The MDR proteins have 2
domains: a C-terminal NAD(P) binding-Rossmann fold domain
of a beta-alpha form and an N-terminal catalytic domain
with distant homology to GroES. The MDR group contains a
host of activities, including the founding alcohol
dehydrogenase (ADH), quinone reductase, sorbitol
dehydrogenase, formaldehyde dehydrogenase, butanediol DH,
ketose reductase, cinnamyl reductase, and numerous
others. The zinc-dependent alcohol dehydrogenases (ADHs)
catalyze the NAD(P)(H)-dependent interconversion of
alcohols to aldehydes or ketones. Active site zinc has a
catalytic role, while structural zinc aids in stability.
Length = 361
Score = 34.5 bits (80), Expect = 0.46
Identities = 11/32 (34%), Positives = 16/32 (50%), Gaps = 2/32 (6%)
Query: 1427 DKIVLEVGCGM-GLLSL-FCAEAGAKHVISVD 1456
V+ G G GL ++ AGA+ VI +D
Sbjct: 178 GDTVVVQGAGPLGLYAVAAAKLAGARRVIVID 209
>gnl|CDD|149213 pfam08003, Methyltransf_9, Protein of unknown function (DUF1698).
This family contains many hypothetical proteins. It also
includes two putative methyltransferase proteins.
Length = 315
Score = 34.3 bits (79), Expect = 0.49
Identities = 34/110 (30%), Positives = 46/110 (41%), Gaps = 33/110 (30%)
Query: 1423 HL--LKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITVI 1480
HL LK + +L+VGCG G GA V+ +D S + E V
Sbjct: 110 HLSPLKGRTILDVGCGNGYHMWRMLGEGAALVVGIDPSELFLCQFEAV------------ 157
Query: 1481 CRRMEDIDR----LPHGIENV------DIIVSNWMGHVLY-----LDSLI 1515
R++ D+ LP GIE + D + S MG VLY LD L+
Sbjct: 158 -RKLLGNDQRAHLLPLGIEQLPALEAFDTVFS--MG-VLYHRRSPLDHLL 203
>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
Validated.
Length = 605
Score = 34.8 bits (80), Expect = 0.50
Identities = 31/182 (17%), Positives = 64/182 (35%), Gaps = 35/182 (19%)
Query: 935 QFQNNMIKTIDFHQFQNNVMKNHPGNKNFKNFNNKSYGDKNKNEFKVTKIMQRPVDENKD 994
+ + ++ + +F + ++ +P K +K I + ++ +
Sbjct: 342 RLKASVNQVFEFEIYLYKIINANPDKLENKKIEISIKPEKKT-INNSQIITTKKINI-VE 399
Query: 995 EIAKLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKE 1054
E K V + K + N + NK+Q+I K+
Sbjct: 400 ESNK------------NSVHFDTLYKTKIFYHKKKIN---QNNKEQDI----------KK 434
Query: 1055 EDKIEKEENNTNAK--ENDE--KSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEGSK 1110
E+ +EKE + K +NDE + + K K +K + + + + K+E S
Sbjct: 435 EELLEKEFVKKSEKIPKNDELLDNLELAKQKFFNKDIELSKNMLQKFN----KFKNEESA 490
Query: 1111 EN 1112
EN
Sbjct: 491 EN 492
>gnl|CDD|129544 TIGR00452, TIGR00452, tRNA (mo5U34)-methyltransferase. This model
describes a set of proteins with homology in its central
region to a large number of methyltransferases active on
a variety of substrates. The similarity includes a motif
resembling ILDVGCG[SG]G. Known examples to date are
restricted to the proteobacteria [Unknown function,
Enzymes of unknown specificity].
Length = 314
Score = 34.1 bits (78), Expect = 0.55
Identities = 30/101 (29%), Positives = 44/101 (43%), Gaps = 26/101 (25%)
Query: 1420 PNSHLLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITV 1479
P+ LK + +L+VGCG G GAK ++ +D +V+ E V
Sbjct: 115 PHLSPLKGRTILDVGCGSGYHMWRMLGHGAKSLVGIDPTVLFLCQFEAV----------- 163
Query: 1480 ICRRMEDIDR----LPHGIENV------DIIVSNWMGHVLY 1510
R++ D D+ P GIE + D + S MG VLY
Sbjct: 164 --RKLLDNDKRAILEPLGIEQLHELYAFDTVFS--MG-VLY 199
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 34.3 bits (79), Expect = 0.61
Identities = 15/86 (17%), Positives = 30/86 (34%), Gaps = 2/86 (2%)
Query: 1028 TNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAK 1087
N K KQQN Q + KK K + ++ ++ ++ + ++++N K K
Sbjct: 4 ENGENEVKQQKQQNKQKGTK-KKNKKSKKDVDDDDAFLAELISENQEAENKQNN-KKKKK 61
Query: 1088 DTATKLSNSADSIKLNKKDEGSKENK 1113
K + D +
Sbjct: 62 KKKKKKKKNLGEAYDLAYDLPVVWSS 87
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 34.4 bits (79), Expect = 0.65
Identities = 24/122 (19%), Positives = 43/122 (35%), Gaps = 12/122 (9%)
Query: 1006 DKKDDKKVDDKN-------VSKDQKI---TKSTNNNLNKKTNKQQNIQILSPAKKPNKEE 1055
+ +D KN KI + ++ LN NK+ + ++ KK NK +
Sbjct: 8 SISSENTIDLKNPLFIKNLKHSSYKIRLESDIKDSLLNLDINKKLHEKLDKKNKKFNKTD 67
Query: 1056 DKIE--KEENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEGSKENK 1113
D + K + K + DD+ + D ++ A S+ K + K K
Sbjct: 68 DLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKK 127
Query: 1114 PQ 1115
Sbjct: 128 KI 129
Score = 32.9 bits (75), Expect = 1.8
Identities = 25/131 (19%), Positives = 48/131 (36%), Gaps = 6/131 (4%)
Query: 1200 PMFNNLYNSNNYNNNKSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSN 1259
P+F ++Y + + DS N+D + ++K K K ++SKKT
Sbjct: 20 PLFIKNLKHSSYKI-RLESDIKDSLLNLDINKKLHEKLDKKNKKFNK-TDDLKDSKKTKL 77
Query: 1260 KKIKFKGPEILNETTEPTASSATNSNVTTVSNTADNSNNPSSILKKSKATEDCKVKTVQG 1319
K+ K ++ + + N++ A + K K + K TV
Sbjct: 78 KQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRK----PKPKTEKLKKKITVNK 133
Query: 1320 APKTEKSQTKS 1330
+ +K + S
Sbjct: 134 STNKKKKKVLS 144
Score = 32.5 bits (74), Expect = 2.3
Identities = 22/113 (19%), Positives = 38/113 (33%), Gaps = 4/113 (3%)
Query: 1001 IPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEK 1060
I L D KD D N +K+ K N NK + + + + KK K + K+
Sbjct: 33 IRLESDIKDSLLNLDINKKLHEKLDK-KNKKFNKTDDLKDSKKTKLKQKK--KIKKKLHI 89
Query: 1061 EENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEGSKENK 1113
+++ N ++ S + K + K K+ K
Sbjct: 90 DDDYDNFFDSKNNSKQFAGPLAISLMRKPKPK-TEKLKKKITVNKSTNKKKKK 141
>gnl|CDD|130009 TIGR00934, 2a38euk, potassium uptake protein, Trk family. The
proteins of the Trk family are derived from Gram-negative
and Gram-positive bacteria, yeast and wheat. The proteins
of E. coli K12 TrkH and TrkG as well as several yeast
proteins have been functionally characterized.The E. coli
TrkH and TrkG proteins are complexed to two peripheral
membrane proteins, TrkA, an NAD-binding protein, and
TrkE, an ATP-binding protein. This complex forms the
potassium uptake system. This family is specific for the
eukaryotic Trk system [Transport and binding proteins,
Cations and iron carrying compounds].
Length = 800
Score = 34.2 bits (78), Expect = 0.66
Identities = 29/226 (12%), Positives = 66/226 (29%), Gaps = 8/226 (3%)
Query: 858 VTTTTSSSETKPAESDNKLPSNNNPSEEGEDVSGRKNAKVVPGSPAKSNNMSQHYNEKGH 917
V + +SD + + + E + R + V P +S M Q +E+
Sbjct: 192 VKRDEPDQNSPDVKSDTRADESISDLEFEKFAKRRGSRDVDPEDLYRSIMMLQGIHERIR 251
Query: 918 KSGGERQRDRGDTGRHHQFQNNMIKTIDFHQFQNNVMKNHPGNKNFKNFNNKSYGDKNKN 977
+ R + Q Q + + ++ + + + +++KS+ +
Sbjct: 252 EKSSANSRSDERSSESIQEQVERRPS-------TSDIERNSQSLT-RRYDDKSFDKAVRL 303
Query: 978 EFKVTKIMQRPVDENKDEIAKLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTN 1037
T D + + AK + D KN K + K +
Sbjct: 304 RRSKTIDRAEACDLEELDRAKDFEKMTYDNWKAHHRKKKNFRPRGWNLKFRKASRFPKDS 363
Query: 1038 KQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLD 1083
+ + + + + N K +D ++ L
Sbjct: 364 DRNYEDNGNHLSASSSFGSEEPSLSSEENLYPTYNKKREDSRHTLS 409
>gnl|CDD|234456 TIGR04074, bacter_Hen1, 3' terminal RNA ribose 2'-O-methyltransferase
Hen1. Members of this protein family are bacterial Hen1,
a 3' terminal RNA ribose 2'-O-methyltransferase that acts
in bacterial RNA repair. All members of the seed
alignment belong to a cassette with the RNA repair enzyme
polynucleotide kinase-phosphatase (Pnkp). Chemically
similar Hen1 in eukaryotes acts instead on small
regulatory RNAs [Transcription, RNA processing, Protein
synthesis, tRNA and rRNA base modification].
Length = 462
Score = 33.8 bits (78), Expect = 0.73
Identities = 14/31 (45%), Positives = 18/31 (58%), Gaps = 1/31 (3%)
Query: 1430 VLEVGCGMG-LLSLFCAEAGAKHVISVDCSV 1459
VL++GCG G LL L AE + + VD S
Sbjct: 286 VLDLGCGEGKLLRLLLAEKQFERIAGVDVSA 316
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 34.1 bits (78), Expect = 0.76
Identities = 22/103 (21%), Positives = 44/103 (42%), Gaps = 2/103 (1%)
Query: 1006 DKKDDKKVDDK-NVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENN 1064
D+ +D DD N DQ + +S N + +K +++ Q + K N D E +
Sbjct: 137 DQTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEKKGQTENTFKFYNFGNDGEEAAAKD 196
Query: 1065 TN-AKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKD 1106
+K +D +D + DD ++A + ++++ K
Sbjct: 197 GGKSKSSDPGPLNDSDGQGDDGDPESAEEDKAASNTRAAYTKA 239
>gnl|CDD|128898 smart00650, rADc, Ribosomal RNA adenine dimethylases.
Length = 169
Score = 32.5 bits (75), Expect = 0.92
Identities = 17/75 (22%), Positives = 30/75 (40%), Gaps = 8/75 (10%)
Query: 1430 VLEVGCGMGLLSLFCAEAGAK-HVISVDCSVITQLTQEVVEENDCSDVITVICRRMEDID 1488
VLE+G G G L+ E + I +D + +L ++ ++ +TVI D
Sbjct: 17 VLEIGPGKGALTEELLERAKRVTAIEIDPRLAPRLREKFAAADN----LTVI---HGDAL 69
Query: 1489 RLPHGIENVDIIVSN 1503
+ +V N
Sbjct: 70 KFDLPKLQPYKVVGN 84
>gnl|CDD|235350 PRK05134, PRK05134, bifunctional 3-demethylubiquinone-9
3-methyltransferase/ 2-octaprenyl-6-hydroxy phenol
methylase; Provisional.
Length = 233
Score = 33.2 bits (77), Expect = 0.97
Identities = 16/32 (50%), Positives = 19/32 (59%), Gaps = 1/32 (3%)
Query: 1425 LKDKIVLEVGCGMGLLSLFCAEAGAKHVISVD 1456
L K VL+VGCG G+LS A GA V +D
Sbjct: 47 LFGKRVLDVGCGGGILSESMARLGAD-VTGID 77
>gnl|CDD|151322 pfam10873, DUF2668, Protein of unknown function (DUF2668). Members
in this family of proteins are annotated as Cysteine and
tyrosine-rich protein 1, however currently no function
is known.
Length = 154
Score = 32.1 bits (73), Expect = 0.97
Identities = 13/34 (38%), Positives = 19/34 (55%), Gaps = 3/34 (8%)
Query: 564 PGPPPQSTSDYKMLYWNNVPMPNMLHAPPPQAPH 597
P PP T D++M Y ++P P ++P PQA
Sbjct: 110 PMAPPPYTYDHEMEYPTDLPPP---YSPAPQASA 140
>gnl|CDD|129628 TIGR00537, hemK_rel_arch, HemK-related putative methylase. The gene
hemK from E. coli was found to contribute to heme
biosynthesis and originally suggested to be
protoporphyrinogen oxidase (Medline 95189105). Functional
analysis of the nearest homolog in Saccharomyces
cerevisiae, YNL063w, finds it is not protoporphyrinogen
oxidase and sequence analysis suggests that HemK homologs
have S-adenosyl-methionine-dependent methyltransferase
activity (Medline 99237242). Homologs are found, usually
in a single copy, in nearly all completed genomes, but
varying somewhat in apparent domain architecture. This
model represents an archaeal and eukaryotic protein
family that lacks an N-terminal domain found in HemK and
its eubacterial homologs. It is found in a single copy in
the first six completed archaeal and eukaryotic genomes
[Unknown function, Enzymes of unknown specificity].
Length = 179
Score = 32.5 bits (74), Expect = 1.1
Identities = 11/34 (32%), Positives = 19/34 (55%), Gaps = 1/34 (2%)
Query: 1425 LKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCS 1458
LK VLE+G G GL+++ G +++ D +
Sbjct: 18 LKPDDVLEIGAGTGLVAIRLKGKGKC-ILTTDIN 50
>gnl|CDD|182340 PRK10258, PRK10258, biotin biosynthesis protein BioC; Provisional.
Length = 251
Score = 32.8 bits (75), Expect = 1.1
Identities = 19/74 (25%), Positives = 33/74 (44%), Gaps = 9/74 (12%)
Query: 1430 VLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEENDCSDVITVICRRMEDIDR 1489
VL+ GCG G +S + E G+ V ++D S + + +++ + DI+
Sbjct: 46 VLDAGCGPGWMSRYWRERGS-QVTALDLS--PPMLAQARQKDAADHYLA------GDIES 96
Query: 1490 LPHGIENVDIIVSN 1503
LP D+ SN
Sbjct: 97 LPLATATFDLAWSN 110
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 33.5 bits (77), Expect = 1.2
Identities = 34/262 (12%), Positives = 76/262 (29%), Gaps = 32/262 (12%)
Query: 823 IPPTVLPSSEKSRKPISHVSPPVVTSIITSTATSNVTTTTSSSETKPAESDNKLPSNNNP 882
T +S+ + + S + ++V + ++ + P N
Sbjct: 1159 KSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKS 1218
Query: 883 SEEGEDVSGRKNAKVVPGSPAKSNNMSQHYNEKGHKSGGERQRDRGDTGRHHQFQNNMIK 942
+ G D + K P + S+ N +
Sbjct: 1219 NSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSD--------------- 1263
Query: 943 TIDFHQFQNNVMKNHPGNKNFKNFNNKSYGDKNKNEFKVTKIMQRPVDENKDEIAKLPIP 1002
+ KN P + ++ +K + +P K ++ K
Sbjct: 1264 ----DLSKEGKPKNAPKRVSAVQYSPPPP---SKRPDGESNGGSKPSSPTKKKVKKRLEG 1316
Query: 1003 LGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEE 1062
K KK + K K KS ++ + + K+ D +++
Sbjct: 1317 SLAALKKKKKSEKKTARKK----KSKTRVKQASASQSSRLL----RRPRKKKSDSSSEDD 1368
Query: 1063 NNTNAKENDEKSTDDEKNKLDD 1084
+++ + +D + DDE ++ DD
Sbjct: 1369 DDS--EVDDSEDEDDEDDEDDD 1388
>gnl|CDD|177970 PLN02336, PLN02336, phosphoethanolamine N-methyltransferase.
Length = 475
Score = 33.2 bits (76), Expect = 1.2
Identities = 31/121 (25%), Positives = 56/121 (46%), Gaps = 31/121 (25%)
Query: 1425 LKDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCSVITQLTQEVVEEND------------ 1472
+ K VLE+G G+G + A+ A VI++D + V+++N+
Sbjct: 36 YEGKSVLELGAGIGRFTGELAKK-AGQVIALD------FIESVIKKNESINGHYKNVKFM 88
Query: 1473 CSDVITVICRRMEDIDRLPHGIENVDIIVSNWMGHVLYL-DSLINAVVYARDRFLKPHGL 1531
C+DV + D++ +VD+I SNW+ ++YL D + + ++LK G
Sbjct: 89 CADVTS------PDLNISD---GSVDLIFSNWL--LMYLSDKEVENLAERMVKWLKVGGY 137
Query: 1532 I 1532
I
Sbjct: 138 I 138
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 33.6 bits (77), Expect = 1.3
Identities = 21/156 (13%), Positives = 61/156 (39%), Gaps = 11/156 (7%)
Query: 968 NKSYGDKNKNEFKVTKIMQRPVDENKDEIAKLPIPLGGDKKDDKKVDDKNVSKDQK-ITK 1026
+ + ++ + + ++E+ +L ++ + + + + ++ I +
Sbjct: 248 KERLEELKARLLEIESLELEALKIREEELRELE-----RLLEELEEKIERLEELEREIEE 302
Query: 1027 STNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKA 1086
+ ++ ++L K E+++EK E E++ + +EKN+L
Sbjct: 303 LEEELEGLRALLEELEELLEKLKS---LEERLEKLEEKLEKLESELEELAEEKNELAKLL 359
Query: 1087 KDTATKLSNSADSI--KLNKKDEGSKENKPQQQRRK 1120
++ +L + + +L K E K+ + Q K
Sbjct: 360 EERLKELEERLEELEKELEKALERLKQLEEAIQELK 395
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 33.2 bits (76), Expect = 1.3
Identities = 19/84 (22%), Positives = 40/84 (47%), Gaps = 9/84 (10%)
Query: 1008 KDDKKVDDKNVSKDQKITKSTNNNLN---KKTNKQQNIQILSPAKKPNKEEDKIEKEENN 1064
K + + K Q++ K N L KK + ++ S + +E+K+++EEN
Sbjct: 357 KSNTDLSGD---KRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENE 413
Query: 1065 TNAKENDEKSTDDEKNKLDDKAKD 1088
K+ E++ +D++ + D+ K
Sbjct: 414 ---KKQKEQADEDKEKRQKDERKK 434
>gnl|CDD|235219 PRK04098, PRK04098, sec-independent translocase; Provisional.
Length = 158
Score = 32.0 bits (73), Expect = 1.4
Identities = 19/96 (19%), Positives = 35/96 (36%), Gaps = 6/96 (6%)
Query: 1020 KDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEK 1079
K +K +S +L KK + L K + E K ++ K +E + +
Sbjct: 65 KYKKEFESAVESLKKKLK----FEELDDLKITAENEIKSIQDLLQDYKKSLEEDTIPNHL 120
Query: 1080 NKLDDKAKDTATKL--SNSADSIKLNKKDEGSKENK 1113
N+ + S +KL K++ K +K
Sbjct: 121 NEEVSNETKLTQESSSDESPKEVKLATKNKTKKHDK 156
>gnl|CDD|221315 pfam11921, DUF3439, Domain of unknown function (DUF3439). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 46 to 94 amino acids in length. This domain is
found associated with pfam01462, pfam00560.
Length = 122
Score = 31.5 bits (71), Expect = 1.5
Identities = 19/60 (31%), Positives = 28/60 (46%), Gaps = 3/60 (5%)
Query: 812 YVNHNKYEPQYIPPTVLPSSEK---SRKPISHVSPPVVTSIITSTATSNVTTTTSSSETK 868
++ N + V P S K S KP+ + P T+ T+T T+ TTTT + TK
Sbjct: 11 WIQKNSGIVKDSSGYVDPDSAKCSGSGKPVRSIICPTTTTTTTTTTTTMPTTTTLPTTTK 70
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 33.2 bits (75), Expect = 1.5
Identities = 48/254 (18%), Positives = 90/254 (35%), Gaps = 39/254 (15%)
Query: 864 SSETKPAESDNKLPSNNNPSEEGEDVSGRKNAKVVPGSPAKSNNMSQHYNEKGHKSGGER 923
+ E K AE +NK+ + E ED + AK K ++ + +K + E+
Sbjct: 1718 AEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEK 1777
Query: 924 QR------DRGDTGRHHQFQNNMIKTIDFHQFQNNVMKNHPGNKNFKNFNN-KSYGDKNK 976
+ D D R + + F N G + N+ K D
Sbjct: 1778 EAVIEEELDEEDEKRRMEVDKKIKDI-----FDNFANIIEGGKEGNLVINDSKEMEDSAI 1832
Query: 977 NEFKVTKIMQRPVDENKDEIAKLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLN--- 1033
E +K MQ ++ + +K+ + + N +
Sbjct: 1833 KEVADSKNMQL--------------------EEADAFEKHKFNKNNENGEDGNKEADFNK 1872
Query: 1034 KKTNKQQNIQILSPAKKPNK-EEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATK 1092
+K K+ + + + A + K ++D IE+E N N + DD+ +K + +D
Sbjct: 1873 EKDLKEDDEEEIEEADEIEKIDKDDIEREIPNNNMAGKNNDIIDDKLDKDEYIKRDAE-- 1930
Query: 1093 LSNSADSIKLNKKD 1106
+ IK++KKD
Sbjct: 1931 -ETREEIIKISKKD 1943
>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
[Transcription].
Length = 175
Score = 32.0 bits (73), Expect = 1.7
Identities = 20/85 (23%), Positives = 34/85 (40%), Gaps = 2/85 (2%)
Query: 994 DEIAKLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNK 1053
DEI + + +K D K DK V ++ + + K ++ + + ++
Sbjct: 85 DEIDEEIQAM--TEKKDIKAKDKEVDAFEEGDEDELDYDEDKEEEEDDEVDSLDDENDDE 142
Query: 1054 EEDKIEKEENNTNAKENDEKSTDDE 1078
+ED E E E DE DDE
Sbjct: 143 DEDDDEIVEILIEDDEVDEDEDDDE 167
>gnl|CDD|181159 PRK07890, PRK07890, short chain dehydrogenase; Provisional.
Length = 258
Score = 32.2 bits (74), Expect = 1.7
Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 3/33 (9%)
Query: 1424 LLKDKIVLEVGCGMGL---LSLFCAEAGAKHVI 1453
LLK K+V+ G G GL L++ A AGA V+
Sbjct: 2 LLKGKVVVVSGVGPGLGRTLAVRAARAGADVVL 34
>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
family. Members of this protein family occur in
Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
related Mycoplasmas in small paralogous families that may
also include truncated forms and/or pseudogenes. Members
are predicted lipoproteins with a conserved signal
peptidase II processing and lipid attachment site. Note
that the name for certain characterized members, p72,
reflects an anomalous apparent molecular weight, given a
theoretical MW of about 61 kDa.
Length = 541
Score = 32.9 bits (75), Expect = 1.8
Identities = 15/99 (15%), Positives = 35/99 (35%), Gaps = 1/99 (1%)
Query: 1006 DKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNT 1065
K + K ++ + K + N N K+N + + +P+ +++ K +
Sbjct: 36 KKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDE-ENPSSTNPEKKPDPSKNKEEI 94
Query: 1066 NAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNK 1104
+++ K D + +N D L+K
Sbjct: 95 EKPKDEPKKPDKKPQADQPNNVHADQPNNNKVDFSDLDK 133
>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
Provisional.
Length = 481
Score = 32.5 bits (73), Expect = 1.9
Identities = 21/81 (25%), Positives = 34/81 (41%)
Query: 1217 KQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKKIKFKGPEILNETTEP 1276
K+ +T +KK+ +T K T+ + + S T+ K N T
Sbjct: 272 KENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTS 331
Query: 1277 TASSATNSNVTTVSNTADNSN 1297
T S TN+N + +NT N+N
Sbjct: 332 TPSKNTNTNTNSNTNTNSNTN 352
Score = 31.3 bits (70), Expect = 5.1
Identities = 29/138 (21%), Positives = 54/138 (39%), Gaps = 5/138 (3%)
Query: 1202 FNNLYNSNNYNNNKSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKK 1261
+NNL +S+ Y K K+T + K + T +K+A +++ + T+ KK
Sbjct: 227 WNNLSSSSIYVGQKLAIKQT-ANTATPKAEVKTEAPAAEKQAAPVVKENTNTNTATTEKK 285
Query: 1262 ----IKFKGPEILNETTEPTASSATNSNVTTVSNTADNSNNPSSILKKSKATEDCKVKTV 1317
+ P+ E +P + +TN+N + + + N ++ SK T
Sbjct: 286 ETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNT 345
Query: 1318 QGAPKTEKSQTKSGEYDN 1335
T +Q S N
Sbjct: 346 NTNSNTNANQGSSNNNSN 363
>gnl|CDD|225319 COG2521, COG2521, Predicted archaeal methyltransferase [General
function prediction only].
Length = 287
Score = 32.4 bits (74), Expect = 2.0
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 1430 VLEVGCGMGLLSLFCAEAGAKHVISVD 1456
VL+ G+G ++ E GA HVI+V+
Sbjct: 138 VLDTCTGLGYTAIEALERGAIHVITVE 164
>gnl|CDD|227938 COG5651, COG5651, PPE-repeat proteins [Cell motility and
secretion].
Length = 490
Score = 32.6 bits (74), Expect = 2.1
Identities = 19/162 (11%), Positives = 42/162 (25%), Gaps = 8/162 (4%)
Query: 321 NEALYAPS-----STTNYPYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFM 375
EA YA + Y+++S +A+T + S+G + V +
Sbjct: 136 TEAQYAEMWAQDAAAMYGYYAAASAAA-SALTPFNEPPPTTNSSGLAAQASAVQALGDLA 194
Query: 376 AATTH-GYSSISNTITNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLNTYGHQASSLSN 434
+ T ++S + N + Q +G+ S
Sbjct: 195 SGITLASQVNLSLLELINPATLSGLANGGTGNLGIGALQQAQNLGFGNVGFGNLGSGNPG 254
Query: 435 SNTTGYSSQNLYQQQPSSVSYSSNSYVYPQTSIATSSNVSST 476
+ + T+I ++ +
Sbjct: 255 APGLASQFSATNLGTLLGSLNPYLGNI-GATNIGLAAAGTGN 295
Score = 32.2 bits (73), Expect = 2.7
Identities = 23/130 (17%), Positives = 33/130 (25%), Gaps = 10/130 (7%)
Query: 318 LFSNEALYAPSSTTNYPYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFMAA 377
S A S + P+ + A SG N N G GG +
Sbjct: 334 AGSGNLGVANSGSAAAPFGIAGANQAALGGANSGAGNFGLGNNPGGGLGGKPLGGTGNGG 393
Query: 378 TTHGYSSISNT----ITNTQPSQQQSNNSVYSS------TINSYSSFPQQSSVNLNTYGH 427
+ I N S SNN+ + T N G
Sbjct: 394 IGASGIGNTGYGNSGIANAGLSNAGSNNAGGENAGNANNTGGGNVGLWNAGDFNAGAAGT 453
Query: 428 QASSLSNSNT 437
++ + NT
Sbjct: 454 GFTNNGSYNT 463
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1, of
polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 32.5 bits (74), Expect = 2.3
Identities = 34/187 (18%), Positives = 71/187 (37%), Gaps = 28/187 (14%)
Query: 960 NKNFKNFNNKSYGDK-----NKNEFKVTKIMQRPVDENKDEIAKLPIP------------ 1002
N + + N+ YG + N N + T + PV K
Sbjct: 112 NLHNCSKNSLEYGKQAGPITNPNVKRRTGVGLPPVAPAASPALKPTANGKRPSSKPPKSI 171
Query: 1003 LGGDKKDDKKVDDKNVSKDQKI------TKSTNNNLNKKTNKQQNIQILSPAKKPNKEED 1056
+ + K ++ SK+ T +L + K+ NI + S KK KE+
Sbjct: 172 MSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNI-MSSFFKKKTKEKK 230
Query: 1057 KIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTAT----KLSNSADSIKLNKKDEGSKEN 1112
+ ++ +T +E++E+S + D+ A+ T S + + +E ++E
Sbjct: 231 EKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEK 290
Query: 1113 KPQQQRR 1119
+ ++++R
Sbjct: 291 EKEKRKR 297
>gnl|CDD|184826 PRK14799, thrS, threonyl-tRNA synthetase; Provisional.
Length = 545
Score = 32.5 bits (74), Expect = 2.4
Identities = 39/194 (20%), Positives = 77/194 (39%), Gaps = 34/194 (17%)
Query: 1227 VDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKKIKFKGPEILNETTEPTASSATNSNV 1286
+D K D++ T + KKA + + + N +I +KG ++ + + S+ N
Sbjct: 38 IDVKSDTSITLDEVKKAINENVLANVSIE---NNQIVYKGNKVSIIEDKVSISTNLNPKY 94
Query: 1287 TTVSNTADNSNNPSSILKKSKATEDCKVKTVQGAPKTEKSQ-----TKSGEYDN--LDER 1339
+ N + + NP+ +++ V + + K+ E D+ + E+
Sbjct: 95 FEILNISTHHPNPNEQY--------VRIRGVAFETEEQLKDYLTWLEKAEETDHRLIGEK 146
Query: 1340 EPSTIQLFM-------GLGLDKPPSQ---KPLVYPIRTFN--MDIDDMTSREIFEDVYAD 1387
+ LF GL L P Q L+ +R N M ++ + +F+
Sbjct: 147 ----LDLFSFHEEAGSGLVLFHPKGQTIRNELIAFMREINDSMGYQEVYTSHVFKTDIWK 202
Query: 1388 IKTHQTTYKDKQLI 1401
I H T Y+DK ++
Sbjct: 203 ISGHYTLYRDKLIV 216
>gnl|CDD|234029 TIGR02830, spore_III_AG, stage III sporulation protein AG. CC A
comparative genome analysis of all sequenced genomes of
shows a number of proteins conserved strictly among the
endospore-forming subset of the Firmicutes. This protein,
a member of this panel, is found in a spore formation
operon and is designated stage III sporulation protein AG
[Cellular processes, Sporulation and germination].
Length = 186
Score = 31.6 bits (72), Expect = 2.5
Identities = 23/123 (18%), Positives = 46/123 (37%), Gaps = 26/123 (21%)
Query: 963 FKNFNNKSYGDKNKNEFKVTKIMQRPVDENKDEIAKLPIPLGGDKK-------------- 1008
F + + D NE + +Q V +DEI+ + K
Sbjct: 23 FSSSEDIEESDTPNNEKTEPEFVQGEVQ-KEDEISDYEKQYENELKEILEKIEGVGDVTV 81
Query: 1009 ------DDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQ---ILSPAKKPNKEEDKIE 1059
++KV KN SK Q+ T+ T+ K++ + ++ ++ + ++E +
Sbjct: 82 MVNLDSSEEKVYAKNTSKGQQTTEETDKEGGKRSVEDESDGEEVVII--RNGDQETPVVL 139
Query: 1060 KEE 1062
K E
Sbjct: 140 KTE 142
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This protein,
which interacts with both microtubules and TRAF3 (tumour
necrosis factor receptor-associated factor 3), is
conserved from worms to humans. The N-terminal region is
the microtubule binding domain and is well-conserved; the
C-terminal 100 residues, also well-conserved, constitute
the coiled-coil region which binds to TRAF3. The central
region of the protein is rich in lysine and glutamic acid
and carries KKE motifs which may also be necessary for
tubulin-binding, but this region is the least
well-conserved.
Length = 506
Score = 32.2 bits (73), Expect = 2.5
Identities = 17/134 (12%), Positives = 41/134 (30%), Gaps = 4/134 (2%)
Query: 1215 KSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKKIKFKGPEILNETT 1274
K K+ + +KK++ R KK +K+ ++++ K+ + + +
Sbjct: 150 KEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPE 209
Query: 1275 EPT----ASSATNSNVTTVSNTADNSNNPSSILKKSKATEDCKVKTVQGAPKTEKSQTKS 1330
EP + + T+ + S + +K +P +T+
Sbjct: 210 EPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRE 269
Query: 1331 GEYDNLDEREPSTI 1344
S
Sbjct: 270 SSKRTETRPRTSLR 283
>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
unknown].
Length = 523
Score = 32.2 bits (73), Expect = 2.6
Identities = 23/93 (24%), Positives = 38/93 (40%), Gaps = 15/93 (16%)
Query: 1020 KDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKEND-EKSTDDE 1078
K K TK N K T ++ K K+ +N + EN E T+D
Sbjct: 3 KASKATKKFTKNHLKNTIDRR---------KQLARSKKVYGTKNRNSHTENKMESGTNDN 53
Query: 1079 KNKLDDKAKDTATKLSNSADSIKLNKKDEGSKE 1111
+K K+ +KL + DS ++++GS+
Sbjct: 54 -----NKNKEKLSKLYSDVDSSSSEEEEDGSES 81
>gnl|CDD|220679 pfam10294, Methyltransf_16, Putative methyltransferase.
Length = 170
Score = 31.1 bits (71), Expect = 2.8
Identities = 26/130 (20%), Positives = 49/130 (37%), Gaps = 7/130 (5%)
Query: 1407 SIINNSHLFKVRKPNSHLLKDKIVLEVGCGMGLLSLFCAEAGAKH--VISVDCSVITQLT 1464
+++ + +L P + L VLE+G G GL+ + V D +L
Sbjct: 25 AVVLSKYLEMKITPGGNNLSGLNVLELGSGTGLVGI-AVALLLPGASVTITDLEEAIELM 83
Query: 1465 QEVVEENDCSDVITV-ICRRMEDIDRLPHGIENVDIIVSNWMGHVLYLDSLINAVVYARD 1523
++ +E N S +T + ED+ VD+I+ +Y + + +
Sbjct: 84 KKNIELNGLSSKVTAKVLDWGEDLPDDVFDPHPVDLIL---AADCVYNEDSFPLLEKTLN 140
Query: 1524 RFLKPHGLIL 1533
L +IL
Sbjct: 141 DLLGKETVIL 150
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 31.9 bits (72), Expect = 2.9
Identities = 20/97 (20%), Positives = 30/97 (30%), Gaps = 9/97 (9%)
Query: 789 SRHLSYEDKDLAKPPVPHTSGNIYVNHNKYEPQYIPPTVLPSSEKSRKPISHVSPPVVTS 848
R S + S + ++ + P++ R P S +PP T
Sbjct: 240 QRGPSGSPQHPPSTTSQDQSTT--GDGQEHTQR---RKTPPATSNRRSPHSTATPPPTTK 294
Query: 849 II-TSTATSNVTTTTSSSETKPAES---DNKLPSNNN 881
T T T TT S + P S P+ N
Sbjct: 295 RQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTTQN 331
>gnl|CDD|240410 PTZ00418, PTZ00418, Poly(A) polymerase; Provisional.
Length = 593
Score = 32.1 bits (73), Expect = 3.0
Identities = 18/83 (21%), Positives = 33/83 (39%), Gaps = 2/83 (2%)
Query: 811 IYVNHNKYEPQYIPPTVLPSSEKSRKPISHVSPPVVTSIITSTATSNVTTTTSSSETKPA 870
Y + +Y+ + LP+ S+ P V T TS+AT++ + +S S +
Sbjct: 510 KYPDQIDINIKYLKKSQLPAFVLSQTPEEPVKTKANTKTNTSSATTSGQSGSSGSTSNSN 569
Query: 871 --ESDNKLPSNNNPSEEGEDVSG 891
ES + S + +G
Sbjct: 570 SNESSPTMSSTELLNVSSTSTTG 592
>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118). This is
the highly conserved family of the major tail subunit
protein.
Length = 121
Score = 30.1 bits (67), Expect = 3.4
Identities = 17/60 (28%), Positives = 33/60 (55%), Gaps = 2/60 (3%)
Query: 1043 QILSPAKKPNKEEDKIEKEENNTNAKENDEK--STDDEKNKLDDKAKDTATKLSNSADSI 1100
+I+ K +++ + E+E+ + +KE +K ST +E+ +L DK D + S DS+
Sbjct: 42 KIVRVEMKEERDKMETEREKRDKESKEERDKFISTMNEQQRLMDKQNDMMGQQQQSIDSL 101
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is a
family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 31.7 bits (72), Expect = 3.6
Identities = 13/106 (12%), Positives = 34/106 (32%), Gaps = 11/106 (10%)
Query: 1201 MFNNLYNSNNYNNNKSKQKKTDSTNNVDKKQD---------STSTDERKKKATQKQQQSE 1251
++L + K +K S ++ + +D + + KK + +
Sbjct: 4 FLDSLPDEKA--PTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGT 61
Query: 1252 EESKKTSNKKIKFKGPEILNETTEPTASSATNSNVTTVSNTADNSN 1297
+ K S+K + + +P S+ + + S +
Sbjct: 62 PRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTES 107
Score = 30.5 bits (69), Expect = 7.4
Identities = 20/87 (22%), Positives = 37/87 (42%), Gaps = 10/87 (11%)
Query: 1212 NNNKSKQKKTDSTNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKKIKFKGPEILN 1271
++ +K + T S + +K+ + + S +SK S + + + PE
Sbjct: 60 GTPRNPKKSSKPTE--SSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEEEEPE--- 114
Query: 1272 ETTEPTAS-----SATNSNVTTVSNTA 1293
ET +P AS S S +T ++TA
Sbjct: 115 ETPDPIASIGGWWSLWGSITSTATSTA 141
>gnl|CDD|221309 pfam11912, DUF3430, Protein of unknown function (DUF3430). This
family of proteins are functionally uncharacterized.
This protein is found in eukaryotes. Proteins in this
family are typically between 209 to 265 amino acids in
length.
Length = 204
Score = 30.9 bits (70), Expect = 3.7
Identities = 25/115 (21%), Positives = 38/115 (33%), Gaps = 6/115 (5%)
Query: 363 SNNGGVYSTNSF--MAATTHGYSSISNTI-TNTQPSQQQSNNSVYSSTINSYSSFPQQSS 419
S++G + N + + S+ + I + S + Y S+ P S
Sbjct: 69 SSDGNIIKYNYYEDSNCSGKPISTETFEIGSCVYSSSYSGSYYSYISSCTDSPEIPSNSL 128
Query: 420 VNLNTYGHQASSLSNSNTTGYSSQNL--YQQQPSSVSYSSNSYVYPQTSIATSSN 472
V Y S N Y + N S +Y NS P T+ SSN
Sbjct: 129 VT-VYYNGSDCSNDNFLYIEYFTNNTCINNNNGFSFTYYCNSTNQPYTTYCKSSN 182
>gnl|CDD|215618 PLN03184, PLN03184, chloroplast Hsp70; Provisional.
Length = 673
Score = 31.7 bits (72), Expect = 3.7
Identities = 19/68 (27%), Positives = 29/68 (42%), Gaps = 15/68 (22%)
Query: 1033 NKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATK 1092
+K T K+Q+I I + P E +++ +E EK +DK K A
Sbjct: 522 DKGTGKKQDITITGASTLPKDEVERMVQEA---------------EKFAKEDKEKRDAVD 566
Query: 1093 LSNSADSI 1100
N ADS+
Sbjct: 567 TKNQADSV 574
>gnl|CDD|215437 PLN02816, PLN02816, mannosyltransferase.
Length = 546
Score = 31.9 bits (72), Expect = 3.7
Identities = 32/155 (20%), Positives = 56/155 (36%), Gaps = 18/155 (11%)
Query: 537 IGHFEFRNTLDIAPM-IPMDSYMTNYMQPGPPPQSTSDYKMLYWNNVPMPNMLHAPPPQA 595
+GH EFR L + P+ + Y M+ S+S K VP N P +
Sbjct: 335 LGHKEFRFVLPVLPIALIFSGYAFAQMEVSGSSSSSSVTKK---KQVPRQNHTKWSP-KL 390
Query: 596 PHSAMYKYTSHIPYDMPTSQATQKPFMKLNDSTSGTDTSAYISPTHMRVHTHQI----PA 651
S + ++IP + +M L D Y+S + I P
Sbjct: 391 RLSVYFLLATNIPMAL---------YMSLFHQRGTEDAMNYLSDEAYKGRVKSILFLMPC 441
Query: 652 HRYPVENTTTNKMPLAYTEPQPPNLQASLNDYQRY 686
H P +T +P+ + + P + L++ ++
Sbjct: 442 HSTPYYSTLHRNIPMQFLDCTPSAEKGELDESDQF 476
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this family
are typically between 146 and 237 amino acids in length.
There are two conserved sequence motifs: QYGP and PKLY.
Length = 155
Score = 30.5 bits (69), Expect = 3.8
Identities = 20/83 (24%), Positives = 40/83 (48%), Gaps = 12/83 (14%)
Query: 1193 QYGGGGPPMFNNL------YNSNNYNNNKSKQKKTDSTNNVDKKQDSTSTDE--RKKKAT 1244
QYG P+ NL + + ++++ ++ + +ST+ +++ + E KKK
Sbjct: 73 QYG----PLVRNLPAMWKIFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKRE 128
Query: 1245 QKQQQSEEESKKTSNKKIKFKGP 1267
+ ++E+E KT KK K P
Sbjct: 129 VPKPKTEKEKPKTEPKKPKPSKP 151
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 31.7 bits (71), Expect = 3.8
Identities = 60/364 (16%), Positives = 115/364 (31%), Gaps = 51/364 (14%)
Query: 326 APSSTTNYPYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFMAATTHGYS-- 383
AP +T + + + TTA ++ G T+ TT
Sbjct: 425 APDTTKSVIFVYTLVHVEPHKTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPE 484
Query: 384 -----SISNTITNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLNTYGHQASSLSNSNTT 438
S + + T S + + +++ + + ++ + ++ + S T
Sbjct: 485 DTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPT 544
Query: 439 GYSSQNLYQQ-QPSSVSYSSNSYVYPQTSIATSSNVSSTSHTSHKYPPYNVDMKNHLDSK 497
G +S Q + S +N+ TS S ++S T
Sbjct: 545 GTTSVPNATSPQVTEESPVNNTNTPVVTS--APSVLTSAVTTGQHGT------------- 589
Query: 498 STLETKSSSEHPPYVDYPNYMGTQYQDSLMKDVKTAADPIGHFEFRNTLDIAPMIPMDSY 557
S P + + + + T+A P G N + P +P ++
Sbjct: 590 -GSSPTSQQPGIPSSSHST---PRSNSTSTTPLLTSAHPTGG---ENITEETPSVPSTTH 642
Query: 558 MTNYMQPGPPPQSTSDYKMLYWNNV-PMPNMLH----APPPQAPHSAMYKYTSHIPYDMP 612
++ + PGP P +TS ++ P +H P P A + P
Sbjct: 643 VST-LSPGPGPGTTSQVSGPGNSSTSRYPGEVHVTEGMPNPNATSPS-----------AP 690
Query: 613 TSQATQKP---FMKLNDSTSGTDTSAYISPTHMRVHTHQIPAHRYPVENTTTNKMPLAYT 669
+ Q T P +++ +TS HT++ A R N TT P +
Sbjct: 691 SGQKTAVPTVTSTGGKANSTTKETSGSTLMASTSPHTNE-GAFRTTPYNATTYLPPSTSS 749
Query: 670 EPQP 673
+ +P
Sbjct: 750 KLRP 753
>gnl|CDD|227002 COG4655, COG4655, Predicted membrane protein [Function unknown].
Length = 565
Score = 31.8 bits (72), Expect = 4.1
Identities = 19/89 (21%), Positives = 31/89 (34%), Gaps = 1/89 (1%)
Query: 347 TTASGNANNVASNGGGSNNGGVYSTNSFMAATTHGYSSISNTITNTQPSQQQSNNS-VYS 405
A GN N S S + GV + + S + TN S S +
Sbjct: 414 EIALGNVANPDSVASFSLDCGVTRADLLDSPLISIGGSAVSIYTNPPASSLTFFQSDIPY 473
Query: 406 STINSYSSFPQQSSVNLNTYGHQASSLSN 434
TI S S+ ++ + G+ S++
Sbjct: 474 RTIKSVSTRDTSVTLTGSLLGNLLLSINL 502
>gnl|CDD|112562 pfam03753, HHV6-IE, Human herpesvirus 6 immediate early protein. The
proteins in this family are poorly characterized, but an
investigation has indicated that the immediate early
protein is required the down-regulation of MHC class I
expression in dendritic cells. Human herpesvirus 6
immediate early protein is also referred to as U90.
Length = 993
Score = 31.6 bits (71), Expect = 4.5
Identities = 33/128 (25%), Positives = 59/128 (46%), Gaps = 22/128 (17%)
Query: 1202 FNNL-YNSNNYNNNKSKQKKTDS-TNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSN 1259
NN N+ + KS + T S T K+ ST+ + KK T K++ ++ K+
Sbjct: 619 MNNTNENTPFSKSGKSPPEVTPSKTFYKRDKKKDISTNRKVKKRTAKRKTVGYKTDKS-- 676
Query: 1260 KKIKFKGPEILNETTEPTASSATNSNVTTVSNTADNSNNPSSILKKSKATEDCKVKTVQG 1319
KKIK + S T++NV +S+ +++ + +I+KKS+ K ++
Sbjct: 677 KKIK-------------SDSLPTDTNVIVISSESEDEEDGFNIIKKSQL-----KKKIKS 718
Query: 1320 APKTEKSQ 1327
K+E S
Sbjct: 719 ELKSESSS 726
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 31.2 bits (71), Expect = 4.6
Identities = 20/85 (23%), Positives = 34/85 (40%), Gaps = 7/85 (8%)
Query: 1048 AKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDE 1107
K K+ ++E+ E K E S DE ++ + D + + +S + E
Sbjct: 206 EVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESE 265
Query: 1108 GSKENKPQQQRRKFNNFHNSSKYQR 1132
NKP + +RK +K QR
Sbjct: 266 YEPINKPVRPKRK-------TKAQR 283
>gnl|CDD|227545 COG5220, TFB3, Cdk activating kinase (CAK)/RNA polymerase II
transcription initiation/nucleotide excision repair
factor TFIIH, subunit TFB3 [Cell division and chromosome
partitioning / Transcription / DNA replication,
recombination, and repair].
Length = 314
Score = 31.1 bits (70), Expect = 4.7
Identities = 28/155 (18%), Positives = 62/155 (40%), Gaps = 10/155 (6%)
Query: 1007 KKDDKKVDDKNVSKDQKITK-STNNNLNKKTNK----QQNIQILSPAKKPNKEEDKIEKE 1061
D ++K V K +++ + S NNL + T + +Q +L K K + + + E
Sbjct: 123 LIDVSLTEEK-VKKYEEMNQDSILNNLERPTPEVMPGKQK-NVLQKRMKLKKRQLERQIE 180
Query: 1062 ENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEGSKENKPQQQRRKF 1121
E + N E + + N+L ++D + S K + ++ +P + + +
Sbjct: 181 EEE---RMNKEWTKKEIGNRLGTASEDGNKTIKIGIISDKFDPEELPRIVVEPTRNNKIY 237
Query: 1122 NNFHNSSKYQRHPYHNNGGDFNYVPVYDGNINKFN 1156
+ P+ GG +N ++ I+ +
Sbjct: 238 LSNVQGFLKDLQPFAPGGGSYNAHTIFSLAISSYG 272
>gnl|CDD|165099 PHA02732, PHA02732, hypothetical protein; Provisional.
Length = 1467
Score = 31.6 bits (71), Expect = 4.7
Identities = 31/170 (18%), Positives = 53/170 (31%), Gaps = 38/170 (22%)
Query: 333 YPYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFMAAT-------------- 378
PY + T T N + G+ V S +M +T
Sbjct: 1099 LPYFMNQTSALVGNTALPKGLNVFSGYMFGAGT--VASAFLYMNSTPQSPVLALLLAPYI 1156
Query: 379 THGYSSISNTITNTQPSQQQSN----------------NSVYSSTINSYSSFPQQSSVNL 422
++ ++++S + T + S + +Y I +Y
Sbjct: 1157 SYKFNALSLGFSITADAAIFSLFGIPAPQLLSSYIPTGSVLYQDPIFTYIPPGIIGMSGT 1216
Query: 423 NTYGHQASSLSNSNTTGYSSQNLYQQQPSSVSYSSNSYVYPQTSIATSSN 472
NT+ +A+ L S + + PSS S SS SI+TS
Sbjct: 1217 NTFTFKAAQLQLSAASSPPAATTPTPPPSSSSSSSAQ------SISTSPG 1260
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region [General
function prediction only].
Length = 303
Score = 31.2 bits (70), Expect = 5.2
Identities = 17/66 (25%), Positives = 29/66 (43%), Gaps = 2/66 (3%)
Query: 1052 NKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEGSKE 1111
++ + E EE ++ E+DE +D K K+ + D A K + I + + E E
Sbjct: 238 DQSMETSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKK--SRKPHIHIEYEQERENE 295
Query: 1112 NKPQQQ 1117
P Q
Sbjct: 296 KIPAVQ 301
>gnl|CDD|110537 pfam01540, Lipoprotein_7, Adhesin lipoprotein. This family consists
of the p50 and variable adherence-associated antigen
(Vaa) adhesins from Mycoplasma hominis. M. hominis is a
mycoplasma associated with human urogenital diseases,
pneumonia, and septic arthritis. An adhesin is a cell
surface molecule that mediates adhesion to other cells or
to the surrounding surface or substrate. The Vaa antigen
is a 50-kDa surface lipoprotein that has four tandem
repetitive DNA sequences encoding a periodic peptide
structure, and is highly immunogenic in the human host.
p50 is also a 50-kDa lipoprotein, having three repeats
A,B and C, that may be a tetramer of 191-kDa in its
native environment.
Length = 353
Score = 31.2 bits (70), Expect = 5.2
Identities = 31/98 (31%), Positives = 46/98 (46%), Gaps = 11/98 (11%)
Query: 996 IAKLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEE 1055
I+KL + K + K +DDKN +++ + N K Q NI+ L KK N E
Sbjct: 92 ISKLSAAVENAKNEKKAIDDKNAQIAKELAER-----NAKI--QSNIEEL---KKINNEA 141
Query: 1056 DKIEKEENNTNAK-ENDEKSTDDEKNKLDDKAKDTATK 1092
++ K N T A+ E K D K +L++ A D K
Sbjct: 142 FELSKTVNKTIAEVEKKFKIPKDFKEQLENFADDLLDK 179
>gnl|CDD|221391 pfam12042, RP1-2, Tubuliform egg casing silk strands structural
domain. Spiders use fibroins to make silk strands. This
family includes tubuliform silk fibroins which are used
to protect egg cases. This domain is a structural domain
which is found in repeats of up to 20 in many
individuals (although this is not necessarily the case).
RP1 makes up structural domains in the N terminal while
RP2 makes up structural domains in the C terminal.
Length = 167
Score = 30.2 bits (68), Expect = 5.3
Identities = 25/134 (18%), Positives = 50/134 (37%), Gaps = 15/134 (11%)
Query: 337 SSSTPTYAAMTTASGNANNVASNGGGSNNGG--------VYSTNSFMAATTHGY---SSI 385
SS+T AA + A + A + G SN V S +A+ + ++I
Sbjct: 38 SSATSASAAGSLAYALGQSAARSLGLSNASALASAVAQAVSSVGVGASASAYANAISNAI 97
Query: 386 SNTITNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLNTYGHQASSLSNSNTTGYSSQNL 445
+ + +S+ SS ++ S+ S+ +++ + + + +S
Sbjct: 98 GQFLAGQGVLNASNASSLASSFASALSASAASSAAQ---AASASAAAAAAQSQAAASAF- 153
Query: 446 YQQQPSSVSYSSNS 459
Q S S S+ S
Sbjct: 154 SQAASQSSSQSAAS 167
>gnl|CDD|233115 TIGR00755, ksgA, dimethyladenosine transferase. In both E. coli and
Saccharomyces cerevisiae, this protein is responsible for
the dimethylation of two adjacent adenosine residues in a
conserved hairpin of 16S rRNA in bacteria, 18S rRNA in
eukaryotes. This adjacent dimethylation is the only rRNA
modification shared by bacteria and eukaryotes. A single
member of this family is present in each of the first 20
completed microbial genomes. This protein is essential in
yeast, but not in E. coli, where its deletion leads to
resistance to the antibiotic kasugamycin. Alternate name:
S-adenosylmethionine--6-N',N'-adenosyl (rRNA)
dimethyltransferase [Protein synthesis, tRNA and rRNA
base modification].
Length = 253
Score = 30.7 bits (70), Expect = 5.5
Identities = 17/82 (20%), Positives = 35/82 (42%), Gaps = 9/82 (10%)
Query: 1424 LLKDKIVLEVGCGMGLLSLFCAEAGAK-HVISVDCSVITQLTQEVVEENDCSDVITVICR 1482
+L+ +VLE+G G+G L+ + K I +D + +L + + + + +I
Sbjct: 27 VLEGDVVLEIGPGLGALTEPLLKRAKKVTAIEIDRRLAERLRKLLSLYEN----LEII-- 80
Query: 1483 RMEDIDRLPHGIEN-VDIIVSN 1503
D ++ +VSN
Sbjct: 81 -EGDALKVDLNEFPKQLKVVSN 101
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 30.9 bits (70), Expect = 5.6
Identities = 21/158 (13%), Positives = 33/158 (20%), Gaps = 8/158 (5%)
Query: 579 WNNVPMPNMLHAPPPQAPHSAMYKYTSHIPYDMPTSQATQKPFMKLNDSTSGTDTS-AYI 637
W+ P A PQ Q + + +
Sbjct: 49 WDPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNT------PTGSPAANPPATFA 102
Query: 638 SPTHMR-VHTHQIPAHRYPVENTTTNKMPLAYTEPQPPNLQASLNDYQRYQNERHHFTDF 696
P P YPV+ A + P Q +L Q+
Sbjct: 103 LPAGPAGPTIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLP 162
Query: 697 GNFQTPPLGNSVQHNYQTNDSFGTNDLSTRIHQQPTAV 734
Q+ + Q Q N + A+
Sbjct: 163 SQQQSAQKNDESQLQQQPNGETPPQQTDGAGDDESEAL 200
>gnl|CDD|218108 pfam04487, CITED, CITED. CITED, CBP/p300-interacting
transactivator with ED-rich tail, are characterized by a
conserved 32-amino acid sequence at the C-terminus.
CITED proteins do not bind DNA directly and are thought
to function as transcriptional co-activators.
Length = 206
Score = 30.6 bits (69), Expect = 5.7
Identities = 14/57 (24%), Positives = 19/57 (33%)
Query: 558 MTNYMQPGPPPQSTSDYKMLYWNNVPMPNMLHAPPPQAPHSAMYKYTSHIPYDMPTS 614
M+ M G P QS Y M ++ P P ML P + K +
Sbjct: 63 MSGPMGGGHPHQSMPAYMMFNPSSKPQPFMLVPGPQLMASMQLQKLNTQYQGHAGAP 119
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit; Provisional.
Length = 482
Score = 31.0 bits (71), Expect = 5.9
Identities = 12/70 (17%), Positives = 29/70 (41%)
Query: 1046 SPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKK 1105
KK K +K EK+ ++ + +K + +++ K+ + + +K
Sbjct: 407 KATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466
Query: 1106 DEGSKENKPQ 1115
+E ++ K Q
Sbjct: 467 EEEEEKKKKQ 476
>gnl|CDD|219481 pfam07596, SBP_bac_10, Protein of unknown function (DUF1559). A
large family of paralogous proteins apparently unique to
planctomycetes.
Length = 255
Score = 30.5 bits (69), Expect = 5.9
Identities = 24/138 (17%), Positives = 43/138 (31%), Gaps = 11/138 (7%)
Query: 320 SNEALYAPSSTTNYPYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSF--MAA 377
S+ + +ST S + A + G ++ + GG S+ G++ NS
Sbjct: 99 SDPSGDTGASTGPSTGSGGAGAGAGAAGGSGGGTDSGTNAGGPSSTNGMFGPNSKTRFRD 158
Query: 378 TTHGYSSISNTI---TNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLNTYGHQASSLSN 434
T G SNTI + + + ++ +T G S+ +
Sbjct: 159 ITDG---TSNTILVGERAGGLADSGSGGTGGDAADGGGGWGGGTTTTTSTNG---STPAT 212
Query: 435 SNTTGYSSQNLYQQQPSS 452
S G S
Sbjct: 213 SGPPGGGDNGAANNGFGS 230
>gnl|CDD|217023 pfam02414, Borrelia_orfA, Borrelia ORF-A. This protein is encoded by
an open reading frame in plasmid borne DNA repeats of
Borrelia species. This protein is known as ORF-A. The
function of this putative protein is unknown.
Length = 285
Score = 30.8 bits (70), Expect = 6.0
Identities = 23/108 (21%), Positives = 38/108 (35%), Gaps = 4/108 (3%)
Query: 1020 KDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEK 1079
K ++ K N KK+NK +++ N +K KEE+ +N EK +
Sbjct: 104 KIERFKKRVNAYFKKKSNKNGSVEKNECINNKNNVINKKIKEESKKKNSKNIEKLKIKKY 163
Query: 1080 ----NKLDDKAKDTATKLSNSADSIKLNKKDEGSKENKPQQQRRKFNN 1123
N + KD KL + +NK + + N
Sbjct: 164 FKKCNFKTNLKKDIKIKLLKIIKIEENFLIKNKKLKNKQKTLKNILYN 211
>gnl|CDD|234548 TIGR04331, o_ant_LIC12162, putative transferase, LIC12162 family.
This protein family shows C-terminal sequence similarity
to various surface carbohydrate biosynthesis enzymes:
spore coat polysaccharide biosynthesis protein SpsB,
UDP-N-acetyl-D-glucosamine 2-epimerase, lipid A
disaccharide synthetase LpxB, etc. It may occur in
O-antigen biosythesis regions.
Length = 585
Score = 31.2 bits (71), Expect = 6.0
Identities = 18/96 (18%), Positives = 33/96 (34%), Gaps = 15/96 (15%)
Query: 1473 CSDVITVICRRMEDIDRLPHGIENVDIIVSNWMGHVLYLDSLINAVVYARDRF--LKPHG 1530
++ + +R+ I H + I++ W+ H + + +Y DR+ LK
Sbjct: 61 YERLLPSLAKRLNAIHGTQHSLRYWRILLGPWLIHFIPI-------LY--DRYLSLKLAL 111
Query: 1531 LILPDRA----ELYCVAANDTMAATKYSFWHDVYGF 1562
L +PD E + D Y F
Sbjct: 112 LKVPDVKALVLEEKDFRPRGDFDDFNHLLSDDDYNF 147
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 31.1 bits (70), Expect = 6.0
Identities = 23/112 (20%), Positives = 45/112 (40%), Gaps = 1/112 (0%)
Query: 1007 KKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTN 1066
++ DKK D + ++ + N + + +Q Q KP E ++ N
Sbjct: 220 EELDKKQIDADKAQQKADFAQDNADKQRDE-VRQKQQEAKNLPKPADTSSPKEDKQVAEN 278
Query: 1067 AKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEGSKENKPQQQR 1118
K EK+ + K ++ K K + K ++K+ KE + Q++R
Sbjct: 279 QKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKR 330
Score = 30.7 bits (69), Expect = 8.4
Identities = 22/111 (19%), Positives = 45/111 (40%), Gaps = 9/111 (8%)
Query: 984 IMQRPVDENKDEIAKLPIPLG-GDKKDDKKVDD---KNVSKDQKITKSTNNNLNKKTNKQ 1039
+ V + + E LP P K+DK+V + + + K Q K + K +
Sbjct: 245 KQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKD-- 302
Query: 1040 QNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTA 1090
K+ +K +K +++ K+ + + D +K K +A+ T+
Sbjct: 303 ---HKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPTS 350
>gnl|CDD|185490 PTZ00163, PTZ00163, hypothetical protein; Provisional.
Length = 230
Score = 30.5 bits (68), Expect = 6.1
Identities = 26/90 (28%), Positives = 38/90 (42%), Gaps = 1/90 (1%)
Query: 1007 KKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTN 1066
KKD + +K++ ++ N K K I S K P K E K E+E N
Sbjct: 27 KKDLQDKFNKSIITINIASRILTENNKKWYKKYIYTSIFSGNKNPQKRERKNEEENQKDN 86
Query: 1067 AK-ENDEKSTDDEKNKLDDKAKDTATKLSN 1095
K +ND ++ +N +DD D L N
Sbjct: 87 TKVDNDNNMENEMENHIDDSIDDPMDDLMN 116
>gnl|CDD|129570 TIGR00478, tly, hemolysin TlyA family protein. Hemolysins are
exotoxins that attack blood cell membranes and cause cell
rupture, often by forming a pore in the membrane. At
least two members of this protein family have been
characterized indirectly as pore-forming hemolysins, one
from the spirochete Serpula (Treponema) hyodysenteriae
and one from Mycobacterium tuberculosis. However,
homology domains in this protein suggest
methyltransferase activity (pfam01728) and RNA-binding
activity (pfam01479) [Unknown function, General].
Length = 228
Score = 30.5 bits (69), Expect = 6.1
Identities = 14/33 (42%), Positives = 18/33 (54%)
Query: 1426 KDKIVLEVGCGMGLLSLFCAEAGAKHVISVDCS 1458
K+KIVL+VG G + + GAK V VD
Sbjct: 75 KNKIVLDVGSSTGGFTDCALQKGAKEVYGVDVG 107
>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2). All proteins in
this family for which functions are known are flap
endonucleases that generate the 3' incision next to DNA
damage as part of nucleotide excision repair. This family
is related to many other flap endonuclease families
including the fen1 family. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 1034
Score = 31.0 bits (70), Expect = 6.3
Identities = 35/166 (21%), Positives = 64/166 (38%), Gaps = 21/166 (12%)
Query: 967 NNKSYGDKNKNEFKVTKIMQRPVDENKDE----IAKLPIPLGGDKKDDKKVDDKNVSKD- 1021
K F T I V E+ D P+ +K++ + D + D
Sbjct: 607 TTNPSNAKEVRNFAETGIQTTNVGESADLLLISNPMEVEPMESEKEESES-DGSFIEVDS 665
Query: 1022 -------QKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKS 1074
Q +KS + +++ + + I +K ++ E EE+N +EK
Sbjct: 666 VSSTLELQVPSKSQPTDESEENAENKVASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKD 725
Query: 1075 TDDEKNKLDDKAKDTATKLSNSADSIKLNKKDE-GSKENKPQQQRR 1119
DD KN+ D + +L ++++ N E S + + QQQ+R
Sbjct: 726 ADDFKNEWQDI---SLEEL----EALEANLLAEQNSLKAQKQQQKR 764
>gnl|CDD|240430 PTZ00473, PTZ00473, Plasmodium Vir superfamily; Provisional.
Length = 420
Score = 31.0 bits (70), Expect = 6.4
Identities = 23/97 (23%), Positives = 34/97 (35%), Gaps = 5/97 (5%)
Query: 328 SSTTNYPYSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFMAATTHGYSSISN 387
S S TY + TT G++ + S Y + S ++T G S S
Sbjct: 329 RSGRTGSSESIRGFTYDSSTTYGGSSYGTSQTDSTST----YGSRSTFDSSTGG-GSQSG 383
Query: 388 TITNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLNT 424
+ S +S SS S F Q +V +T
Sbjct: 384 GGSTYGGSSTFDGSSRGSSDSFGVSYFGPQQTVGFST 420
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This family
consists of several bovine specific leukaemia virus
receptors which are thought to function as transmembrane
proteins, although their exact function is unknown.
Length = 561
Score = 31.2 bits (70), Expect = 6.4
Identities = 16/95 (16%), Positives = 43/95 (45%), Gaps = 4/95 (4%)
Query: 1235 STDERKKKATQKQQQSEEESKKTSNKKIKFKGPEILNETTEPTASSATNSNVTTVSNTAD 1294
+K K +K+++ +E K + FK + + + +A+S ++ +++NT
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVS 255
Query: 1295 NSNNPSSILKKSKATEDCKVKTVQGAPKTEKSQTK 1329
+ S + +D + + + +PK +K + +
Sbjct: 256 GTAPDS----EPDEPKDAEAEETKKSPKHKKKKQR 286
>gnl|CDD|223109 COG0030, KsgA, Dimethyladenosine transferase (rRNA methylation)
[Translation, ribosomal structure and biogenesis].
Length = 259
Score = 30.7 bits (70), Expect = 6.4
Identities = 18/84 (21%), Positives = 35/84 (41%), Gaps = 9/84 (10%)
Query: 1422 SHLLKDKIVLEVGCGMGLLSLFCAEAGAK-HVISVDCSVITQLTQEVVEENDCSDVITVI 1480
+++ VLE+G G+G L+ E A+ I +D + L + ++ +TVI
Sbjct: 26 ANISPGDNVLEIGPGLGALTEPLLERAARVTAIEIDRRLAEVLKERFAPYDN----LTVI 81
Query: 1481 CRRMEDIDRLPHG-IENVDIIVSN 1503
D + + +V+N
Sbjct: 82 ---NGDALKFDFPSLAQPYKVVAN 102
>gnl|CDD|219680 pfam07980, SusD, SusD family. This family includes several
hypothetical proteins. It also contains RagB, a protein
involved in signalling and SusD, an outer membrane
protein involved in nutrient binding.
Length = 257
Score = 30.6 bits (69), Expect = 6.5
Identities = 23/114 (20%), Positives = 30/114 (26%), Gaps = 21/114 (18%)
Query: 1135 YHNNGGDFNYVPVYDGNINKFNKFGGGGNNRYPNHRYYD--------FDFIDPR------ 1180
N G F Y G+ GGG P D ++ DPR
Sbjct: 14 TGNGGNSFGYGSGPAGSG-----GGGGWGGVTPTQELVDLFLMKENPYENRDPRLYATIL 68
Query: 1181 --SCFFPGSGTPYPQYGGGGPPMFNNLYNSNNYNNNKSKQKKTDSTNNVDKKQD 1232
G+G GGG + N Y K + N + D
Sbjct: 69 YDGDTNKGNGADTGGIYGGGGGESSGGGNPTGYYLRKYVDADDANANGGNSGID 122
>gnl|CDD|181572 PRK08871, flgK, flagellar hook-associated protein FlgK; Validated.
Length = 626
Score = 31.0 bits (70), Expect = 6.6
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 3/39 (7%)
Query: 422 LNTYGHQASSLSNSNTTGYSSQNLYQQQPSSVSYSSNSY 460
LNT GH ++SN NT GYS Q++ Q + ++Y
Sbjct: 20 LNTTGH---NISNVNTEGYSRQSVIQGTNDPRQFGGSTY 55
>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
Length = 621
Score = 30.9 bits (70), Expect = 6.8
Identities = 23/76 (30%), Positives = 35/76 (46%), Gaps = 4/76 (5%)
Query: 1034 KKTNKQQNIQILSPAKKPNKEEDKI--EKEENNTNAKENDEKSTDDEKNKLDDKAKDTAT 1091
K T K+Q+I I + P E +++ E E+N KE EK D KN+ +
Sbjct: 486 KGTGKEQSITIQGASTLPKDEVERMVKEAEKNAAEDKEKREKI--DLKNQAESLCYQAEK 543
Query: 1092 KLSNSADSIKLNKKDE 1107
+L D I KK++
Sbjct: 544 QLKELKDKISEEKKEK 559
>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
translational repressor [Translation, ribosomal
structure and biogenesis].
Length = 777
Score = 30.9 bits (70), Expect = 7.0
Identities = 38/235 (16%), Positives = 81/235 (34%), Gaps = 34/235 (14%)
Query: 363 SNNGGVYSTNSFMAATTHGYSSISNTITNTQPSQQQSNNSVYSS---TINSYSSFPQQSS 419
+ +T+ + +S S P ++++S + IN+ + SS
Sbjct: 21 KKSPPSSTTSQELMNGNSTPNSFS-------PIPSKASSSATFTLNLPINNSVNHKITSS 73
Query: 420 VNLNTYGHQASSLSNSNTTGYSSQNLYQQQPSSVSYSSNS-----YVYPQT-SIATSSNV 473
+ + S++ S++T S L + SS + S++S T +S+V
Sbjct: 74 SSSRRKPSGSWSVAISSSTSGSQSLLMELPSSSFNPSTSSRNKSNSALSSTQQGNANSSV 133
Query: 474 SSTSHTSHKYPPYNVDMKNHLDSKSTLETKSSSEHPPYVDYPNY--MGTQYQDSLMKDVK 531
+ +S T+ N + + ++ T S + T S +K
Sbjct: 134 TLSSSTASSMFNSNKLPLPNPNHSNSATTNQSGSSFINTPASSSSQPLTNLVVSSIKRFP 193
Query: 532 --TAADPIGHFE-------------FRNTLDIAP-MIPMDSYMTNYMQPGPPPQS 570
T+ P ++ + + P + P + + T+ + P P QS
Sbjct: 194 YLTSLSPFFNYLIDPSSDSATASADTSPSFNPPPNLSPNNLFSTSDLSPLPDTQS 248
Score = 30.5 bits (69), Expect = 9.9
Identities = 43/182 (23%), Positives = 66/182 (36%), Gaps = 27/182 (14%)
Query: 309 PEHSTRDGKLFSNEALYAPSSTTNYPYSSSSTPTYAAMTTASGNANN-------VASNGG 361
ST + E PSS+ N SS + A +T GNAN+ AS+
Sbjct: 88 ISSSTSGSQSLLMEL---PSSSFNPSTSSRNKSNSALSSTQQGNANSSVTLSSSTASSMF 144
Query: 362 GSNNGGVYSTNSFMAATTHGYSSISNTITNTQPSQQQSN-NSVYSSTI---------NSY 411
SN + + N +ATT+ S S+ I S Q N V SS +
Sbjct: 145 NSNKLPLPNPNHSNSATTN--QSGSSFINTPASSSSQPLTNLVVSSIKRFPYLTSLSPFF 202
Query: 412 SSFPQQSSVNLNTYGHQASSL----SNSNTTGYSSQNLYQQQPSSVSYSSNSYVYPQTSI 467
+ SS + + S + S +S+ +L P + S +N + +SI
Sbjct: 203 NYLIDPSSDSATASADTSPSFNPPPNLSPNNLFSTSDL-SPLPDTQSVENNIILNSSSSI 261
Query: 468 AT 469
Sbjct: 262 NE 263
>gnl|CDD|225174 COG2265, TrmA, SAM-dependent methyltransferases related to tRNA
(uracil-5-)-methyltransferase [Translation, ribosomal
structure and biogenesis].
Length = 432
Score = 30.7 bits (70), Expect = 7.1
Identities = 11/33 (33%), Positives = 17/33 (51%), Gaps = 1/33 (3%)
Query: 1424 LLKDKIVLEVGCGMGLLSLFCAEAGAKHVISVD 1456
L + VL++ CG+G L A+ K V V+
Sbjct: 291 LAGGERVLDLYCGVGTFGLPLAK-RVKKVHGVE 322
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 30.8 bits (69), Expect = 7.1
Identities = 23/94 (24%), Positives = 40/94 (42%), Gaps = 9/94 (9%)
Query: 1021 DQKITKSTNNNLNKKTNKQQNIQ-ILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDDEK 1079
DQ K+ + N + T+K+ ++ S + EE E E ++++ +DE S
Sbjct: 134 DQP-EKNESGNNDHATDKENLLESDASSSNDSESEESDSESEIESSDSDHDDENSDSKLD 192
Query: 1080 N-------KLDDKAKDTATKLSNSADSIKLNKKD 1106
N D+A + S+ DSI+ K D
Sbjct: 193 NLRNYIVSLKKDEADAESVLSSDDNDSIEEIKYD 226
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 30.1 bits (68), Expect = 7.1
Identities = 19/76 (25%), Positives = 39/76 (51%), Gaps = 5/76 (6%)
Query: 1006 DKKDDKKVDDK--NVSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEEN 1063
D+++ KK DD+ ++ K + N K +K + +++ N+EED+ E +
Sbjct: 47 DEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDE---ESS 103
Query: 1064 NTNAKENDEKSTDDEK 1079
+ N KE +EK+ + +
Sbjct: 104 DENEKETEEKTESNVE 119
>gnl|CDD|201997 pfam01829, Peptidase_A6, Peptidase A6 family.
Length = 358
Score = 30.6 bits (69), Expect = 7.4
Identities = 17/56 (30%), Positives = 31/56 (55%), Gaps = 3/56 (5%)
Query: 322 EALYAPSSTTNYPYSSSSTPTYAAM--TTASGNANNVASNGGGSNNGGVYSTNSFM 375
A SSTT +++ + P + ++ TTA+ A+ V++ S N G+Y T++ M
Sbjct: 79 PAGTFASSTTV-TFNAVNFPGFTSLFGTTATSRADQVSAFRYASMNVGIYPTSNLM 133
>gnl|CDD|222862 PHA02539, 18, tail sheath protein; Provisional.
Length = 648
Score = 30.8 bits (70), Expect = 8.0
Identities = 16/99 (16%), Positives = 35/99 (35%), Gaps = 13/99 (13%)
Query: 323 ALYAPSSTTNYP------YSSSSTPTYAAMTTASGNANNVASNGGGSNNGGVYSTNSFMA 376
A++ PS+ Y + + +++ASG + GG + G+ TN
Sbjct: 142 AVFIPSAKIIAKAKSVGVYPTLGSNWTTEISSASGGVAATLTLGGIVTDSGITLTN---- 197
Query: 377 ATTHGYSSISNTITNTQPSQQQSNNSV--YSSTINSYSS 413
++I+ ++ + V Y + S +
Sbjct: 198 -AETAEAAITAVAFQANIAKYGVPSVVALYPGEVGSTTE 235
>gnl|CDD|237748 PRK14537, PRK14537, 50S ribosomal protein L20/unknown domain fusion
protein; Provisional.
Length = 230
Score = 30.0 bits (67), Expect = 8.2
Identities = 29/141 (20%), Positives = 51/141 (36%), Gaps = 12/141 (8%)
Query: 989 VDENKDEIAKLPIPLGGDKKDDKKVDDKNVSKDQKITKSTNNNLNKKTNKQQNIQILSPA 1048
VD N+ ++ L ++ K +S+ Q T K K ++L
Sbjct: 88 VDINRKVLSDLAHLQPEVFVQYVELAKKFLSQFQSTTI-------TKPKKVLINEVLQEK 140
Query: 1049 KKPNKEEDKIEKEENNTNAKENDEKSTDDEKNKLDDKAKDTATKLSNSADSIKLNKKDEG 1108
EE + E + + ++EKS + + + K + S + I L+K
Sbjct: 141 TINQNEEKTSLQNEKVLSPELSEEKSDSELETQPQ---KTQLKEKKPSIEHIDLSKML-- 195
Query: 1109 SKENKPQQQRRKFNNFHNSSK 1129
KE K + K NF+ K
Sbjct: 196 LKELKKLAKEHKIPNFNKLKK 216
>gnl|CDD|225953 COG3419, PilY1, Tfp pilus assembly protein, tip-associated adhesin
PilY1 [Cell motility and secretion / Intracellular
trafficking and secretion].
Length = 1036
Score = 30.6 bits (69), Expect = 8.6
Identities = 42/213 (19%), Positives = 64/213 (30%), Gaps = 38/213 (17%)
Query: 295 GSQIFNAASTLNETPEHSTRDGKLFSNEALYAPSSTTNYPYSSSSTP----TYAAMTTAS 350
G T +S +G + + A A S+S P TY T +
Sbjct: 214 GLGATGIGDTNTINDSYSQGNGDAWESPARNAAVYND----WSNSVPTRIGTYTISLTDA 269
Query: 351 GNANNVASNGGGSNNGGVYSTNSFMAATTHGYSSISNTITNTQPSQQQSNNSVYSSTINS 410
N + VA +NN ST + + + + +
Sbjct: 270 ANPDTVALRASDANNIQFGSTYVAQLGDSAHTYNARPDTFGDV--------QLTTYWLAV 321
Query: 411 YSSFPQQSSVNLNTY-------GHQASSLSNSNTTGYS-----SQNLYQQQPSSVS---- 454
S S V LN Y G + N NT S + Y P++ S
Sbjct: 322 VSKPNGTSPVGLNQYYLTTFYGGRCVAPRPNKNTKQPSVASQSTNAEYGADPTASSGAAN 381
Query: 455 -YSSNSYVYPQTSI-----ATSSNVSSTSHTSH 481
+S+ S SI A S SS++ T+
Sbjct: 382 FFSAPSAESMVASIKRIFSAISGYASSSAGTAL 414
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 30.7 bits (70), Expect = 8.6
Identities = 23/141 (16%), Positives = 48/141 (34%), Gaps = 22/141 (15%)
Query: 962 NFKNFNNKSYGDKNKNEFKVTKIMQRPVDENKDEIAKL----PIPLGGDKKDDKKVDDKN 1017
N + N S K ++ ++ ++ N+D+I L + + ++ V+
Sbjct: 736 NLRAEMNLSPSAPLKVVLVGSEELEDRLEANEDDIKGLANLEELEILSPDPEEPPVEVTA 795
Query: 1018 VSKDQKITKSTNNNLNKKTNKQQNIQILSPAKKPNKEEDKIEKEENNTNAKENDEKSTDD 1077
V ++ ++ + KE +K+EKE + K ++E
Sbjct: 796 VVGGAELFLPLAGLIDLAAEL----------ARLEKELEKLEKEIDRIEKKLSNEGFVAK 845
Query: 1078 --------EKNKLDDKAKDTA 1090
EK KL + A
Sbjct: 846 APEEVVEKEKEKLAEYQVKLA 866
>gnl|CDD|185753 cd09214, GH64-like, glycosyl hydrolase 64 family. This family is
represented by the laminaripentaose-producing,
beta-1,3-glucanase (LPHase) of Streptomyces matensis and
related bacterial and ascomycete proteins. LPHase is a
member of glycoside hydrolase family 64 (GH64), it is an
inverting enzyme involved in the cleavage of long-chain
polysaccharide beta-1,3-glucans, into specific
pentasaccharide oligomers. LPHase is a two-domain
crescent fold structure: one domain is composed of 10
beta-strands, eight coming from the N-terminus of the
protein and two from the C-terminal region, and the
protein has a second inserted domain; this cd includes
both domains. This protein has an electronegative,
substrate-binding cleft, and conserved Glu and Asp
residues involved in the cleavage of the
beta-1,3-glucan, laminarin, a plant and fungal cell wall
component. Among bacteria, many beta-1,3-glucanases are
implicated in fungal cell wall degradation. Also
included in this family is GluB , the beta-1,3-glucanase
B from Lysobacter enzymogenes Strain N4-7. Recombinant
GluB demonstrated higher relative activity toward the
branched-chain beta-1,3 glucan substrate zymosan A than
toward linear beta-1,3 glucan substrates. Sometimes
these two domains are found associated with other
domains such as in the Catenulispora acidiphila DSM
44928 carbohydrate binding family 6 protein in which
they are positioned N-terminal of a carbohydrate binding
module, family 6 (CBM_6) domain. In the
Cellulosimicrobium cellulans, glucan
endo-1,3-beta-glucosidase, they are positioned
N-terminal of a RICIN, carbohydrate-binding domain, and
in the Salinispora tropica CNB-440, coagulation factor
5/8 C-terminal domain (FA58C) protein, they are
positioned C-terminal of two FA58C domains which are
proposed to function as cell surface-attached,
carbohydrate-binding domain. This FA58C-containing
protein has an internal peptide deletion (of approx. 44
residues) in the LPHase domain II.
Length = 319
Score = 30.4 bits (69), Expect = 9.1
Identities = 17/55 (30%), Positives = 27/55 (49%), Gaps = 9/55 (16%)
Query: 394 PSQQQSNNSVY--SSTINSYSSFPQQSSVNLNTYG-------HQASSLSNSNTTG 439
P+ Q S+ S Y ++ N Y+ F S+N YG Q+S+LS ++ T
Sbjct: 259 PAAQWSDASNYYQAAPANYYAQFWHAHSINGLAYGFPYDDVNGQSSTLSTTDPTH 313
>gnl|CDD|226973 COG4625, COG4625, Uncharacterized protein with a C-terminal OMP
(outer membrane protein) domain [Function unknown].
Length = 577
Score = 30.5 bits (69), Expect = 9.1
Identities = 37/171 (21%), Positives = 60/171 (35%), Gaps = 14/171 (8%)
Query: 261 DNGDVDSRTTTASILDSSLYQNTYDSKTRNFVDSGSQIFNAASTLNETPEHSTRDGKLFS 320
+G+ + T S+L S Y T V SG+ + A S ++T G + +
Sbjct: 101 YSGNSIVKGGTLSVLGSLRYVVT--------VTSGATLGGAGSLGGLIAGNTT-GGAVVN 151
Query: 321 NEALYAPSSTTNYPYSSSSTPTYAAMTTASGNANNVASNGGGSN-NGGVYSTNSFMAATT 379
E + S +P A GN + V NG + G
Sbjct: 152 GEVRLT--YGSTVAVESLQSPVVNPAAIALGNVDTVTQNGAENVEQVGQAYDILQADGGG 209
Query: 380 HGYSSISNTITNTQPSQQQSNNSVYSSTINSYSSFPQQSSVNLNTYGHQAS 430
G++ ++N T+T P+ + +V SS S F S+ Q S
Sbjct: 210 TGFTKVANAATST-PNLLAAGAAVQSSGSGS-VLFELASTAAAQAAFQQLS 258
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 30.6 bits (69), Expect = 9.5
Identities = 13/43 (30%), Positives = 23/43 (53%)
Query: 1223 STNNVDKKQDSTSTDERKKKATQKQQQSEEESKKTSNKKIKFK 1265
+ + +KK + ERKKK +K ++ E + K + K+ K K
Sbjct: 4 TESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAK 46
>gnl|CDD|224705 COG1792, MreC, Cell shape-determining protein [Cell envelope
biogenesis, outer membrane].
Length = 284
Score = 30.0 bits (68), Expect = 9.8
Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 6/69 (8%)
Query: 1239 RKKKATQKQQQSEEESKKTSNKKIKFKGPEILNETTEPTASSATNSNVTTVSNTADNSNN 1298
+K+ A +Q E ES + NK++K E+L+ + + V + S D +
Sbjct: 79 KKELAELEQLLEEVESLEEENKRLK----ELLDFKESSSDYDPIAARVISRS--PDPWSQ 132
Query: 1299 PSSILKKSK 1307
I K S
Sbjct: 133 TIVIDKGSN 141
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.309 0.126 0.363
Gapped
Lambda K H
0.267 0.0829 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 79,035,503
Number of extensions: 7461121
Number of successful extensions: 7488
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7038
Number of HSP's successfully gapped: 378
Length of query: 1662
Length of database: 10,937,602
Length adjustment: 110
Effective length of query: 1552
Effective length of database: 6,058,662
Effective search space: 9403043424
Effective search space used: 9403043424
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 66 (29.0 bits)