PSIBLAST alignment of GI: 254780142 and protein with PDB id: 3lu0
gi|307776498|pdb|3LU0|D Chain D, Molecular Model Of Escherichia Coli Core Rna Polymerase Length = 1407
Score = 1727 bits (4473), Expect = 0.0, Method: Composition-based stats.
Identities = 743/1372 (54%), Positives = 982/1372 (71%), Gaps = 14/1372 (1%)
Query: 3 QEVMRFFNPWTDDRGFDSVRISIASPAKIASLSYGEIKKPETINYRTFKPERDGLFCARI 62
+++++F T FD+++I++ASP I S S+GE+KKPETINYRTFKPERDGLFCARI
Sbjct: 2 KDLLKFLKAQTKTEEFDAIKIALASPDMIRSWSFGEVKKPETINYRTFKPERDGLFCARI 61
Query: 63 FGPIKDYECICGKYKRMKYKGIICEKCGVEVTLSSVRRDRMAHIDLASPVAHPWFLKSLP 122
FGP+KDYEC+CGKYKR+K++G+ICEKCGVEVT + VRR+RM HI+LASP AH WFLKSLP
Sbjct: 62 FGPVKDYECLCGKYKRLKHRGVICEKCGVEVTQTKVRRERMGHIELASPTAHIWFLKSLP 121
Query: 123 SRISTLLGMSLRDVERVLYFESYVVVDPGLSSLEKYQILTEEEYVEAVSQFGQDQFIAMM 182
SRI LL M LRD+ERVLYFESYVV++ G+++LE+ QILTEE+Y++A+ +FG D+F A M
Sbjct: 122 SRIGLLLDMPLRDIERVLYFESYVVIEGGMTNLERQQILTEEQYLDALEEFG-DEFDAKM 180
Query: 183 GADAIYELLIALDLQNLALTLRDQLSKSSSIFRRKKIIKRLKIVDSFITSGNNPGWMIIR 242
GA+AI LL ++DL+ LR++L++++S +RKK+ KR+K++++F+ SGN P WMI+
Sbjct: 181 GAEAIQALLKSMDLEQECEQLREELNETNSETKRKKLTKRIKLLEAFVQSGNKPEWMILT 240
Query: 243 KLPVLPPDLRPLVALDFGRFAASDLNDLYRRVIGRNNRLMRLKSLHAPEIIIRNEKRMLQ 302
LPVLPPDLRPLV LD GRFA SDLNDLYRRVI RNNRL RL L AP+II+RNEKRMLQ
Sbjct: 241 VLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLDLAAPDIIVRNEKRMLQ 300
Query: 303 ESVDALFDNGRHKRVVTGANRRPLKSLSDMLKGKQGRFRTNLLGKRVDYSGRSVIVAGPE 362
E+VDAL DNGR R +TG+N+RPLKSL+DM+KGKQGRFR NLLGKRVDYSGRSVI GP
Sbjct: 301 EAVDALLDNGRRGRAITGSNKRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGPY 360
Query: 363 LQLHQCGLPKLMALELFKPFLYAQLEKKGYVSTVKQAKKFVEKERPEVWDVLAEVVHQHV 422
L+LHQCGLPK MALELFKPF+Y +LE +G +T+K AKK VE+E VWD+L EV+ +H
Sbjct: 361 LRLHQCGLPKKMALELFKPFIYGKLELRGLATTIKAAKKMVEREEAVVWDILDEVIREHP 420
Query: 423 VLLNRAPSLHRLSMQAFEPKIISGKAIQLHPLVCAGYNADFDGDQMAVYAVISPEAQLEA 482
VLLNRAP+LHRL +QAFEP +I GKAIQLHPLVCA YNADFDGDQMAV+ ++ EAQLEA
Sbjct: 421 VLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCAAYNADFDGDQMAVHVPLTLEAQLEA 480
Query: 483 RVLMLSTNNLLHPASGAPVTVPSQDMVLGLCYLSTVHEGDPGEGMLFADMGEVYHALENK 542
R LM+STNN+L PA+G P+ VPSQD+VLGL Y++ GEGM+ E +
Sbjct: 481 RALMMSTNNILSPANGEPIIVPSQDVVLGLYYMTRDCVNAKGEGMVLTGPKEAERLYRSG 540
Query: 543 IVTLHSKIRGRYKSVDKDGNS----ISKIYDTTPGRMIIGEILPRHHEISFDICNQEMIK 598
+ +LH++++ R +KD N + + DTT GR I+ I+P+ + + I NQ + K
Sbjct: 541 LASLHARVKVRITEYEKDANGELVAKTSLKDTTVGRAILWMIVPKG--LPYSIVNQALGK 598
Query: 599 KNISAMVDTIYRHCGQKSTVAFCDDLMRLGFRYACSSGISFGKDDIIVPESKEKIIAEAD 658
K IS M++T YR G K TV F D +M GF YA SG S G DD+++PE K +II+EA+
Sbjct: 599 KAISKMLNTCYRILGLKPTVIFADQIMYTGFAYAARSGASVGIDDMVIPEKKHEIISEAE 658
Query: 659 KMVKEYENQYNDGLITRGEKYNKVVDLWGKTTDKVTEEMMARIKRV---EFDPETGRQKK 715
V E + Q+ GL+T GE+YNKV+D+W D+V++ MM ++ D + +Q
Sbjct: 659 AEVAEIQEQFQSGLVTAGERYNKVIDIWAAANDRVSKAMMDNLQTETVINRDGQEEKQVS 718
Query: 716 MNSIFMMSHSGARGSIHQMRQLGGMRGLIAKPSGEIIESPIRSHFKGGLCGFEFFQSCVG 775
NSI+MM+ SGARGS Q+RQL GMRGL+AKP G IIE+PI ++F+ GL ++F S G
Sbjct: 719 FNSIYMMADSGARGSAAQIRQLAGMRGLMAKPDGSIIETPITANFREGLNVLQYFISTHG 778
Query: 776 GRKGLLDVVMRTASSGYLSRRLVDVAQNCVVNQVDCNTKKGLTITHIVDSGQVVYSLGSR 835
RKGL D ++TA+SGYL+RRLVDVAQ+ VV + DC T +G+ +T +++ G V L R
Sbjct: 779 ARKGLADTALKTANSGYLTRRLVDVAQDLVVTEDDCGTHEGIMMTPVIEGGDVKEPLRDR 838
Query: 836 VLGRTALDDIINPLTNECIVKAGQLILESHVNEIEKCGIRSVRIRSALTCESSRGVCVLC 895
VLGR +D++ P T + +V L+ E + +E+ + +V++RS ++C++ GVC C
Sbjct: 839 VLGRVTAEDVLKPGTADILVPRNTLLHEQWCDLLEENSVDAVKVRSVVSCDTDFGVCAHC 898
Query: 896 YGRDLARGSLVNVGEAVGVIAAQSIGEPGTQLTMRTFHLGG-AVTVMDRSFIESPCDGIV 954
YGRDLARG ++N GEA+GVIAAQSIGEPGTQLTMRTFH+GG A S I+ G +
Sbjct: 899 YGRDLARGHIINKGEAIGVIAAQSIGEPGTQLTMRTFHIGGAASRAAAESSIQVKNKGSI 958
Query: 955 KIKNRNVCRNSTNDLISMGRNTTLQILDMSGQEQYSHRIMYGAKLFVDDGGVIECGQRIS 1014
K+ N NS+ L+ RNT L+++D G+ + S+++ YGA L DG + G+ ++
Sbjct: 959 KLSNVKSVVNSSGKLVITSRNTELKLIDEFGRTKESYKVPYGAVLAKGDGEQVAGGETVA 1018
Query: 1015 EWDPHTFPIITEVSGTVGFEDLVDGISVIESIGESTGIAKRKVIDW-RFASRSQNLKPAI 1073
WDPHT P+ITEVSG V F D++DG ++ E TG++ V+D + ++L+PA+
Sbjct: 1019 NWDPHTMPVITEVSGFVRFTDMIDGQTITRQTDELTGLSSLVVLDSAERTAGGKDLRPAL 1078
Query: 1074 VVTDENGV-VLKSARGTDARWFLPVDALLSVSPGQKVSTGDVLARLPISSAKTKDITSGL 1132
+ D G VL A++FLP A++ + G ++S+GD LAR+P S TKDIT GL
Sbjct: 1079 KIVDAQGNDVLIPGTDMPAQYFLPGKAIVQLEDGVQISSGDTLARIPQESGGTKDITGGL 1138
Query: 1133 PRVAELFEARRPKNHAILAEISGTIRIKRNYKNKSRVVIEPFEDGVEPAEYFIPKNKHFY 1192
PRVA+LFEARRPK AILAEISG + + K K R+VI P DG +P E IPK +
Sbjct: 1139 PRVADLFEARRPKEPAILAEISGIVSFGKETKGKRRLVITPV-DGSDPYEEMIPKWRQLN 1197
Query: 1193 LQDGDHVEKGDYILDGNPVPQDILRIKGVEALASYLINEVQEVYRLEGVAINHKHIEVVV 1252
+ +G+ VE+GD I DG P DILR++GV A+ Y++NEVQ+VYRL+GV IN KHIEV+V
Sbjct: 1198 VFEGERVERGDVISDGPEAPHDILRLRGVHAVTRYIVNEVQDVYRLQGVKINDKHIEVIV 1257
Query: 1253 RHMLQKVEITDPADTEYILGDNVDRIEVEELNRSLAQQGKKLVSFSPILQGITKASLQTK 1312
R ML+K I + ++++ G+ V+ V+ NR L GK ++S L GITKASL T+
Sbjct: 1258 RQMLRKATIVNAGSSDFLEGEQVEYSRVKIANRELEANGKVGATYSRDLLGITKASLATE 1317
Query: 1313 SFISAASFQETTKVLTEAAIAGKVDTLDGFKENVIVGRSIPAGTGAILHEKR 1364
SFISAASFQETT+VLTEAA+AGK D L G KENVIVGR IPAGTG H+ R
Sbjct: 1318 SFISAASFQETTRVLTEAAVAGKRDELRGLKENVIVGRLIPAGTGYAYHQDR 1369