RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6409
(1832 letters)
>gnl|CDD|223587 COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication,
recombination, and repair / Transcription / Translation,
ribosomal structure and biogenesis].
Length = 513
Score = 381 bits (980), Expect = e-117
Identities = 163/410 (39%), Positives = 237/410 (57%), Gaps = 37/410 (9%)
Query: 716 ITIKGGKVPDPVRNWKEASLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETG 775
+ L E+L+ ++ +G+ EPTPIQ AIP+ L RD++G A+TG
Sbjct: 17 NVALSRGEEKTPPEFASLGLSPELLQALKDLGFEEPTPIQLAAIPLILAGRDVLGQAQTG 76
Query: 776 SGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPL-GI 834
+GKT AFLLPLL KI + + A+I+APTRELA QI EE K G L G+
Sbjct: 77 TGKTAAFLLPLLQ------KILKSVERKYVS-ALILAPTRELAVQIAEELRKLGKNLGGL 129
Query: 835 RTVLVVGGLSREEQGFRLRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDM 894
R +V GG+S +Q L+ G +IV+ATPGRL+D+++ L L+ +VLDEADRM+DM
Sbjct: 130 RVAVVYGGVSIRKQIEALKRGVDIVVATPGRLLDLIKRGKLDLSGVETLVLDEADRMLDM 189
Query: 895 GFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLAR 954
GF D++KIL+ +P RQT++F+ATMP + LAR
Sbjct: 190 GFIDDIEKILKALP-------------------------PDRQTLLFSATMPDDIRELAR 224
Query: 955 SYLRRPATVYIGS--VGKPTERIEQIVY-ILSEQDKRKKLMEVLNRGVKKPVIIFVNQKK 1011
YL P + + + + ++I+Q + SE++K + L+++L + VI+FV K+
Sbjct: 225 RYLNDPVEIEVSVEKLERTLKKIKQFYLEVESEEEKLELLLKLLKDEDEGRVIVFVRTKR 284
Query: 1012 GADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSM 1071
+ LA+ L K G+ LHG QE+R+ AL K G +LVATDVA RG+DI DVS
Sbjct: 285 LVEELAESLRKRGFKVAALHGDLPQEERDRALEKFKDGELRVLVATDVAARGLDIPDVSH 344
Query: 1072 VINYDMAKSIEDYTHRIGRTGRAGKEGLAVSFCTKD-DSHLFYDLKQMMI 1120
VINYD+ EDY HRIGRTGRAG++G+A+SF T++ + +++ +
Sbjct: 345 VINYDLPLDPEDYVHRIGRTGRAGRKGVAISFVTEEEEVKKLKRIEKRLE 394
Score = 381 bits (980), Expect = e-117
Identities = 163/410 (39%), Positives = 237/410 (57%), Gaps = 37/410 (9%)
Query: 1391 ITIKGGKVPDPVRNWKEASLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETG 1450
+ L E+L+ ++ +G+ EPTPIQ AIP+ L RD++G A+TG
Sbjct: 17 NVALSRGEEKTPPEFASLGLSPELLQALKDLGFEEPTPIQLAAIPLILAGRDVLGQAQTG 76
Query: 1451 SGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPL-GI 1509
+GKT AFLLPLL KI + + A+I+APTRELA QI EE K G L G+
Sbjct: 77 TGKTAAFLLPLLQ------KILKSVERKYVS-ALILAPTRELAVQIAEELRKLGKNLGGL 129
Query: 1510 RTVLVVGGLSREEQGFRLRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDM 1569
R +V GG+S +Q L+ G +IV+ATPGRL+D+++ L L+ +VLDEADRM+DM
Sbjct: 130 RVAVVYGGVSIRKQIEALKRGVDIVVATPGRLLDLIKRGKLDLSGVETLVLDEADRMLDM 189
Query: 1570 GFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLAR 1629
GF D++KIL+ +P RQT++F+ATMP + LAR
Sbjct: 190 GFIDDIEKILKALP-------------------------PDRQTLLFSATMPDDIRELAR 224
Query: 1630 SYLRRPATVYIGS--VGKPTERIEQIVY-ILSEQDKRKKLMEVLNRGVKKPVIIFVNQKK 1686
YL P + + + + ++I+Q + SE++K + L+++L + VI+FV K+
Sbjct: 225 RYLNDPVEIEVSVEKLERTLKKIKQFYLEVESEEEKLELLLKLLKDEDEGRVIVFVRTKR 284
Query: 1687 GADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSM 1746
+ LA+ L K G+ LHG QE+R+ AL K G +LVATDVA RG+DI DVS
Sbjct: 285 LVEELAESLRKRGFKVAALHGDLPQEERDRALEKFKDGELRVLVATDVAARGLDIPDVSH 344
Query: 1747 VINYDMAKSIEDYTHRIGRTGRAGKEGLAVSFCTKD-DSHLFYDLKQMMI 1795
VINYD+ EDY HRIGRTGRAG++G+A+SF T++ + +++ +
Sbjct: 345 VINYDLPLDPEDYVHRIGRTGRAGRKGVAISFVTEEEEVKKLKRIEKRLE 394
Score = 272 bits (696), Expect = 2e-79
Identities = 145/548 (26%), Positives = 217/548 (39%), Gaps = 98/548 (17%)
Query: 2 IEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDA 61
++ +G+ EPTPIQ AIP+ L RD++G A+TG+GKT AFLLPLL KI + +
Sbjct: 44 LKDLGFEEPTPIQLAAIPLILAGRDVLGQAQTGTGKTAAFLLPLLQ------KILKSVER 97
Query: 62 DQGPYAIIMAPTRELAQQIEEETNKFGTPL-GIRTVLVVGGLSREEQGFRLRLGCEIVIA 120
A+I+APTRELA QI EE K G L G+R +V GG+S +Q L+ G +IV+A
Sbjct: 98 KYVS-ALILAPTRELAVQIAEELRKLGKNLGGLRVAVVYGGVSIRKQIEALKRGVDIVVA 156
Query: 121 TPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAED 180
TPGRL+D+++ L L+ +VLDEADRM+DMGF D++KIL+ +P
Sbjct: 157 TPGRLLDLIKRGKLDLSGVETLVLDEADRMLDMGFIDDIEKILKALP------------- 203
Query: 181 ENKLLANYNSKKKYRQTVMFTATMPPAKILEYMPVTNLKPDTEDAEDENKLLANYNSKKK 240
RQT++F+A
Sbjct: 204 ------------PDRQTLLFSA-------------------------------------- 213
Query: 241 YRQTVMFTATMPPAVERLARSYLRRPATVYIGS--VGKPTERIEQIVY-ILSEQDKRKKL 297
TMP + LAR YL P + + + + ++I+Q + SE++K + L
Sbjct: 214 ---------TMPDDIRELARRYLNDPVEIEVSVEKLERTLKKIKQFYLEVESEEEKLELL 264
Query: 298 MEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSK 357
+++L + VI+FV K+ + LA+ L K G+ LHG QE+R+ AL K G
Sbjct: 265 LKLLKDEDEGRVIVFVRTKRLVEELAESLRKRGFKVAALHGDLPQEERDRALEKFKDGEL 324
Query: 358 DILMAGDRRSRSRSPPRKRR-----SRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERD 412
+L+A D +R P DY R R + E
Sbjct: 325 RVLVATDVAARGLDIPDVSHVINYDLPLDPEDYVHRIGRTGRAGRKGVAISFVTEEEEVK 384
Query: 413 RSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVIS 472
+ + E+ + + R S K+
Sbjct: 385 KLKRIEKRLERKLPSAVLLPLDEPEDAKLLKTTRPGLEEESDISDEIKK----------L 434
Query: 473 DEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKK 532
+ +L +LLA E + E E
Sbjct: 435 KSSKKALLRGLGVRFTLSKLLANLGKEIPGAGDAVTIDPELERRSPNSADDIEYILKGLS 494
Query: 533 MEEERKKR 540
E +
Sbjct: 495 YRAEERTA 502
>gnl|CDD|240273 PTZ00110, PTZ00110, helicase; Provisional.
Length = 545
Score = 382 bits (982), Expect = e-117
Identities = 196/437 (44%), Positives = 266/437 (60%), Gaps = 39/437 (8%)
Query: 1377 MTERDWRIFREDYSITIKGGK-VPDPVRNWKEASLPTEILEIIEKIGYAEPTPIQRQAIP 1435
++ ++ R++ ITI G+ VP PV +++ S P IL+ ++ G+ EPTPIQ Q P
Sbjct: 103 LSSKEVDEIRKEKEITIIAGENVPKPVVSFEYTSFPDYILKSLKNAGFTEPTPIQVQGWP 162
Query: 1436 IGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQ 1495
I L RD+IG+AETGSGKTLAFLLP +V I + P + R D GP +++APTRELA+Q
Sbjct: 163 IALSGRDMIGIAETGSGKTLAFLLPAIVHINAQPLL-RYGD---GPIVLVLAPTRELAEQ 218
Query: 1496 IEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIATPGRLIDVLENRYLVLNQC 1555
I E+ NKFG IR + GG+ + Q + LR G EI+IA PGRLID LE+ L +
Sbjct: 219 IREQCNKFGASSKIRNTVAYGGVPKRGQIYALRRGVEILIACPGRLIDFLESNVTNLRRV 278
Query: 1556 TYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVM 1615
TY+VLDEADRM+DMGFEP ++KI V+ ++PD RQT+M
Sbjct: 279 TYLVLDEADRMLDMGFEPQIRKI-----VSQIRPD--------------------RQTLM 313
Query: 1616 FTATMPPAVERLARSYLRR-PATVYIGSVG-KPTERIEQIVYILSEQDKRKKLMEVLNRG 1673
++AT P V+ LAR + P V +GS+ I+Q V+++ E +KR KL +L R
Sbjct: 314 WSATWPKEVQSLARDLCKEEPVHVNVGSLDLTACHNIKQEVFVVEEHEKRGKLKMLLQRI 373
Query: 1674 VKK--PVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVA 1731
++ ++IFV KKGAD L K L G+ A +HG K QE+R LN K G I++A
Sbjct: 374 MRDGDKILIFVETKKGADFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKTGKSPIMIA 433
Query: 1732 TDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRAGKEGLAVSFCTKDDSHLFYDLK 1791
TDVA RG+D+KDV VIN+D IEDY HRIGRTGRAG +G + +F T D L DL
Sbjct: 434 TDVASRGLDVKDVKYVINFDFPNQIEDYVHRIGRTGRAGAKGASYTFLTPDKYRLARDLV 493
Query: 1792 QMMISS--PVSTCPPEL 1806
+++ + PV PPEL
Sbjct: 494 KVLREAKQPV---PPEL 507
Score = 379 bits (975), Expect = e-116
Identities = 190/423 (44%), Positives = 259/423 (61%), Gaps = 34/423 (8%)
Query: 702 MTERDWRIFREDYSITIKGGK-VPDPVRNWKEASLPTEILEIIEKIGYAEPTPIQRQAIP 760
++ ++ R++ ITI G+ VP PV +++ S P IL+ ++ G+ EPTPIQ Q P
Sbjct: 103 LSSKEVDEIRKEKEITIIAGENVPKPVVSFEYTSFPDYILKSLKNAGFTEPTPIQVQGWP 162
Query: 761 IGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQ 820
I L RD+IG+AETGSGKTLAFLLP +V I + P + R D GP +++APTRELA+Q
Sbjct: 163 IALSGRDMIGIAETGSGKTLAFLLPAIVHINAQPLL-RYGD---GPIVLVLAPTRELAEQ 218
Query: 821 IEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIATPGRLIDVLENRYLVLNQC 880
I E+ NKFG IR + GG+ + Q + LR G EI+IA PGRLID LE+ L +
Sbjct: 219 IREQCNKFGASSKIRNTVAYGGVPKRGQIYALRRGVEILIACPGRLIDFLESNVTNLRRV 278
Query: 881 TYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVM 940
TY+VLDEADRM+DMGFEP ++KI V+ ++PD RQT+M
Sbjct: 279 TYLVLDEADRMLDMGFEPQIRKI-----VSQIRPD--------------------RQTLM 313
Query: 941 FTATMPPAVERLARSYLRR-PATVYIGSVG-KPTERIEQIVYILSEQDKRKKLMEVLNRG 998
++AT P V+ LAR + P V +GS+ I+Q V+++ E +KR KL +L R
Sbjct: 314 WSATWPKEVQSLARDLCKEEPVHVNVGSLDLTACHNIKQEVFVVEEHEKRGKLKMLLQRI 373
Query: 999 VKK--PVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVA 1056
++ ++IFV KKGAD L K L G+ A +HG K QE+R LN K G I++A
Sbjct: 374 MRDGDKILIFVETKKGADFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKTGKSPIMIA 433
Query: 1057 TDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRAGKEGLAVSFCTKDDSHLFYDLK 1116
TDVA RG+D+KDV VIN+D IEDY HRIGRTGRAG +G + +F T D L DL
Sbjct: 434 TDVASRGLDVKDVKYVINFDFPNQIEDYVHRIGRTGRAGAKGASYTFLTPDKYRLARDLV 493
Query: 1117 QMM 1119
+++
Sbjct: 494 KVL 496
Score = 257 bits (657), Expect = 8e-74
Identities = 145/371 (39%), Positives = 194/371 (52%), Gaps = 80/371 (21%)
Query: 2 IEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDA 61
++ G+ EPTPIQ Q PI L RD+IG+AETGSGKTLAFLLP +V I + P + R D
Sbjct: 145 LKNAGFTEPTPIQVQGWPIALSGRDMIGIAETGSGKTLAFLLPAIVHINAQPLL-RYGD- 202
Query: 62 DQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIAT 121
GP +++APTRELA+QI E+ NKFG IR + GG+ + Q + LR G EI+IA
Sbjct: 203 --GPIVLVLAPTRELAEQIREQCNKFGASSKIRNTVAYGGVPKRGQIYALRRGVEILIAC 260
Query: 122 PGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDE 181
PGRLID LE+ L + TY+VLDEADRM+DMGFEP ++KI V+ ++PD
Sbjct: 261 PGRLIDFLESNVTNLRRVTYLVLDEADRMLDMGFEPQIRKI-----VSQIRPD------- 308
Query: 182 NKLLANYNSKKKYRQTVMFTATMPPAKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKY 241
RQT+M
Sbjct: 309 -------------RQTLM------------------------------------------ 313
Query: 242 RQTVMFTATMPPAVERLARSYLRR-PATVYIGSVG-KPTERIEQIVYILSEQDKRKKLME 299
++AT P V+ LAR + P V +GS+ I+Q V+++ E +KR KL
Sbjct: 314 -----WSATWPKEVQSLARDLCKEEPVHVNVGSLDLTACHNIKQEVFVVEEHEKRGKLKM 368
Query: 300 VLNRGVKK--PVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSK 357
+L R ++ ++IFV KKGAD L K L G+ A +HG K QE+R LN K G
Sbjct: 369 LLQRIMRDGDKILIFVETKKGADFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKTGKS 428
Query: 358 DILMAGDRRSR 368
I++A D SR
Sbjct: 429 PIMIATDVASR 439
>gnl|CDD|236977 PRK11776, PRK11776, ATP-dependent RNA helicase DbpA; Provisional.
Length = 460
Score = 297 bits (764), Expect = 6e-89
Identities = 139/414 (33%), Positives = 206/414 (49%), Gaps = 46/414 (11%)
Query: 1409 SLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSL 1468
LP +L + ++GY E TPIQ Q++P L +D+I A+TGSGKT AF L LL Q L
Sbjct: 10 PLPPALLANLNELGYTEMTPIQAQSLPAILAGKDVIAQAKTGSGKTAAFGLGLL---QKL 66
Query: 1469 PKIARMEDADQ-GPYAIIMAPTRELAQQIEEETNKFGTPL-GIRTVLVVGGLSREEQGFR 1526
D + A+++ PTRELA Q+ +E + + I+ + + GG+ Q
Sbjct: 67 -------DVKRFRVQALVLCPTRELADQVAKEIRRLARFIPNIKVLTLCGGVPMGPQIDS 119
Query: 1527 LRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTN 1586
L G I++ TPGR++D L L L+ +VLDEADRM+DMGF+ + I+ P
Sbjct: 120 LEHGAHIIVGTPGRILDHLRKGTLDLDALNTLVLDEADRMLDMGFQDAIDAIIRQAP--- 176
Query: 1587 LKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSV-GK 1645
RQT++F+AT P + +++ + R P V + S
Sbjct: 177 ----------------------ARRQTLLFSATYPEGIAAISQRFQRDPVEVKVESTHDL 214
Query: 1646 PTERIEQIVYILSEQDKRKKLMEVLNRGVKKPV--IIFVNQKKGADVLAKGLEKLGYNAC 1703
P IEQ Y +S ++ L +L +P ++F N KK +A L G++A
Sbjct: 215 PA--IEQRFYEVSPDERLPALQRLLLH--HQPESCVVFCNTKKECQEVADALNAQGFSAL 270
Query: 1704 TLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRI 1763
LHG Q R+ L S +LVATDVA RG+DIK + VINY++A+ E + HRI
Sbjct: 271 ALHGDLEQRDRDQVLVRFANRSCSVLVATDVAARGLDIKALEAVINYELARDPEVHVHRI 330
Query: 1764 GRTGRAGKEGLAVSFCTKDDSHLFYDLKQMMISSPVSTCP-PELLNHPDAQHKP 1816
GRTGRAG +GLA+S ++ ++ + ++ P P L P
Sbjct: 331 GRTGRAGSKGLALSLVAPEEMQRANAIEDYL-GRKLNWEPLPSLSPLSGVPLLP 383
Score = 296 bits (761), Expect = 1e-88
Identities = 135/391 (34%), Positives = 200/391 (51%), Gaps = 44/391 (11%)
Query: 734 SLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSL 793
LP +L + ++GY E TPIQ Q++P L +D+I A+TGSGKT AF L LL Q L
Sbjct: 10 PLPPALLANLNELGYTEMTPIQAQSLPAILAGKDVIAQAKTGSGKTAAFGLGLL---QKL 66
Query: 794 PKIARMEDADQ-GPYAIIMAPTRELAQQIEEETNKFGTPL-GIRTVLVVGGLSREEQGFR 851
D + A+++ PTRELA Q+ +E + + I+ + + GG+ Q
Sbjct: 67 -------DVKRFRVQALVLCPTRELADQVAKEIRRLARFIPNIKVLTLCGGVPMGPQIDS 119
Query: 852 LRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTN 911
L G I++ TPGR++D L L L+ +VLDEADRM+DMGF+ + I+ P
Sbjct: 120 LEHGAHIIVGTPGRILDHLRKGTLDLDALNTLVLDEADRMLDMGFQDAIDAIIRQAP--- 176
Query: 912 LKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSV-GK 970
RQT++F+AT P + +++ + R P V + S
Sbjct: 177 ----------------------ARRQTLLFSATYPEGIAAISQRFQRDPVEVKVESTHDL 214
Query: 971 PTERIEQIVYILSEQDKRKKLMEVLNRGVKKPV--IIFVNQKKGADVLAKGLEKLGYNAC 1028
P IEQ Y +S ++ L +L +P ++F N KK +A L G++A
Sbjct: 215 PA--IEQRFYEVSPDERLPALQRLLLH--HQPESCVVFCNTKKECQEVADALNAQGFSAL 270
Query: 1029 TLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRI 1088
LHG Q R+ L S +LVATDVA RG+DIK + VINY++A+ E + HRI
Sbjct: 271 ALHGDLEQRDRDQVLVRFANRSCSVLVATDVAARGLDIKALEAVINYELARDPEVHVHRI 330
Query: 1089 GRTGRAGKEGLAVSFCTKDDSHLFYDLKQMM 1119
GRTGRAG +GLA+S ++ ++ +
Sbjct: 331 GRTGRAGSKGLALSLVAPEEMQRANAIEDYL 361
Score = 171 bits (435), Expect = 3e-45
Identities = 103/368 (27%), Positives = 155/368 (42%), Gaps = 91/368 (24%)
Query: 2 IEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDA 61
+ ++GY E TPIQ Q++P L +D+I A+TGSGKT AF L LL Q L D
Sbjct: 19 LNELGYTEMTPIQAQSLPAILAGKDVIAQAKTGSGKTAAFGLGLL---QKL-------DV 68
Query: 62 DQ-GPYAIIMAPTRELAQQIEEETNKFGTPL-GIRTVLVVGGLSREEQGFRLRLGCEIVI 119
+ A+++ PTRELA Q+ +E + + I+ + + GG+ Q L G I++
Sbjct: 69 KRFRVQALVLCPTRELADQVAKEIRRLARFIPNIKVLTLCGGVPMGPQIDSLEHGAHIIV 128
Query: 120 ATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAE 179
TPGR++D L L L+ +VLDEADRM+DMGF+ + I+ P
Sbjct: 129 GTPGRILDHLRKGTLDLDALNTLVLDEADRMLDMGFQDAIDAIIRQAP------------ 176
Query: 180 DENKLLANYNSKKKYRQTVMFTATMPPAKILEYMPVTNLKPDTEDAEDENKLLANYNSKK 239
RQT++F
Sbjct: 177 -------------ARRQTLLF--------------------------------------- 184
Query: 240 KYRQTVMFTATMPPAVERLARSYLRRPATVYIGSV-GKPTERIEQIVYILSEQDKRKKLM 298
+AT P + +++ + R P V + S P IEQ Y +S ++ L
Sbjct: 185 --------SATYPEGIAAISQRFQRDPVEVKVESTHDLPA--IEQRFYEVSPDERLPALQ 234
Query: 299 EVLNRGVKKPV--IIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGS 356
+L +P ++F N KK +A L G++A LHG Q R+ L S
Sbjct: 235 RLLLH--HQPESCVVFCNTKKECQEVADALNAQGFSALALHGDLEQRDRDQVLVRFANRS 292
Query: 357 KDILMAGD 364
+L+A D
Sbjct: 293 CSVLVATD 300
>gnl|CDD|215103 PLN00206, PLN00206, DEAD-box ATP-dependent RNA helicase; Provisional.
Length = 518
Score = 291 bits (747), Expect = 5e-86
Identities = 149/454 (32%), Positives = 244/454 (53%), Gaps = 38/454 (8%)
Query: 1377 MTERDWRIFREDYSITIKGGKVPDPVRNWKEASLPTEILEIIEKIGYAEPTPIQRQAIPI 1436
++ + R I +KG VP P+ ++ LP ++L +E GY PTPIQ QAIP
Sbjct: 95 LSSSQAELLRRKLEIHVKGEAVPPPILSFSSCGLPPKLLLNLETAGYEFPTPIQMQAIPA 154
Query: 1437 GLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARME-DADQGPYAIIMAPTRELAQQ 1495
L R ++ A+TGSGKT +FL+P+ I I + P A+++ PTREL Q
Sbjct: 155 ALSGRSLLVSADTGSGKTASFLVPI---ISRCCTIRSGHPSEQRNPLAMVLTPTRELCVQ 211
Query: 1496 IEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIATPGRLIDVLENRYLVLNQC 1555
+E++ G L +T LVVGG + +Q +R++ G E+++ TPGRLID+L + L+
Sbjct: 212 VEDQAKVLGKGLPFKTALVVGGDAMPQQLYRIQQGVELIVGTPGRLIDLLSKHDIELDNV 271
Query: 1556 TYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVM 1615
+ +VLDE D M++ GF V +I + + +P Q ++
Sbjct: 272 SVLVLDEVDCMLERGFRDQVMQIFQALS----QP----------------------QVLL 305
Query: 1616 FTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKLMEVLNRGV- 1674
F+AT+ P VE+ A S + + IG+ +P + ++Q+ + + K++KL ++L
Sbjct: 306 FSATVSPEVEKFASSLAKDIILISIGNPNRPNKAVKQLAIWVETKQKKQKLFDILKSKQH 365
Query: 1675 -KKPVIIFVNQKKGADVLAKGLEKL-GYNACTLHGGKGQEQRELALNSLKGGSKDILVAT 1732
K P ++FV+ + GAD+LA + + G A ++HG K ++R + S G ++VAT
Sbjct: 366 FKPPAVVFVSSRLGADLLANAITVVTGLKALSIHGEKSMKERREVMKSFLVGEVPVIVAT 425
Query: 1733 DVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRAGKEGLAVSFCTKDDSHLFYDLKQ 1792
V GRG+D+ V VI +DM +I++Y H+IGR R G++G A+ F ++D +LF +L
Sbjct: 426 GVLGRGVDLLRVRQVIIFDMPNTIKEYIHQIGRASRMGEKGTAIVFVNEEDRNLFPELVA 485
Query: 1793 MMISSPVSTCPPELLNHPDAQHKPGTVMVPKKRR 1826
++ SS + P EL N G+ KK+R
Sbjct: 486 LLKSSGAAI-PRELANSRYL----GSGRKRKKKR 514
Score = 287 bits (736), Expect = 1e-84
Identities = 141/425 (33%), Positives = 233/425 (54%), Gaps = 33/425 (7%)
Query: 702 MTERDWRIFREDYSITIKGGKVPDPVRNWKEASLPTEILEIIEKIGYAEPTPIQRQAIPI 761
++ + R I +KG VP P+ ++ LP ++L +E GY PTPIQ QAIP
Sbjct: 95 LSSSQAELLRRKLEIHVKGEAVPPPILSFSSCGLPPKLLLNLETAGYEFPTPIQMQAIPA 154
Query: 762 GLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARME-DADQGPYAIIMAPTRELAQQ 820
L R ++ A+TGSGKT +FL+P+ I I + P A+++ PTREL Q
Sbjct: 155 ALSGRSLLVSADTGSGKTASFLVPI---ISRCCTIRSGHPSEQRNPLAMVLTPTRELCVQ 211
Query: 821 IEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIATPGRLIDVLENRYLVLNQC 880
+E++ G L +T LVVGG + +Q +R++ G E+++ TPGRLID+L + L+
Sbjct: 212 VEDQAKVLGKGLPFKTALVVGGDAMPQQLYRIQQGVELIVGTPGRLIDLLSKHDIELDNV 271
Query: 881 TYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVM 940
+ +VLDE D M++ GF V +I + + +P Q ++
Sbjct: 272 SVLVLDEVDCMLERGFRDQVMQIFQALS----QP----------------------QVLL 305
Query: 941 FTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKLMEVLNRGV- 999
F+AT+ P VE+ A S + + IG+ +P + ++Q+ + + K++KL ++L
Sbjct: 306 FSATVSPEVEKFASSLAKDIILISIGNPNRPNKAVKQLAIWVETKQKKQKLFDILKSKQH 365
Query: 1000 -KKPVIIFVNQKKGADVLAKGLEKL-GYNACTLHGGKGQEQRELALNSLKGGSKDILVAT 1057
K P ++FV+ + GAD+LA + + G A ++HG K ++R + S G ++VAT
Sbjct: 366 FKPPAVVFVSSRLGADLLANAITVVTGLKALSIHGEKSMKERREVMKSFLVGEVPVIVAT 425
Query: 1058 DVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRAGKEGLAVSFCTKDDSHLFYDLKQ 1117
V GRG+D+ V VI +DM +I++Y H+IGR R G++G A+ F ++D +LF +L
Sbjct: 426 GVLGRGVDLLRVRQVIIFDMPNTIKEYIHQIGRASRMGEKGTAIVFVNEEDRNLFPELVA 485
Query: 1118 MMISS 1122
++ SS
Sbjct: 486 LLKSS 490
Score = 181 bits (460), Expect = 4e-48
Identities = 102/365 (27%), Positives = 170/365 (46%), Gaps = 80/365 (21%)
Query: 2 IEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARME-D 60
+E GY PTPIQ QAIP L R ++ A+TGSGKT +FL+P+ I I
Sbjct: 136 LETAGYEFPTPIQMQAIPAALSGRSLLVSADTGSGKTASFLVPI---ISRCCTIRSGHPS 192
Query: 61 ADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIA 120
+ P A+++ PTREL Q+E++ G L +T LVVGG + +Q +R++ G E+++
Sbjct: 193 EQRNPLAMVLTPTRELCVQVEDQAKVLGKGLPFKTALVVGGDAMPQQLYRIQQGVELIVG 252
Query: 121 TPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAED 180
TPGRLID+L + L+ + +VLDE D M++ GF V +I + + +P
Sbjct: 253 TPGRLIDLLSKHDIELDNVSVLVLDEVDCMLERGFRDQVMQIFQALS----QP------- 301
Query: 181 ENKLLANYNSKKKYRQTVMFTATMPPAKILEYMPVTNLKPDTEDAEDENKLLANYNSKKK 240
Q ++F+AT+
Sbjct: 302 ---------------QVLLFSATVS----------------------------------- 311
Query: 241 YRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKLMEV 300
P VE+ A S + + IG+ +P + ++Q+ + + K++KL ++
Sbjct: 312 ------------PEVEKFASSLAKDIILISIGNPNRPNKAVKQLAIWVETKQKKQKLFDI 359
Query: 301 LNRGV--KKPVIIFVNQKKGADVLAKGLEKL-GYNACTLHGGKGQEQRELALNSLKGGSK 357
L K P ++FV+ + GAD+LA + + G A ++HG K ++R + S G
Sbjct: 360 LKSKQHFKPPAVVFVSSRLGADLLANAITVVTGLKALSIHGEKSMKERREVMKSFLVGEV 419
Query: 358 DILMA 362
+++A
Sbjct: 420 PVIVA 424
>gnl|CDD|236722 PRK10590, PRK10590, ATP-dependent RNA helicase RhlE; Provisional.
Length = 456
Score = 273 bits (699), Expect = 2e-80
Identities = 143/385 (37%), Positives = 218/385 (56%), Gaps = 28/385 (7%)
Query: 735 LPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLP 794
L +IL + + GY EPTPIQ+QAIP L+ RD++ A+TG+GKT F LPLL Q L
Sbjct: 8 LSPDILRAVAEQGYREPTPIQQQAIPAVLEGRDLMASAQTGTGKTAGFTLPLL---QHLI 64
Query: 795 KIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRL 854
+ A+I+ PTRELA QI E + L IR+++V GG+S Q +LR
Sbjct: 65 TRQPHAKGRRPVRALILTPTRELAAQIGENVRDYSKYLNIRSLVVFGGVSINPQMMKLRG 124
Query: 855 GCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKP 914
G ++++ATPGRL+D+ + L+Q +VLDEADRM+DMGF D++++L +P
Sbjct: 125 GVDVLVATPGRLLDLEHQNAVKLDQVEILVLDEADRMLDMGFIHDIRRVLAKLPAK---- 180
Query: 915 DTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTER 974
RQ ++F+AT ++ LA L P + + +E+
Sbjct: 181 ---------------------RQNLLFSATFSDDIKALAEKLLHNPLEIEVARRNTASEQ 219
Query: 975 IEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGK 1034
+ Q V+ + ++ KR+ L +++ +G + V++F K GA+ LA+ L K G + +HG K
Sbjct: 220 VTQHVHFVDKKRKRELLSQMIGKGNWQQVLVFTRTKHGANHLAEQLNKDGIRSAAIHGNK 279
Query: 1035 GQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRA 1094
Q R AL K G +LVATD+A RG+DI+++ V+NY++ EDY HRIGRTGRA
Sbjct: 280 SQGARTRALADFKSGDIRVLVATDIAARGLDIEELPHVVNYELPNVPEDYVHRIGRTGRA 339
Query: 1095 GKEGLAVSFCTKDDSHLFYDLKQMM 1119
G A+S D+ L D+++++
Sbjct: 340 AATGEALSLVCVDEHKLLRDIEKLL 364
Score = 273 bits (699), Expect = 2e-80
Identities = 143/385 (37%), Positives = 218/385 (56%), Gaps = 28/385 (7%)
Query: 1410 LPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLP 1469
L +IL + + GY EPTPIQ+QAIP L+ RD++ A+TG+GKT F LPLL Q L
Sbjct: 8 LSPDILRAVAEQGYREPTPIQQQAIPAVLEGRDLMASAQTGTGKTAGFTLPLL---QHLI 64
Query: 1470 KIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRL 1529
+ A+I+ PTRELA QI E + L IR+++V GG+S Q +LR
Sbjct: 65 TRQPHAKGRRPVRALILTPTRELAAQIGENVRDYSKYLNIRSLVVFGGVSINPQMMKLRG 124
Query: 1530 GCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKP 1589
G ++++ATPGRL+D+ + L+Q +VLDEADRM+DMGF D++++L +P
Sbjct: 125 GVDVLVATPGRLLDLEHQNAVKLDQVEILVLDEADRMLDMGFIHDIRRVLAKLPAK---- 180
Query: 1590 DTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTER 1649
RQ ++F+AT ++ LA L P + + +E+
Sbjct: 181 ---------------------RQNLLFSATFSDDIKALAEKLLHNPLEIEVARRNTASEQ 219
Query: 1650 IEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGK 1709
+ Q V+ + ++ KR+ L +++ +G + V++F K GA+ LA+ L K G + +HG K
Sbjct: 220 VTQHVHFVDKKRKRELLSQMIGKGNWQQVLVFTRTKHGANHLAEQLNKDGIRSAAIHGNK 279
Query: 1710 GQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRA 1769
Q R AL K G +LVATD+A RG+DI+++ V+NY++ EDY HRIGRTGRA
Sbjct: 280 SQGARTRALADFKSGDIRVLVATDIAARGLDIEELPHVVNYELPNVPEDYVHRIGRTGRA 339
Query: 1770 GKEGLAVSFCTKDDSHLFYDLKQMM 1794
G A+S D+ L D+++++
Sbjct: 340 AATGEALSLVCVDEHKLLRDIEKLL 364
Score = 188 bits (478), Expect = 3e-51
Identities = 115/367 (31%), Positives = 175/367 (47%), Gaps = 75/367 (20%)
Query: 2 IEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDA 61
+ + GY EPTPIQ+QAIP L+ RD++ A+TG+GKT F LPLL Q L
Sbjct: 16 VAEQGYREPTPIQQQAIPAVLEGRDLMASAQTGTGKTAGFTLPLL---QHLITRQPHAKG 72
Query: 62 DQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIAT 121
+ A+I+ PTRELA QI E + L IR+++V GG+S Q +LR G ++++AT
Sbjct: 73 RRPVRALILTPTRELAAQIGENVRDYSKYLNIRSLVVFGGVSINPQMMKLRGGVDVLVAT 132
Query: 122 PGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDE 181
PGRL+D+ + L+Q +VLDEADRM+DMGF D++++L +P
Sbjct: 133 PGRLLDLEHQNAVKLDQVEILVLDEADRMLDMGFIHDIRRVLAKLPAK------------ 180
Query: 182 NKLLANYNSKKKYRQTVMFTATMPPAKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKY 241
RQ ++F+AT D+ K LA
Sbjct: 181 -------------RQNLLFSATF---------------------SDDIKALAE------- 199
Query: 242 RQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKLMEVL 301
L P + + +E++ Q V+ + ++ KR+ L +++
Sbjct: 200 -------------------KLLHNPLEIEVARRNTASEQVTQHVHFVDKKRKRELLSQMI 240
Query: 302 NRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILM 361
+G + V++F K GA+ LA+ L K G + +HG K Q R AL K G +L+
Sbjct: 241 GKGNWQQVLVFTRTKHGANHLAEQLNKDGIRSAAIHGNKSQGARTRALADFKSGDIRVLV 300
Query: 362 AGDRRSR 368
A D +R
Sbjct: 301 ATDIAAR 307
>gnl|CDD|238167 cd00268, DEADc, DEAD-box helicases. A diverse family of proteins
involved in ATP-dependent RNA unwinding, needed in a
variety of cellular processes including splicing,
ribosome biogenesis and RNA degradation. The name
derives from the sequence of the Walker B motif (motif
II). This domain contains the ATP- binding region.
Length = 203
Score = 258 bits (661), Expect = 1e-78
Identities = 102/234 (43%), Positives = 141/234 (60%), Gaps = 32/234 (13%)
Query: 730 WKEASLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVW 789
++E L E+L I +G+ +PTPIQ +AIP L RD+IG A+TGSGKT AFL+P+L
Sbjct: 1 FEELGLSPELLRGIYALGFEKPTPIQARAIPPLLSGRDVIGQAQTGSGKTAAFLIPILEK 60
Query: 790 IQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQG 849
+ PK GP A+I+APTRELA QI E K G ++ V++ GG S ++Q
Sbjct: 61 LDPSPK-------KDGPQALILAPTRELALQIAEVARKLGKHTNLKVVVIYGGTSIDKQI 113
Query: 850 FRLRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPV 909
+L+ G IV+ATPGRL+D+LE L L++ Y+VLDEADRM+DMGFE +++IL+ +P
Sbjct: 114 RKLKRGPHIVVATPGRLLDLLERGKLDLSKVKYLVLDEADRMLDMGFEDQIREILKLLP- 172
Query: 910 TNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATV 963
K RQT++F+ATMP V LAR +LR P +
Sbjct: 173 ------------------------KDRQTLLFSATMPKEVRDLARKFLRNPVRI 202
Score = 258 bits (661), Expect = 1e-78
Identities = 102/234 (43%), Positives = 141/234 (60%), Gaps = 32/234 (13%)
Query: 1405 WKEASLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVW 1464
++E L E+L I +G+ +PTPIQ +AIP L RD+IG A+TGSGKT AFL+P+L
Sbjct: 1 FEELGLSPELLRGIYALGFEKPTPIQARAIPPLLSGRDVIGQAQTGSGKTAAFLIPILEK 60
Query: 1465 IQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQG 1524
+ PK GP A+I+APTRELA QI E K G ++ V++ GG S ++Q
Sbjct: 61 LDPSPK-------KDGPQALILAPTRELALQIAEVARKLGKHTNLKVVVIYGGTSIDKQI 113
Query: 1525 FRLRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPV 1584
+L+ G IV+ATPGRL+D+LE L L++ Y+VLDEADRM+DMGFE +++IL+ +P
Sbjct: 114 RKLKRGPHIVVATPGRLLDLLERGKLDLSKVKYLVLDEADRMLDMGFEDQIREILKLLP- 172
Query: 1585 TNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATV 1638
K RQT++F+ATMP V LAR +LR P +
Sbjct: 173 ------------------------KDRQTLLFSATMPKEVRDLARKFLRNPVRI 202
Score = 236 bits (605), Expect = 3e-71
Identities = 91/212 (42%), Positives = 128/212 (60%), Gaps = 33/212 (15%)
Query: 2 IEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDA 61
I +G+ +PTPIQ +AIP L RD+IG A+TGSGKT AFL+P+L + PK
Sbjct: 14 IYALGFEKPTPIQARAIPPLLSGRDVIGQAQTGSGKTAAFLIPILEKLDPSPK------- 66
Query: 62 DQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIAT 121
GP A+I+APTRELA QI E K G ++ V++ GG S ++Q +L+ G IV+AT
Sbjct: 67 KDGPQALILAPTRELALQIAEVARKLGKHTNLKVVVIYGGTSIDKQIRKLKRGPHIVVAT 126
Query: 122 PGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDE 181
PGRL+D+LE L L++ Y+VLDEADRM+DMGFE +++IL+ +P
Sbjct: 127 PGRLLDLLERGKLDLSKVKYLVLDEADRMLDMGFEDQIREILKLLP-------------- 172
Query: 182 NKLLANYNSKKKYRQTVMFTATMPPAKILEYM 213
K RQT++F+ATMP ++ +
Sbjct: 173 -----------KDRQTLLFSATMPK-EVRDLA 192
Score = 43.6 bits (104), Expect = 3e-04
Identities = 16/30 (53%), Positives = 21/30 (70%)
Query: 240 KYRQTVMFTATMPPAVERLARSYLRRPATV 269
K RQT++F+ATMP V LAR +LR P +
Sbjct: 173 KDRQTLLFSATMPKEVRDLARKFLRNPVRI 202
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 247 bits (632), Expect = 2e-71
Identities = 144/431 (33%), Positives = 221/431 (51%), Gaps = 42/431 (9%)
Query: 1397 KVPDPVRNWK----------------EASLPTEILEIIEKIGYAEPTPIQRQAIPIGLQN 1440
+ P P WK + +L E++ I +G+ TPIQ Q + L
Sbjct: 65 RKPKPASLWKLEDFVVEPQEGKTRFHDFNLAPELMHAIHDLGFPYCTPIQAQVLGYTLAG 124
Query: 1441 RDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEET 1500
D IG A+TG+GKT AFL+ ++ + P + P A+I+APTREL QI ++
Sbjct: 125 HDAIGRAQTGTGKTAAFLISIINQLLQTPPPKERYMGE--PRALIIAPTRELVVQIAKDA 182
Query: 1501 NKFGTPLGIRTVLVVGGLSREEQGFRLRLG-CEIVIATPGRLIDVLENRYLVLNQCTYIV 1559
G+ + VGG+ ++Q +L C+I++ATPGRL+D + + L+ +V
Sbjct: 183 AALTKYTGLNVMTFVGGMDFDKQLKQLEARFCDILVATPGRLLDFNQRGEVHLDMVEVMV 242
Query: 1560 LDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTAT 1619
LDEADRM+DMGF P V++I+ P +K+ RQT++F+AT
Sbjct: 243 LDEADRMLDMGFIPQVRQIIRQTP-----------------------RKEERQTLLFSAT 279
Query: 1620 MPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKLMEVLNRGVKKPVI 1679
V LA+ + PA V I ++ +EQ VY ++ DK K L ++ + + V+
Sbjct: 280 FTDDVMNLAKQWTTDPAIVEIEPENVASDTVEQHVYAVAGSDKYKLLYNLVTQNPWERVM 339
Query: 1680 IFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGI 1739
+F N+K + + L K G NA L G Q +R L + G +LVATDVAGRGI
Sbjct: 340 VFANRKDEVRRIEERLVKDGINAAQLSGDVPQHKRIKTLEGFREGKIRVLVATDVAGRGI 399
Query: 1740 DIKDVSMVINYDMAKSIEDYTHRIGRTGRAGKEGLAVSFCTKDDSHLFYDLKQMMISSPV 1799
I +S VIN+ + + +DY HRIGRTGRAG G+++SF +DD+ ++++++
Sbjct: 400 HIDGISHVINFTLPEDPDDYVHRIGRTGRAGASGVSISFAGEDDAFQLPEIEELLGRKIS 459
Query: 1800 STCPPELLNHP 1810
PP L P
Sbjct: 460 CEMPPAELLKP 470
Score = 246 bits (629), Expect = 7e-71
Identities = 140/406 (34%), Positives = 211/406 (51%), Gaps = 42/406 (10%)
Query: 722 KVPDPVRNWK----------------EASLPTEILEIIEKIGYAEPTPIQRQAIPIGLQN 765
+ P P WK + +L E++ I +G+ TPIQ Q + L
Sbjct: 65 RKPKPASLWKLEDFVVEPQEGKTRFHDFNLAPELMHAIHDLGFPYCTPIQAQVLGYTLAG 124
Query: 766 RDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEET 825
D IG A+TG+GKT AFL+ ++ + P + P A+I+APTREL QI ++
Sbjct: 125 HDAIGRAQTGTGKTAAFLISIINQLLQTPPPKERYMGE--PRALIIAPTRELVVQIAKDA 182
Query: 826 NKFGTPLGIRTVLVVGGLSREEQGFRLRLG-CEIVIATPGRLIDVLENRYLVLNQCTYIV 884
G+ + VGG+ ++Q +L C+I++ATPGRL+D + + L+ +V
Sbjct: 183 AALTKYTGLNVMTFVGGMDFDKQLKQLEARFCDILVATPGRLLDFNQRGEVHLDMVEVMV 242
Query: 885 LDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTAT 944
LDEADRM+DMGF P V++I+ P +K+ RQT++F+AT
Sbjct: 243 LDEADRMLDMGFIPQVRQIIRQTP-----------------------RKEERQTLLFSAT 279
Query: 945 MPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKLMEVLNRGVKKPVI 1004
V LA+ + PA V I ++ +EQ VY ++ DK K L ++ + + V+
Sbjct: 280 FTDDVMNLAKQWTTDPAIVEIEPENVASDTVEQHVYAVAGSDKYKLLYNLVTQNPWERVM 339
Query: 1005 IFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGI 1064
+F N+K + + L K G NA L G Q +R L + G +LVATDVAGRGI
Sbjct: 340 VFANRKDEVRRIEERLVKDGINAAQLSGDVPQHKRIKTLEGFREGKIRVLVATDVAGRGI 399
Query: 1065 DIKDVSMVINYDMAKSIEDYTHRIGRTGRAGKEGLAVSFCTKDDSH 1110
I +S VIN+ + + +DY HRIGRTGRAG G+++SF +DD+
Sbjct: 400 HIDGISHVINFTLPEDPDDYVHRIGRTGRAGASGVSISFAGEDDAF 445
Score = 155 bits (393), Expect = 7e-40
Identities = 107/368 (29%), Positives = 165/368 (44%), Gaps = 73/368 (19%)
Query: 2 IEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDA 61
I +G+ TPIQ Q + L D IG A+TG+GKT AFL+ ++ + P
Sbjct: 102 IHDLGFPYCTPIQAQVLGYTLAGHDAIGRAQTGTGKTAAFLISIINQLLQTPPPKERYMG 161
Query: 62 DQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLG-CEIVIA 120
+ P A+I+APTREL QI ++ G+ + VGG+ ++Q +L C+I++A
Sbjct: 162 E--PRALIIAPTRELVVQIAKDAAALTKYTGLNVMTFVGGMDFDKQLKQLEARFCDILVA 219
Query: 121 TPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAED 180
TPGRL+D + + L+ +VLDEADRM+DMGF P V++I+ P
Sbjct: 220 TPGRLLDFNQRGEVHLDMVEVMVLDEADRMLDMGFIPQVRQIIRQTP------------- 266
Query: 181 ENKLLANYNSKKKYRQTVMFTATMPPAKILEYMPVTNLKPDTEDAEDENKLLANYNSKKK 240
+K+ RQT++F+AT T+D
Sbjct: 267 ----------RKEERQTLLFSATF-----------------TDD---------------- 283
Query: 241 YRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKLMEV 300
V LA+ + PA V I ++ +EQ VY ++ DK K L +
Sbjct: 284 --------------VMNLAKQWTTDPAIVEIEPENVASDTVEQHVYAVAGSDKYKLLYNL 329
Query: 301 LNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDIL 360
+ + + V++F N+K + + L K G NA L G Q +R L + G +L
Sbjct: 330 VTQNPWERVMVFANRKDEVRRIEERLVKDGINAAQLSGDVPQHKRIKTLEGFREGKIRVL 389
Query: 361 MAGDRRSR 368
+A D R
Sbjct: 390 VATDVAGR 397
>gnl|CDD|235307 PRK04537, PRK04537, ATP-dependent RNA helicase RhlB; Provisional.
Length = 572
Score = 248 bits (635), Expect = 1e-70
Identities = 167/531 (31%), Positives = 258/531 (48%), Gaps = 47/531 (8%)
Query: 743 IEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDA 802
+E G+ TPIQ +P+ L D+ G A+TG+GKTLAFL+ ++ + S P +A +
Sbjct: 24 LESAGFTRCTPIQALTLPVALPGGDVAGQAQTGTGKTLAFLVAVMNRLLSRPALADRKPE 83
Query: 803 DQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIAT 862
D P A+I+APTRELA QI ++ KFG LG+R LV GG+ ++Q L+ G +++IAT
Sbjct: 84 D--PRALILAPTRELAIQIHKDAVKFGADLGLRFALVYGGVDYDKQRELLQQGVDVIIAT 141
Query: 863 PGRLIDVLENRYLV-LNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAED 921
PGRLID ++ +V L+ C VLDEADRM D+GF D++ +L MP
Sbjct: 142 PGRLIDYVKQHKVVSLHACEICVLDEADRMFDLGFIKDIRFLLRRMP------------- 188
Query: 922 ENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYI 981
++ RQT++F+AT+ V LA ++ P + + + R+ Q +Y
Sbjct: 189 ----------ERGTRQTLLFSATLSHRVLELAYEHMNEPEKLVVETETITAARVRQRIYF 238
Query: 982 LSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQREL 1041
++++K+ L+ +L+R ++FVN K + +A+ LE+ GY L G Q++RE
Sbjct: 239 PADEEKQTLLLGLLSRSEGARTMVFVNTKAFVERVARTLERHGYRVGVLSGDVPQKKRES 298
Query: 1042 ALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRAGKEGLAV 1101
LN + G +ILVATDVA RG+ I V V NYD+ EDY HRIGRT R G+EG A+
Sbjct: 299 LLNRFQKGQLEILVATDVAARGLHIDGVKYVYNYDLPFDAEDYVHRIGRTARLGEEGDAI 358
Query: 1102 SFCTKDDSHLFYD----LKQMMISSPVTGR-----AGKEGLAVSFCTKDDSHLFYDLKQM 1152
SF + + D ++Q + PVT + V DD
Sbjct: 359 SFACERYAMSLPDIEAYIEQKIPVEPVTAELLTPLPRPPRVPVEGEEADDEA-------- 410
Query: 1153 MISSPVSTCPPELLNHPDAQHKPGTVMMAGDRRSRSRSPARKRRSRSRDRDYDRRFKRKK 1212
V T E A+ + +G SRS + R D + + ++
Sbjct: 411 --GDSVGTIFREAREQRAAEEQRRGGGRSGPGG-GSRSGSVGGGGRRDGAGADGKPRPRR 467
Query: 1213 SPQCEA-QSSRFSACSLPRSHKSSSLLSRYSEQDPEEKELNKDKEREGEAI 1262
P+ E + + P +++ D E + + R G +
Sbjct: 468 KPRVEGEADAAAAGAETPVVAAAAAQAPGVVAADGERAPRKRRRRRNGRPV 518
Score = 247 bits (633), Expect = 2e-70
Identities = 142/362 (39%), Positives = 210/362 (58%), Gaps = 26/362 (7%)
Query: 1418 IEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDA 1477
+E G+ TPIQ +P+ L D+ G A+TG+GKTLAFL+ ++ + S P +A +
Sbjct: 24 LESAGFTRCTPIQALTLPVALPGGDVAGQAQTGTGKTLAFLVAVMNRLLSRPALADRKPE 83
Query: 1478 DQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIAT 1537
D P A+I+APTRELA QI ++ KFG LG+R LV GG+ ++Q L+ G +++IAT
Sbjct: 84 D--PRALILAPTRELAIQIHKDAVKFGADLGLRFALVYGGVDYDKQRELLQQGVDVIIAT 141
Query: 1538 PGRLIDVLENRYLV-LNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAED 1596
PGRLID ++ +V L+ C VLDEADRM D+GF D++ +L MP
Sbjct: 142 PGRLIDYVKQHKVVSLHACEICVLDEADRMFDLGFIKDIRFLLRRMP------------- 188
Query: 1597 ENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYI 1656
++ RQT++F+AT+ V LA ++ P + + + R+ Q +Y
Sbjct: 189 ----------ERGTRQTLLFSATLSHRVLELAYEHMNEPEKLVVETETITAARVRQRIYF 238
Query: 1657 LSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQREL 1716
++++K+ L+ +L+R ++FVN K + +A+ LE+ GY L G Q++RE
Sbjct: 239 PADEEKQTLLLGLLSRSEGARTMVFVNTKAFVERVARTLERHGYRVGVLSGDVPQKKRES 298
Query: 1717 ALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRAGKEGLAV 1776
LN + G +ILVATDVA RG+ I V V NYD+ EDY HRIGRT R G+EG A+
Sbjct: 299 LLNRFQKGQLEILVATDVAARGLHIDGVKYVYNYDLPFDAEDYVHRIGRTARLGEEGDAI 358
Query: 1777 SF 1778
SF
Sbjct: 359 SF 360
Score = 172 bits (436), Expect = 8e-45
Identities = 116/368 (31%), Positives = 184/368 (50%), Gaps = 73/368 (19%)
Query: 2 IEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDA 61
+E G+ TPIQ +P+ L D+ G A+TG+GKTLAFL+ ++ + S P +A +
Sbjct: 24 LESAGFTRCTPIQALTLPVALPGGDVAGQAQTGTGKTLAFLVAVMNRLLSRPALADRKPE 83
Query: 62 DQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIAT 121
D P A+I+APTRELA QI ++ KFG LG+R LV GG+ ++Q L+ G +++IAT
Sbjct: 84 D--PRALILAPTRELAIQIHKDAVKFGADLGLRFALVYGGVDYDKQRELLQQGVDVIIAT 141
Query: 122 PGRLIDVLENRYLV-LNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAED 180
PGRLID ++ +V L+ C VLDEADRM D+GF D++ +L MP
Sbjct: 142 PGRLIDYVKQHKVVSLHACEICVLDEADRMFDLGFIKDIRFLLRRMP------------- 188
Query: 181 ENKLLANYNSKKKYRQTVMFTATMPPAKILEYMPVTNLKPDTEDAEDENKLLANYNSKKK 240
++ RQT++F+AT+ ++LE
Sbjct: 189 ----------ERGTRQTLLFSATL-SHRVLE----------------------------- 208
Query: 241 YRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKLMEV 300
LA ++ P + + + R+ Q +Y ++++K+ L+ +
Sbjct: 209 -----------------LAYEHMNEPEKLVVETETITAARVRQRIYFPADEEKQTLLLGL 251
Query: 301 LNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDIL 360
L+R ++FVN K + +A+ LE+ GY L G Q++RE LN + G +IL
Sbjct: 252 LSRSEGARTMVFVNTKAFVERVARTLERHGYRVGVLSGDVPQKKRESLLNRFQKGQLEIL 311
Query: 361 MAGDRRSR 368
+A D +R
Sbjct: 312 VATDVAAR 319
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 239 bits (613), Expect = 3e-69
Identities = 135/379 (35%), Positives = 191/379 (50%), Gaps = 34/379 (8%)
Query: 728 RNWKEASLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLL 787
+ E L +LE ++ GY PT IQ +AIP L RD++G A TG+GKT AFLLP L
Sbjct: 1 TTFSELELDESLLEALQDKGYTRPTAIQAEAIPPALDGRDVLGSAPTGTGKTAAFLLPAL 60
Query: 788 VWIQSLPKIARMEDADQGPYAI-IMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSRE 846
+ P+ GP I I+ PTRELA Q+ ++ + + + GG++
Sbjct: 61 QHLLDFPR------RKSGPPRILILTPTRELAMQVADQARELAKHTHLDIATITGGVAYM 114
Query: 847 EQGFRLRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEY 906
+IV+ATPGRL+ ++ ++LDEADRM+DMGF D++ I
Sbjct: 115 NHAEVFSENQDIVVATPGRLLQYIKEENFDCRAVETLILDEADRMLDMGFAQDIETI--- 171
Query: 907 MPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMP-PAVERLARSYLRRPATVYI 965
AE + +QT++F+AT+ AV+ A L P V
Sbjct: 172 -----------AAETRWR-----------KQTLLFSATLEGDAVQDFAERLLNDPVEVEA 209
Query: 966 GSVGKPTERIEQIVYILSEQD-KRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLG 1024
+ ++I Q Y + + K L +L + I+FV ++ LA L K G
Sbjct: 210 EPSRRERKKIHQWYYRADDLEHKTALLCHLLKQPEVTRSIVFVRTRERVHELAGWLRKAG 269
Query: 1025 YNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDY 1084
N C L G Q +R A+ L G ++LVATDVA RGIDI DVS VIN+DM +S + Y
Sbjct: 270 INCCYLEGEMVQAKRNEAIKRLTDGRVNVLVATDVAARGIDIDDVSHVINFDMPRSADTY 329
Query: 1085 THRIGRTGRAGKEGLAVSF 1103
HRIGRTGRAG++G A+S
Sbjct: 330 LHRIGRTGRAGRKGTAISL 348
Score = 239 bits (613), Expect = 3e-69
Identities = 135/379 (35%), Positives = 191/379 (50%), Gaps = 34/379 (8%)
Query: 1403 RNWKEASLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLL 1462
+ E L +LE ++ GY PT IQ +AIP L RD++G A TG+GKT AFLLP L
Sbjct: 1 TTFSELELDESLLEALQDKGYTRPTAIQAEAIPPALDGRDVLGSAPTGTGKTAAFLLPAL 60
Query: 1463 VWIQSLPKIARMEDADQGPYAI-IMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSRE 1521
+ P+ GP I I+ PTRELA Q+ ++ + + + GG++
Sbjct: 61 QHLLDFPR------RKSGPPRILILTPTRELAMQVADQARELAKHTHLDIATITGGVAYM 114
Query: 1522 EQGFRLRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEY 1581
+IV+ATPGRL+ ++ ++LDEADRM+DMGF D++ I
Sbjct: 115 NHAEVFSENQDIVVATPGRLLQYIKEENFDCRAVETLILDEADRMLDMGFAQDIETI--- 171
Query: 1582 MPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMP-PAVERLARSYLRRPATVYI 1640
AE + +QT++F+AT+ AV+ A L P V
Sbjct: 172 -----------AAETRWR-----------KQTLLFSATLEGDAVQDFAERLLNDPVEVEA 209
Query: 1641 GSVGKPTERIEQIVYILSEQD-KRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLG 1699
+ ++I Q Y + + K L +L + I+FV ++ LA L K G
Sbjct: 210 EPSRRERKKIHQWYYRADDLEHKTALLCHLLKQPEVTRSIVFVRTRERVHELAGWLRKAG 269
Query: 1700 YNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDY 1759
N C L G Q +R A+ L G ++LVATDVA RGIDI DVS VIN+DM +S + Y
Sbjct: 270 INCCYLEGEMVQAKRNEAIKRLTDGRVNVLVATDVAARGIDIDDVSHVINFDMPRSADTY 329
Query: 1760 THRIGRTGRAGKEGLAVSF 1778
HRIGRTGRAG++G A+S
Sbjct: 330 LHRIGRTGRAGRKGTAISL 348
Score = 128 bits (323), Expect = 4e-31
Identities = 60/162 (37%), Positives = 87/162 (53%), Gaps = 7/162 (4%)
Query: 2 IEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDA 61
++ GY PT IQ +AIP L RD++G A TG+GKT AFLLP L + P+
Sbjct: 16 LQDKGYTRPTAIQAEAIPPALDGRDVLGSAPTGTGKTAAFLLPALQHLLDFPR------R 69
Query: 62 DQGPYAI-IMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIA 120
GP I I+ PTRELA Q+ ++ + + + GG++ +IV+A
Sbjct: 70 KSGPPRILILTPTRELAMQVADQARELAKHTHLDIATITGGVAYMNHAEVFSENQDIVVA 129
Query: 121 TPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKI 162
TPGRL+ ++ ++LDEADRM+DMGF D++ I
Sbjct: 130 TPGRLLQYIKEENFDCRAVETLILDEADRMLDMGFAQDIETI 171
Score = 48.8 bits (117), Expect = 2e-05
Identities = 38/129 (29%), Positives = 60/129 (46%), Gaps = 2/129 (1%)
Query: 242 RQTVMFTATMP-PAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQD-KRKKLME 299
+QT++F+AT+ AV+ A L P V + ++I Q Y + + K L
Sbjct: 179 KQTLLFSATLEGDAVQDFAERLLNDPVEVEAEPSRRERKKIHQWYYRADDLEHKTALLCH 238
Query: 300 VLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDI 359
+L + I+FV ++ LA L K G N C L G Q +R A+ L G ++
Sbjct: 239 LLKQPEVTRSIVFVRTRERVHELAGWLRKAGINCCYLEGEMVQAKRNEAIKRLTDGRVNV 298
Query: 360 LMAGDRRSR 368
L+A D +R
Sbjct: 299 LVATDVAAR 307
>gnl|CDD|235314 PRK04837, PRK04837, ATP-dependent RNA helicase RhlB; Provisional.
Length = 423
Score = 234 bits (598), Expect = 3e-67
Identities = 129/370 (34%), Positives = 186/370 (50%), Gaps = 25/370 (6%)
Query: 734 SLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSL 793
+L +++E +EK G+ TPIQ A+P+ L RD+ G A+TG+GKT+AFL ++ S
Sbjct: 14 ALHPQVVEALEKKGFHNCTPIQALALPLTLAGRDVAGQAQTGTGKTMAFLTATFHYLLSH 73
Query: 794 PKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLR 853
P + P A+IMAPTRELA QI + G++ L GG ++Q L
Sbjct: 74 PAPE--DRKVNQPRALIMAPTRELAVQIHADAEPLAQATGLKLGLAYGGDGYDKQLKVLE 131
Query: 854 LGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLK 913
G +I+I T GRLID + ++ L +VLDEADRM D+GF D++ + MP
Sbjct: 132 SGVDILIGTTGRLIDYAKQNHINLGAIQVVVLDEADRMFDLGFIKDIRWLFRRMPPA--- 188
Query: 914 PDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTE 973
R ++F+AT+ V LA ++ P V + K
Sbjct: 189 --------------------NQRLNMLFSATLSYRVRELAFEHMNNPEYVEVEPEQKTGH 228
Query: 974 RIEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGG 1033
RI++ ++ S ++K + L ++ IIF N K + + L G+ L G
Sbjct: 229 RIKEELFYPSNEEKMRLLQTLIEEEWPDRAIIFANTKHRCEEIWGHLAADGHRVGLLTGD 288
Query: 1034 KGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGR 1093
Q++R L G DILVATDVA RG+ I V+ V NYD+ EDY HRIGRTGR
Sbjct: 289 VAQKKRLRILEEFTRGDLDILVATDVAARGLHIPAVTHVFNYDLPDDCEDYVHRIGRTGR 348
Query: 1094 AGKEGLAVSF 1103
AG G ++S
Sbjct: 349 AGASGHSISL 358
Score = 234 bits (598), Expect = 3e-67
Identities = 129/370 (34%), Positives = 186/370 (50%), Gaps = 25/370 (6%)
Query: 1409 SLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSL 1468
+L +++E +EK G+ TPIQ A+P+ L RD+ G A+TG+GKT+AFL ++ S
Sbjct: 14 ALHPQVVEALEKKGFHNCTPIQALALPLTLAGRDVAGQAQTGTGKTMAFLTATFHYLLSH 73
Query: 1469 PKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLR 1528
P + P A+IMAPTRELA QI + G++ L GG ++Q L
Sbjct: 74 PAPE--DRKVNQPRALIMAPTRELAVQIHADAEPLAQATGLKLGLAYGGDGYDKQLKVLE 131
Query: 1529 LGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLK 1588
G +I+I T GRLID + ++ L +VLDEADRM D+GF D++ + MP
Sbjct: 132 SGVDILIGTTGRLIDYAKQNHINLGAIQVVVLDEADRMFDLGFIKDIRWLFRRMPPA--- 188
Query: 1589 PDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTE 1648
R ++F+AT+ V LA ++ P V + K
Sbjct: 189 --------------------NQRLNMLFSATLSYRVRELAFEHMNNPEYVEVEPEQKTGH 228
Query: 1649 RIEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGG 1708
RI++ ++ S ++K + L ++ IIF N K + + L G+ L G
Sbjct: 229 RIKEELFYPSNEEKMRLLQTLIEEEWPDRAIIFANTKHRCEEIWGHLAADGHRVGLLTGD 288
Query: 1709 KGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGR 1768
Q++R L G DILVATDVA RG+ I V+ V NYD+ EDY HRIGRTGR
Sbjct: 289 VAQKKRLRILEEFTRGDLDILVATDVAARGLHIPAVTHVFNYDLPDDCEDYVHRIGRTGR 348
Query: 1769 AGKEGLAVSF 1778
AG G ++S
Sbjct: 349 AGASGHSISL 358
Score = 135 bits (343), Expect = 9e-34
Identities = 68/169 (40%), Positives = 95/169 (56%), Gaps = 2/169 (1%)
Query: 2 IEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDA 61
+EK G+ TPIQ A+P+ L RD+ G A+TG+GKT+AFL ++ S P +
Sbjct: 23 LEKKGFHNCTPIQALALPLTLAGRDVAGQAQTGTGKTMAFLTATFHYLLSHPAPE--DRK 80
Query: 62 DQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIAT 121
P A+IMAPTRELA QI + G++ L GG ++Q L G +I+I T
Sbjct: 81 VNQPRALIMAPTRELAVQIHADAEPLAQATGLKLGLAYGGDGYDKQLKVLESGVDILIGT 140
Query: 122 PGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTN 170
GRLID + ++ L +VLDEADRM D+GF D++ + MP N
Sbjct: 141 TGRLIDYAKQNHINLGAIQVVVLDEADRMFDLGFIKDIRWLFRRMPPAN 189
Score = 36.9 bits (86), Expect = 0.100
Identities = 34/129 (26%), Positives = 57/129 (44%)
Query: 240 KYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKLME 299
R ++F+AT+ V LA ++ P V + K RI++ ++ S ++K + L
Sbjct: 189 NQRLNMLFSATLSYRVRELAFEHMNNPEYVEVEPEQKTGHRIKEELFYPSNEEKMRLLQT 248
Query: 300 VLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDI 359
++ IIF N K + + L G+ L G Q++R L G DI
Sbjct: 249 LIEEEWPDRAIIFANTKHRCEEIWGHLAADGHRVGLLTGDVAQKKRLRILEEFTRGDLDI 308
Query: 360 LMAGDRRSR 368
L+A D +R
Sbjct: 309 LVATDVAAR 317
>gnl|CDD|236941 PRK11634, PRK11634, ATP-dependent RNA helicase DeaD; Provisional.
Length = 629
Score = 240 bits (613), Expect = 3e-67
Identities = 146/386 (37%), Positives = 208/386 (53%), Gaps = 35/386 (9%)
Query: 735 LPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLP 794
L ILE + +GY +P+PIQ + IP L RD++G+A+TGSGKT AF LPLL +
Sbjct: 13 LKAPILEALNDLGYEKPSPIQAECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLHNLDPEL 72
Query: 795 KIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPL-GIRTVLVVGGLSREEQGFRLR 853
K P +++APTRELA Q+ E F + G+ V + GG + Q LR
Sbjct: 73 K---------APQILVLAPTRELAVQVAEAMTDFSKHMRGVNVVALYGGQRYDVQLRALR 123
Query: 854 LGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLK 913
G +IV+ TPGRL+D L+ L L++ + +VLDEAD M+ MGF DV+ I+ +P +
Sbjct: 124 QGPQIVVGTPGRLLDHLKRGTLDLSKLSGLVLDEADEMLRMGFIEDVETIMAQIPEGH-- 181
Query: 914 PDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTE 973
QT +F+ATMP A+ R+ R +++ P V I S
Sbjct: 182 -----------------------QTALFSATMPEAIRRITRRFMKEPQEVRIQSSVTTRP 218
Query: 974 RIEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGG 1033
I Q + + K + L+ L IIFV K +A+ LE+ GYN+ L+G
Sbjct: 219 DISQSYWTVWGMRKNEALVRFLEAEDFDAAIIFVRTKNATLEVAEALERNGYNSAALNGD 278
Query: 1034 KGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGR 1093
Q RE L LK G DIL+ATDVA RG+D++ +S+V+NYD+ E Y HRIGRTGR
Sbjct: 279 MNQALREQTLERLKDGRLDILIATDVAARGLDVERISLVVNYDIPMDSESYVHRIGRTGR 338
Query: 1094 AGKEGLAVSFCTKDDSHLFYDLKQMM 1119
AG+ G A+ F + L ++++ M
Sbjct: 339 AGRAGRALLFVENRERRLLRNIERTM 364
Score = 240 bits (613), Expect = 3e-67
Identities = 146/386 (37%), Positives = 208/386 (53%), Gaps = 35/386 (9%)
Query: 1410 LPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLP 1469
L ILE + +GY +P+PIQ + IP L RD++G+A+TGSGKT AF LPLL +
Sbjct: 13 LKAPILEALNDLGYEKPSPIQAECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLHNLDPEL 72
Query: 1470 KIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPL-GIRTVLVVGGLSREEQGFRLR 1528
K P +++APTRELA Q+ E F + G+ V + GG + Q LR
Sbjct: 73 K---------APQILVLAPTRELAVQVAEAMTDFSKHMRGVNVVALYGGQRYDVQLRALR 123
Query: 1529 LGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLK 1588
G +IV+ TPGRL+D L+ L L++ + +VLDEAD M+ MGF DV+ I+ +P +
Sbjct: 124 QGPQIVVGTPGRLLDHLKRGTLDLSKLSGLVLDEADEMLRMGFIEDVETIMAQIPEGH-- 181
Query: 1589 PDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTE 1648
QT +F+ATMP A+ R+ R +++ P V I S
Sbjct: 182 -----------------------QTALFSATMPEAIRRITRRFMKEPQEVRIQSSVTTRP 218
Query: 1649 RIEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGG 1708
I Q + + K + L+ L IIFV K +A+ LE+ GYN+ L+G
Sbjct: 219 DISQSYWTVWGMRKNEALVRFLEAEDFDAAIIFVRTKNATLEVAEALERNGYNSAALNGD 278
Query: 1709 KGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGR 1768
Q RE L LK G DIL+ATDVA RG+D++ +S+V+NYD+ E Y HRIGRTGR
Sbjct: 279 MNQALREQTLERLKDGRLDILIATDVAARGLDVERISLVVNYDIPMDSESYVHRIGRTGR 338
Query: 1769 AGKEGLAVSFCTKDDSHLFYDLKQMM 1794
AG+ G A+ F + L ++++ M
Sbjct: 339 AGRAGRALLFVENRERRLLRNIERTM 364
Score = 151 bits (384), Expect = 1e-37
Identities = 115/368 (31%), Positives = 164/368 (44%), Gaps = 82/368 (22%)
Query: 2 IEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDA 61
+ +GY +P+PIQ + IP L RD++G+A+TGSGKT AF LPLL + K
Sbjct: 21 LNDLGYEKPSPIQAECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLHNLDPELK------- 73
Query: 62 DQGPYAIIMAPTRELAQQIEEETNKFGTPL-GIRTVLVVGGLSREEQGFRLRLGCEIVIA 120
P +++APTRELA Q+ E F + G+ V + GG + Q LR G +IV+
Sbjct: 74 --APQILVLAPTRELAVQVAEAMTDFSKHMRGVNVVALYGGQRYDVQLRALRQGPQIVVG 131
Query: 121 TPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAED 180
TPGRL+D L+ L L++ + +VLDEAD M+ MGF DV+ I+ +P +
Sbjct: 132 TPGRLLDHLKRGTLDLSKLSGLVLDEADEMLRMGFIEDVETIMAQIPEGH---------- 181
Query: 181 ENKLLANYNSKKKYRQTVMFTATMPPAKILEYMPVTNLKPDTEDAEDENKLLANYNSKKK 240
QT +F+ATM
Sbjct: 182 ---------------QTALFSATM------------------------------------ 190
Query: 241 YRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKLMEV 300
P A+ R+ R +++ P V I S I Q + + K + L+
Sbjct: 191 -----------PEAIRRITRRFMKEPQEVRIQSSVTTRPDISQSYWTVWGMRKNEALVRF 239
Query: 301 LNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDIL 360
L IIFV K +A+ LE+ GYN+ L+G Q RE L LK G DIL
Sbjct: 240 LEAEDFDAAIIFVRTKNATLEVAEALERNGYNSAALNGDMNQALREQTLERLKDGRLDIL 299
Query: 361 MAGDRRSR 368
+A D +R
Sbjct: 300 IATDVAAR 307
Score = 32.9 bits (75), Expect = 1.9
Identities = 13/35 (37%), Positives = 17/35 (48%)
Query: 388 VRERDTDRSRDDRGRDRDRDRLERDRSREREREKR 422
+R + R RDDRG DR R +R R +R
Sbjct: 446 MRPKREFRDRDDRGPRDRNDRGPRGDREDRPRRER 480
Score = 31.7 bits (72), Expect = 4.5
Identities = 14/47 (29%), Positives = 21/47 (44%), Gaps = 3/47 (6%)
Query: 372 PPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSRERE 418
P + + R+RD R+R+ R DR DR R ER + +
Sbjct: 444 APMRPKREFRDRDDRGPRDRNDRGPRGDRE---DRPRRERRDVGDMQ 487
>gnl|CDD|185609 PTZ00424, PTZ00424, helicase 45; Provisional.
Length = 401
Score = 214 bits (546), Expect = 1e-60
Identities = 123/386 (31%), Positives = 201/386 (52%), Gaps = 37/386 (9%)
Query: 725 DPVRNWKEASLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLL 784
+ V ++ L ++L I G+ +P+ IQ++ I L D IG A++G+GKT F+
Sbjct: 25 EIVDSFDALKLNEDLLRGIYSYGFEKPSAIQQRGIKPILDGYDTIGQAQSGTGKTATFV- 83
Query: 785 PLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLS 844
I +L I +A Q A+I+APTRELAQQI++ G L +R VGG
Sbjct: 84 -----IAALQLIDYDLNACQ---ALILAPTRELAQQIQKVVLALGDYLKVRCHACVGGTV 135
Query: 845 REEQGFRLRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKIL 904
+ +L+ G +V+ TPGR+ D+++ R+L ++ +LDEAD M+ GF+ + +
Sbjct: 136 VRDDINKLKAGVHMVVGTPGRVYDMIDKRHLRVDDLKLFILDEADEMLSRGFKGQIYDVF 195
Query: 905 EYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVY 964
+ +P PD Q +F+ATMP + L ++R P +
Sbjct: 196 KKLP-----PDV--------------------QVALFSATMPNEILELTTKFMRDPKRIL 230
Query: 965 IGSVGKPTERIEQIVYILSEQDKRK--KLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEK 1022
+ E I Q Y+ E+++ K L ++ II+ N ++ D L K + +
Sbjct: 231 VKKDELTLEGIRQF-YVAVEKEEWKFDTLCDLYETLTITQAIIYCNTRRKVDYLTKKMHE 289
Query: 1023 LGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIE 1082
+ +HG Q+ R+L + + GS +L+ TD+ RGID++ VS+VINYD+ S E
Sbjct: 290 RDFTVSCMHGDMDQKDRDLIMREFRSGSTRVLITTDLLARGIDVQQVSLVINYDLPASPE 349
Query: 1083 DYTHRIGRTGRAGKEGLAVSFCTKDD 1108
+Y HRIGR+GR G++G+A++F T DD
Sbjct: 350 NYIHRIGRSGRFGRKGVAINFVTPDD 375
Score = 214 bits (546), Expect = 1e-60
Identities = 123/386 (31%), Positives = 201/386 (52%), Gaps = 37/386 (9%)
Query: 1400 DPVRNWKEASLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLL 1459
+ V ++ L ++L I G+ +P+ IQ++ I L D IG A++G+GKT F+
Sbjct: 25 EIVDSFDALKLNEDLLRGIYSYGFEKPSAIQQRGIKPILDGYDTIGQAQSGTGKTATFV- 83
Query: 1460 PLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLS 1519
I +L I +A Q A+I+APTRELAQQI++ G L +R VGG
Sbjct: 84 -----IAALQLIDYDLNACQ---ALILAPTRELAQQIQKVVLALGDYLKVRCHACVGGTV 135
Query: 1520 REEQGFRLRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKIL 1579
+ +L+ G +V+ TPGR+ D+++ R+L ++ +LDEAD M+ GF+ + +
Sbjct: 136 VRDDINKLKAGVHMVVGTPGRVYDMIDKRHLRVDDLKLFILDEADEMLSRGFKGQIYDVF 195
Query: 1580 EYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVY 1639
+ +P PD Q +F+ATMP + L ++R P +
Sbjct: 196 KKLP-----PDV--------------------QVALFSATMPNEILELTTKFMRDPKRIL 230
Query: 1640 IGSVGKPTERIEQIVYILSEQDKRK--KLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEK 1697
+ E I Q Y+ E+++ K L ++ II+ N ++ D L K + +
Sbjct: 231 VKKDELTLEGIRQF-YVAVEKEEWKFDTLCDLYETLTITQAIIYCNTRRKVDYLTKKMHE 289
Query: 1698 LGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIE 1757
+ +HG Q+ R+L + + GS +L+ TD+ RGID++ VS+VINYD+ S E
Sbjct: 290 RDFTVSCMHGDMDQKDRDLIMREFRSGSTRVLITTDLLARGIDVQQVSLVINYDLPASPE 349
Query: 1758 DYTHRIGRTGRAGKEGLAVSFCTKDD 1783
+Y HRIGR+GR G++G+A++F T DD
Sbjct: 350 NYIHRIGRSGRFGRKGVAINFVTPDD 375
Score = 124 bits (313), Expect = 5e-30
Identities = 94/365 (25%), Positives = 156/365 (42%), Gaps = 84/365 (23%)
Query: 6 GYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGP 65
G+ +P+ IQ++ I L D IG A++G+GKT F+ I +L I +A Q
Sbjct: 47 GFEKPSAIQQRGIKPILDGYDTIGQAQSGTGKTATFV------IAALQLIDYDLNACQ-- 98
Query: 66 YAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIATPGRL 125
A+I+APTRELAQQI++ G L +R VGG + +L+ G +V+ TPGR+
Sbjct: 99 -ALILAPTRELAQQIQKVVLALGDYLKVRCHACVGGTVVRDDINKLKAGVHMVVGTPGRV 157
Query: 126 IDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDENKLL 185
D+++ R+L ++ +LDEAD M+ GF+ + + + +P PD
Sbjct: 158 YDMIDKRHLRVDDLKLFILDEADEMLSRGFKGQIYDVFKKLP-----PDV---------- 202
Query: 186 ANYNSKKKYRQTVMFTATMPPAKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTV 245
Q +F+ATM P +ILE
Sbjct: 203 ----------QVALFSATM-PNEILE---------------------------------- 217
Query: 246 MFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRK--KLMEVLNR 303
L ++R P + + E I Q Y+ E+++ K L ++
Sbjct: 218 ------------LTTKFMRDPKRILVKKDELTLEGIRQF-YVAVEKEEWKFDTLCDLYET 264
Query: 304 GVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILMAG 363
II+ N ++ D L K + + + +HG Q+ R+L + + GS +L+
Sbjct: 265 LTITQAIIYCNTRRKVDYLTKKMHERDFTVSCMHGDMDQKDRDLIMREFRSGSTRVLITT 324
Query: 364 DRRSR 368
D +R
Sbjct: 325 DLLAR 329
>gnl|CDD|215832 pfam00270, DEAD, DEAD/DEAH box helicase. Members of this family
include the DEAD and DEAH box helicases. Helicases are
involved in unwinding nucleic acids. The DEAD box
helicases are involved in various aspects of RNA
metabolism, including nuclear transcription, pre mRNA
splicing, ribosome biogenesis, nucleocytoplasmic
transport, translation, RNA decay and organellar gene
expression.
Length = 169
Score = 187 bits (478), Expect = 1e-54
Identities = 81/203 (39%), Positives = 113/203 (55%), Gaps = 36/203 (17%)
Query: 752 TPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIM 811
TPIQ QAIP L +D++ A TGSGKTLAFLLP+L + GP A+++
Sbjct: 1 TPIQAQAIPAILSGKDVLVQAPTGSGKTLAFLLPIL---------QALLPKKGGPQALVL 51
Query: 812 APTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLG-CEIVIATPGRLIDVL 870
APTRELA+QI EE K LG+R L+ GG S +EQ +L+ G +I++ TPGRL+D+L
Sbjct: 52 APTRELAEQIYEELKKLFKILGLRVALLTGGTSLKEQARKLKKGKADILVGTPGRLLDLL 111
Query: 871 ENRYL-VLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANY 929
L +L +VLDEA R++DMGF D+++IL +P
Sbjct: 112 RRGKLKLLKNLKLLVLDEAHRLLDMGFGDDLEEILSRLP--------------------- 150
Query: 930 NSKKKYRQTVMFTATMPPAVERL 952
RQ ++ +AT+P +E L
Sbjct: 151 ----PDRQILLLSATLPRNLEDL 169
Score = 187 bits (478), Expect = 1e-54
Identities = 81/203 (39%), Positives = 113/203 (55%), Gaps = 36/203 (17%)
Query: 1427 TPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIM 1486
TPIQ QAIP L +D++ A TGSGKTLAFLLP+L + GP A+++
Sbjct: 1 TPIQAQAIPAILSGKDVLVQAPTGSGKTLAFLLPIL---------QALLPKKGGPQALVL 51
Query: 1487 APTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLG-CEIVIATPGRLIDVL 1545
APTRELA+QI EE K LG+R L+ GG S +EQ +L+ G +I++ TPGRL+D+L
Sbjct: 52 APTRELAEQIYEELKKLFKILGLRVALLTGGTSLKEQARKLKKGKADILVGTPGRLLDLL 111
Query: 1546 ENRYL-VLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANY 1604
L +L +VLDEA R++DMGF D+++IL +P
Sbjct: 112 RRGKLKLLKNLKLLVLDEAHRLLDMGFGDDLEEILSRLP--------------------- 150
Query: 1605 NSKKKYRQTVMFTATMPPAVERL 1627
RQ ++ +AT+P +E L
Sbjct: 151 ----PDRQILLLSATLPRNLEDL 169
Score = 183 bits (467), Expect = 4e-53
Identities = 79/197 (40%), Positives = 110/197 (55%), Gaps = 36/197 (18%)
Query: 11 TPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIM 70
TPIQ QAIP L +D++ A TGSGKTLAFLLP+L + GP A+++
Sbjct: 1 TPIQAQAIPAILSGKDVLVQAPTGSGKTLAFLLPIL---------QALLPKKGGPQALVL 51
Query: 71 APTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLG-CEIVIATPGRLIDVL 129
APTRELA+QI EE K LG+R L+ GG S +EQ +L+ G +I++ TPGRL+D+L
Sbjct: 52 APTRELAEQIYEELKKLFKILGLRVALLTGGTSLKEQARKLKKGKADILVGTPGRLLDLL 111
Query: 130 ENRYL-VLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANY 188
L +L +VLDEA R++DMGF D+++IL +P
Sbjct: 112 RRGKLKLLKNLKLLVLDEAHRLLDMGFGDDLEEILSRLP--------------------- 150
Query: 189 NSKKKYRQTVMFTATMP 205
RQ ++ +AT+P
Sbjct: 151 ----PDRQILLLSATLP 163
Score = 29.5 bits (67), Expect = 9.3
Identities = 7/17 (41%), Positives = 12/17 (70%)
Query: 242 RQTVMFTATMPPAVERL 258
RQ ++ +AT+P +E L
Sbjct: 153 RQILLLSATLPRNLEDL 169
>gnl|CDD|214692 smart00487, DEXDc, DEAD-like helicases superfamily.
Length = 201
Score = 174 bits (443), Expect = 2e-49
Identities = 85/238 (35%), Positives = 122/238 (51%), Gaps = 39/238 (16%)
Query: 743 IEKIGYAEPTPIQRQAIPIGLQN-RDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMED 801
IEK G+ P Q++AI L RD+I A TGSGKTLA LLP L ++
Sbjct: 1 IEKFGFEPLRPYQKEAIEALLSGLRDVILAAPTGSGKTLAALLPALEALKRGK------- 53
Query: 802 ADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGC-EIVI 860
G +++ PTRELA+Q EE K G LG++ V + GG S+ EQ +L G +I++
Sbjct: 54 ---GGRVLVLVPTRELAEQWAEELKKLGPSLGLKVVGLYGGDSKREQLRKLESGKTDILV 110
Query: 861 ATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAE 920
TPGRL+D+LEN L L+ ++LDEA R++D GF ++K+L+ +P
Sbjct: 111 TTPGRLLDLLENDKLSLSNVDLVILDEAHRLLDGGFGDQLEKLLKLLP------------ 158
Query: 921 DENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQI 978
K Q ++ +AT P +E L +L P + +G P E IEQ
Sbjct: 159 -------------KNVQLLLLSATPPEEIENLLELFLNDPVFIDVGF--TPLEPIEQF 201
Score = 174 bits (443), Expect = 2e-49
Identities = 85/238 (35%), Positives = 122/238 (51%), Gaps = 39/238 (16%)
Query: 1418 IEKIGYAEPTPIQRQAIPIGLQN-RDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMED 1476
IEK G+ P Q++AI L RD+I A TGSGKTLA LLP L ++
Sbjct: 1 IEKFGFEPLRPYQKEAIEALLSGLRDVILAAPTGSGKTLAALLPALEALKRGK------- 53
Query: 1477 ADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGC-EIVI 1535
G +++ PTRELA+Q EE K G LG++ V + GG S+ EQ +L G +I++
Sbjct: 54 ---GGRVLVLVPTRELAEQWAEELKKLGPSLGLKVVGLYGGDSKREQLRKLESGKTDILV 110
Query: 1536 ATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAE 1595
TPGRL+D+LEN L L+ ++LDEA R++D GF ++K+L+ +P
Sbjct: 111 TTPGRLLDLLENDKLSLSNVDLVILDEAHRLLDGGFGDQLEKLLKLLP------------ 158
Query: 1596 DENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQI 1653
K Q ++ +AT P +E L +L P + +G P E IEQ
Sbjct: 159 -------------KNVQLLLLSATPPEEIENLLELFLNDPVFIDVGF--TPLEPIEQF 201
Score = 159 bits (404), Expect = 3e-44
Identities = 75/208 (36%), Positives = 108/208 (51%), Gaps = 37/208 (17%)
Query: 2 IEKIGYAEPTPIQRQAIPIGLQN-RDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMED 60
IEK G+ P Q++AI L RD+I A TGSGKTLA LLP L ++
Sbjct: 1 IEKFGFEPLRPYQKEAIEALLSGLRDVILAAPTGSGKTLAALLPALEALKRGK------- 53
Query: 61 ADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGC-EIVI 119
G +++ PTRELA+Q EE K G LG++ V + GG S+ EQ +L G +I++
Sbjct: 54 ---GGRVLVLVPTRELAEQWAEELKKLGPSLGLKVVGLYGGDSKREQLRKLESGKTDILV 110
Query: 120 ATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAE 179
TPGRL+D+LEN L L+ ++LDEA R++D GF ++K+L+ +P
Sbjct: 111 TTPGRLLDLLENDKLSLSNVDLVILDEAHRLLDGGFGDQLEKLLKLLP------------ 158
Query: 180 DENKLLANYNSKKKYRQTVMFTATMPPA 207
K Q ++ +AT P
Sbjct: 159 -------------KNVQLLLLSATPPEE 173
Score = 37.9 bits (88), Expect = 0.025
Identities = 15/46 (32%), Positives = 22/46 (47%), Gaps = 2/46 (4%)
Query: 239 KKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQI 284
K Q ++ +AT P +E L +L P + +G P E IEQ
Sbjct: 158 PKNVQLLLLSATPPEEIENLLELFLNDPVFIDVGF--TPLEPIEQF 201
>gnl|CDD|238005 cd00046, DEXDc, DEAD-like helicases superfamily. A diverse family
of proteins involved in ATP-dependent RNA or DNA
unwinding. This domain contains the ATP-binding region.
Length = 144
Score = 135 bits (342), Expect = 1e-36
Identities = 60/180 (33%), Positives = 88/180 (48%), Gaps = 36/180 (20%)
Query: 25 RDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEET 84
RD++ A TGSGKTLA LLP+L + D+ +G +++APTRELA Q+ E
Sbjct: 1 RDVLLAAPTGSGKTLAALLPIL----------ELLDSLKGGQVLVLAPTRELANQVAERL 50
Query: 85 NKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVL 144
+ GI+ ++GG S ++Q L +IV+ TPGRL+D LE L L + ++L
Sbjct: 51 KELF-GEGIKVGYLIGGTSIKQQEKLLSGKTDIVVGTPGRLLDELERLKLSLKKLDLLIL 109
Query: 145 DEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATM 204
DEA R+++ GF KIL K RQ ++ +AT
Sbjct: 110 DEAHRLLNQGFGLLGLKILLK-------------------------LPKDRQVLLLSATP 144
Score = 135 bits (342), Expect = 1e-36
Identities = 60/180 (33%), Positives = 88/180 (48%), Gaps = 36/180 (20%)
Query: 766 RDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEET 825
RD++ A TGSGKTLA LLP+L + D+ +G +++APTRELA Q+ E
Sbjct: 1 RDVLLAAPTGSGKTLAALLPIL----------ELLDSLKGGQVLVLAPTRELANQVAERL 50
Query: 826 NKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVL 885
+ GI+ ++GG S ++Q L +IV+ TPGRL+D LE L L + ++L
Sbjct: 51 KELF-GEGIKVGYLIGGTSIKQQEKLLSGKTDIVVGTPGRLLDELERLKLSLKKLDLLIL 109
Query: 886 DEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATM 945
DEA R+++ GF KIL K RQ ++ +AT
Sbjct: 110 DEAHRLLNQGFGLLGLKILLK-------------------------LPKDRQVLLLSATP 144
Score = 135 bits (342), Expect = 1e-36
Identities = 60/180 (33%), Positives = 88/180 (48%), Gaps = 36/180 (20%)
Query: 1441 RDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEET 1500
RD++ A TGSGKTLA LLP+L + D+ +G +++APTRELA Q+ E
Sbjct: 1 RDVLLAAPTGSGKTLAALLPIL----------ELLDSLKGGQVLVLAPTRELANQVAERL 50
Query: 1501 NKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVL 1560
+ GI+ ++GG S ++Q L +IV+ TPGRL+D LE L L + ++L
Sbjct: 51 KELF-GEGIKVGYLIGGTSIKQQEKLLSGKTDIVVGTPGRLLDELERLKLSLKKLDLLIL 109
Query: 1561 DEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATM 1620
DEA R+++ GF KIL K RQ ++ +AT
Sbjct: 110 DEAHRLLNQGFGLLGLKILLK-------------------------LPKDRQVLLLSATP 144
>gnl|CDD|238034 cd00079, HELICc, Helicase superfamily c-terminal domain; associated
with DEXDc-, DEAD-, and DEAH-box proteins, yeast
initiation factor 4A, Ski2p, and Hepatitis C virus NS3
helicases; this domain is found in a wide variety of
helicases and helicase related proteins; may not be an
autonomously folding unit, but an integral part of the
helicase; 4 helicase superfamilies at present according
to the organization of their signature motifs; all
helicases share the ability to unwind nucleic acid
duplexes with a distinct directional polarity; they
utilize the free energy from nucleoside triphosphate
hydrolysis to fuel their translocation along DNA,
unwinding the duplex in the process.
Length = 131
Score = 133 bits (336), Expect = 5e-36
Identities = 56/129 (43%), Positives = 75/129 (58%), Gaps = 2/129 (1%)
Query: 977 QIVYILSEQDKRKKLMEVLNRGVKKP--VIIFVNQKKGADVLAKGLEKLGYNACTLHGGK 1034
+ + E +K + L+E+L +KK V+IF KK D LA+ L K G LHG
Sbjct: 3 KQYVLPVEDEKLEALLELLKEHLKKGGKVLIFCPSKKMLDELAELLRKPGIKVAALHGDG 62
Query: 1035 GQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRA 1094
QE+RE L + G +LVATDV RGID+ +VS+VINYD+ S Y RIGR GRA
Sbjct: 63 SQEEREEVLKDFREGEIVVLVATDVIARGIDLPNVSVVINYDLPWSPSSYLQRIGRAGRA 122
Query: 1095 GKEGLAVSF 1103
G++G A+
Sbjct: 123 GQKGTAILL 131
Score = 133 bits (336), Expect = 5e-36
Identities = 56/129 (43%), Positives = 75/129 (58%), Gaps = 2/129 (1%)
Query: 1652 QIVYILSEQDKRKKLMEVLNRGVKKP--VIIFVNQKKGADVLAKGLEKLGYNACTLHGGK 1709
+ + E +K + L+E+L +KK V+IF KK D LA+ L K G LHG
Sbjct: 3 KQYVLPVEDEKLEALLELLKEHLKKGGKVLIFCPSKKMLDELAELLRKPGIKVAALHGDG 62
Query: 1710 GQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRA 1769
QE+RE L + G +LVATDV RGID+ +VS+VINYD+ S Y RIGR GRA
Sbjct: 63 SQEEREEVLKDFREGEIVVLVATDVIARGIDLPNVSVVINYDLPWSPSSYLQRIGRAGRA 122
Query: 1770 GKEGLAVSF 1778
G++G A+
Sbjct: 123 GQKGTAILL 131
Score = 58.8 bits (143), Expect = 5e-10
Identities = 29/82 (35%), Positives = 42/82 (51%), Gaps = 2/82 (2%)
Query: 283 QIVYILSEQDKRKKLMEVLNRGVKKP--VIIFVNQKKGADVLAKGLEKLGYNACTLHGGK 340
+ + E +K + L+E+L +KK V+IF KK D LA+ L K G LHG
Sbjct: 3 KQYVLPVEDEKLEALLELLKEHLKKGGKVLIFCPSKKMLDELAELLRKPGIKVAALHGDG 62
Query: 341 GQEQRELALNSLKGGSKDILMA 362
QE+RE L + G +L+A
Sbjct: 63 SQEEREEVLKDFREGEIVVLVA 84
>gnl|CDD|197757 smart00490, HELICc, helicase superfamily c-terminal domain.
Length = 82
Score = 107 bits (269), Expect = 1e-27
Identities = 40/82 (48%), Positives = 51/82 (62%)
Query: 1014 DVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVI 1073
+ LA+ L++LG LHGG QE+RE L+ G +LVATDVA RG+D+ V +VI
Sbjct: 1 EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVATDVAERGLDLPGVDLVI 60
Query: 1074 NYDMAKSIEDYTHRIGRTGRAG 1095
YD+ S Y RIGR GRAG
Sbjct: 61 IYDLPWSPASYIQRIGRAGRAG 82
Score = 107 bits (269), Expect = 1e-27
Identities = 40/82 (48%), Positives = 51/82 (62%)
Query: 1689 DVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVI 1748
+ LA+ L++LG LHGG QE+RE L+ G +LVATDVA RG+D+ V +VI
Sbjct: 1 EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVATDVAERGLDLPGVDLVI 60
Query: 1749 NYDMAKSIEDYTHRIGRTGRAG 1770
YD+ S Y RIGR GRAG
Sbjct: 61 IYDLPWSPASYIQRIGRAGRAG 82
Score = 43.0 bits (102), Expect = 6e-05
Identities = 17/43 (39%), Positives = 25/43 (58%)
Query: 320 DVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILMA 362
+ LA+ L++LG LHGG QE+RE L+ G +L+A
Sbjct: 1 EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVA 43
>gnl|CDD|201125 pfam00271, Helicase_C, Helicase conserved C-terminal domain. The
Prosite family is restricted to DEAD/H helicases, whereas
this domain family is found in a wide variety of
helicases and helicase related proteins. It may be that
this is not an autonomously folding unit, but an integral
part of the helicase.
Length = 78
Score = 106 bits (268), Expect = 2e-27
Identities = 41/76 (53%), Positives = 49/76 (64%)
Query: 1020 LEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAK 1079
L K G LHGG QE+RE L + G +LVATDVAGRGID+ DV++VINYD+
Sbjct: 3 LRKPGIKVARLHGGLSQEEREEILEDFRNGKSKVLVATDVAGRGIDLPDVNLVINYDLPW 62
Query: 1080 SIEDYTHRIGRTGRAG 1095
+ Y RIGR GRAG
Sbjct: 63 NPASYIQRIGRAGRAG 78
Score = 106 bits (268), Expect = 2e-27
Identities = 41/76 (53%), Positives = 49/76 (64%)
Query: 1695 LEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAK 1754
L K G LHGG QE+RE L + G +LVATDVAGRGID+ DV++VINYD+
Sbjct: 3 LRKPGIKVARLHGGLSQEEREEILEDFRNGKSKVLVATDVAGRGIDLPDVNLVINYDLPW 62
Query: 1755 SIEDYTHRIGRTGRAG 1770
+ Y RIGR GRAG
Sbjct: 63 NPASYIQRIGRAGRAG 78
Score = 37.1 bits (87), Expect = 0.006
Identities = 15/37 (40%), Positives = 19/37 (51%)
Query: 326 LEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILMA 362
L K G LHGG QE+RE L + G +L+A
Sbjct: 3 LRKPGIKVARLHGGLSQEEREEILEDFRNGKSKVLVA 39
>gnl|CDD|223588 COG0514, RecQ, Superfamily II DNA helicase [DNA replication,
recombination, and repair].
Length = 590
Score = 100 bits (250), Expect = 3e-21
Identities = 98/379 (25%), Positives = 161/379 (42%), Gaps = 63/379 (16%)
Query: 744 EKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDAD 803
+ GYA P Q++ I L +D + V TG GK+L + +P A + +
Sbjct: 11 QVFGYASFRPGQQEIIDALLSGKDTLVVMPTGGGKSLCYQIP-----------ALLLE-- 57
Query: 804 QGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQG---FRLRLG-CEIV 859
G ++++P L + + + + GIR + LSREE+ +L+ G +++
Sbjct: 58 -GL-TLVVSPLISL-MKDQVDQLE---AAGIRAAYLNSTLSREERQQVLNQLKSGQLKLL 111
Query: 860 IATPGRL-----IDVLENRYLVLNQCTYIVLDEADRMIDMG--FEPDVQKILEYMPVTNL 912
+P RL +++L+ + L + +DEA + G F PD +++
Sbjct: 112 YISPERLMSPRFLELLKRLPISL-----VAIDEAHCISQWGHDFRPDYRRLG-------- 158
Query: 913 KPDTEDAEDENKLLANYNSKKKYRQTVM-FTATMPPAVERLARSYLR-RPATVYIGSVGK 970
+L A V+ TAT P V R L + A ++ GS +
Sbjct: 159 -----------RLRAGLP-----NPPVLALTATATPRVRDDIREQLGLQDANIFRGSFDR 202
Query: 971 PTERIEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTL 1030
P ++ + D+ L VL + K II+ +K + LA+ L K G +A
Sbjct: 203 PNLALKVVEK-GEPSDQLAFLATVLPQL-SKSGIIYCLTRKKVEELAEWLRKNGISAGAY 260
Query: 1031 HGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGR 1090
H G E+RE + ++VAT+ G GID DV VI+YD+ SIE Y GR
Sbjct: 261 HAGLSNEERERVQQAFLNDEIKVMVATNAFGMGIDKPDVRFVIHYDLPGSIESYYQETGR 320
Query: 1091 TGRAGKEGLAVSFCTKDDS 1109
GR G A+ + +D
Sbjct: 321 AGRDGLPAEAILLYSPEDI 339
Score = 100 bits (250), Expect = 3e-21
Identities = 98/379 (25%), Positives = 161/379 (42%), Gaps = 63/379 (16%)
Query: 1419 EKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDAD 1478
+ GYA P Q++ I L +D + V TG GK+L + +P A + +
Sbjct: 11 QVFGYASFRPGQQEIIDALLSGKDTLVVMPTGGGKSLCYQIP-----------ALLLE-- 57
Query: 1479 QGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQG---FRLRLG-CEIV 1534
G ++++P L + + + + GIR + LSREE+ +L+ G +++
Sbjct: 58 -GL-TLVVSPLISL-MKDQVDQLE---AAGIRAAYLNSTLSREERQQVLNQLKSGQLKLL 111
Query: 1535 IATPGRL-----IDVLENRYLVLNQCTYIVLDEADRMIDMG--FEPDVQKILEYMPVTNL 1587
+P RL +++L+ + L + +DEA + G F PD +++
Sbjct: 112 YISPERLMSPRFLELLKRLPISL-----VAIDEAHCISQWGHDFRPDYRRLG-------- 158
Query: 1588 KPDTEDAEDENKLLANYNSKKKYRQTVM-FTATMPPAVERLARSYLR-RPATVYIGSVGK 1645
+L A V+ TAT P V R L + A ++ GS +
Sbjct: 159 -----------RLRAGLP-----NPPVLALTATATPRVRDDIREQLGLQDANIFRGSFDR 202
Query: 1646 PTERIEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTL 1705
P ++ + D+ L VL + K II+ +K + LA+ L K G +A
Sbjct: 203 PNLALKVVEK-GEPSDQLAFLATVLPQL-SKSGIIYCLTRKKVEELAEWLRKNGISAGAY 260
Query: 1706 HGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGR 1765
H G E+RE + ++VAT+ G GID DV VI+YD+ SIE Y GR
Sbjct: 261 HAGLSNEERERVQQAFLNDEIKVMVATNAFGMGIDKPDVRFVIHYDLPGSIESYYQETGR 320
Query: 1766 TGRAGKEGLAVSFCTKDDS 1784
GR G A+ + +D
Sbjct: 321 AGRDGLPAEAILLYSPEDI 339
Score = 44.6 bits (106), Expect = 6e-04
Identities = 30/101 (29%), Positives = 45/101 (44%), Gaps = 3/101 (2%)
Query: 247 FTATMPPAVERLARSYLR-RPATVYIGSVGKPTERIEQIVYILSEQDKRKKLMEVLNRGV 305
TAT P V R L + A ++ GS +P ++ + D+ L VL +
Sbjct: 172 LTATATPRVRDDIREQLGLQDANIFRGSFDRPNLALKVVEK-GEPSDQLAFLATVLPQL- 229
Query: 306 KKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRE 346
K II+ +K + LA+ L K G +A H G E+RE
Sbjct: 230 SKSGIIYCLTRKKVEELAEWLRKNGISAGAYHAGLSNEERE 270
Score = 30.7 bits (70), Expect = 8.2
Identities = 16/45 (35%), Positives = 25/45 (55%)
Query: 3 EKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLV 47
+ GYA P Q++ I L +D + V TG GK+L + +P L+
Sbjct: 11 QVFGYASFRPGQQEIIDALLSGKDTLVVMPTGGGKSLCYQIPALL 55
>gnl|CDD|224122 COG1201, Lhr, Lhr-like helicases [General function prediction only].
Length = 814
Score = 85.8 bits (213), Expect = 1e-16
Identities = 101/375 (26%), Positives = 160/375 (42%), Gaps = 49/375 (13%)
Query: 734 SLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSL 793
L + E K + TP QR AIP +++ +A TGSGKT A LP++ L
Sbjct: 7 ILDPRVREWF-KRKFTSLTPPQRYAIPEIHSGENVLIIAPTGSGKTEAAFLPVI---NEL 62
Query: 794 PKIARMEDADQGPYAIIMAPTRELAQQIE----EETNKFGTPLGIRTVLVVGGLSREEQG 849
+ + + D G YA+ ++P + L I E + G + +R G + E+
Sbjct: 63 LSLGKGKLED-GIYALYISPLKALNNDIRRRLEEPLRELGIEVAVRH----GDTPQSEKQ 117
Query: 850 FRLRLGCEIVIATPGRLIDVLENRYL--VLNQCTYIVLDEADRMIDMGFEPDVQKILEYM 907
L+ I+I TP L +L + +L Y+++DE + + + VQ L
Sbjct: 118 KMLKNPPHILITTPESLAILLNSPKFRELLRDVRYVIVDEIHALAES--KRGVQLALSLE 175
Query: 908 PVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYL---RRPATVY 964
+ L D Q + +AT+ P E +A+ +L P +
Sbjct: 176 RLRELAGDF--------------------QRIGLSATVGP-PEEVAK-FLVGFGDPCEIV 213
Query: 965 IGSVGKPTERIEQIV---YILSEQDKRKKLMEVLNRGVK--KPVIIFVNQKKGADVLAKG 1019
S K E I+ I ++ +++ L E + VK + +IF N + GA+ LA
Sbjct: 214 DVSAAKKLE-IKVISPVEDLIYDEELWAALYERIAELVKKHRTTLIFTNTRSGAERLAFR 272
Query: 1020 LEKLGYNACTLHGGK-GQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMA 1078
L+KLG + +H G +E R LK G +VAT GIDI D+ +VI
Sbjct: 273 LKKLGPDIIEVHHGSLSRELRLEVEERLKEGELKAVVATSSLELGIDIGDIDLVIQLGSP 332
Query: 1079 KSIEDYTHRIGRTGR 1093
KS+ + RIGR G
Sbjct: 333 KSVNRFLQRIGRAGH 347
Score = 85.8 bits (213), Expect = 1e-16
Identities = 101/375 (26%), Positives = 160/375 (42%), Gaps = 49/375 (13%)
Query: 1409 SLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSL 1468
L + E K + TP QR AIP +++ +A TGSGKT A LP++ L
Sbjct: 7 ILDPRVREWF-KRKFTSLTPPQRYAIPEIHSGENVLIIAPTGSGKTEAAFLPVI---NEL 62
Query: 1469 PKIARMEDADQGPYAIIMAPTRELAQQIE----EETNKFGTPLGIRTVLVVGGLSREEQG 1524
+ + + D G YA+ ++P + L I E + G + +R G + E+
Sbjct: 63 LSLGKGKLED-GIYALYISPLKALNNDIRRRLEEPLRELGIEVAVRH----GDTPQSEKQ 117
Query: 1525 FRLRLGCEIVIATPGRLIDVLENRYL--VLNQCTYIVLDEADRMIDMGFEPDVQKILEYM 1582
L+ I+I TP L +L + +L Y+++DE + + + VQ L
Sbjct: 118 KMLKNPPHILITTPESLAILLNSPKFRELLRDVRYVIVDEIHALAES--KRGVQLALSLE 175
Query: 1583 PVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYL---RRPATVY 1639
+ L D Q + +AT+ P E +A+ +L P +
Sbjct: 176 RLRELAGDF--------------------QRIGLSATVGP-PEEVAK-FLVGFGDPCEIV 213
Query: 1640 IGSVGKPTERIEQIV---YILSEQDKRKKLMEVLNRGVK--KPVIIFVNQKKGADVLAKG 1694
S K E I+ I ++ +++ L E + VK + +IF N + GA+ LA
Sbjct: 214 DVSAAKKLE-IKVISPVEDLIYDEELWAALYERIAELVKKHRTTLIFTNTRSGAERLAFR 272
Query: 1695 LEKLGYNACTLHGGK-GQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMA 1753
L+KLG + +H G +E R LK G +VAT GIDI D+ +VI
Sbjct: 273 LKKLGPDIIEVHHGSLSRELRLEVEERLKEGELKAVVATSSLELGIDIGDIDLVIQLGSP 332
Query: 1754 KSIEDYTHRIGRTGR 1768
KS+ + RIGR G
Sbjct: 333 KSVNRFLQRIGRAGH 347
Score = 54.2 bits (131), Expect = 6e-07
Identities = 42/155 (27%), Positives = 71/155 (45%), Gaps = 14/155 (9%)
Query: 4 KIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQ 63
K + TP QR AIP +++ +A TGSGKT A LP++ L + + + D
Sbjct: 17 KRKFTSLTPPQRYAIPEIHSGENVLIIAPTGSGKTEAAFLPVI---NELLSLGKGKLED- 72
Query: 64 GPYAIIMAPTRELAQQIE----EETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVI 119
G YA+ ++P + L I E + G + +R G + E+ L+ I+I
Sbjct: 73 GIYALYISPLKALNNDIRRRLEEPLRELGIEVAVRH----GDTPQSEKQKMLKNPPHILI 128
Query: 120 ATPGRLIDVLENRYL--VLNQCTYIVLDEADRMID 152
TP L +L + +L Y+++DE + +
Sbjct: 129 TTPESLAILLNSPKFRELLRDVRYVIVDEIHALAE 163
Score = 31.8 bits (73), Expect = 4.1
Identities = 23/71 (32%), Positives = 34/71 (47%), Gaps = 3/71 (4%)
Query: 295 KKLMEVLNRGVK--KPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGK-GQEQRELALNS 351
L E + VK + +IF N + GA+ LA L+KLG + +H G +E R
Sbjct: 240 AALYERIAELVKKHRTTLIFTNTRSGAERLAFRLKKLGPDIIEVHHGSLSRELRLEVEER 299
Query: 352 LKGGSKDILMA 362
LK G ++A
Sbjct: 300 LKEGELKAVVA 310
>gnl|CDD|129701 TIGR00614, recQ_fam, ATP-dependent DNA helicase, RecQ family. All
proteins in this family for which functions are known are
3'-5' DNA-DNA helicases. These proteins are used for
recombination, recombinational repair, and possibly
maintenance of chromosome stability. This family is based
on the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 470
Score = 82.9 bits (205), Expect = 4e-16
Identities = 98/408 (24%), Positives = 158/408 (38%), Gaps = 88/408 (21%)
Query: 744 EKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDAD 803
G + P+Q + I L RD V TG GK+L + LP L I
Sbjct: 5 TVFGLSSFRPVQLEVINAVLLGRDCFVVMPTGGGKSLCYQLPALC----SDGIT------ 54
Query: 804 QGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGF---RLRLG-CEIV 859
++++P L +E++ + GI + S+E+Q L+ G +++
Sbjct: 55 -----LVISPLISL---MEDQVLQL-KASGIPATFLNSSQSKEQQKNVLTDLKDGKIKLL 105
Query: 860 IATP------GRLIDVLENRYLVLNQCTYIVLDEADRMIDMG--FEPDVQKILEYMPVTN 911
TP RL+ LE R + T I +DEA + G F PD Y + +
Sbjct: 106 YVTPEKCSASNRLLQTLEERKGI----TLIAVDEAHCISQWGHDFRPD------YKALGS 155
Query: 912 LKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYL--RRPATVYIGSVG 969
LK + + TAT P+V L + P ++ S
Sbjct: 156 LKQKFPNVP-----------------IMALTATASPSVREDILRQLNLKNP-QIFCTSFD 197
Query: 970 KPTERIEQIVYILSEQDKRKKLMEVLNRGVKK-----PVIIFVNQKKGADVLAKGLEKLG 1024
+P + Y + K K++E L R ++K II+ +K ++ + L+ LG
Sbjct: 198 RP-----NLYYEVRR--KTPKILEDLLRFIRKEFKGKSGIIYCPSRKKSEQVTASLQNLG 250
Query: 1025 YNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDY 1084
A H G R+ + + ++VAT G GI+ DV VI+Y + KS+E Y
Sbjct: 251 IAAGAYHAGLEISARDDVHHKFQRDEIQVVVATVAFGMGINKPDVRFVIHYSLPKSMESY 310
Query: 1085 THRIGRTGRAGKEGLAVSFCTKDDSHLFYD------LKQMMISSPVTG 1126
GR GR G + HLFY L+++++ P
Sbjct: 311 YQESGRAGRDG---------LPSECHLFYAPADINRLRRLLMEEPDGQ 349
Score = 81.7 bits (202), Expect = 1e-15
Identities = 98/406 (24%), Positives = 158/406 (38%), Gaps = 88/406 (21%)
Query: 1419 EKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDAD 1478
G + P+Q + I L RD V TG GK+L + LP L I
Sbjct: 5 TVFGLSSFRPVQLEVINAVLLGRDCFVVMPTGGGKSLCYQLPALC----SDGIT------ 54
Query: 1479 QGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGF---RLRLG-CEIV 1534
++++P L +E++ + GI + S+E+Q L+ G +++
Sbjct: 55 -----LVISPLISL---MEDQVLQL-KASGIPATFLNSSQSKEQQKNVLTDLKDGKIKLL 105
Query: 1535 IATP------GRLIDVLENRYLVLNQCTYIVLDEADRMIDMG--FEPDVQKILEYMPVTN 1586
TP RL+ LE R + T I +DEA + G F PD Y + +
Sbjct: 106 YVTPEKCSASNRLLQTLEERKGI----TLIAVDEAHCISQWGHDFRPD------YKALGS 155
Query: 1587 LKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYL--RRPATVYIGSVG 1644
LK + + TAT P+V L + P ++ S
Sbjct: 156 LKQKFPNVP-----------------IMALTATASPSVREDILRQLNLKNP-QIFCTSFD 197
Query: 1645 KPTERIEQIVYILSEQDKRKKLMEVLNRGVKK-----PVIIFVNQKKGADVLAKGLEKLG 1699
+P + Y + K K++E L R ++K II+ +K ++ + L+ LG
Sbjct: 198 RP-----NLYYEVRR--KTPKILEDLLRFIRKEFKGKSGIIYCPSRKKSEQVTASLQNLG 250
Query: 1700 YNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDY 1759
A H G R+ + + ++VAT G GI+ DV VI+Y + KS+E Y
Sbjct: 251 IAAGAYHAGLEISARDDVHHKFQRDEIQVVVATVAFGMGINKPDVRFVIHYSLPKSMESY 310
Query: 1760 THRIGRTGRAGKEGLAVSFCTKDDSHLFYD------LKQMMISSPV 1799
GR GR G + HLFY L+++++ P
Sbjct: 311 YQESGRAGRDG---------LPSECHLFYAPADINRLRRLLMEEPD 347
Score = 35.9 bits (83), Expect = 0.19
Identities = 39/155 (25%), Positives = 62/155 (40%), Gaps = 33/155 (21%)
Query: 3 EKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDAD 62
G + P+Q + I L RD V TG GK+L + LP L I
Sbjct: 5 TVFGLSSFRPVQLEVINAVLLGRDCFVVMPTGGGKSLCYQLPALC----SDGI------- 53
Query: 63 QGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGF---RLRLG-CEIV 118
++++P L +E++ + GI + S+E+Q L+ G +++
Sbjct: 54 ----TLVISPLISL---MEDQVLQL-KASGIPATFLNSSQSKEQQKNVLTDLKDGKIKLL 105
Query: 119 IATP------GRLIDVLENRYLVLNQCTYIVLDEA 147
TP RL+ LE R + T I +DEA
Sbjct: 106 YVTPEKCSASNRLLQTLEERKGI----TLIAVDEA 136
>gnl|CDD|130456 TIGR01389, recQ, ATP-dependent DNA helicase RecQ. The ATP-dependent
DNA helicase RecQ of E. coli is about 600 residues long.
This model represents bacterial proteins with a high
degree of similarity in domain architecture and in
primary sequence to E. coli RecQ. The model excludes
eukaryotic and archaeal proteins with RecQ-like regions,
as well as more distantly related bacterial helicases
related to RecQ [DNA metabolism, DNA replication,
recombination, and repair].
Length = 591
Score = 80.1 bits (198), Expect = 5e-15
Identities = 88/387 (22%), Positives = 148/387 (38%), Gaps = 74/387 (19%)
Query: 747 GYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLL----VWIQSLPKIARMEDA 802
GY + P Q + I L RD++ V TG GK+L + +P L + + P I+ M
Sbjct: 10 GYDDFRPGQEEIISHVLDGRDVLVVMPTGGGKSLCYQVPALLLKGLTVVISPLISLM--K 67
Query: 803 DQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLG----CEI 858
DQ Q+ G+ + LS +EQ + ++
Sbjct: 68 DQ-------------VDQLRA--------AGVAAAYLNSTLSAKEQQDIEKALVNGELKL 106
Query: 859 VIATPGRLIDVLENRYLVLNQCTYIVL---DEADRMIDMG--FEPDVQKILEYMPVTNLK 913
+ P RL ++ +L + Q I L DEA + G F P+ Q++ +
Sbjct: 107 LYVAPERL---EQDYFLNMLQRIPIALVAVDEAHCVSQWGHDFRPEYQRL---GSLAERF 160
Query: 914 PDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTE 973
P + TAT + R LR + +
Sbjct: 161 PQ--------------------VPRIALTATADAETRQDIRELLRLADANEFIT---SFD 197
Query: 974 RIEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGG 1033
R ++ + +K+K L++ L + + II+ + +K + LA+ LE G +A H G
Sbjct: 198 RPNLRFSVVKKNNKQKFLLDYLKKHRGQSGIIYASSRKKVEELAERLESQGISALAYHAG 257
Query: 1034 KGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGR 1093
+ R ++VAT+ G GID +V VI+YDM ++E Y GR
Sbjct: 258 LSNKVRAENQEDFLYDDVKVMVATNAFGMGIDKPNVRFVIHYDMPGNLESYYQ---EAGR 314
Query: 1094 AGKEGLAVSFCTKDDSHLFYDLKQMMI 1120
AG++GL ++ L Y + +
Sbjct: 315 AGRDGL------PAEAILLYSPADIAL 335
Score = 80.1 bits (198), Expect = 5e-15
Identities = 88/387 (22%), Positives = 148/387 (38%), Gaps = 74/387 (19%)
Query: 1422 GYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLL----VWIQSLPKIARMEDA 1477
GY + P Q + I L RD++ V TG GK+L + +P L + + P I+ M
Sbjct: 10 GYDDFRPGQEEIISHVLDGRDVLVVMPTGGGKSLCYQVPALLLKGLTVVISPLISLM--K 67
Query: 1478 DQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLG----CEI 1533
DQ Q+ G+ + LS +EQ + ++
Sbjct: 68 DQ-------------VDQLRA--------AGVAAAYLNSTLSAKEQQDIEKALVNGELKL 106
Query: 1534 VIATPGRLIDVLENRYLVLNQCTYIVL---DEADRMIDMG--FEPDVQKILEYMPVTNLK 1588
+ P RL ++ +L + Q I L DEA + G F P+ Q++ +
Sbjct: 107 LYVAPERL---EQDYFLNMLQRIPIALVAVDEAHCVSQWGHDFRPEYQRL---GSLAERF 160
Query: 1589 PDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTE 1648
P + TAT + R LR + +
Sbjct: 161 PQ--------------------VPRIALTATADAETRQDIRELLRLADANEFIT---SFD 197
Query: 1649 RIEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGG 1708
R ++ + +K+K L++ L + + II+ + +K + LA+ LE G +A H G
Sbjct: 198 RPNLRFSVVKKNNKQKFLLDYLKKHRGQSGIIYASSRKKVEELAERLESQGISALAYHAG 257
Query: 1709 KGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGR 1768
+ R ++VAT+ G GID +V VI+YDM ++E Y GR
Sbjct: 258 LSNKVRAENQEDFLYDDVKVMVATNAFGMGIDKPNVRFVIHYDMPGNLESYYQ---EAGR 314
Query: 1769 AGKEGLAVSFCTKDDSHLFYDLKQMMI 1795
AG++GL ++ L Y + +
Sbjct: 315 AGRDGL------PAEAILLYSPADIAL 335
Score = 39.7 bits (93), Expect = 0.016
Identities = 23/104 (22%), Positives = 42/104 (40%), Gaps = 3/104 (2%)
Query: 247 FTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKLMEVLNRGVK 306
TAT + R LR + +R ++ + +K+K L++ L +
Sbjct: 168 LTATADAETRQDIRELLRLADANEFIT---SFDRPNLRFSVVKKNNKQKFLLDYLKKHRG 224
Query: 307 KPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALN 350
+ II+ + +K + LA+ LE G +A H G + R
Sbjct: 225 QSGIIYASSRKKVEELAERLESQGISALAYHAGLSNKVRAENQE 268
>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777). This is
a family of eukaryotic proteins of unknown function.
Some of the proteins in this family are putative nucleic
acid binding proteins.
Length = 158
Score = 67.6 bits (165), Expect = 8e-13
Identities = 45/96 (46%), Positives = 54/96 (56%), Gaps = 7/96 (7%)
Query: 366 RSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHR 425
RSRSRSP R RR R R R DR RER +RSR R+RDR R R RS R R R R
Sbjct: 3 RSRSRSPRRSRR-RGRSRSRDR-RERRRERSRS---RERDRRRRSRSRSPHRSRRSRSPR 57
Query: 426 RDRSKERDGKDRREGYRRERREEEASGSKHKSRDKE 461
R RS+ R RR+ R+ R+++A K + R K
Sbjct: 58 RHRSRSRSPSRRRD--RKRERDKDAREPKKRERQKL 91
Score = 55.7 bits (134), Expect = 1e-08
Identities = 35/98 (35%), Positives = 42/98 (42%), Gaps = 6/98 (6%)
Query: 365 RRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHH 424
RR RSRS R+ R R R R R+ DR R R R R R R R R R +
Sbjct: 14 RRGRSRSRDRRERRRERSRS------RERDRRRRSRSRSPHRSRRSRSPRRHRSRSRSPS 67
Query: 425 RRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEG 462
RR K KD RE +RER++ D+E
Sbjct: 68 RRRDRKRERDKDAREPKKRERQKLIKEEDLEGKSDEEV 105
Score = 42.6 bits (100), Expect = 3e-04
Identities = 29/58 (50%), Positives = 35/58 (60%), Gaps = 6/58 (10%)
Query: 364 DRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREK 421
RRSRSRSP R RRSRS R R R R R RD R R+RD+ + ++RER+K
Sbjct: 39 RRRSRSRSPHRSRRSRSPRR--HRSRSRSPSRRRD---RKRERDK-DAREPKKRERQK 90
Score = 41.8 bits (98), Expect = 8e-04
Identities = 34/109 (31%), Positives = 49/109 (44%), Gaps = 5/109 (4%)
Query: 401 GRDRDRDR---LERDRSREREREKRHHRRDRSKERDGKDRREGYRRERRE-EEASGSKHK 456
GR R R R RSR R+R +R R RS+ERD + RR R R S +H+
Sbjct: 2 GRSRSRSPRRSRRRGRSRSRDRRERRRERSRSRERD-RRRRSRSRSPHRSRRSRSPRRHR 60
Query: 457 SRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSK 505
SR + ++ D++ + +++ L EE L K EE K
Sbjct: 61 SRSRSPSRRRDRKRERDKDAREPKKRERQKLIKEEDLEGKSDEEVEMMK 109
Score = 37.5 bits (87), Expect = 0.018
Identities = 25/81 (30%), Positives = 40/81 (49%), Gaps = 5/81 (6%)
Query: 1184 RRSRSRSPARKRRSRSRDRDYDRRFKRKKSPQCEAQSSRFSACSLPRSHKSS--SLLSRY 1241
RR RSRS R+ R R R R +R +R+ + +S R + PR H+S S R
Sbjct: 14 RRGRSRSRDRRERRRERSRSRERDRRRRSRSRSPHRSRRSRS---PRRHRSRSRSPSRRR 70
Query: 1242 SEQDPEEKELNKDKEREGEAI 1262
+ +K+ + K+RE + +
Sbjct: 71 DRKRERDKDAREPKKRERQKL 91
Score = 33.3 bits (76), Expect = 0.49
Identities = 21/46 (45%), Positives = 25/46 (54%), Gaps = 6/46 (13%)
Query: 1183 DRRSRSRSPARKRRS------RSRDRDYDRRFKRKKSPQCEAQSSR 1222
RRSRSRSP R RRS RSR R RR RK+ +A+ +
Sbjct: 39 RRRSRSRSPHRSRRSRSPRRHRSRSRSPSRRRDRKRERDKDAREPK 84
Score = 33.3 bits (76), Expect = 0.52
Identities = 27/94 (28%), Positives = 41/94 (43%), Gaps = 12/94 (12%)
Query: 1182 GDRRSRSRSPARKRRSRSRDRDYDR--------RFKRKKSPQCEAQSSRFSACSLPR--- 1230
G RSR R R+ RSRSR+RD R R +R +SP+ SR + R
Sbjct: 16 GRSRSRDRRERRRERSRSRERDRRRRSRSRSPHRSRRSRSPRRHRSRSRSPSRRRDRKRE 75
Query: 1231 -SHKSSSLLSRYSEQDPEEKELNKDKEREGEAIK 1263
+ R ++ +E++L + E E +K
Sbjct: 76 RDKDAREPKKRERQKLIKEEDLEGKSDEEVEMMK 109
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
splicing factor. These splicing factors consist of an
N-terminal arginine-rich low complexity domain followed
by three tandem RNA recognition motifs (pfam00076). The
well-characterized members of this family are auxilliary
components of the U2 small nuclear ribonuclearprotein
splicing factor (U2AF). These proteins are closely
related to the CC1-like subfamily of splicing factors
(TIGR01622). Members of this subfamily are found in
plants, metazoa and fungi.
Length = 509
Score = 68.4 bits (167), Expect = 2e-11
Identities = 31/104 (29%), Positives = 38/104 (36%), Gaps = 9/104 (8%)
Query: 366 RSRSRSPPRKR-RSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSRERER----- 419
R R R R R RS ER R R+R R R R R+R R R R R
Sbjct: 5 PDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPR 64
Query: 420 --EKRHHRRDRSK-ERDGKDRREGYRRERREEEASGSKHKSRDK 460
RR R + R + R + RR + S S +D
Sbjct: 65 SLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDD 108
Score = 56.8 bits (137), Expect = 7e-08
Identities = 38/103 (36%), Positives = 42/103 (40%), Gaps = 9/103 (8%)
Query: 365 RRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHH 424
RRSR RS R R RSRER Y R R DR R D R RSR+R R
Sbjct: 26 RRSRDRSRFRDRHRRSRERSY-REDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPR----- 79
Query: 425 RRDRSKERDGKDRREGYRRERREEEASGSKHKS---RDKEGYE 464
RR RS + RR R + K +S GYE
Sbjct: 80 RRSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRSLWDIKPPGYE 122
Score = 54.9 bits (132), Expect = 3e-07
Identities = 34/139 (24%), Positives = 51/139 (36%), Gaps = 9/139 (6%)
Query: 381 RERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREG 440
R+ + DR RE+ R RD R +R R R R R R R++ R+RS D + R
Sbjct: 1 RDEEPDREREKSRGRDRD-RSSERPRRR---SRDRSRFRDRHRRSRERSYREDSRPRDRR 56
Query: 441 YRRERREEEASGSKH-KSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAE 499
R S +SRD+ + E+ + + P S + KK
Sbjct: 57 RYDSRSPRSLRYSSVRRSRDR---PRRRSRSVRSIEQHRRRLRDRSP-SNQWRKDDKKRS 112
Query: 500 EEARSKPKFLTKEERAAEA 518
P + A+A
Sbjct: 113 LWDIKPPGYELVTADQAKA 131
Score = 34.5 bits (79), Expect = 0.66
Identities = 28/105 (26%), Positives = 38/105 (36%), Gaps = 20/105 (19%)
Query: 1184 RRSRSRSPARKRRSRSR-----DRDYDRRFKRKKSPQCEAQSSRFSACSLPRSHKSSSLL 1238
R RSR R RRSR R R DRR +SP+ SS + PR
Sbjct: 28 SRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRRR------ 81
Query: 1239 SRYSEQDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLNDRK 1283
SR + + +D+ + +K + R L D K
Sbjct: 82 SRSVRSIEQHRRRLRDRSPSNQW---------RKDDKKRSLWDIK 117
Score = 33.7 bits (77), Expect = 0.94
Identities = 24/101 (23%), Positives = 37/101 (36%), Gaps = 6/101 (5%)
Query: 1183 DRRSRSRSPARKRRSRSRDRDYDRRFKRKKSPQCEAQSSRFSACSLPRSHKSSSLLSRYS 1242
DR S P R+ R RSR RD RR + + + R S RS +S S
Sbjct: 16 DRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDS--RSPRSLRYSSVRR 73
Query: 1243 EQDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLNDRK 1283
+D + ++ + R + R +D+K
Sbjct: 74 SRDRPRRR-SRSVRSI---EQHRRRLRDRSPSNQWRKDDKK 110
>gnl|CDD|223989 COG1061, SSL2, DNA or RNA helicases of superfamily II [Transcription
/ DNA replication, recombination, and repair].
Length = 442
Score = 65.5 bits (160), Expect = 1e-10
Identities = 74/384 (19%), Positives = 127/384 (33%), Gaps = 66/384 (17%)
Query: 750 EPTPIQRQAIP----IGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQG 805
E P Q +A+ R + V TG+GKT+ +
Sbjct: 36 ELRPYQEEALDALVKNRRTERRGVIVLPTGAGKTVVAA-----------EAIAE----LK 80
Query: 806 PYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIATPGR 865
+++ PT+EL Q E KF + GG +E + + + +AT
Sbjct: 81 RSTLVLVPTKELLDQWAEALKKFLLLNDEIGIY--GGGEKELEPAK------VTVAT--- 129
Query: 866 LIDVLENR----YLVLNQCTYIVLDEADRMIDMGFE------------------P---DV 900
+ L R + N+ I+ DE + + P D
Sbjct: 130 -VQTLARRQLLDEFLGNEFGLIIFDEVHHLPAPSYRRILELLSAAYPRLGLTATPEREDG 188
Query: 901 QKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRP 960
+I + + + P + + + Y + KY + + T T E A+ R
Sbjct: 189 GRIGDLFDL--IGPIVYEVSLKELIDEGYLAPYKYVE-IKVTLTEDEEREY-AKESARFR 244
Query: 961 ATVYIGSVGKPTERIEQIVYIL-SEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKG 1019
+ + +I + + L+ RG +IF + + A +AK
Sbjct: 245 ELLRARGTLRAENEARRIAIASERKIAAVRGLLLKHARG--DKTLIFASDVEHAYEIAKL 302
Query: 1020 LEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAK 1079
G + G +E+RE L + G +LV V G+DI D ++I
Sbjct: 303 FLAPGI-VEAITGETPKEEREAILERFRTGGIKVLVTVKVLDEGVDIPDADVLIILRPTG 361
Query: 1080 SIEDYTHRIGRTGRA--GKEGLAV 1101
S + R+GR R GKE
Sbjct: 362 SRRLFIQRLGRGLRPAEGKEDTLA 385
Score = 65.5 bits (160), Expect = 1e-10
Identities = 74/384 (19%), Positives = 127/384 (33%), Gaps = 66/384 (17%)
Query: 1425 EPTPIQRQAIP----IGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQG 1480
E P Q +A+ R + V TG+GKT+ +
Sbjct: 36 ELRPYQEEALDALVKNRRTERRGVIVLPTGAGKTVVAA-----------EAIAE----LK 80
Query: 1481 PYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIATPGR 1540
+++ PT+EL Q E KF + GG +E + + + +AT
Sbjct: 81 RSTLVLVPTKELLDQWAEALKKFLLLNDEIGIY--GGGEKELEPAK------VTVAT--- 129
Query: 1541 LIDVLENR----YLVLNQCTYIVLDEADRMIDMGFE------------------P---DV 1575
+ L R + N+ I+ DE + + P D
Sbjct: 130 -VQTLARRQLLDEFLGNEFGLIIFDEVHHLPAPSYRRILELLSAAYPRLGLTATPEREDG 188
Query: 1576 QKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRP 1635
+I + + + P + + + Y + KY + + T T E A+ R
Sbjct: 189 GRIGDLFDL--IGPIVYEVSLKELIDEGYLAPYKYVE-IKVTLTEDEEREY-AKESARFR 244
Query: 1636 ATVYIGSVGKPTERIEQIVYIL-SEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKG 1694
+ + +I + + L+ RG +IF + + A +AK
Sbjct: 245 ELLRARGTLRAENEARRIAIASERKIAAVRGLLLKHARG--DKTLIFASDVEHAYEIAKL 302
Query: 1695 LEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAK 1754
G + G +E+RE L + G +LV V G+DI D ++I
Sbjct: 303 FLAPGI-VEAITGETPKEEREAILERFRTGGIKVLVTVKVLDEGVDIPDADVLIILRPTG 361
Query: 1755 SIEDYTHRIGRTGRA--GKEGLAV 1776
S + R+GR R GKE
Sbjct: 362 SRRLFIQRLGRGLRPAEGKEDTLA 385
Score = 42.8 bits (101), Expect = 0.002
Identities = 65/368 (17%), Positives = 118/368 (32%), Gaps = 75/368 (20%)
Query: 9 EPTPIQRQAIP----IGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQG 64
E P Q +A+ R + V TG+GKT+ +
Sbjct: 36 ELRPYQEEALDALVKNRRTERRGVIVLPTGAGKTVVAA-----------EAIAE----LK 80
Query: 65 PYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIATPGR 124
+++ PT+EL Q E KF + GG +E + + + +AT
Sbjct: 81 RSTLVLVPTKELLDQWAEALKKFLLLNDEIGIY--GGGEKELEPAK------VTVAT--- 129
Query: 125 LIDVLENR----YLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAED 180
+ L R + N+ I+ DE + + ++ILE ++ P
Sbjct: 130 -VQTLARRQLLDEFLGNEFGLIIFDEVHHLPAPSY----RRILEL--LSAAYP------- 175
Query: 181 ENKLLANYNSKKKYRQTVMFTATMP---PAKILEYMPVTNLKPDTEDAEDENKLLANYNS 237
L TAT +I + + + P + + + Y +
Sbjct: 176 ---RLG-------------LTATPEREDGGRIGDLFDL--IGPIVYEVSLKELIDEGYLA 217
Query: 238 KKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYIL-SEQDKRKK 296
KY + + T T E A+ R + + +I + +
Sbjct: 218 PYKYVE-IKVTLTEDEEREY-AKESARFRELLRARGTLRAENEARRIAIASERKIAAVRG 275
Query: 297 LMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGS 356
L+ RG +IF + + A +AK G + G +E+RE L + G
Sbjct: 276 LLLKHARG--DKTLIFASDVEHAYEIAKLFLAPGI-VEAITGETPKEEREAILERFRTGG 332
Query: 357 KDILMAGD 364
+L+
Sbjct: 333 IKVLVTVK 340
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 65.2 bits (158), Expect = 3e-10
Identities = 61/298 (20%), Positives = 129/298 (43%), Gaps = 24/298 (8%)
Query: 415 REREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDE 474
++ E K+ +++ D E + E + K + + E+ +DE
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADE 1395
Query: 475 EEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTK---------EERAAEALRKRQAE 525
+ K KK+ L++ A KK +EA+ K + K E + A+ +K+ E
Sbjct: 1396 AKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEE 1455
Query: 526 ---VEEMRKKMEEERK-----------KRQEFTKEASFESKRENFDARLRRDREKKKEDP 571
EE +KK EE +K K+ + K+ + E+K++ +A+ + +KK ++
Sbjct: 1456 AKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEA 1515
Query: 572 EEKELNKDKEREGEAIKERYLGLVKKKRRVRRLNDRKFVFDWDASEDTSVDYNSIYKERH 631
++ E K + +A + + KK ++ ++ K + +E+ + E
Sbjct: 1516 KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEED 1575
Query: 632 QVQFFGRGNIAGIDIKAQKRDQSKFYGEMLEKRRTEAEK-EQEKVRLKKVKKREEKQK 688
+ + A +A+ + K Y E + + EA+K E+ K++ +++KK EE++K
Sbjct: 1576 KNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKK 1633
Score = 61.7 bits (149), Expect = 4e-09
Identities = 44/219 (20%), Positives = 94/219 (42%), Gaps = 12/219 (5%)
Query: 377 RSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKD 436
+ + D ++ + + D+ + ++ + + + E ++ ++ ++ +D
Sbjct: 1516 KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEED 1575
Query: 437 RREGYRR----ERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEEL 492
+ R+ ++ EE K ++E E+ ++E + K KK +++
Sbjct: 1576 KNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKV 1635
Query: 493 LAKKKAEEEARSKPKFLTKEE-----RAAEALRKRQAE---VEEMRKKMEEERKKRQEFT 544
KK E E + K + L K E +AAE +K + + EE +K E+E+K +
Sbjct: 1636 EQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695
Query: 545 KEASFESKRENFDARLRRDREKKKEDPEEKELNKDKERE 583
KEA K E + +++K +E + +E NK K E
Sbjct: 1696 KEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEE 1734
Score = 61.7 bits (149), Expect = 4e-09
Identities = 62/322 (19%), Positives = 139/322 (43%), Gaps = 13/322 (4%)
Query: 364 DRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRH 423
D+ R +++ + + + + + + + +++ + ++ E EK+
Sbjct: 1575 DKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKK 1634
Query: 424 HRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAK 483
+ + KE + K + E ++ E + ++ + +E + E+ ++E+E K A A
Sbjct: 1635 VEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEAL 1694
Query: 484 K----EPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKK 539
K E EEL KKK EE + + EE + + E EE +KK EE +K
Sbjct: 1695 KKEAEEAKKAEEL--KKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKD 1752
Query: 540 RQEFTKEASFESKRENFDARLRRDRE---KKKEDPEEKELNKDKEREGEAIKERYLGLVK 596
+E K A + + E +R+++E +++ D E+++ + +++ + I + + +++
Sbjct: 1753 EEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIE 1812
Query: 597 -KKRRVRRLNDRKFVFDWDASEDTSVDYNSIYKERHQVQ--FFGRGNIAGIDIKAQKRDQ 653
K +ND K + D A ++ + N +E + F + N G D +
Sbjct: 1813 GGKEGNLVINDSKEMED-SAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFN 1871
Query: 654 SKFYGEMLEKRRTEAEKEQEKV 675
+ + ++ E E EK+
Sbjct: 1872 KEKDLKEDDEEEIEEADEIEKI 1893
Score = 61.7 bits (149), Expect = 4e-09
Identities = 73/392 (18%), Positives = 161/392 (41%), Gaps = 67/392 (17%)
Query: 375 KRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDG 434
K+++ ++ D ++ + + D + + + + + E +K + +++
Sbjct: 1489 KKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKA 1548
Query: 435 KDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKK--EPLSLEEL 492
+ ++ ++ EE K K+ + + E + M + EE K A + E + L E
Sbjct: 1549 DELKKAEELKKAEE-----KKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEE 1603
Query: 493 LAKKKAE-----EEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEA 547
K KAE EEA+ K + L K E + + + + + E +KK EE +K +E +A
Sbjct: 1604 EKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKA 1663
Query: 548 SFESKRENFDAR----LRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRR 603
+ E+K+ D + ++ E +K+ E + ++ ++ E +K++ KK +++
Sbjct: 1664 AEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKK 1723
Query: 604 LNDRKFVFDWDASEDTSVDYNSIYKERHQVQFFGRGNIAGIDIKAQKRDQSKFYGEMLEK 663
+ K +++K E +K
Sbjct: 1724 AEE---------------------------------------ENKIKAEEAKKEAEEDKK 1744
Query: 664 RRTEAEK-EQEKVRLKKVKKREEKQKWDDRHWTEKSLDE-----------MTERDWRIFR 711
+ EA+K E+EK ++ +KK EEK+ + R E ++E ++ +
Sbjct: 1745 KAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIF 1804
Query: 712 EDYSITIKGGKVPDPVRNWKEASLPTEILEII 743
++++ I+GGK + V N + + I E+
Sbjct: 1805 DNFANIIEGGKEGNLVINDSKEMEDSAIKEVA 1836
Score = 59.8 bits (144), Expect = 2e-08
Identities = 54/263 (20%), Positives = 117/263 (44%), Gaps = 27/263 (10%)
Query: 365 RRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSRERER----- 419
+++ + + ++ D +++ + + +++ + + + E D++ +
Sbjct: 1528 KKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAK 1587
Query: 420 ---EKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEE 476
E R + E + K + E ++ EE ++ + +E + EQ+ + EE
Sbjct: 1588 KAEEARIEEVMKLYEEEKKMKAEEAKKA--EEAKIKAEELKKAEEEKKKVEQLKKKEAEE 1645
Query: 477 DKGAAAKK---EPLSLEELLAKKKAEEEARSKPKFLTKEE---RAAEALRKRQAE---VE 527
K A K E ++ KKAEE+ + + EE +AAEAL+K E E
Sbjct: 1646 KKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE 1705
Query: 528 EMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAI 587
E++KK EE+KK +E K E + +++ + KK+ + ++K+ + K+ E E
Sbjct: 1706 ELKKKEAEEKKKAEELKKA---EEEN-----KIKAEEAKKEAEEDKKKAEEAKKDEEEKK 1757
Query: 588 KERYLGLVKKKRRVRRLNDRKFV 610
K +L ++K+ +++ V
Sbjct: 1758 KIAHLKKEEEKKAEEIRKEKEAV 1780
Score = 58.2 bits (140), Expect = 4e-08
Identities = 73/359 (20%), Positives = 145/359 (40%), Gaps = 65/359 (18%)
Query: 365 RRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHH 424
+ +R +R++ + D + ++ + + + + E +R+ E R+
Sbjct: 1201 KAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEA 1260
Query: 425 RRDRSKERDGKDRREGYRR---------ERREEEASGSKHKSRDKEGYEPTEQMVISDEE 475
R R + E R+ +++ +EA ++ K + E + E+ +DE
Sbjct: 1261 RMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEA 1320
Query: 476 EDKGAAAKKEPLSL----EELLAK---KKAEEEARSK------------PKFLTKEERAA 516
+ K AKK+ + EE KAE EA + K + ++ A
Sbjct: 1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKA 1380
Query: 517 EALRKRQAEV---EEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEE 573
+A +K+ E +E +KK EE++KK E K A+ + K + ++ E+KK+ E
Sbjct: 1381 DAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAD----EAKKKAEEKKKADEA 1436
Query: 574 KELNKDKEREGEAIKERYLGLVKKKRRVRRLNDRKFVFDWDASEDTSVDYNSIYKERHQV 633
K+ ++ ++ EA K+ ++ ++ + A E D E
Sbjct: 1437 KKKAEEAKKADEAKKK-----AEEAKKAEEAKKK-------AEEAKKADEAKKKAE---- 1480
Query: 634 QFFGRGNIAGIDIKAQKRDQSKFYGEMLEKRRTEAEK-EQEKVRLKKVKKREEKQKWDD 691
+A+K D++K E +K+ EA+K + K + + KK EE +K D+
Sbjct: 1481 -------------EAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE 1526
Score = 54.8 bits (131), Expect = 6e-07
Identities = 64/329 (19%), Positives = 141/329 (42%), Gaps = 20/329 (6%)
Query: 375 KRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDG 434
+++ ++ + ++ + + + + + D + + + +++ + K+ + K +
Sbjct: 1443 AKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEA 1502
Query: 435 KDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLA 494
K E ++ ++A +K K+ E + EE K A K+ L++
Sbjct: 1503 KKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEE 1562
Query: 495 KKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRE 554
KKKAEE +++ +A EA + +A +EE+ K EEE+K + E K+A E +
Sbjct: 1563 KKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKA--EEAKI 1620
Query: 555 NFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLND--RKFVFD 612
+ + + EKKK + +K+ ++K++ E K +K ++ + +K
Sbjct: 1621 KAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEA 1680
Query: 613 WDASEDTSVDYNSIYKERHQVQFFGRGNIAGIDIKAQKRDQSKFYGEMLEKRRTEAEKEQ 672
A ED ++ KE + A+K ++ K EK++ E K+
Sbjct: 1681 KKAEEDEKKAAEALKKEAEE---------------AKKAEELKKKEAE-EKKKAEELKKA 1724
Query: 673 EKVRLKKVKKREEKQKWDDRHWTEKSLDE 701
E+ K ++ +++ + D + E DE
Sbjct: 1725 EEENKIKAEEAKKEAEEDKKKAEEAKKDE 1753
Score = 54.0 bits (129), Expect = 9e-07
Identities = 67/350 (19%), Positives = 140/350 (40%), Gaps = 45/350 (12%)
Query: 365 RRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHH 424
R++ + R + + R+ + R ++ + D + E R E R+
Sbjct: 1134 RKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEEL 1193
Query: 425 RRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKK 484
R+ + R+ ER+ EEA ++ + + + E D EE K A ++
Sbjct: 1194 RKAEDARKAEAARK--AEEERKAEEARKAEDAKKAEAVKKAEE--AKKDAEEAKKAEEER 1249
Query: 485 EPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRK----------RQAE----VEEMR 530
+ + + A AR + +E R A+ L+K ++AE +E +
Sbjct: 1250 NNEEIRKFEEARMAHF-ARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAK 1308
Query: 531 KKMEEER-----KKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKER--- 582
KK EE + KK+ E K+ + +K++ +A+ + K + + E +E+
Sbjct: 1309 KKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEA 1368
Query: 583 ---EGEAIKERYLGLVKKKRRVRRLNDRKFVFDWDASEDTSVDYNSIYKERHQVQFFGRG 639
+ E K++ KK ++ ++ K + D + + + K++
Sbjct: 1369 AEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAD------- 1421
Query: 640 NIAGIDIKAQKRDQSKFYGEMLEKRRTEAEKEQE-KVRLKKVKKREEKQK 688
+A+K+ + K + +K+ EA+K E K + ++ KK EE +K
Sbjct: 1422 -------EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKK 1464
Score = 52.8 bits (126), Expect = 2e-06
Identities = 60/330 (18%), Positives = 134/330 (40%), Gaps = 16/330 (4%)
Query: 374 RKRRSRSRERDYDRVRE-RDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKER 432
+ ++ D+D + R + + + G+ + + E ++ E + + ++ +
Sbjct: 1070 EGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARK 1129
Query: 433 DGKDRR-EGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEE 491
+ R+ E R+ +A +K ++ + + E+ K A A ++ + +
Sbjct: 1130 AEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRK 1189
Query: 492 LLAKKKAEEEARSKPKFLTKEERAAEALRKRQAE--------VEEMRKKMEEERKKRQEF 543
+KAE+ +++ +EER AE RK + EE +K EE +K +E
Sbjct: 1190 AEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEER 1249
Query: 544 TKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRR 603
E + + RR K E+ + + K E + +A + + KK ++
Sbjct: 1250 NNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKK 1309
Query: 604 LNDRKFVFDW--DASEDTSVDYNSIYKERHQVQFFGRGNIAGIDIKAQKRDQSKFYGEML 661
+ D +E+ ++ K+ + + A + A + + ++ E
Sbjct: 1310 KAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAA 1369
Query: 662 EKRRTEAEKEQEKVRLKKVKKREEKQKWDD 691
EK++ EA+K+ + + KK EEK+K D+
Sbjct: 1370 EKKKEEAKKKADAAK----KKAEEKKKADE 1395
Score = 49.4 bits (117), Expect = 2e-05
Identities = 64/331 (19%), Positives = 133/331 (40%), Gaps = 34/331 (10%)
Query: 370 RSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRS 429
+ + + ++ + + + ++ + + D + + E D+ + E +K + ++
Sbjct: 1361 AAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKA 1420
Query: 430 KE--RDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKK--- 484
E + +++++ +++ EEA + + E + E+ EE K AKK
Sbjct: 1421 DEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAE 1480
Query: 485 EPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFT 544
E +E AKKKAEE + + E +A ++AE KK +E +K +
Sbjct: 1481 EAKKADE--AKKKAEEAKKKADEAKKAAEAKKKADEAKKAE---EAKKADEAKKAEEAKK 1535
Query: 545 KEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRL 604
+ + +++ + L++ E KK + ++K K E + + R KK R
Sbjct: 1536 ADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIE 1595
Query: 605 NDRKFVFDWDASEDTSVDYNSIYKERHQVQFFGRGNIAGIDIKAQ---KRDQSKFYGEML 661
K E+ + K IKA+ K ++ K E L
Sbjct: 1596 EVMKLY-----EEEKKMKAEEAKKAEEA------------KIKAEELKKAEEEKKKVEQL 1638
Query: 662 EKRRTEAEKEQEKVR----LKKVKKREEKQK 688
+K+ E +K+ E+++ K+K EE +K
Sbjct: 1639 KKKEAEEKKKAEELKKAEEENKIKAAEEAKK 1669
Score = 49.0 bits (116), Expect = 3e-05
Identities = 65/315 (20%), Positives = 121/315 (38%), Gaps = 28/315 (8%)
Query: 393 TDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRE--RREEEA 450
D +D D D E +R+ E E + K GK E ++ E+A
Sbjct: 1068 QDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDA 1127
Query: 451 SGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLT 510
++ + ++ + E D + + A ++ EE +KAE+ +++
Sbjct: 1128 RKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEE---ARKAEDAKKAEAARKA 1184
Query: 511 KEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKED 570
+E R AE LRK E+ RK E RK E +R+ +AR D +K +
Sbjct: 1185 EEVRKAEELRK----AEDARK-AEAARK----------AEEERKAEEARKAEDAKKAEAV 1229
Query: 571 PEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLNDRKFVFDWDASEDTSVDYNSIYKER 630
+ +E KD E +A +ER ++K R + + A E D +E+
Sbjct: 1230 KKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEK 1289
Query: 631 HQVQFFGRGNIAGIDIKAQKRDQSKFYGEMLEKRRTEAEKEQEKVRLK--------KVKK 682
+ + +A+K+ + + +K+ EA+K+ + + K + K
Sbjct: 1290 KKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAK 1349
Query: 683 REEKQKWDDRHWTEK 697
E + D+ E+
Sbjct: 1350 AEAEAAADEAEAAEE 1364
Score = 44.0 bits (103), Expect = 0.001
Identities = 47/242 (19%), Positives = 102/242 (42%), Gaps = 17/242 (7%)
Query: 353 KGGSKDILMAGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERD 412
K ++++ A + + + +K ++ + + E + + + + D+ + +
Sbjct: 1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAE 1678
Query: 413 RSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVIS 472
+++ E +++ KE + + E +++ EE+ + K ++E E+
Sbjct: 1679 EAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKE 1738
Query: 473 DEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEV--EEMR 530
EE+ K A EE KK EEE + +EE+ AE +RK + V EE+
Sbjct: 1739 AEEDKKKA---------EEA---KKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELD 1786
Query: 531 KKMEEERKKRQEFTKE--ASFESKRE-NFDARLRRDREKKKEDPEEKELNKDKEREGEAI 587
++ E+ R + + K+ +F + E + L + K+ ED KE+ K + E
Sbjct: 1787 EEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEA 1846
Query: 588 KE 589
Sbjct: 1847 DA 1848
Score = 42.8 bits (100), Expect = 0.002
Identities = 49/236 (20%), Positives = 96/236 (40%), Gaps = 20/236 (8%)
Query: 482 AKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQ 541
AK++ + E EEA+ +E R AE +K+ + + + + E ++
Sbjct: 1083 AKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKA 1142
Query: 542 EFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRV 601
E ++A E + AR + + +K E+ + E K E +A + R ++K
Sbjct: 1143 EEARKA--EDAKRVEIAR-KAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDA 1199
Query: 602 RRLNDRKFVFDWDASEDTSVDYNSIYKERHQVQFFGRGNIAGIDIKAQKRDQSKFYGEML 661
R+ + A E+ + ++ + + + A D + K+ + + E +
Sbjct: 1200 RKAEAAR-----KAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEI 1254
Query: 662 EK----------RRTEAEKEQEKVRLKKVKKREEKQKWDDRHWTE--KSLDEMTER 705
K RR A K +E + ++KK EEK+K D+ E K DE ++
Sbjct: 1255 RKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKK 1310
>gnl|CDD|182933 PRK11057, PRK11057, ATP-dependent DNA helicase RecQ; Provisional.
Length = 607
Score = 62.8 bits (153), Expect = 1e-09
Identities = 104/408 (25%), Positives = 163/408 (39%), Gaps = 94/408 (23%)
Query: 733 ASLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQS 792
SL ++L+ E GY + P Q++ I L RD + V TG GK+L + +P LV +
Sbjct: 10 ESLAKQVLQ--ETFGYQQFRPGQQEIIDAVLSGRDCLVVMPTGGGKSLCYQIPALV-LDG 66
Query: 793 L-----PKIARMED-ADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSRE 846
L P I+ M+D DQ LA G+ + +RE
Sbjct: 67 LTLVVSPLISLMKDQVDQ-----------LLAN-------------GVAAACLNSTQTRE 102
Query: 847 EQGFRLRLGC-----EIVIATPGRLI--DVLENRYLVLNQCTYIVLDEADRMIDMG--FE 897
+Q + GC +++ P RL+ + LE+ L + +DEA + G F
Sbjct: 103 QQ-LEVMAGCRTGQIKLLYIAPERLMMDNFLEH--LAHWNPALLAVDEAHCISQWGHDFR 159
Query: 898 PD------VQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVER 951
P+ +++ +P L T A+D T V
Sbjct: 160 PEYAALGQLRQRFPTLPFMAL---TATADD---------------------TTRQDIVRL 195
Query: 952 LARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDK-RKKLMEVLNRGVKKPVIIFVNQK 1010
L L P + I S +P I Y L E+ K +LM + K II+ N +
Sbjct: 196 LG---LNDP-LIQISSFDRP-----NIRYTLVEKFKPLDQLMRYVQEQRGKSGIIYCNSR 246
Query: 1011 KGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVS 1070
+ A L+ G +A H G + R + + I+VAT G GI+ +V
Sbjct: 247 AKVEDTAARLQSRGISAAAYHAGLDNDVRADVQEAFQRDDLQIVVATVAFGMGINKPNVR 306
Query: 1071 MVINYDMAKSIEDYTHRIGRTGRAGKEGLAVSFCTKDDSHLFYDLKQM 1118
V+++D+ ++IE Y TGRAG++GL ++ LFYD M
Sbjct: 307 FVVHFDIPRNIESYYQ---ETGRAGRDGLPA------EAMLFYDPADM 345
Score = 62.8 bits (153), Expect = 1e-09
Identities = 104/408 (25%), Positives = 163/408 (39%), Gaps = 94/408 (23%)
Query: 1408 ASLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQS 1467
SL ++L+ E GY + P Q++ I L RD + V TG GK+L + +P LV +
Sbjct: 10 ESLAKQVLQ--ETFGYQQFRPGQQEIIDAVLSGRDCLVVMPTGGGKSLCYQIPALV-LDG 66
Query: 1468 L-----PKIARMED-ADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSRE 1521
L P I+ M+D DQ LA G+ + +RE
Sbjct: 67 LTLVVSPLISLMKDQVDQ-----------LLAN-------------GVAAACLNSTQTRE 102
Query: 1522 EQGFRLRLGC-----EIVIATPGRLI--DVLENRYLVLNQCTYIVLDEADRMIDMG--FE 1572
+Q + GC +++ P RL+ + LE+ L + +DEA + G F
Sbjct: 103 QQ-LEVMAGCRTGQIKLLYIAPERLMMDNFLEH--LAHWNPALLAVDEAHCISQWGHDFR 159
Query: 1573 PD------VQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVER 1626
P+ +++ +P L T A+D T V
Sbjct: 160 PEYAALGQLRQRFPTLPFMAL---TATADD---------------------TTRQDIVRL 195
Query: 1627 LARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDK-RKKLMEVLNRGVKKPVIIFVNQK 1685
L L P + I S +P I Y L E+ K +LM + K II+ N +
Sbjct: 196 LG---LNDP-LIQISSFDRP-----NIRYTLVEKFKPLDQLMRYVQEQRGKSGIIYCNSR 246
Query: 1686 KGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVS 1745
+ A L+ G +A H G + R + + I+VAT G GI+ +V
Sbjct: 247 AKVEDTAARLQSRGISAAAYHAGLDNDVRADVQEAFQRDDLQIVVATVAFGMGINKPNVR 306
Query: 1746 MVINYDMAKSIEDYTHRIGRTGRAGKEGLAVSFCTKDDSHLFYDLKQM 1793
V+++D+ ++IE Y TGRAG++GL ++ LFYD M
Sbjct: 307 FVVHFDIPRNIESYYQ---ETGRAGRDGLPA------EAMLFYDPADM 345
Score = 32.0 bits (73), Expect = 3.4
Identities = 18/45 (40%), Positives = 25/45 (55%)
Query: 3 EKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLV 47
E GY + P Q++ I L RD + V TG GK+L + +P LV
Sbjct: 19 ETFGYQQFRPGQQEIIDAVLSGRDCLVVMPTGGGKSLCYQIPALV 63
>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family. This model
represents a subfamily of RNA splicing factors including
the Pad-1 protein (N. crassa), CAPER (M. musculus) and
CC1.3 (H.sapiens). These proteins are characterized by
an N-terminal arginine-rich, low complexity domain
followed by three (or in the case of 4 H. sapiens
paralogs, two) RNA recognition domains (rrm: pfam00706).
These splicing factors are closely related to the U2AF
splicing factor family (TIGR01642). A homologous gene
from Plasmodium falciparum was identified in the course
of the analysis of that genome at TIGR and was included
in the seed.
Length = 457
Score = 59.9 bits (145), Expect = 8e-09
Identities = 32/89 (35%), Positives = 39/89 (43%), Gaps = 12/89 (13%)
Query: 366 RSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHR 425
R R R + R R+ R D+ RER RS R RDR R RDR R R
Sbjct: 1 RYRDRE---RGRLRNDTRRSDKGRERSRRRS-----RSRDRSRRRRDRDYYRGRR----G 48
Query: 426 RDRSKERDGKDRREGYRRERREEEASGSK 454
R RS+ + R G R RR++ SG
Sbjct: 49 RSRSRSPNRYYRPRGDRSYRRDDRRSGRN 77
Score = 52.2 bits (125), Expect = 2e-06
Identities = 34/103 (33%), Positives = 45/103 (43%), Gaps = 7/103 (6%)
Query: 366 RSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRS---REREREKR 422
R R+ + R + RER R R RD R R DR R R R RS R R R
Sbjct: 9 RLRNDTR---RSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNRYYRPRGDR 65
Query: 423 HHRR-DRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYE 464
+RR DR R+ K+ R+ R K+R+++ YE
Sbjct: 66 SYRRDDRRSGRNTKEPLTEAERDDRTVFVLQLALKARERDLYE 108
Score = 50.7 bits (121), Expect = 5e-06
Identities = 28/84 (33%), Positives = 32/84 (38%), Gaps = 11/84 (13%)
Query: 366 RSRSRSPPRKR-----RSRSRERDYDRV-RERDTDRSRDDRGRDRDRDRLERDRSRERER 419
R R R+ R+ RSR R R DR R RD D R RGR R R R R
Sbjct: 7 RGRLRNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNRYYRPRGDRS 66
Query: 420 EKRHHRRDRSK-----ERDGKDRR 438
+R RR +D R
Sbjct: 67 YRRDDRRSGRNTKEPLTEAERDDR 90
Score = 34.9 bits (80), Expect = 0.45
Identities = 20/70 (28%), Positives = 31/70 (44%), Gaps = 3/70 (4%)
Query: 1181 AGDRRSRSRSPARKRRSRSRDRDYDRRFKRKKSPQCEAQSSRFSACSLPRSHKSSSLLSR 1240
G RSR RS +R R R RDRDY R + + +R+ RS++ S
Sbjct: 19 KGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSR---SRSPNRYYRPRGDRSYRRDDRRSG 75
Query: 1241 YSEQDPEEKE 1250
+ ++P +
Sbjct: 76 RNTKEPLTEA 85
Score = 30.6 bits (69), Expect = 10.0
Identities = 26/94 (27%), Positives = 32/94 (34%), Gaps = 18/94 (19%)
Query: 365 RRSRSRSPPRKRRSRSRE---------------RDYDRVRERDTDRSRDDRGRDRDRDRL 409
RR R R R RR RSR R DR R+T + RD +
Sbjct: 35 RRRRDRDYYRGRRGRSRSRSPNRYYRPRGDRSYRRDDRRSGRNTKEPLTEAERDDRTVFV 94
Query: 410 ERDRSREREREKRHHRRDRSKERDG---KDRREG 440
+ + RER+ K RD KDR
Sbjct: 95 LQLALKARERDLYEFFSKVGKVRDVQCIKDRNSR 128
>gnl|CDD|224036 COG1111, MPH1, ERCC4-like helicases [DNA replication, recombination,
and repair].
Length = 542
Score = 59.3 bits (144), Expect = 1e-08
Identities = 37/131 (28%), Positives = 62/131 (47%), Gaps = 10/131 (7%)
Query: 988 RKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKG---------QEQ 1038
R+ L E L + VI+F + A+ + L+K+G A G+ Q++
Sbjct: 354 REILKEQLEKNGDSRVIVFTEYRDTAEEIVNFLKKIGIKARVRFIGQASREGDKGMSQKE 413
Query: 1039 RELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRAGKEG 1098
++ ++ + G ++LVAT V G+DI +V +VI Y+ S R GRTGR ++G
Sbjct: 414 QKEIIDQFRKGEYNVLVATSVGEEGLDIPEVDLVIFYEPVPSEIRSIQRKGRTGR-KRKG 472
Query: 1099 LAVSFCTKDDS 1109
V T+
Sbjct: 473 RVVVLVTEGTR 483
Score = 59.3 bits (144), Expect = 1e-08
Identities = 37/131 (28%), Positives = 62/131 (47%), Gaps = 10/131 (7%)
Query: 1663 RKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKG---------QEQ 1713
R+ L E L + VI+F + A+ + L+K+G A G+ Q++
Sbjct: 354 REILKEQLEKNGDSRVIVFTEYRDTAEEIVNFLKKIGIKARVRFIGQASREGDKGMSQKE 413
Query: 1714 RELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRAGKEG 1773
++ ++ + G ++LVAT V G+DI +V +VI Y+ S R GRTGR ++G
Sbjct: 414 QKEIIDQFRKGEYNVLVATSVGEEGLDIPEVDLVIFYEPVPSEIRSIQRKGRTGR-KRKG 472
Query: 1774 LAVSFCTKDDS 1784
V T+
Sbjct: 473 RVVVLVTEGTR 483
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
Provisional.
Length = 672
Score = 58.0 bits (141), Expect = 4e-08
Identities = 24/101 (23%), Positives = 35/101 (34%)
Query: 362 AGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREK 421
A +R R+RR +R + R R R D R R++ RE
Sbjct: 160 AAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERG 219
Query: 422 RHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEG 462
R DR R +DRR+ + RE+ + G
Sbjct: 220 RRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGG 260
Score = 56.8 bits (138), Expect = 8e-08
Identities = 28/101 (27%), Positives = 36/101 (35%)
Query: 362 AGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREK 421
+ R R R + R R +R RD RDR R + DR ER R
Sbjct: 163 RTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRD 222
Query: 422 RHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEG 462
RR R + RD +D R RE R + + G
Sbjct: 223 GGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRG 263
Score = 55.3 bits (134), Expect = 3e-07
Identities = 23/104 (22%), Positives = 36/104 (34%), Gaps = 3/104 (2%)
Query: 362 AGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSR---ERE 418
R + + R R R R+ + +R R +R RD +RDR + +
Sbjct: 154 TEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGD 213
Query: 419 REKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEG 462
R + RRD R + RR+ + D EG
Sbjct: 214 RREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEG 257
Score = 54.9 bits (133), Expect = 3e-07
Identities = 25/91 (27%), Positives = 29/91 (31%), Gaps = 1/91 (1%)
Query: 362 AGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREK 421
G R R R + R RE+ R D D RGR R RDR + RE
Sbjct: 190 RGRREERGRDGDDRDRRDRREQGDRREERGRRDG-GDRRGRRRRRDRRDARGDDNREDRG 248
Query: 422 RHHRRDRSKERDGKDRREGYRRERREEEASG 452
D + RR R R G
Sbjct: 249 DRDGDDGEGRGGRRGRRFRDRDRRGRRGGDG 279
Score = 54.5 bits (132), Expect = 5e-07
Identities = 30/115 (26%), Positives = 43/115 (37%)
Query: 339 GKGQEQRELALNSLKGGSKDILMAGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRD 398
G+G EQ + + +RR R R+ + ER R RD D
Sbjct: 146 GEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDR 205
Query: 399 DRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGS 453
R++ R ER R +R R RRDR R +R + R+ + E G
Sbjct: 206 RDRREQGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGG 260
Score = 54.1 bits (131), Expect = 6e-07
Identities = 26/102 (25%), Positives = 39/102 (38%), Gaps = 2/102 (1%)
Query: 362 AGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREK 421
A +RR R + R ++ E E D + +RD R DR + +
Sbjct: 127 ARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAE 186
Query: 422 RHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGY 463
R R R E G+D + RR+RRE+ + RD
Sbjct: 187 RGERGRR--EERGRDGDDRDRRDRREQGDRREERGRRDGGDR 226
Score = 53.0 bits (128), Expect = 1e-06
Identities = 29/142 (20%), Positives = 52/142 (36%), Gaps = 5/142 (3%)
Query: 338 GGKGQEQRELALNSLKGGSKDILMAGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSR 397
+ +E+RE + +G ++ G++ + R+ ERD ER R
Sbjct: 124 AAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERD-----ERRRRGDR 178
Query: 398 DDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKS 457
+DR + +R R R R+ + R R R + ++R +RR + +
Sbjct: 179 EDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDA 238
Query: 458 RDKEGYEPTEQMVISDEEEDKG 479
R + E D E G
Sbjct: 239 RGDDNREDRGDRDGDDGEGRGG 260
Score = 52.6 bits (127), Expect = 2e-06
Identities = 28/97 (28%), Positives = 36/97 (37%), Gaps = 2/97 (2%)
Query: 353 KGGSKDILMAGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERD 412
D R R R RR R R R+R R D+R DRD + +
Sbjct: 197 GRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGE 256
Query: 413 RSREREREKRHHRRDRSKERDGKDRREGYRRERREEE 449
R R +R RDR + R G D E RE++
Sbjct: 257 GRGGR-RGRRFRDRDR-RGRRGGDGGNEREPELREDD 291
Score = 52.2 bits (126), Expect = 2e-06
Identities = 20/91 (21%), Positives = 30/91 (32%), Gaps = 4/91 (4%)
Query: 362 AGDRRSRSRSPPRKRRSRSRERDYDRVR----ERDTDRSRDDRGRDRDRDRLERDRSRER 417
R R+ R R + R R +R +R R D G R R R R
Sbjct: 181 RQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDARG 240
Query: 418 EREKRHHRRDRSKERDGKDRREGYRRERREE 448
+ + + +G+ R G R R+
Sbjct: 241 DDNREDRGDRDGDDGEGRGGRRGRRFRDRDR 271
Score = 51.8 bits (125), Expect = 4e-06
Identities = 26/112 (23%), Positives = 39/112 (34%)
Query: 338 GGKGQEQRELALNSLKGGSKDILMAGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSR 397
G ++++ A +G ++ GD R R + R R R R R
Sbjct: 175 RGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRD 234
Query: 398 DDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEE 449
R D DR + + R R ++RD + RR G RE E
Sbjct: 235 RRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGGNEREPE 286
Score = 49.9 bits (120), Expect = 1e-05
Identities = 20/101 (19%), Positives = 33/101 (32%)
Query: 362 AGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREK 421
+ + + +++RER R + G ++ D + E E+
Sbjct: 109 RAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEE 168
Query: 422 RHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEG 462
R RR R D + E R RREE + R
Sbjct: 169 RDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRR 209
Score = 45.3 bits (108), Expect = 3e-04
Identities = 20/101 (19%), Positives = 34/101 (33%), Gaps = 1/101 (0%)
Query: 362 AGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRG-RDRDRDRLERDRSRERERE 420
A + + R+RR R R+ + R +R E + ER R
Sbjct: 116 AEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRR 175
Query: 421 KRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKE 461
R ER + RRE R+ + + + + +E
Sbjct: 176 GDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRRE 216
Score = 44.1 bits (105), Expect = 6e-04
Identities = 22/101 (21%), Positives = 36/101 (35%)
Query: 362 AGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREK 421
A + +P + RER R + +R+ E ER++
Sbjct: 112 AAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDE 171
Query: 422 RHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEG 462
R R DR + +R E RRE R + + R ++G
Sbjct: 172 RRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQG 212
Score = 42.6 bits (101), Expect = 0.002
Identities = 11/98 (11%), Positives = 29/98 (29%)
Query: 365 RRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHH 424
+ + + + + + R R + R + + +
Sbjct: 101 KAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEARADA 160
Query: 425 RRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEG 462
+E + RR G R +R+ E G + + ++
Sbjct: 161 AERTEEEERDERRRRGDREDRQAEAERGERGRREERGR 198
Score = 41.4 bits (98), Expect = 0.005
Identities = 18/73 (24%), Positives = 25/73 (34%)
Query: 363 GDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKR 422
D R R+ R +R D R + RG R R +RDR R +
Sbjct: 221 RDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGG 280
Query: 423 HHRRDRSKERDGK 435
+ R +E D
Sbjct: 281 NEREPELREDDVL 293
>gnl|CDD|221821 pfam12871, PRP38_assoc, Pre-mRNA-splicing factor 38-associated
hydrophilic C-term. This domain is a hydrophilic region
found at the C-terminus of plant and metazoan
pre-mRNA-splicing factor 38 proteins. The function is
not known.
Length = 97
Score = 52.1 bits (125), Expect = 4e-08
Identities = 26/63 (41%), Positives = 31/63 (49%)
Query: 364 DRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRH 423
R R R+R R + R R RD DR+R DRDRDR +R RSR R R +
Sbjct: 35 RGRRSPRRRTRRRSRRRKRSRKRRRRRRDRDRARYRDRDDRDRDRYDRSRSRSRSRSRDR 94
Query: 424 HRR 426
RR
Sbjct: 95 RRR 97
Score = 50.6 bits (121), Expect = 2e-07
Identities = 26/74 (35%), Positives = 34/74 (45%)
Query: 364 DRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRH 423
+ R ++ + R R R R R R R R R RDRDR R R+R+R R
Sbjct: 24 EIRRKAERDVDRGRRSPRRRTRRRSRRRKRSRKRRRRRRDRDRARYRDRDDRDRDRYDRS 83
Query: 424 HRRDRSKERDGKDR 437
R RS+ RD + R
Sbjct: 84 RSRSRSRSRDRRRR 97
Score = 42.8 bits (101), Expect = 1e-04
Identities = 23/84 (27%), Positives = 36/84 (42%)
Query: 380 SRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRRE 439
+ + E RD R R R RSR R+R ++ RR R ++R R+
Sbjct: 14 EESEEEEDDEEIRRKAERDVDRGRRSPRRRTRRRSRRRKRSRKRRRRRRDRDRARYRDRD 73
Query: 440 GYRRERREEEASGSKHKSRDKEGY 463
R+R + S S+ +SRD+
Sbjct: 74 DRDRDRYDRSRSRSRSRSRDRRRR 97
Score = 40.5 bits (95), Expect = 5e-04
Identities = 25/76 (32%), Positives = 31/76 (40%), Gaps = 3/76 (3%)
Query: 374 RKRRSRSRERDYDRVRERDTDRSRDDRGRDRD---RDRLERDRSREREREKRHHRRDRSK 430
+ R ERD DR R R+R R + R R RDR R R R++ RDR
Sbjct: 22 DEEIRRKAERDVDRGRRSPRRRTRRRSRRRKRSRKRRRRRRDRDRARYRDRDDRDRDRYD 81
Query: 431 ERDGKDRREGYRRERR 446
+ R R RR
Sbjct: 82 RSRSRSRSRSRDRRRR 97
Score = 37.1 bits (86), Expect = 0.010
Identities = 25/95 (26%), Positives = 41/95 (43%), Gaps = 7/95 (7%)
Query: 376 RRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKE---- 431
R + E D D E + + ++ R +RD RS R +R RR RS++
Sbjct: 1 PRVSALEEDLDEEEESEEEEDDEEIRRKAERDVDRGRRSPRRRTRRRSRRRKRSRKRRRR 60
Query: 432 ---RDGKDRREGYRRERREEEASGSKHKSRDKEGY 463
RD R+ R+R + S S+ +SR ++
Sbjct: 61 RRDRDRARYRDRDDRDRDRYDRSRSRSRSRSRDRR 95
Score = 35.1 bits (81), Expect = 0.041
Identities = 17/81 (20%), Positives = 28/81 (34%)
Query: 384 DYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRR 443
E D D + + D + + + RR R + R K R+ RR
Sbjct: 1 PRVSALEEDLDEEEESEEEEDDEEIRRKAERDVDRGRRSPRRRTRRRSRRRKRSRKRRRR 60
Query: 444 ERREEEASGSKHKSRDKEGYE 464
R + A RD++ Y+
Sbjct: 61 RRDRDRARYRDRDDRDRDRYD 81
Score = 29.4 bits (66), Expect = 4.9
Identities = 14/77 (18%), Positives = 28/77 (36%), Gaps = 3/77 (3%)
Query: 389 RERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREE 448
R + D+ + E D R+ E+ R RS R + R +R R+
Sbjct: 2 RVSALEEDLDEEEESEEE---EDDEEIRRKAERDVDRGRRSPRRRTRRRSRRRKRSRKRR 58
Query: 449 EASGSKHKSRDKEGYEP 465
+ ++R ++ +
Sbjct: 59 RRRRDRDRARYRDRDDR 75
>gnl|CDD|224126 COG1205, COG1205, Distinct helicase family with a unique C-terminal
domain including a metal-binding cysteine cluster
[General function prediction only].
Length = 851
Score = 57.8 bits (140), Expect = 5e-08
Identities = 85/380 (22%), Positives = 140/380 (36%), Gaps = 71/380 (18%)
Query: 755 QRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGP--YAIIMA 812
Q A+ + + R+++ TGSGKT +FLLP+L + + P A+++
Sbjct: 75 QVDALRLIREGRNVVVTTGTGSGKTESFLLPIL------------DHLLRDPSARALLLY 122
Query: 813 PTRELAQQIEEETNKFG--TPLGIRTVLVVGGLSREEQGFRLRLGCEIVIATPGRLIDVL 870
PT LA E + P + G EE+ +R +I++ P D+L
Sbjct: 123 PTNALANDQAERLRELISDLPGKVTFGRYTGDTPPEERRAIIRNPPDILLTNP----DML 178
Query: 871 E-------NRYLVL-NQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDE 922
+ +L L Y+V+DE +V + + L
Sbjct: 179 HYLLLRNHDAWLWLLRNLKYLVVDEL-HTYRGVQGSEV-----ALLLRRLL--------- 223
Query: 923 NKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVY-- 980
+ L Y S + T +AT+ E A R V + G P +
Sbjct: 224 -RRLRRYGSPLQIICT---SATLANPGE-FAEELFGRDFEVPVDEDGSPRGLRYFVRREP 278
Query: 981 ---ILSEQDKRKKLMEV-------LNRGVKKPVIIFVNQKKGADVLAKGL--------EK 1022
L+E +R L E+ + G++ ++F +K ++L K
Sbjct: 279 PIRELAESIRRSALAELATLAALLVRNGIQ--TLVFFRSRKQVELLYLSPRRRLVREGGK 336
Query: 1023 LGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDM-AKSI 1081
L T G +E+R K G ++AT+ GIDI + VI Y S+
Sbjct: 337 LLDAVSTYRAGLHREERRRIEAEFKEGELLGVIATNALELGIDIGSLDAVIAYGYPGVSV 396
Query: 1082 EDYTHRIGRTGRAGKEGLAV 1101
+ R GR GR G+E L +
Sbjct: 397 LSFRQRAGRAGRRGQESLVL 416
Score = 57.8 bits (140), Expect = 5e-08
Identities = 85/380 (22%), Positives = 140/380 (36%), Gaps = 71/380 (18%)
Query: 1430 QRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGP--YAIIMA 1487
Q A+ + + R+++ TGSGKT +FLLP+L + + P A+++
Sbjct: 75 QVDALRLIREGRNVVVTTGTGSGKTESFLLPIL------------DHLLRDPSARALLLY 122
Query: 1488 PTRELAQQIEEETNKFG--TPLGIRTVLVVGGLSREEQGFRLRLGCEIVIATPGRLIDVL 1545
PT LA E + P + G EE+ +R +I++ P D+L
Sbjct: 123 PTNALANDQAERLRELISDLPGKVTFGRYTGDTPPEERRAIIRNPPDILLTNP----DML 178
Query: 1546 E-------NRYLVL-NQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDE 1597
+ +L L Y+V+DE +V + + L
Sbjct: 179 HYLLLRNHDAWLWLLRNLKYLVVDEL-HTYRGVQGSEV-----ALLLRRLL--------- 223
Query: 1598 NKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVY-- 1655
+ L Y S + T +AT+ E A R V + G P +
Sbjct: 224 -RRLRRYGSPLQIICT---SATLANPGE-FAEELFGRDFEVPVDEDGSPRGLRYFVRREP 278
Query: 1656 ---ILSEQDKRKKLMEV-------LNRGVKKPVIIFVNQKKGADVLAKGL--------EK 1697
L+E +R L E+ + G++ ++F +K ++L K
Sbjct: 279 PIRELAESIRRSALAELATLAALLVRNGIQ--TLVFFRSRKQVELLYLSPRRRLVREGGK 336
Query: 1698 LGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDM-AKSI 1756
L T G +E+R K G ++AT+ GIDI + VI Y S+
Sbjct: 337 LLDAVSTYRAGLHREERRRIEAEFKEGELLGVIATNALELGIDIGSLDAVIAYGYPGVSV 396
Query: 1757 EDYTHRIGRTGRAGKEGLAV 1776
+ R GR GR G+E L +
Sbjct: 397 LSFRQRAGRAGRRGQESLVL 416
Score = 48.2 bits (115), Expect = 5e-05
Identities = 36/146 (24%), Positives = 61/146 (41%), Gaps = 28/146 (19%)
Query: 14 QRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGP--YAIIMA 71
Q A+ + + R+++ TGSGKT +FLLP+L + + P A+++
Sbjct: 75 QVDALRLIREGRNVVVTTGTGSGKTESFLLPIL------------DHLLRDPSARALLLY 122
Query: 72 PTRELAQQIEEETNKFG--TPLGIRTVLVVGGLSREEQGFRLRLGCEIVIATPGRLIDVL 129
PT LA E + P + G EE+ +R +I++ P D+L
Sbjct: 123 PTNALANDQAERLRELISDLPGKVTFGRYTGDTPPEERRAIIRNPPDILLTNP----DML 178
Query: 130 E-------NRYLVL-NQCTYIVLDEA 147
+ +L L Y+V+DE
Sbjct: 179 HYLLLRNHDAWLWLLRNLKYLVVDEL 204
>gnl|CDD|224125 COG1204, COG1204, Superfamily II helicase [General function
prediction only].
Length = 766
Score = 56.2 bits (136), Expect = 2e-07
Identities = 37/170 (21%), Positives = 72/170 (42%), Gaps = 16/170 (9%)
Query: 720 GGKVPDPVRNWKEASLPTEILEIIEKIGYAEPTPIQRQAI-PIGLQNRDIIGVAETGSGK 778
+ + + L +LEI++ G E Q++A+ L + +++ A TGSGK
Sbjct: 1 EIFMKEEKLATSKVKLDDRVLEILKGDGIDELFNPQQEAVEKGLLSDENVLISAPTGSGK 60
Query: 779 TLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVL 838
TL LL + G + + P + LA++ EE ++ LGIR +
Sbjct: 61 TLIALL-----------AILSTLLEGGGKVVYIVPLKALAEEKYEEFSRL-EELGIRVGI 108
Query: 839 VVGGLSREEQGFRLRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEA 888
G +++ ++++ TP +L + R + + +V+DE
Sbjct: 109 STGDYDLDDERL---ARYDVIVTTPEKLDSLTRKRPSWIEEVDLVVIDEI 155
Score = 56.2 bits (136), Expect = 2e-07
Identities = 37/170 (21%), Positives = 72/170 (42%), Gaps = 16/170 (9%)
Query: 1395 GGKVPDPVRNWKEASLPTEILEIIEKIGYAEPTPIQRQAI-PIGLQNRDIIGVAETGSGK 1453
+ + + L +LEI++ G E Q++A+ L + +++ A TGSGK
Sbjct: 1 EIFMKEEKLATSKVKLDDRVLEILKGDGIDELFNPQQEAVEKGLLSDENVLISAPTGSGK 60
Query: 1454 TLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVL 1513
TL LL + G + + P + LA++ EE ++ LGIR +
Sbjct: 61 TLIALL-----------AILSTLLEGGGKVVYIVPLKALAEEKYEEFSRL-EELGIRVGI 108
Query: 1514 VVGGLSREEQGFRLRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEA 1563
G +++ ++++ TP +L + R + + +V+DE
Sbjct: 109 STGDYDLDDERL---ARYDVIVTTPEKLDSLTRKRPSWIEEVDLVVIDEI 155
Score = 50.8 bits (122), Expect = 7e-06
Identities = 31/136 (22%), Positives = 57/136 (41%), Gaps = 15/136 (11%)
Query: 12 PIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMA 71
P Q L + +++ A TGSGKTL LL + G + +
Sbjct: 35 PQQEAVEKGLLSDENVLISAPTGSGKTLIALL-----------AILSTLLEGGGKVVYIV 83
Query: 72 PTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRLGCEIVIATPGRLIDVLEN 131
P + LA++ EE ++ LGIR + G +++ ++++ TP +L +
Sbjct: 84 PLKALAEEKYEEFSRL-EELGIRVGISTGDYDLDDERL---ARYDVIVTTPEKLDSLTRK 139
Query: 132 RYLVLNQCTYIVLDEA 147
R + + +V+DE
Sbjct: 140 RPSWIEEVDLVVIDEI 155
>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
Length = 773
Score = 56.0 bits (136), Expect = 2e-07
Identities = 48/155 (30%), Positives = 71/155 (45%), Gaps = 25/155 (16%)
Query: 973 ERIEQIVYILSEQD----KRKKLMEVLNRGVKKP----VIIFVNQKKGADVLAKGLEKLG 1024
R + V E D K +KL E++ + K +I+F + A+ + LEK G
Sbjct: 330 PRFRKAVRKAKELDIEHPKLEKLREIVKEQLGKNPDSRIIVFTQYRDTAEKIVDLLEKEG 389
Query: 1025 YNA------CTLHGGKGQEQRE--LALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYD 1076
A + G KG Q+E L+ + G ++LV+T VA G+DI V +VI Y+
Sbjct: 390 IKAVRFVGQASKDGDKGMSQKEQIEILDKFRAGEFNVLVSTSVAEEGLDIPSVDLVIFYE 449
Query: 1077 MA----KSIEDYTHRIGRTGRAGKEGLAVSFCTKD 1107
+SI+ R GRTGR +EG V K
Sbjct: 450 PVPSEIRSIQ----RKGRTGR-QEEGRVVVLIAKG 479
Score = 56.0 bits (136), Expect = 2e-07
Identities = 48/155 (30%), Positives = 71/155 (45%), Gaps = 25/155 (16%)
Query: 1648 ERIEQIVYILSEQD----KRKKLMEVLNRGVKKP----VIIFVNQKKGADVLAKGLEKLG 1699
R + V E D K +KL E++ + K +I+F + A+ + LEK G
Sbjct: 330 PRFRKAVRKAKELDIEHPKLEKLREIVKEQLGKNPDSRIIVFTQYRDTAEKIVDLLEKEG 389
Query: 1700 YNA------CTLHGGKGQEQRE--LALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYD 1751
A + G KG Q+E L+ + G ++LV+T VA G+DI V +VI Y+
Sbjct: 390 IKAVRFVGQASKDGDKGMSQKEQIEILDKFRAGEFNVLVSTSVAEEGLDIPSVDLVIFYE 449
Query: 1752 MA----KSIEDYTHRIGRTGRAGKEGLAVSFCTKD 1782
+SI+ R GRTGR +EG V K
Sbjct: 450 PVPSEIRSIQ----RKGRTGR-QEEGRVVVLIAKG 479
Score = 33.3 bits (77), Expect = 1.5
Identities = 38/124 (30%), Positives = 59/124 (47%), Gaps = 20/124 (16%)
Query: 30 VAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGT 89
V TG GKT + LLV + L K G +I+APT+ L +Q E KF
Sbjct: 35 VLPTGLGKTA---IALLVIAERLHKK--------GGKVLILAPTKPLVEQHAEFFRKFLN 83
Query: 90 PLGIRTVLVVGGLSREEQGFRLRL--GCEIVIATPGRLI--DVLENRYLVLNQCTYIVLD 145
+ V+ G +S E+ R L ++++ATP ++I D++ R L + ++ D
Sbjct: 84 IPEEKIVVFTGEVSPEK---RAELWEKAKVIVATP-QVIENDLIAGRI-SLEDVSLLIFD 138
Query: 146 EADR 149
EA R
Sbjct: 139 EAHR 142
Score = 33.3 bits (77), Expect = 1.5
Identities = 38/124 (30%), Positives = 59/124 (47%), Gaps = 20/124 (16%)
Query: 771 VAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGT 830
V TG GKT + LLV + L K G +I+APT+ L +Q E KF
Sbjct: 35 VLPTGLGKTA---IALLVIAERLHKK--------GGKVLILAPTKPLVEQHAEFFRKFLN 83
Query: 831 PLGIRTVLVVGGLSREEQGFRLRL--GCEIVIATPGRLI--DVLENRYLVLNQCTYIVLD 886
+ V+ G +S E+ R L ++++ATP ++I D++ R L + ++ D
Sbjct: 84 IPEEKIVVFTGEVSPEK---RAELWEKAKVIVATP-QVIENDLIAGRI-SLEDVSLLIFD 138
Query: 887 EADR 890
EA R
Sbjct: 139 EAHR 142
Score = 33.3 bits (77), Expect = 1.5
Identities = 38/124 (30%), Positives = 59/124 (47%), Gaps = 20/124 (16%)
Query: 1446 VAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGT 1505
V TG GKT + LLV + L K G +I+APT+ L +Q E KF
Sbjct: 35 VLPTGLGKTA---IALLVIAERLHKK--------GGKVLILAPTKPLVEQHAEFFRKFLN 83
Query: 1506 PLGIRTVLVVGGLSREEQGFRLRL--GCEIVIATPGRLI--DVLENRYLVLNQCTYIVLD 1561
+ V+ G +S E+ R L ++++ATP ++I D++ R L + ++ D
Sbjct: 84 IPEEKIVVFTGEVSPEK---RAELWEKAKVIVATP-QVIENDLIAGRI-SLEDVSLLIFD 138
Query: 1562 EADR 1565
EA R
Sbjct: 139 EAHR 142
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 54.2 bits (130), Expect = 6e-07
Identities = 62/344 (18%), Positives = 127/344 (36%), Gaps = 31/344 (9%)
Query: 375 KRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDG 434
KR+ + R + E + D +L+ + E + + + +E
Sbjct: 169 KRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLL 228
Query: 435 KDRREGYRRERR---------EEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKE 485
ER E+E S + +KE + + + EEE + ++E
Sbjct: 229 YLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEE 288
Query: 486 PLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTK 545
L + + K+E L K ER ++ E E+ KK+E+E KK +E
Sbjct: 289 LKLLAKEEEELKSE---------LLKLERRKVDDEEKLKESEKELKKLEKELKKEKEE-- 337
Query: 546 EASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLN 605
E ++E + ++R+ E+++E+ EK K ++ E E + ++ L + +
Sbjct: 338 --IEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSAAKLKE 395
Query: 606 DRKFVFDWDASEDTSVDYNSIYKERHQVQFFGRGNIAGIDIKAQKRDQSKFYGEMLEKRR 665
+ + + + E + S +E + + ++ + E + +
Sbjct: 396 EELELKNEEEKEAKLLLELSEQEEDL--------LKEEKKEELKIVEELEESLETKQGKL 447
Query: 666 TEAEKEQEKVRLKKVKKR-EEKQKWDDRHWTEKSLDEMTERDWR 708
TE ++E EK LK +K + E K+ D T+
Sbjct: 448 TEEKEELEKQALKLLKDKLELKKSEDLLKETKLVKLLEQLELLL 491
Score = 51.9 bits (124), Expect = 4e-06
Identities = 53/264 (20%), Positives = 103/264 (39%), Gaps = 12/264 (4%)
Query: 442 RRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEE 501
RR EEEA+GS+ K + KE + + + E K + AKK E
Sbjct: 155 RRLEIEEEAAGSREKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYY 214
Query: 502 ARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLR 561
+ L +E K E ++ +++ + ++ E + + E + E L+
Sbjct: 215 QLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIE-SSKQELEKEEEILAQVLK 273
Query: 562 RDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLNDRKFVFDWDASEDTSV 621
++E++KE ++E K +E +E L+K +RR ++ SE
Sbjct: 274 ENKEEEKEKKLQEEELKLLAKE---EEELKSELLKLERRKVDDEEKL-----KESEKELK 325
Query: 622 DYNSIYKERHQVQFFGRGNIAGIDIKAQKRDQSKFYGEMLEKRRTEAEKEQEKVRLKKVK 681
K+ + + ++IK + ++ + E LEK + + E+ +E++ KK
Sbjct: 326 KLEKELKKEKEEIEELEKELKELEIKREAEEEEE---EQLEKLQEKLEQLEEELLAKKKL 382
Query: 682 KREEKQKWDDRHWTEKSLDEMTER 705
+ E E L E+
Sbjct: 383 ESERLSSAAKLKEEELELKNEEEK 406
Score = 42.3 bits (99), Expect = 0.003
Identities = 45/259 (17%), Positives = 89/259 (34%), Gaps = 14/259 (5%)
Query: 350 NSLKGGSKDILMAGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRL 409
++G KD + S R+ S E E S + +++
Sbjct: 624 KVVEGILKDTELTKLLESAKAKESGLRKGVSLEEGLAEKSELKASLSELTKELLAEQELQ 683
Query: 410 ERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQM 469
E+ S + E + + K+ + +EE + +++DK E
Sbjct: 684 EKAESELAKNEILRRQEEIKKKEQRIKEELKKLKLEKEELLADKVQEAQDKINEELKLLE 743
Query: 470 VISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEM 529
E+E++ ++ + EE ++ +E+ + EE K + E EE
Sbjct: 744 QKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELA-------EEEEKTEKLKVEEEKEEK 796
Query: 530 RKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKE 589
K EEE + +E E K E + +++E +E+EL + E K
Sbjct: 797 LKAQEEELRALEE-------ELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKL 849
Query: 590 RYLGLVKKKRRVRRLNDRK 608
L + +R + +
Sbjct: 850 EKLAEEELERLEEEITKEE 868
Score = 38.0 bits (88), Expect = 0.053
Identities = 29/185 (15%), Positives = 68/185 (36%), Gaps = 16/185 (8%)
Query: 504 SKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRD 563
KP+ + E A R+++ + E ++K +EE + E L+
Sbjct: 151 MKPERRLEIEEEAAGSREKRKKKERLKKLIEETENLAELIID------LEELKLQELKLK 204
Query: 564 REKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLNDRKFVFDWDASEDTSVDY 623
+ KK + K + E + YL L +++ + + R + ++S+
Sbjct: 205 EQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKE 264
Query: 624 NSIYKERHQVQFFGRGNIAGIDIKAQKRDQSKFYGEMLEKRRTEAEKEQEKVRLKKVKKR 683
I + + + +++ + ++L K E + E K+ +KV
Sbjct: 265 EEILAQVLKE----------NKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDE 314
Query: 684 EEKQK 688
E+ ++
Sbjct: 315 EKLKE 319
Score = 31.9 bits (72), Expect = 3.8
Identities = 34/284 (11%), Positives = 83/284 (29%), Gaps = 12/284 (4%)
Query: 340 KGQEQRELALNSLKGGSKDILMAGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDD 399
K + ++E S + ++ + + + + + +++
Sbjct: 755 KSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEE 814
Query: 400 RGRDRDRDRLERDRSREREREKRHHRR------DRSKERDGKDRREGYRRERREEEA--S 451
+ L + +E E K + + R + E
Sbjct: 815 AELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELL 874
Query: 452 GSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTK 511
+ + +++ + E ++EE K + + +L E + E A L
Sbjct: 875 LKEEELEEQKLKDELESKEEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILLKY 934
Query: 512 EERAAEALRKRQAEVEEMR-KKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKED 570
E E L + E E+ K EEE + ++ + + A E+ +D
Sbjct: 935 ESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKD 994
Query: 571 PEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLNDRKFVFDWD 614
+KE E E + + + ++ + +
Sbjct: 995 ELKKE---RLEEEKKELLREIIEETCQRFKEFLELFVSINRGLN 1035
Score = 31.5 bits (71), Expect = 6.2
Identities = 40/293 (13%), Positives = 99/293 (33%), Gaps = 16/293 (5%)
Query: 279 ERIEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFV--NQKKGADVLAKGLEKLGYNACTL 336
++ E+ K K+L E + K V K + L E+L A L
Sbjct: 759 KKEEEEEEKSELSLKEKELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELL 818
Query: 337 HGGKGQEQRELALNSLKGGSKDILMAGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRS 396
+ ++E + + + + +++ + R + ++E
Sbjct: 819 EEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEE 878
Query: 397 RDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHK 456
+ + +D + ++ +E ++E ++ ++ E +E E +
Sbjct: 879 ELEEQKLKDELESKEEKEKEEKKELE------------EESQKDNLLEEKENEIEERIAE 926
Query: 457 SRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAA 516
+E + EE D+ + EE + +E + E
Sbjct: 927 EAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEE 986
Query: 517 EALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKE 569
+ R + E+++ +++EEE+K+ E + + +E + + +R K
Sbjct: 987 KEERYNKDELKK--ERLEEEKKELLREIIEETCQRFKEFLELFVSINRGLNKV 1037
>gnl|CDD|233069 TIGR00643, recG, ATP-dependent DNA helicase RecG. [DNA metabolism,
DNA replication, recombination, and repair].
Length = 630
Score = 53.1 bits (128), Expect = 1e-06
Identities = 82/357 (22%), Positives = 137/357 (38%), Gaps = 72/357 (20%)
Query: 765 NRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEE 824
NR + G + GSGKTL L +L I++ G +MAPT LA+Q
Sbjct: 258 NRLLQG--DVGSGKTLVAALAMLAAIEA------------GYQVALMAPTEILAEQHYNS 303
Query: 825 TNKFGTPLGIRTVLVVGGL---SREEQGFRLRLG-CEIVIATPGRLIDVLENRYLVLNQC 880
PLGI L+ G L R+E + G +V+ T + + +E +
Sbjct: 304 LRNLLAPLGIEVALLTGSLKGKRRKELLETIASGQIHLVVGTHALIQEKVE-----FKRL 358
Query: 881 TYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVM 940
+++DE R G E + K L ++
Sbjct: 359 ALVIIDEQHR---FGVE------------------------QRKKLREKGQGGFTPHVLV 391
Query: 941 FTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKLMEVLNRGVK 1000
+AT P LA + T I + + I ++ + D++ + E + +
Sbjct: 392 MSAT--PIPRTLALTVYGDLDTSIIDELPPGRKPITTVLI---KHDEKDIVYEFIEEEIA 446
Query: 1001 K--------PVI--IFVNQKKGADVLAKGLEKL--GYNACTLHGGKGQEQRELALNSLKG 1048
K P+I K A+ L + L+K YN LHG +++E + +
Sbjct: 447 KGRQAYVVYPLIEESEKLDLKAAEALYERLKKAFPKYNVGLLHGRMKSDEKEAVMEEFRE 506
Query: 1049 GSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRI-GRTGRAGKEGLAVSFC 1104
G DILVAT V G+D+ + ++++ D + H++ GR GR + S+C
Sbjct: 507 GEVDILVATTVIEVGVDVPNATVMVIEDAERFGLSQLHQLRGRVGRGDHQ----SYC 559
Score = 53.1 bits (128), Expect = 1e-06
Identities = 82/357 (22%), Positives = 137/357 (38%), Gaps = 72/357 (20%)
Query: 1440 NRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEE 1499
NR + G + GSGKTL L +L I++ G +MAPT LA+Q
Sbjct: 258 NRLLQG--DVGSGKTLVAALAMLAAIEA------------GYQVALMAPTEILAEQHYNS 303
Query: 1500 TNKFGTPLGIRTVLVVGGL---SREEQGFRLRLG-CEIVIATPGRLIDVLENRYLVLNQC 1555
PLGI L+ G L R+E + G +V+ T + + +E +
Sbjct: 304 LRNLLAPLGIEVALLTGSLKGKRRKELLETIASGQIHLVVGTHALIQEKVE-----FKRL 358
Query: 1556 TYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVM 1615
+++DE R G E + K L ++
Sbjct: 359 ALVIIDEQHR---FGVE------------------------QRKKLREKGQGGFTPHVLV 391
Query: 1616 FTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKLMEVLNRGVK 1675
+AT P LA + T I + + I ++ + D++ + E + +
Sbjct: 392 MSAT--PIPRTLALTVYGDLDTSIIDELPPGRKPITTVLI---KHDEKDIVYEFIEEEIA 446
Query: 1676 K--------PVI--IFVNQKKGADVLAKGLEKL--GYNACTLHGGKGQEQRELALNSLKG 1723
K P+I K A+ L + L+K YN LHG +++E + +
Sbjct: 447 KGRQAYVVYPLIEESEKLDLKAAEALYERLKKAFPKYNVGLLHGRMKSDEKEAVMEEFRE 506
Query: 1724 GSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRI-GRTGRAGKEGLAVSFC 1779
G DILVAT V G+D+ + ++++ D + H++ GR GR + S+C
Sbjct: 507 GEVDILVATTVIEVGVDVPNATVMVIEDAERFGLSQLHQLRGRVGRGDHQ----SYC 559
Score = 38.9 bits (91), Expect = 0.031
Identities = 36/130 (27%), Positives = 55/130 (42%), Gaps = 23/130 (17%)
Query: 24 NRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEE 83
NR + G + GSGKTL L +L I++ G +MAPT LA+Q
Sbjct: 258 NRLLQG--DVGSGKTLVAALAMLAAIEA------------GYQVALMAPTEILAEQHYNS 303
Query: 84 TNKFGTPLGIRTVLVVGGL---SREEQGFRLRLG-CEIVIATPGRLIDVLENRYLVLNQC 139
PLGI L+ G L R+E + G +V+ T + + +E +
Sbjct: 304 LRNLLAPLGIEVALLTGSLKGKRRKELLETIASGQIHLVVGTHALIQEKVE-----FKRL 358
Query: 140 TYIVLDEADR 149
+++DE R
Sbjct: 359 ALVIIDEQHR 368
>gnl|CDD|224035 COG1110, COG1110, Reverse gyrase [DNA replication, recombination, and
repair].
Length = 1187
Score = 53.2 bits (128), Expect = 1e-06
Identities = 98/468 (20%), Positives = 162/468 (34%), Gaps = 96/468 (20%)
Query: 1383 RIFREDYSITIKGGKVPDPVRNWKEASLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRD 1442
RE ++ + R E E E +K P QR ++ +
Sbjct: 47 NALRERKTL----KNLEFFRRYLWEYE---EFEEFFKKATGFRPWSAQRVWAKRLVRGKS 99
Query: 1443 IIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNK 1502
+A TG GKT F L + +++ A +G I+ PT L +Q+ E K
Sbjct: 100 FAIIAPTGVGKT-TFGLLMSLYL-----------AKKGKRVYIIVPTTTLVRQVYERLKK 147
Query: 1503 FGTPLGIRTVLVV--GGLS-REEQGFRLRL---GCEIVIATPGRLIDVLENRYLVLNQCT 1556
F G VLVV L +E++ R+ +I+I T L E L +
Sbjct: 148 FAEDAGSLDVLVVYHSALPTKEKEEALERIESGDFDILITTSQFLSKRFEE--LSKLKFD 205
Query: 1557 YIVLDEAD------RMID-----MGFEPDV---QKILEYMPVTNLKPDTEDAEDENKLLA 1602
+I +D+ D + +D +GF +V L + L + + AE + L
Sbjct: 206 FIFVDDVDAILKASKNVDRLLRLLGFSEEVIESAYELIKL-RRKLYGE-KRAERVREELR 263
Query: 1603 -----NYNSKKKYRQTVMFTATMPPAVERLA-------------RSYLRRPATVYIGSVG 1644
++K V+ +AT P RL LR +Y+
Sbjct: 264 EVEREREKKRRKLGILVVSSATGKPRGSRLKLFRELLGFEVGSGGEGLRNIVDIYV---- 319
Query: 1645 KPTERIEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADV---LAKGLEKLGYN 1701
+E +E++V ++ + L G +IFV G + LA+ L G N
Sbjct: 320 -ESESLEKVVELVKK----------LGDG----GLIFVPIDYGREKAEELAEYLRSHGIN 364
Query: 1702 ACTLHGGKGQEQRELALNSLKGGSKDILVAT----DVAGRGIDI-KDVSMVINYDMAK-- 1754
A +H K E AL + G D+LV V RG+D+ + + Y + K
Sbjct: 365 AELIHAEK-----EEALEDFEEGEVDVLVGVASYYGVLVRGLDLPHRIRYAVFYGVPKFR 419
Query: 1755 -SIEDYTHRIGRTGRAGKEGLAVSFCTKDDSHLFYDLKQMMISSPVST 1801
+ R ++ + + + SP S
Sbjct: 420 FRLTLEESDPKRLLYLLSALSDREARSRLEGLAGRLRRIIRRLSPYSL 467
Score = 52.8 bits (127), Expect = 2e-06
Identities = 92/403 (22%), Positives = 146/403 (36%), Gaps = 92/403 (22%)
Query: 708 RIFREDYSITIKGGKVPDPVRNWKEASLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRD 767
RE ++ + R E E E +K P QR ++ +
Sbjct: 47 NALRERKTL----KNLEFFRRYLWEYE---EFEEFFKKATGFRPWSAQRVWAKRLVRGKS 99
Query: 768 IIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNK 827
+A TG GKT F L + +++ A +G I+ PT L +Q+ E K
Sbjct: 100 FAIIAPTGVGKT-TFGLLMSLYL-----------AKKGKRVYIIVPTTTLVRQVYERLKK 147
Query: 828 FGTPLGIRTVLVV--GGLS-REEQGFRLRL---GCEIVIATPGRLIDVLENRYLVLNQCT 881
F G VLVV L +E++ R+ +I+I T L E L +
Sbjct: 148 FAEDAGSLDVLVVYHSALPTKEKEEALERIESGDFDILITTSQFLSKRFEE--LSKLKFD 205
Query: 882 YIVLDEAD------RMID-----MGFEPDV---QKILEYMPVTNLKPDTEDAEDENKLLA 927
+I +D+ D + +D +GF +V L + L + + AE + L
Sbjct: 206 FIFVDDVDAILKASKNVDRLLRLLGFSEEVIESAYELIKL-RRKLYGE-KRAERVREELR 263
Query: 928 -----NYNSKKKYRQTVMFTATMPPAVERLA-------------RSYLRRPATVYIGSVG 969
++K V+ +AT P RL LR +Y+
Sbjct: 264 EVEREREKKRRKLGILVVSSATGKPRGSRLKLFRELLGFEVGSGGEGLRNIVDIYV---- 319
Query: 970 KPTERIEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADV---LAKGLEKLGYN 1026
+E +E++V ++ + L G +IFV G + LA+ L G N
Sbjct: 320 -ESESLEKVVELVKK----------LGDG----GLIFVPIDYGREKAEELAEYLRSHGIN 364
Query: 1027 ACTLHGGKGQEQRELALNSLKGGSKDILVAT----DVAGRGID 1065
A +H K E AL + G D+LV V RG+D
Sbjct: 365 AELIHAEK-----EEALEDFEEGEVDVLVGVASYYGVLVRGLD 402
Score = 33.9 bits (78), Expect = 1.0
Identities = 75/360 (20%), Positives = 126/360 (35%), Gaps = 85/360 (23%)
Query: 3 EKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDAD 62
+K P QR ++ + +A TG GKT F L + +++ A
Sbjct: 76 KKATGFRPWSAQRVWAKRLVRGKSFAIIAPTGVGKT-TFGLLMSLYL-----------AK 123
Query: 63 QGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVV--GGLS-REEQGFRLRL---GCE 116
+G I+ PT L +Q+ E KF G VLVV L +E++ R+ +
Sbjct: 124 KGKRVYIIVPTTTLVRQVYERLKKFAEDAGSLDVLVVYHSALPTKEKEEALERIESGDFD 183
Query: 117 IVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLKPDTE 176
I+I T L E L + +I +D+ D ++ +V ++L + +
Sbjct: 184 ILITTSQFLSKRFEE--LSKLKFDFIFVDDVDAILKAS--KNVDRLLRLLGFSE------ 233
Query: 177 DAEDENKLLANYNSKKKYRQTVMFTATMPPAKILEYMPVTNLKPDTEDAEDENKLLANYN 236
E ++K K E + E E E +
Sbjct: 234 --EVIESAYELIKLRRKLYG----------EKRAE-------RVREELREVERER---EK 271
Query: 237 SKKKYRQTVMFTATMPPAVERLA-------------RSYLRRPATVYIGSVGKPTERIEQ 283
++K V+ +AT P RL LR +Y+ +E +E+
Sbjct: 272 KRRKLGILVVSSATGKPRGSRLKLFRELLGFEVGSGGEGLRNIVDIYV-----ESESLEK 326
Query: 284 IVYILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADV---LAKGLEKLGYNACTLHGGK 340
+V ++ + L G +IFV G + LA+ L G NA +H K
Sbjct: 327 VVELVKK----------LGDG----GLIFVPIDYGREKAEELAEYLRSHGINAELIHAEK 372
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 53.1 bits (128), Expect = 2e-06
Identities = 56/304 (18%), Positives = 114/304 (37%), Gaps = 38/304 (12%)
Query: 406 RDRLERDRSREREREKRHHRRDRS---KERDGKDRREGYRRERREEEASGSKHKSRDKEG 462
E +EK R + ++ + E E + + E
Sbjct: 217 LKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLE- 275
Query: 463 YEPTEQMVISDEEEDKGAAAKKEPLSLEELLAK-KKAEEEARSKPKFLTKEERAAEA-LR 520
E ++ I D E++ K++ LE +A +++ E + + + EA +
Sbjct: 276 -ELNKK--IKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEID 332
Query: 521 KRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDK 580
K AE+EE+ +++EEERK+R + T+E + +E + E KE E ++ KD
Sbjct: 333 KLLAEIEELEREIEEERKRRDKLTEE--YAELKEELEDLRAELEEVDKEFAETRDELKDY 390
Query: 581 EREGEAIKERYLGLVKKKRRVRRLNDRKFVFDWDASEDTSVDYNSIYKERHQVQFFGRGN 640
+ E +K L KR + RL + + +
Sbjct: 391 REKLEKLKREINEL---KRELDRLQEEL-------------------QRLSEELADLNAA 428
Query: 641 IAGIDIKAQKRDQSKFYGEMLEKRRTEAEKEQEKVRLKKVKKREEKQKWDDRHWTEKSLD 700
IAGI + ++ E +K ++E + +L + E++ +D + ++
Sbjct: 429 IAGI-----EAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEK 483
Query: 701 EMTE 704
E+++
Sbjct: 484 ELSK 487
Score = 43.5 bits (103), Expect = 0.001
Identities = 39/234 (16%), Positives = 80/234 (34%), Gaps = 16/234 (6%)
Query: 360 LMAGDRRSRSRSPPRKRRSRSRERDYDRVRER--DTDRSRDDRGRDRDRDRLERDRSRER 417
M G R+ R + + R+RER R + R D +
Sbjct: 654 AMTGGSRAPRGGILFSRSEPAELQ---RLRERLEGLKRELSSLQSELRRIENRLDELSQE 710
Query: 418 EREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEED 477
+ + KE + ++ E +ER EE ++ E + +E + E+
Sbjct: 711 LSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVK-SELKELEARIEE 769
Query: 478 KGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEER 537
K +L +L EAR + + + L + + +E +++E++
Sbjct: 770 LEEDLHKLEEALNDL--------EARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKL 821
Query: 538 KKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERY 591
+ ++ E + + + +E+ K +E E K+ E E E
Sbjct: 822 NRLTL--EKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEEL 873
Score = 37.4 bits (87), Expect = 0.083
Identities = 40/200 (20%), Positives = 72/200 (36%), Gaps = 22/200 (11%)
Query: 376 RRSRSRERDYDRVRER--DTDRSRDDRGRDRD---------RDRLERDRSREREREKRHH 424
R E + D++ + +R ++ + RD ++ LE R+ E +K
Sbjct: 322 ERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFA 381
Query: 425 R-RDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAK 483
RD K+ +++ E +RE E + + + + E + G AK
Sbjct: 382 ETRDELKDY--REKLEKLKREINELKRELDRLQEELQRLSEELADL----NAAIAGIEAK 435
Query: 484 KEPL--SLEELLAK-KKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKR 540
L E+ + KK E + L+K E+ L++ VE+ K++ E
Sbjct: 436 INELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQREL-AE 494
Query: 541 QEFTKEASFESKRENFDARL 560
E AS E R
Sbjct: 495 AEAQARASEERVRGGRAVEE 514
Score = 35.0 bits (81), Expect = 0.42
Identities = 37/190 (19%), Positives = 72/190 (37%), Gaps = 17/190 (8%)
Query: 409 LERDRSREREREKRHHRRDRSKER-------DGKDRREGYRRERREEEASGSKHKSRDKE 461
LE S R E + +E +++ +E + +
Sbjct: 784 LEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRI 843
Query: 462 GYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEE-EARSKPKFLTKEERAAEA-L 519
+ + + + E G + LEE L + +A + S+ L KE EA L
Sbjct: 844 DLKEQIKSIEKEIENLNGKKEE-----LEEELEELEAALRDLESRLGDLKKERDELEAQL 898
Query: 520 RKRQAEVEEMRKKMEEERK--KRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELN 577
R+ + ++EE+ ++E++RK + EA E E + D E +E+ +++
Sbjct: 899 RELERKIEELEAQIEKKRKRLSELKAKLEALEEELSE-IEDPKGEDEEIPEEELSLEDVQ 957
Query: 578 KDKEREGEAI 587
+ +R E I
Sbjct: 958 AELQRVEEEI 967
Score = 33.5 bits (77), Expect = 1.5
Identities = 45/158 (28%), Positives = 67/158 (42%), Gaps = 15/158 (9%)
Query: 408 RLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTE 467
LE + R+ E R + ++ RE R+ E EA K + R E
Sbjct: 869 ELEELEAALRDLESRLGDLKKERDELEAQLRE-LERKIEELEAQIEKKRKRLSE---LKA 924
Query: 468 QMVISDEEEDKGAAAKKEP-------LSLEELLAKKKAEEEARSK--PKFLTKEERAAEA 518
++ +EE + K E LSLE++ A+ + EE P + + E
Sbjct: 925 KLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPVNMLAIQEYEEV 984
Query: 519 LRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENF 556
L KR E++E R K+EEERK E +E + KRE F
Sbjct: 985 L-KRLDELKEKRAKLEEERKAILERIEEYE-KKKREVF 1020
>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
Length = 1176
Score = 51.1 bits (123), Expect = 6e-06
Identities = 88/369 (23%), Positives = 146/369 (39%), Gaps = 87/369 (23%)
Query: 744 EKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDAD 803
+K G +P +QR L +A TG GKT F L + +++ A
Sbjct: 75 KKTGS-KPWSLQRTWAKRLLLGESFAIIAPTGVGKT-TFGLVMSLYL-----------AK 121
Query: 804 QGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVV---GGLSREEQGFRLRL---GCE 857
+G + I+ PTR L +Q+ E+ KFG +G ++ +E++ F RL +
Sbjct: 122 KGKKSYIIFPTRLLVEQVVEKLEKFGEKVGCGVKILYYHSSLKKKEKEEFLERLKEGDFD 181
Query: 858 IVIATPGRLI---DVLENRYLVLNQCTYIVLDEAD------RMID-----MGF-EPDVQK 902
I++ T L D L + ++ +D+ D + ID +GF E D++K
Sbjct: 182 ILVTTSQFLSKNFDELPKKKF-----DFVFVDDVDAVLKSSKNIDKLLYLLGFSEEDIEK 236
Query: 903 ILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVM--FTATMPPAVERL-------- 952
+E + LK E+ ++ + L ++ K ++ V+ +AT P R+
Sbjct: 237 AMELI---RLKRKYEEIYEKIRELEEKIAELKDKKGVLVVSSATGRPRGNRVKLFRELLG 293
Query: 953 -----ARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFV 1007
YLR YI + +E++V ++ L G +IFV
Sbjct: 294 FEVGSPVFYLRNIVDSYI----VDEDSVEKLVELVKR----------LGDG----GLIFV 335
Query: 1008 NQKKGADV---LAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVA----TDVA 1060
KG + LA+ LE LG NA E + G D+LV V
Sbjct: 336 PSDKGKEYAEELAEYLEDLGINAELAI-----SGFERKFEKFEEGEVDVLVGVASYYGVL 390
Query: 1061 GRGIDIKDV 1069
RGID+ +
Sbjct: 391 VRGIDLPER 399
Score = 51.1 bits (123), Expect = 6e-06
Identities = 88/369 (23%), Positives = 146/369 (39%), Gaps = 87/369 (23%)
Query: 1419 EKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDAD 1478
+K G +P +QR L +A TG GKT F L + +++ A
Sbjct: 75 KKTGS-KPWSLQRTWAKRLLLGESFAIIAPTGVGKT-TFGLVMSLYL-----------AK 121
Query: 1479 QGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVV---GGLSREEQGFRLRL---GCE 1532
+G + I+ PTR L +Q+ E+ KFG +G ++ +E++ F RL +
Sbjct: 122 KGKKSYIIFPTRLLVEQVVEKLEKFGEKVGCGVKILYYHSSLKKKEKEEFLERLKEGDFD 181
Query: 1533 IVIATPGRLI---DVLENRYLVLNQCTYIVLDEAD------RMID-----MGF-EPDVQK 1577
I++ T L D L + ++ +D+ D + ID +GF E D++K
Sbjct: 182 ILVTTSQFLSKNFDELPKKKF-----DFVFVDDVDAVLKSSKNIDKLLYLLGFSEEDIEK 236
Query: 1578 ILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVM--FTATMPPAVERL-------- 1627
+E + LK E+ ++ + L ++ K ++ V+ +AT P R+
Sbjct: 237 AMELI---RLKRKYEEIYEKIRELEEKIAELKDKKGVLVVSSATGRPRGNRVKLFRELLG 293
Query: 1628 -----ARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFV 1682
YLR YI + +E++V ++ L G +IFV
Sbjct: 294 FEVGSPVFYLRNIVDSYI----VDEDSVEKLVELVKR----------LGDG----GLIFV 335
Query: 1683 NQKKGADV---LAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVA----TDVA 1735
KG + LA+ LE LG NA E + G D+LV V
Sbjct: 336 PSDKGKEYAEELAEYLEDLGINAELAI-----SGFERKFEKFEEGEVDVLVGVASYYGVL 390
Query: 1736 GRGIDIKDV 1744
RGID+ +
Sbjct: 391 VRGIDLPER 399
Score = 34.5 bits (80), Expect = 0.65
Identities = 86/384 (22%), Positives = 145/384 (37%), Gaps = 104/384 (27%)
Query: 3 EKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDAD 62
+K G +P +QR L +A TG GKT F L + +++ A
Sbjct: 75 KKTGS-KPWSLQRTWAKRLLLGESFAIIAPTGVGKT-TFGLVMSLYL-----------AK 121
Query: 63 QGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVV---GGLSREEQGFRLRL---GCE 116
+G + I+ PTR L +Q+ E+ KFG +G ++ +E++ F RL +
Sbjct: 122 KGKKSYIIFPTRLLVEQVVEKLEKFGEKVGCGVKILYYHSSLKKKEKEEFLERLKEGDFD 181
Query: 117 IVIATPGRLI---DVLENRYLVLNQCTYIVLDEAD------RMID-----MGF-EPDVQK 161
I++ T L D L + ++ +D+ D + ID +GF E D++K
Sbjct: 182 ILVTTSQFLSKNFDELPKKKF-----DFVFVDDVDAVLKSSKNIDKLLYLLGFSEEDIEK 236
Query: 162 ILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVM--FTATMPPAKILEYMPVTNLK 219
+E + LK E+ ++ + L ++ K ++ V+ +AT +
Sbjct: 237 AMELI---RLKRKYEEIYEKIRELEEKIAELKDKKGVLVVSSAT--------------GR 279
Query: 220 PDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTE 279
P N K +R+ + F V YLR YI +
Sbjct: 280 PRG-------------NRVKLFRELLGFE------VGS-PVFYLRNIVDSYI----VDED 315
Query: 280 RIEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADV---LAKGLEKLGYNACTL 336
+E++V ++ L G +IFV KG + LA+ LE LG NA
Sbjct: 316 SVEKLVELVKR----------LGDG----GLIFVPSDKGKEYAEELAEYLEDLGINAELA 361
Query: 337 HGGKGQEQRELALNSLKGGSKDIL 360
E + G D+L
Sbjct: 362 I-----SGFERKFEKFEEGEVDVL 380
>gnl|CDD|215597 PLN03137, PLN03137, ATP-dependent DNA helicase; Q4-like; Provisional.
Length = 1195
Score = 50.3 bits (120), Expect = 1e-05
Identities = 40/128 (31%), Positives = 59/128 (46%), Gaps = 9/128 (7%)
Query: 1679 IIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKD---ILVATDVA 1735
II+ + + +A+ L++ G+ A HG QR K SKD I+ AT
Sbjct: 684 IIYCLSRMDCEKVAERLQEFGHKAAFYHGSMDPAQRAFVQ---KQWSKDEINIICATVAF 740
Query: 1736 GRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRAGKEGLAVSFCTKDDSHLFYDLKQMMI 1795
G GI+ DV VI++ + KSIE Y GR GR G+ V + + D + +K M+
Sbjct: 741 GMGINKPDVRFVIHHSLPKSIEGYHQECGRAGRDGQRSSCVLYYSYSD---YIRVKHMIS 797
Query: 1796 SSPVSTCP 1803
V P
Sbjct: 798 QGGVEQSP 805
Score = 49.5 bits (118), Expect = 2e-05
Identities = 34/96 (35%), Positives = 48/96 (50%), Gaps = 6/96 (6%)
Query: 1004 IIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKD---ILVATDVA 1060
II+ + + +A+ L++ G+ A HG QR K SKD I+ AT
Sbjct: 684 IIYCLSRMDCEKVAERLQEFGHKAAFYHGSMDPAQRAFVQ---KQWSKDEINIICATVAF 740
Query: 1061 GRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRAGK 1096
G GI+ DV VI++ + KSIE Y GR GR G+
Sbjct: 741 GMGINKPDVRFVIHHSLPKSIEGYHQECGRAGRDGQ 776
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 48.3 bits (115), Expect = 3e-05
Identities = 34/165 (20%), Positives = 75/165 (45%), Gaps = 1/165 (0%)
Query: 426 RDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKE 485
R + +++ + + +++ ++ K ++ ++ + EQ +++ + A K+
Sbjct: 54 RIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQ 113
Query: 486 PLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTK 545
++ + KA++ A +K K + E+ A+ K+QAE E K E +KK E K
Sbjct: 114 AEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKK 173
Query: 546 EASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKER 590
+A E+K + +A+ + E+ K E + E +A E
Sbjct: 174 KAEAEAKAKA-EAKAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEA 217
Score = 34.8 bits (80), Expect = 0.37
Identities = 34/150 (22%), Positives = 59/150 (39%), Gaps = 21/150 (14%)
Query: 409 LERDRSREREREKRHHRRDRSKE---RDGKDRREGYRRERREEEASGSKHKSRDKEGYEP 465
E+ R+ E+ R+K +R +++ + + ++ ++++ EEA + +
Sbjct: 80 AEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAE 139
Query: 466 TEQMVISD-----EEEDKG------------AAAKKEPLSLEELLAKKKAEEEARSKPKF 508
E+ + EEE K A K E + + AK KA+ E
Sbjct: 140 AEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAE 199
Query: 509 LTKEERAAEALRKRQAEVEEMRKKMEEERK 538
K + AAEA K +AE E ERK
Sbjct: 200 AAKAKAAAEAAAKAEAE-AAAAAAAEAERK 228
>gnl|CDD|233251 TIGR01054, rgy, reverse gyrase. This model describes reverse gyrase,
found in both archaeal and bacterial thermophiles. This
enzyme, a fusion of a type I topoisomerase domain and a
helicase domain, introduces positive supercoiling to
increase the melting temperature of DNA double strands.
Generally, these gyrases are encoded as a single
polypeptide. An exception was found in Methanopyrus
kandleri, where enzyme is split within the topoisomerase
domain, yielding a heterodimer of gene products
designated RgyB and RgyA [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1171
Score = 48.7 bits (116), Expect = 3e-05
Identities = 83/363 (22%), Positives = 147/363 (40%), Gaps = 59/363 (16%)
Query: 738 EILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIA 797
E E +K +EP IQ+ L+ +A TG GKT F L + +++
Sbjct: 66 EFEEFFKKAVGSEPWSIQKMWAKRVLRGDSFAIIAPTGVGKT-TFGLAMSLFL------- 117
Query: 798 RMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSR----EEQGFRLR 853
A +G I+ PT L Q+ E+ + G+ TV + SR E++ F R
Sbjct: 118 ----AKKGKRCYIILPTTLLVIQVAEKISSLAEKAGVGTVNIGAYHSRLPTKEKKEFMER 173
Query: 854 L---GCEIVIATPGRL---IDVLENRYLVLNQCTYIVLDEAD------RMID-----MGF 896
+ +I+I T L D L ++ +I +D+ D + +D +GF
Sbjct: 174 IENGDFDILITTTMFLSKNYDELGPKF------DFIFVDDVDALLKASKNVDKLLKLLGF 227
Query: 897 -EPDVQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARS 955
E ++K + + LK + L KK ++ +AT P R R+
Sbjct: 228 SEELIEKAWKLIR-LRLKLYRALHAKKRLELLEAIPGKKRGCLIVSSATGRP---RGKRA 283
Query: 956 YLRRPATVYIGSVGKPTERIEQIVYILSE-QDKRKKLMEVLNRGVKKPVIIFVNQKKGAD 1014
L R + VG ++ + +V + E +D ++ L+E++ + + I++V+ G +
Sbjct: 284 KLFRELLGF--EVGGGSDTLRNVVDVYVEDEDLKETLLEIVKK-LGTGGIVYVSIDYGKE 340
Query: 1015 V---LAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVAT----DVAGRGIDIK 1067
+A+ LE G A H K +E E G D+L+ RG+D+
Sbjct: 341 KAEEIAEFLENHGVKAVAYHATKPKEDYEK----FAEGEIDVLIGVASYYGTLVRGLDLP 396
Query: 1068 DVS 1070
+
Sbjct: 397 ERV 399
Score = 48.7 bits (116), Expect = 3e-05
Identities = 83/363 (22%), Positives = 147/363 (40%), Gaps = 59/363 (16%)
Query: 1413 EILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIA 1472
E E +K +EP IQ+ L+ +A TG GKT F L + +++
Sbjct: 66 EFEEFFKKAVGSEPWSIQKMWAKRVLRGDSFAIIAPTGVGKT-TFGLAMSLFL------- 117
Query: 1473 RMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSR----EEQGFRLR 1528
A +G I+ PT L Q+ E+ + G+ TV + SR E++ F R
Sbjct: 118 ----AKKGKRCYIILPTTLLVIQVAEKISSLAEKAGVGTVNIGAYHSRLPTKEKKEFMER 173
Query: 1529 L---GCEIVIATPGRL---IDVLENRYLVLNQCTYIVLDEAD------RMID-----MGF 1571
+ +I+I T L D L ++ +I +D+ D + +D +GF
Sbjct: 174 IENGDFDILITTTMFLSKNYDELGPKF------DFIFVDDVDALLKASKNVDKLLKLLGF 227
Query: 1572 -EPDVQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARS 1630
E ++K + + LK + L KK ++ +AT P R R+
Sbjct: 228 SEELIEKAWKLIR-LRLKLYRALHAKKRLELLEAIPGKKRGCLIVSSATGRP---RGKRA 283
Query: 1631 YLRRPATVYIGSVGKPTERIEQIVYILSE-QDKRKKLMEVLNRGVKKPVIIFVNQKKGAD 1689
L R + VG ++ + +V + E +D ++ L+E++ + + I++V+ G +
Sbjct: 284 KLFRELLGF--EVGGGSDTLRNVVDVYVEDEDLKETLLEIVKK-LGTGGIVYVSIDYGKE 340
Query: 1690 V---LAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVAT----DVAGRGIDIK 1742
+A+ LE G A H K +E E G D+L+ RG+D+
Sbjct: 341 KAEEIAEFLENHGVKAVAYHATKPKEDYEK----FAEGEIDVLIGVASYYGTLVRGLDLP 396
Query: 1743 DVS 1745
+
Sbjct: 397 ERV 399
Score = 33.6 bits (77), Expect = 1.4
Identities = 77/355 (21%), Positives = 135/355 (38%), Gaps = 74/355 (20%)
Query: 3 EKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDAD 62
+K +EP IQ+ L+ +A TG GKT F L + +++ A
Sbjct: 72 KKAVGSEPWSIQKMWAKRVLRGDSFAIIAPTGVGKT-TFGLAMSLFL-----------AK 119
Query: 63 QGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSR----EEQGFRLRL---GC 115
+G I+ PT L Q+ E+ + G+ TV + SR E++ F R+
Sbjct: 120 KGKRCYIILPTTLLVIQVAEKISSLAEKAGVGTVNIGAYHSRLPTKEKKEFMERIENGDF 179
Query: 116 EIVIATPGRL---IDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPVTNLK 172
+I+I T L D L ++ +I +D+ D ++ +V K+L+ L
Sbjct: 180 DILITTTMFLSKNYDELGPKF------DFIFVDDVDALLKAS--KNVDKLLKL-----LG 226
Query: 173 PDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAKILEYMPVTNLKPDTEDAEDENKLL 232
E E KL+ LK + L
Sbjct: 227 FSEELIEKAWKLIR-----------------------------LRLKLYRALHAKKRLEL 257
Query: 233 ANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSE-Q 291
KK ++ +AT P R R+ L R + VG ++ + +V + E +
Sbjct: 258 LEAIPGKKRGCLIVSSATGRP---RGKRAKLFRELLGF--EVGGGSDTLRNVVDVYVEDE 312
Query: 292 DKRKKLMEVLNRGVKKPVIIFVNQKKGADV---LAKGLEKLGYNACTLHGGKGQE 343
D ++ L+E++ + + I++V+ G + +A+ LE G A H K +E
Sbjct: 313 DLKETLLEIVKK-LGTGGIVYVSIDYGKEKAEEIAEFLENHGVKAVAYHATKPKE 366
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 48.6 bits (116), Expect = 4e-05
Identities = 49/255 (19%), Positives = 105/255 (41%), Gaps = 14/255 (5%)
Query: 365 RRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHH 424
S + KR S +++R+ + E + + + L+ + + +
Sbjct: 649 SGSITGGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELR 708
Query: 425 RRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKK 484
R+ ER ++ + E E S+ + ++E E E++ +E +++ ++
Sbjct: 709 RQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEEL---EELQERLEELEE 765
Query: 485 EPLSLEELLAKKKAEEEARSKPKFLTKEERAA--EALRKRQAEVEEMRKKMEEERKKRQE 542
E SLEE LAK K E E + + +EE E L + + ++ + +++E ++R+
Sbjct: 766 ELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRER 825
Query: 543 FTKEASF--------ESKRENFDARLRRDREKKKEDPEE-KELNKDKEREGEAIKERYLG 593
+E E K + + L ++ +E EE +EL +KE + +KE
Sbjct: 826 LEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEE 885
Query: 594 LVKKKRRVRRLNDRK 608
+ + +R L
Sbjct: 886 KEELEEELRELESEL 900
Score = 45.1 bits (107), Expect = 4e-04
Identities = 42/258 (16%), Positives = 92/258 (35%), Gaps = 27/258 (10%)
Query: 378 SRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSR---------EREREKRHHRRDR 428
E+ R+ E + + + + ++RLE + + E R
Sbjct: 729 EEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKR 788
Query: 429 SKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLS 488
++ + E E + + ++ E EQ + EEE + K + L
Sbjct: 789 QALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELE 848
Query: 489 LEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEM-------RKKMEEERKKRQ 541
E +K+ EE + ++E + L++ + E EE+ ++ E +++ +
Sbjct: 849 EELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIE 908
Query: 542 EFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKER--YLGLV---- 595
+ + + + E++ E+ E L + ERE E ++E LG V
Sbjct: 909 KLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRA 968
Query: 596 -----KKKRRVRRLNDRK 608
+ + R L ++
Sbjct: 969 IEEYEEVEERYEELKSQR 986
Score = 37.0 bits (86), Expect = 0.11
Identities = 49/316 (15%), Positives = 101/316 (31%), Gaps = 11/316 (3%)
Query: 376 RRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGK 435
R + Y ++ + R E + E + +E +
Sbjct: 206 ERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEA 265
Query: 436 DRR----EGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEE 491
++ + E REE + KE E E + E + + E LEE
Sbjct: 266 EKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLE--ELENELEELEE 323
Query: 492 LLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFES 551
L + K + EA + L + E E L + AE+EE +++ EE+ E FE+
Sbjct: 324 RLEELKEKIEALKEE--LEERETLLEELEQLLAELEEAKEE-LEEKLSALLEELEELFEA 380
Query: 552 KRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLNDRKFVF 611
RE L + + + + EE + + E L +K++ + +
Sbjct: 381 LREELA-ELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQT 439
Query: 612 DWDASEDTSVDYNSIYKERHQVQFFGRGNIAGI-DIKAQKRDQSKFYGEMLEKRRTEAEK 670
+ + + + +E +A + + + + L++ E
Sbjct: 440 ELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRA 499
Query: 671 EQEKVRLKKVKKREEK 686
Q + + +
Sbjct: 500 SQGVRAVLEALESGLP 515
Score = 37.0 bits (86), Expect = 0.13
Identities = 54/256 (21%), Positives = 101/256 (39%), Gaps = 53/256 (20%)
Query: 443 RERRE--EEASG-SKHKSRDKEGYEPTEQMVIS-DEEEDKGAAAKKEPLSLEELL-AKKK 497
ERR+ EEA+G SK+K R +E E+ + + ED LEEL +K
Sbjct: 155 EERRKLIEEAAGVSKYKERKEEAERKLERTEENLERLEDL----------LEELEKQLEK 204
Query: 498 AEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTK-----EASFESK 552
E +A ++ + R+ E+ + K++E RK+ +E + E E
Sbjct: 205 LERQAEKAERYQELKA------ELRELELALLLAKLKELRKELEELEEELSRLEEELEEL 258
Query: 553 RENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLNDRKFVFD 612
+E + + E K E +EL ++ E E + E + + + + L +R
Sbjct: 259 QEELEEAEKEIEELKS---ELEELREELEELQEELLELKEEIEELEGEISLLRERL---- 311
Query: 613 WDASEDTSVDYNSIYKERHQVQFFGRGNIAGIDIKAQKRDQSKFYGEMLEKRRTEAEKEQ 672
E+ + + + +++ I+A K E+ E+ E EQ
Sbjct: 312 ----EELENELEELEERLEELKE---------KIEALKE-------ELEERETLLEELEQ 351
Query: 673 EKVRLKKVKKREEKQK 688
L++ K+ E++
Sbjct: 352 LLAELEEAKEELEEKL 367
Score = 33.5 bits (77), Expect = 1.2
Identities = 36/217 (16%), Positives = 75/217 (34%), Gaps = 17/217 (7%)
Query: 389 RERDTDRSRDDRGRDRDRDRLERDRSRERERE--KRHHRRDRSKERDGKDRREGYRRERR 446
E R R + + + ER + + E K + + + +
Sbjct: 302 GEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKE 361
Query: 447 EEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPL-----SLEELLAKKKAEEE 501
E E S +E +E + + E E + E L SLEE L + E
Sbjct: 362 ELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLE 421
Query: 502 ARSK------PKFLTKEERAAE---ALRKRQAEVEEMRKKMEE-ERKKRQEFTKEASFES 551
+ + + E L + + ++EE+R +++E ER+ + + E
Sbjct: 422 DLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEK 481
Query: 552 KRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIK 588
+ + +ARL R +++ + + + E +
Sbjct: 482 ELSSLEARLDRLEAEQRASQGVRAVLEALESGLPGVY 518
Score = 32.4 bits (74), Expect = 3.0
Identities = 25/134 (18%), Positives = 46/134 (34%), Gaps = 5/134 (3%)
Query: 408 RLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTE 467
E++ E RE + +E + R + E + E
Sbjct: 883 EEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYE-- 940
Query: 468 QMVISDEEEDKGAAAKKEPLSLEELLAKKKAE-EEARSKPKFLTKE-ERAAEALRKRQAE 525
+ E E + ++E +L + + E EE + + L + E EA K
Sbjct: 941 -DTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEV 999
Query: 526 VEEMRKKMEEERKK 539
+EE+ K+ E K+
Sbjct: 1000 IEELDKEKRERFKE 1013
Score = 31.6 bits (72), Expect = 5.1
Identities = 25/140 (17%), Positives = 55/140 (39%), Gaps = 7/140 (5%)
Query: 408 RLERDRSREREREKRHHRRDRSKE-RDGKDRREGYRRERREEEASGSKHKSRDKEGYEPT 466
E++ + +E + + +E R+ + + E + + +++ +
Sbjct: 869 EAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLER----L 924
Query: 467 EQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEV 526
E + EEE + LE + + + E EA EE E + +R E+
Sbjct: 925 EVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEY--EEVEERYEEL 982
Query: 527 EEMRKKMEEERKKRQEFTKE 546
+ R+ +EE ++K E +E
Sbjct: 983 KSQREDLEEAKEKLLEVIEE 1002
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 47.1 bits (112), Expect = 6e-05
Identities = 43/213 (20%), Positives = 82/213 (38%), Gaps = 20/213 (9%)
Query: 385 YDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRE 444
Y+R +++ R + R + + E ++K+ ++R K+ + + +++
Sbjct: 64 YNRQQQQQKSAKRAEEQRKKKEQQQA-----EELQQKQAAEQERLKQLEKERLAAQEQKK 118
Query: 445 RREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARS 504
+ EE A + K + E + A A+ E A KKA EA+
Sbjct: 119 QAEEAAKQAALKQKQAE-------EAAAKAAAAAKAKAEAEAKRAAA--AAKKAAAEAKK 169
Query: 505 KPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDR 564
K + ++ AAEA K++AE E K E +KK + K+ + ++ A +
Sbjct: 170 KAEAEAAKKAAAEA--KKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAA 227
Query: 565 EK----KKEDPEEKELNKDKEREGEAIKERYLG 593
K K E+ K E+ A +
Sbjct: 228 AKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVD 260
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 47.0 bits (111), Expect = 8e-05
Identities = 44/208 (21%), Positives = 84/208 (40%), Gaps = 16/208 (7%)
Query: 375 KRRSRSRERDYDRV-----RERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRS 429
+R +R ER +R R+++ + D+ R++ D E E + + +
Sbjct: 61 ERLARREERRDERFSEALERQKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESR 120
Query: 430 KERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSL 489
+ER+ + EG + ++ + E+ EEE+K E +
Sbjct: 121 EEREEVEETEGVTKSEQKND--------WRDAEECQKEEKEPEPEEEEKPKRGSLEENNG 172
Query: 490 EELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASF 549
E + K K E S+ + A + K + + +E ++EE +KKR+E K
Sbjct: 173 EFMTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLEE 232
Query: 550 ES---KRENFDARLRRDREKKKEDPEEK 574
E K+E D + R + EK++ E +
Sbjct: 233 EEQRRKQEEADRKSREEEEKRRLKEEIE 260
Score = 46.2 bits (109), Expect = 1e-04
Identities = 51/224 (22%), Positives = 94/224 (41%), Gaps = 7/224 (3%)
Query: 370 RSPPRKRRSRSRERDY-DRVRERD---TDRSRDDRGRDRDRDRLERD-RSREREREKRHH 424
R R+ R R + +R +E TD+S + R D + E E+E+
Sbjct: 62 RLARREERRDERFSEALERQKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESRE 121
Query: 425 RRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKK 484
R+ +E +G + E R EE + + +E +P + + E K
Sbjct: 122 EREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKH 181
Query: 485 EPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFT 544
+ A+ E + K K++ AA L + + + EE RK +EEE ++R++
Sbjct: 182 TENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQ-- 239
Query: 545 KEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIK 588
+EA +S+ E RL+ + E+++ + EK ++ E K
Sbjct: 240 EEADRKSREEEEKRRLKEEIERRRAEAAEKRQKVPEDGLSEDKK 283
Score = 40.4 bits (94), Expect = 0.007
Identities = 56/251 (22%), Positives = 105/251 (41%), Gaps = 16/251 (6%)
Query: 362 AGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRD-------DRGRDRDRDRLERDRS 414
A +RR R R + R + E +V + S++ D + LER
Sbjct: 8 ARERRRRERE--EQLRQKQEEGSLGQVTTQVEVNSQNSVPDEESKTSTDDEAALLERLAR 65
Query: 415 REREREKR-HHRRDRSKERDGKDRREGYRR-ERREEEASGSKHKSRDKEGYEPTEQMVIS 472
RE R++R +R KE + RR +E SG+++++ ++E E + +
Sbjct: 66 REERRDERFSEALERQKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREEREE 125
Query: 473 DEEEDKGAAAKKEPLSLEELLAKKKA---EEEARSKPKFLTKEERAAEALRKRQAEVEEM 529
EE + ++++ + +K+ E E KPK + EE E + + E
Sbjct: 126 VEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENT 185
Query: 530 RKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKE 589
+ E + E KE + K++ +A L + KKK + K L ++++R + +
Sbjct: 186 FSRGGAE-GAQVEAGKEFE-KLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEAD 243
Query: 590 RYLGLVKKKRR 600
R ++KRR
Sbjct: 244 RKSREEEEKRR 254
Score = 35.4 bits (81), Expect = 0.33
Identities = 31/212 (14%), Positives = 76/212 (35%), Gaps = 9/212 (4%)
Query: 342 QEQRELALNSLKGGSKDILMAGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRG 401
Q++ + M D + + + + + SRE + +S
Sbjct: 81 QKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKND 140
Query: 402 RDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKE 461
+ + ++ E E E+ + ++ E + + E + S+ + +
Sbjct: 141 WRDAEECQKEEKEPEPEEEE-----KPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQ 195
Query: 462 GYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRK 521
E + ++++ ++ EE ++K EE + K + ++ E K
Sbjct: 196 VEAGKEFEKLKQKQQEAALELEELKKKREE---RRKVLEEEEQRRKQEEADRKSREEEEK 252
Query: 522 RQAEVEEMRKKMEEERKKRQEFTKEASFESKR 553
R EE+ ++ E +KRQ+ ++ E K+
Sbjct: 253 R-RLKEEIERRRAEAAEKRQKVPEDGLSEDKK 283
Score = 33.1 bits (75), Expect = 1.4
Identities = 51/275 (18%), Positives = 100/275 (36%), Gaps = 13/275 (4%)
Query: 436 DRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAK 495
D E R RR E + K + + T Q V + + K E L +
Sbjct: 3 DEEEAARERRRREREEQLRQKQEEGSLGQVTTQ-VEVNSQNSVPDEESKTSTDDEAALLE 61
Query: 496 KKAEEEARSKPKFLTKEERAAE-ALRKRQAEVEEMRKKMEEERKKRQEFTKEASFE---S 551
+ A E R +F ER E + E ++M+E+ E +E E
Sbjct: 62 RLARREERRDERFSEALERQKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESRE 121
Query: 552 KRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLNDRKFVF 611
+RE + + ++K D + E + +E+E E +E + K
Sbjct: 122 EREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKH 181
Query: 612 DWDASEDTSVDYNSIYKERHQVQFFGRGNIAGIDIKAQKRDQSKFYGEMLEKRRTEAEKE 671
+ + + + + + A ++++ K+ + E+R+ E+E
Sbjct: 182 TENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKKKRE-------ERRKVLEEEE 234
Query: 672 QEKVRLKKVKK-REEKQKWDDRHWTEKSLDEMTER 705
Q + + + +K REE++K + E+ E E+
Sbjct: 235 QRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEK 269
>gnl|CDD|237497 PRK13767, PRK13767, ATP-dependent helicase; Provisional.
Length = 876
Score = 46.8 bits (112), Expect = 1e-04
Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 3/75 (4%)
Query: 7 YAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPY 66
+ TP QR AIP+ + ++++ + TGSGKTLA L + I L ++ R + + Y
Sbjct: 30 FGTFTPPQRYAIPLIHEGKNVLISSPTGSGKTLAAFLAI---IDELFRLGREGELEDKVY 86
Query: 67 AIIMAPTRELAQQIE 81
+ ++P R L I
Sbjct: 87 CLYVSPLRALNNDIH 101
Score = 46.8 bits (112), Expect = 1e-04
Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 3/75 (4%)
Query: 748 YAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPY 807
+ TP QR AIP+ + ++++ + TGSGKTLA L + I L ++ R + + Y
Sbjct: 30 FGTFTPPQRYAIPLIHEGKNVLISSPTGSGKTLAAFLAI---IDELFRLGREGELEDKVY 86
Query: 808 AIIMAPTRELAQQIE 822
+ ++P R L I
Sbjct: 87 CLYVSPLRALNNDIH 101
Score = 46.8 bits (112), Expect = 1e-04
Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 3/75 (4%)
Query: 1423 YAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPY 1482
+ TP QR AIP+ + ++++ + TGSGKTLA L + I L ++ R + + Y
Sbjct: 30 FGTFTPPQRYAIPLIHEGKNVLISSPTGSGKTLAAFLAI---IDELFRLGREGELEDKVY 86
Query: 1483 AIIMAPTRELAQQIE 1497
+ ++P R L I
Sbjct: 87 CLYVSPLRALNNDIH 101
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 45.7 bits (108), Expect = 2e-04
Identities = 41/206 (19%), Positives = 82/206 (39%), Gaps = 15/206 (7%)
Query: 385 YDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRE 444
Y R++ + + + ++ R + + E+ + ++ R + E K+R + ++
Sbjct: 64 YGRIQSQQSSAKKGEQQR---KKKEEQVAEELKPKQAAEQERLKQLE---KERLKAQEQQ 117
Query: 445 RREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARS 504
++ EEA K E + EQ + E+ K A A K + E AK KA EA+
Sbjct: 118 KQAEEAE----KQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEA--AKLKAAAEAKK 171
Query: 505 KPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDR 564
K + + +AAE + + +K E + ++ EA ++K E ++
Sbjct: 172 KAE---EAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEK 228
Query: 565 EKKKEDPEEKELNKDKEREGEAIKER 590
++ + DK ER
Sbjct: 229 AAAEKKKAAAKAKADKAAAAAKAAER 254
Score = 40.3 bits (94), Expect = 0.009
Identities = 43/224 (19%), Positives = 90/224 (40%), Gaps = 38/224 (16%)
Query: 366 RSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHR 425
R +S+ K+ + R++ ++V E + ++ R + +LE++R + +E++K+
Sbjct: 66 RIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLK---QLEKERLKAQEQQKQAEE 122
Query: 426 RDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKE 485
++ + + K + E R+ E++ K++ E AAA+ +
Sbjct: 123 AEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAA------------EAAKLKAAAEAK 170
Query: 486 PLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTK 545
+ E KA EEA++K A A K++AE E K E+ K E
Sbjct: 171 KKAEEA----AKAAEEAKAK---------AEAAAAKKKAEAEA--KAAAEKAKAEAEAKA 215
Query: 546 EASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKE 589
+A +++ EK + ++ ++ A K
Sbjct: 216 KAEKKAEAAA--------EEKAAAEKKKAAAKAKADKAAAAAKA 251
Score = 37.6 bits (87), Expect = 0.058
Identities = 26/123 (21%), Positives = 59/123 (47%), Gaps = 14/123 (11%)
Query: 477 DKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEE 536
D GA ++ + + KK E++ + K E+ AE L+ +QA E+E
Sbjct: 56 DPGAVVQQYGRIQSQQSSAKKGEQQRKKKE------EQVAEELKPKQAA--------EQE 101
Query: 537 RKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVK 596
R K+ E + + E +++ +A + E+K+++ + ++ +++++ EA K +
Sbjct: 102 RLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAA 161
Query: 597 KKR 599
K +
Sbjct: 162 KLK 164
Score = 37.2 bits (86), Expect = 0.067
Identities = 32/128 (25%), Positives = 57/128 (44%), Gaps = 8/128 (6%)
Query: 475 EEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKME 534
E+ + KKE EEL K+ AE+E + ++ER +++QAE E + ++E
Sbjct: 78 EQQR---KKKEEQVAEELKPKQAAEQERLKQ----LEKERLKAQEQQKQAEEAEKQAQLE 130
Query: 535 EERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKEL-NKDKEREGEAIKERYLG 593
+++++ Q A + K E A+ + K K E K+ + + EA +
Sbjct: 131 QKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAA 190
Query: 594 LVKKKRRV 601
KKK
Sbjct: 191 AAKKKAEA 198
>gnl|CDD|224123 COG1202, COG1202, Superfamily II helicase, archaea-specific [General
function prediction only].
Length = 830
Score = 45.9 bits (109), Expect = 2e-04
Identities = 83/386 (21%), Positives = 145/386 (37%), Gaps = 64/386 (16%)
Query: 731 KEASLPTEILEIIEKIGYAEPTPIQRQAIPIGL-QNRDIIGVAETGSGKTLAFLLPLLVW 789
E +P + ++++ G E P+Q A+ GL + +++ V+ T SGKTL
Sbjct: 197 DELDIPEKFKRMLKREGIEELLPVQVLAVEAGLLEGENLLVVSATASGKTL--------- 247
Query: 790 IQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQG 849
I L I R+ G + + P LA Q E+ + + LG++ + V G+S
Sbjct: 248 IGELAGIPRL--LSGGKKMLFLVPLVALANQKYEDFKERYSKLGLKVAIRV-GMS----- 299
Query: 850 FRLRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPV 909
R++ E V+ D++ Y + D ++ G + + +
Sbjct: 300 -RIKTREEPVVVDTSPDADIIVGTY-----------EGIDYLLRTGKDLGDIGTVVIDEI 347
Query: 910 TNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARS-------YLRRPAT 962
L+ D E + L+ Q + +AT+ E LA+ Y RP
Sbjct: 348 HTLE-DEERGPRLDGLIGRLRYLFPGAQFIYLSATVGNPEE-LAKKLGAKLVLYDERPV- 404
Query: 963 VYIGSVGKPTERIEQIVYILSEQDKRKKLMEVLNR--------GVKKPVIIFVNQKKGAD 1014
P ER +V+ +E +K + ++ R G + I+F ++
Sbjct: 405 --------PLER--HLVFARNESEKWDIIARLVKREFSTESSKGYRGQTIVFTYSRRRCH 454
Query: 1015 VLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVIN 1074
LA L G A H G ++R+ + +V T G+D S VI
Sbjct: 455 ELADALTGKGLKAAPYHAGLPYKERKSVERAFAAQELAAVVTTAALAAGVDFP-ASQVIF 513
Query: 1075 YDMAKSIE-----DYTHRIGRTGRAG 1095
+A IE ++ +GR GR
Sbjct: 514 ESLAMGIEWLSVREFQQMLGRAGRPD 539
Score = 45.9 bits (109), Expect = 2e-04
Identities = 83/386 (21%), Positives = 145/386 (37%), Gaps = 64/386 (16%)
Query: 1406 KEASLPTEILEIIEKIGYAEPTPIQRQAIPIGL-QNRDIIGVAETGSGKTLAFLLPLLVW 1464
E +P + ++++ G E P+Q A+ GL + +++ V+ T SGKTL
Sbjct: 197 DELDIPEKFKRMLKREGIEELLPVQVLAVEAGLLEGENLLVVSATASGKTL--------- 247
Query: 1465 IQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQG 1524
I L I R+ G + + P LA Q E+ + + LG++ + V G+S
Sbjct: 248 IGELAGIPRL--LSGGKKMLFLVPLVALANQKYEDFKERYSKLGLKVAIRV-GMS----- 299
Query: 1525 FRLRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADRMIDMGFEPDVQKILEYMPV 1584
R++ E V+ D++ Y + D ++ G + + +
Sbjct: 300 -RIKTREEPVVVDTSPDADIIVGTY-----------EGIDYLLRTGKDLGDIGTVVIDEI 347
Query: 1585 TNLKPDTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARS-------YLRRPAT 1637
L+ D E + L+ Q + +AT+ E LA+ Y RP
Sbjct: 348 HTLE-DEERGPRLDGLIGRLRYLFPGAQFIYLSATVGNPEE-LAKKLGAKLVLYDERPV- 404
Query: 1638 VYIGSVGKPTERIEQIVYILSEQDKRKKLMEVLNR--------GVKKPVIIFVNQKKGAD 1689
P ER +V+ +E +K + ++ R G + I+F ++
Sbjct: 405 --------PLER--HLVFARNESEKWDIIARLVKREFSTESSKGYRGQTIVFTYSRRRCH 454
Query: 1690 VLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVIN 1749
LA L G A H G ++R+ + +V T G+D S VI
Sbjct: 455 ELADALTGKGLKAAPYHAGLPYKERKSVERAFAAQELAAVVTTAALAAGVDFP-ASQVIF 513
Query: 1750 YDMAKSIE-----DYTHRIGRTGRAG 1770
+A IE ++ +GR GR
Sbjct: 514 ESLAMGIEWLSVREFQQMLGRAGRPD 539
Score = 35.1 bits (81), Expect = 0.43
Identities = 28/101 (27%), Positives = 48/101 (47%), Gaps = 12/101 (11%)
Query: 1 IIEKIGYAEPTPIQRQAIPIGL-QNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARME 59
++++ G E P+Q A+ GL + +++ V+ T SGKTL I L I R+
Sbjct: 208 MLKREGIEELLPVQVLAVEAGLLEGENLLVVSATASGKTL---------IGELAGIPRL- 257
Query: 60 DADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVG 100
G + + P LA Q E+ + + LG++ + VG
Sbjct: 258 -LSGGKKMLFLVPLVALANQKYEDFKERYSKLGLKVAIRVG 297
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 44.5 bits (106), Expect = 6e-04
Identities = 28/144 (19%), Positives = 57/144 (39%), Gaps = 3/144 (2%)
Query: 460 KEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEAL 519
+ G E +EEE + AK+E + A++ A EA + K AE
Sbjct: 26 ELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEA 85
Query: 520 RKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKD 579
+ A E + E+E + E A+ K + + + + KKK+ +
Sbjct: 86 AEAAAAAEAAARPAEDEAARPAE---AAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGK 142
Query: 580 KEREGEAIKERYLGLVKKKRRVRR 603
+ + G+ + R ++K++ ++
Sbjct: 143 RGKGGKGRRRRRGRRRRRKKKKKQ 166
Score = 37.9 bits (89), Expect = 0.052
Identities = 25/129 (19%), Positives = 50/129 (38%), Gaps = 2/129 (1%)
Query: 482 AKKEPLSLEELLAK-KKAEEEARSKPKFLTKEE-RAAEALRKRQAEVEEMRKKMEEERKK 539
AK+ +S +ELL K K+ E +S + +EE R EA R+ + E + ++ +
Sbjct: 10 AKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAE 69
Query: 540 RQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKR 599
+ + A+ E +A + + + E + R +A K K +
Sbjct: 70 EEAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAAKKKKGPKPK 129
Query: 600 RVRRLNDRK 608
+ +
Sbjct: 130 KKKPKRKAA 138
Score = 37.5 bits (88), Expect = 0.084
Identities = 25/133 (18%), Positives = 45/133 (33%)
Query: 476 EDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEE 535
++ G K ++EE A+K+ + + EE AA + EE
Sbjct: 25 KELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEE 84
Query: 536 ERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLV 595
+ A A R K K+ ++K+ K K+++ + R
Sbjct: 85 AAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRG 144
Query: 596 KKKRRVRRLNDRK 608
K + RR R+
Sbjct: 145 KGGKGRRRRRGRR 157
Score = 33.3 bits (77), Expect = 1.3
Identities = 35/157 (22%), Positives = 59/157 (37%), Gaps = 13/157 (8%)
Query: 431 ERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLE 490
E + + E R E +A + + + E E + EE AAA E
Sbjct: 39 EEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEA---- 94
Query: 491 ELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFE 550
A + AE+EA E AA + ++A ++ K ++++ KR+ +
Sbjct: 95 ---AARPAEDEAARPA------EAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGK 145
Query: 551 SKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAI 587
+ R RR R KKK+ + E + E I
Sbjct: 146 GGKGRRRRRGRRRRRKKKKKQKPTEKIPREVVIPETI 182
>gnl|CDD|152107 pfam11671, Apis_Csd, Complementary sex determiner protein. This
family of proteins represents the complementary sex
determiner in the honeybee. In the honeybee, the
mechanism of sex determination depends on the csd gene
which produces an SR-type protein. Males are homozygous
while females are homozygous for the csd gene.
Heterozygosity generates an active protein which
initiates female development.
Length = 146
Score = 41.2 bits (96), Expect = 0.001
Identities = 21/45 (46%), Positives = 29/45 (64%)
Query: 374 RKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSRERE 418
RKR SRSRER+ + ++ R + R+R RDR ER+RSRE +
Sbjct: 3 RKRYSRSREREQKSYKNENSYREYRETSRERSRDRTERERSREHK 47
Score = 38.5 bits (89), Expect = 0.007
Identities = 17/44 (38%), Positives = 24/44 (54%)
Query: 365 RRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDR 408
R+ SRS R+++S E Y RE +RSRD R+R R+
Sbjct: 3 RKRYSRSREREQKSYKNENSYREYRETSRERSRDRTERERSREH 46
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 42.6 bits (101), Expect = 0.001
Identities = 48/207 (23%), Positives = 90/207 (43%), Gaps = 8/207 (3%)
Query: 403 DRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEG 462
+R++ R E R +ERE+ +R +E D + +E +RE++++ + ++
Sbjct: 82 EREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQE--KREKQKKLREEIDEFNEERIE 139
Query: 463 YEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKR 522
+ E+ + EE+ + + E + + E K + + + E
Sbjct: 140 RK--EEEKEREREEELKILEYQREKAEREEEREAERRERKEEKEREVARLRAQQEEAEDE 197
Query: 523 QAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKER 582
+ E++E+R + +E +R+E KE KR L+R RE E EEKE +ER
Sbjct: 198 REELDELRADLYQEEYERKERQKEKEEAEKRRRQKQELQRARE---EQIEEKEERLQEER 254
Query: 583 EGE-AIKERYLGLVKKKRRVRRLNDRK 608
E A +ER L + + + N K
Sbjct: 255 AEEEAERERMLEKQAEDEELEQENAEK 281
Score = 36.8 bits (86), Expect = 0.10
Identities = 47/195 (24%), Positives = 81/195 (41%), Gaps = 34/195 (17%)
Query: 402 RDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKE 461
R+ +R+ R+R E+ERE R + + D ++ + R + +EE + K R KE
Sbjct: 165 REEEREAERRERKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEY---ERKERQKE 221
Query: 462 GYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRK 521
K A K+ E A+++ EE KEER E +
Sbjct: 222 ----------------KEEAEKRRRQKQELQRAREEQIEE---------KEERLQEERAE 256
Query: 522 RQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRD----REKKKEDPEEKELN 577
+AE E M +K E+ + QE ++ KR L + E++ + EE+
Sbjct: 257 EEAERERMLEKQAEDEELEQENAEKR--RMKRLEHRRELEQQIEEKEERRAAEREEELEE 314
Query: 578 KDKEREGEAIKERYL 592
++ RE EA ++ +
Sbjct: 315 GERLREEEAERQARI 329
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 43.1 bits (102), Expect = 0.001
Identities = 39/154 (25%), Positives = 76/154 (49%), Gaps = 23/154 (14%)
Query: 388 VRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERRE 447
+ + DR + R +R RDR+ R+ RD R+G++ RE RR RR+
Sbjct: 594 PKAEAKPERQQDRRKPRQNNR--RDRNERRDT------RDNRTRREGRENREENRRNRRQ 645
Query: 448 EEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPK 507
+ + ++R+ + E TE+ + +E+ + +++ +E K++A++EA K
Sbjct: 646 AQQQTA--ETRESQQAEVTEKAR-TQDEQQQAPRRERQRRRNDE---KRQAQQEA----K 695
Query: 508 FLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQ 541
L EE++ + E EE ++++ RK+RQ
Sbjct: 696 ALNVEEQSVQE-----TEQEERVQQVQPRRKQRQ 724
Score = 37.3 bits (87), Expect = 0.095
Identities = 27/142 (19%), Positives = 57/142 (40%), Gaps = 15/142 (10%)
Query: 348 ALNSLKGGSKDILMAGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRG-RDRDR 406
AL +L G ++ ++ + P +++ R + R +R RD + RD R R R
Sbjct: 575 ALKALFSGGEETKPQEQPAPKAEAKPERQQDRRKPRQNNR---RDRNERRDTRDNRTRRE 631
Query: 407 DRLERDRSREREREKRHH---RRDRSKERDGKDRREGYRRERREEEASGSKHKSRDK--- 460
R R+ +R R+ + R+ + + R +++ + ++ +K
Sbjct: 632 GRENREENRRNRRQAQQQTAETRESQQAEVTEKARTQ-DEQQQAPRRERQRRRNDEKRQA 690
Query: 461 ----EGYEPTEQMVISDEEEDK 478
+ EQ V E+E++
Sbjct: 691 QQEAKALNVEEQSVQETEQEER 712
Score = 33.1 bits (76), Expect = 1.9
Identities = 40/196 (20%), Positives = 77/196 (39%), Gaps = 68/196 (34%)
Query: 410 ERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQM 469
+ +R ++R + ++++RRDR++ RD +D R RRE RE +++ +
Sbjct: 599 KPERQQDRRKPRQNNRRDRNERRDTRDNRT--RREGRENREENRRNRRQ----------- 645
Query: 470 VISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEM 529
++ E R E + AE K + + E+
Sbjct: 646 -------------------------AQQQTAETR--------ESQQAEVTEKARTQDEQQ 672
Query: 530 RKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKE 589
+ R++RQ RR+ EK++ E K LN +++ E +E
Sbjct: 673 Q----APRRERQR------------------RRNDEKRQAQQEAKALNVEEQSVQETEQE 710
Query: 590 RYLGLVKKKRRVRRLN 605
+ V+ +R+ R+LN
Sbjct: 711 ERVQQVQPRRKQRQLN 726
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 42.6 bits (100), Expect = 0.002
Identities = 29/197 (14%), Positives = 65/197 (32%), Gaps = 8/197 (4%)
Query: 414 SREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISD 473
+E + + + ++ K+ + ++++E+ K + +E E
Sbjct: 93 KPAKEPKNESGKEEEKEKEQVKEEK----KKKKEKPKEEPKDRKPKEEAKEKRPPKEKEK 148
Query: 474 EEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKM 533
E+E K + + + K+ + K K K++ E ++RQA E ++ K
Sbjct: 149 EKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKP 208
Query: 534 EEE---RKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKER 590
EE ++ +E E+ + + + L K A E
Sbjct: 209 EEPDVNEEREKEEDDGKDRETTTSPME-EDESRQSSEISRRSSSSLKKPDPSPSMASPET 267
Query: 591 YLGLVKKKRRVRRLNDR 607
+ + R R
Sbjct: 268 RESSKRTETRPRTSLRP 284
Score = 42.2 bits (99), Expect = 0.003
Identities = 41/236 (17%), Positives = 74/236 (31%), Gaps = 7/236 (2%)
Query: 366 RSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHR 425
R P K + + ++ + R RE + R R + ++ ++++E + +
Sbjct: 137 AKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQ 196
Query: 426 RDRSKER-DGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKK 484
R ++E GK ER +EE G ++ E + + K
Sbjct: 197 RQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKP 256
Query: 485 EPLSLEELLAKK--KAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQE 542
+P + E R + R A A R A RK++ + Q
Sbjct: 257 DPSPSMASPETRESSKRTETRPRTSLRPPSARPASA---RPAPPRVKRKEIVTVLQDAQG 313
Query: 543 FTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKK 598
K S E + D E + + E+ GLV+K
Sbjct: 314 VGKIVS-NVILEGKKSEDEDDENFVVEAAAQAPDIVAGGEDEAEDGEQQGGLVQKI 368
Score = 41.0 bits (96), Expect = 0.006
Identities = 33/184 (17%), Positives = 71/184 (38%), Gaps = 14/184 (7%)
Query: 400 RGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRD 459
+ +++ ++ ++ + + +E K ++ +KE+ +E + ++ EE + K R+
Sbjct: 108 KEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRE 167
Query: 460 KEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEAL 519
+ + + K KK+ EE ++ A E + KP EE
Sbjct: 168 RVRAKSRP-----KKPPKKKPPNKKKEPPEEEK-QRQAAREAVKGKP-----EEPDVNEE 216
Query: 520 RKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKD 579
R+++ + K E +E S E R + + + D PE +E +K
Sbjct: 217 REKEEDDG---KDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSKR 273
Query: 580 KERE 583
E
Sbjct: 274 TETR 277
Score = 40.6 bits (95), Expect = 0.007
Identities = 30/185 (16%), Positives = 57/185 (30%), Gaps = 6/185 (3%)
Query: 364 DRRSRSRSPPRKRRSRSRERDYDRVRER-DTDRSRDDRGRDRDRDRLERDRSREREREK- 421
++ + + R ++R+ R + R + + ++ E+ R RE K
Sbjct: 147 EKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKG 206
Query: 422 ----RHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEED 477
+R KE D RE E+E+ S SR S +
Sbjct: 207 KPEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPE 266
Query: 478 KGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEER 537
++K+ L A + K + L+ Q + + + E +
Sbjct: 267 TRESSKRTETRPRTSLRPPSARPASARPAPPRVKRKEIVTVLQDAQGVGKIVSNVILEGK 326
Query: 538 KKRQE 542
K E
Sbjct: 327 KSEDE 331
Score = 39.9 bits (93), Expect = 0.012
Identities = 29/162 (17%), Positives = 58/162 (35%), Gaps = 23/162 (14%)
Query: 428 RSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPL 487
+ K +E +EEE + K K+ E ++ + ++ K KE
Sbjct: 86 KGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKE----EPKDRKPKEEAKEKR 141
Query: 488 SLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEA 547
+E K+K +E+ +P+ E + R R + K + KK++ +E
Sbjct: 142 PPKE---KEKEKEKKVEEPR---DREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEK 195
Query: 548 SFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKE 589
R ++ + +E + ++ERE E
Sbjct: 196 Q-------------RQAAREAVKGKPEEPDVNEEREKEEDDG 224
Score = 36.8 bits (85), Expect = 0.12
Identities = 27/193 (13%), Positives = 65/193 (33%), Gaps = 12/193 (6%)
Query: 362 AGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREK 421
S K + + ++ + + + R E+++ +E++ E+
Sbjct: 97 EPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEE 156
Query: 422 RHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAA 481
RDR +E+ + R R ++ ++ +K K +E + +
Sbjct: 157 ---PRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKP----- 208
Query: 482 AKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQ 541
+EP EE +K+ ++ + EE + + KK +
Sbjct: 209 --EEPDVNEE--REKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMAS 264
Query: 542 EFTKEASFESKRE 554
T+E+S ++
Sbjct: 265 PETRESSKRTETR 277
Score = 36.4 bits (84), Expect = 0.16
Identities = 28/192 (14%), Positives = 67/192 (34%), Gaps = 8/192 (4%)
Query: 369 SRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRL-----ERDRSREREREKRH 423
+K + + +D E R ++ +++++ E ++ RER R K
Sbjct: 115 EEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKS- 173
Query: 424 HRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAK 483
R + ++ ++++ E ++ +A+ K + +E E+ D+ +D+
Sbjct: 174 -RPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDR-ETTT 231
Query: 484 KEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEF 543
E + + + + S K A+ R+ E + R
Sbjct: 232 SPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTETRPRTSLRPPSARPAS 291
Query: 544 TKEASFESKREN 555
+ A KR+
Sbjct: 292 ARPAPPRVKRKE 303
Score = 34.9 bits (80), Expect = 0.49
Identities = 31/122 (25%), Positives = 58/122 (47%), Gaps = 2/122 (1%)
Query: 487 LSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKE 546
L A K+ E+ P TK + + ++ E E+ ++++EE+KK++E KE
Sbjct: 69 SKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEK--EQVKEEKKKKKEKPKE 126
Query: 547 ASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLND 606
+ K + R +EK+KE ++ E +D+E E + + R KK + + N
Sbjct: 127 EPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNK 186
Query: 607 RK 608
+K
Sbjct: 187 KK 188
Score = 33.7 bits (77), Expect = 0.96
Identities = 25/186 (13%), Positives = 67/186 (36%), Gaps = 2/186 (1%)
Query: 502 ARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLR 561
A++KP K E E ++++ EE +KK E+ +++ ++ + + KR +
Sbjct: 90 AKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKE 149
Query: 562 RDREKKKEDPEEKELNKDKEREGEA--IKERYLGLVKKKRRVRRLNDRKFVFDWDASEDT 619
++++ ++ E+E +++ R + KKK R+ + +
Sbjct: 150 KEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPE 209
Query: 620 SVDYNSIYKERHQVQFFGRGNIAGIDIKAQKRDQSKFYGEMLEKRRTEAEKEQEKVRLKK 679
D N ++ + ++ ++ ++ + ++
Sbjct: 210 EPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRE 269
Query: 680 VKKREE 685
KR E
Sbjct: 270 SSKRTE 275
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 42.3 bits (99), Expect = 0.002
Identities = 34/167 (20%), Positives = 69/167 (41%), Gaps = 17/167 (10%)
Query: 444 ERREEEASGSKHKSRD----KEGYEPTEQMVISDEEEDKGAAAKK-EPLSLEELLAKKKA 498
+ ++E A +K ++R K G E ++ +E +K A K + +++EE AK++A
Sbjct: 9 DLKKEAARRAKEEARKRLVAKHGAEISKL---EEENREKEKALPKNDDMTIEE--AKRRA 63
Query: 499 EEEARSKPKFLTKEERAA-------EALRKRQAEVEEMRKKMEEERKKRQEFTKEASFES 551
A++K L K++R E + + + K K+++E T+E + E
Sbjct: 64 AAAAKAKAAALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEE 123
Query: 552 KRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKK 598
K + K ++K ++ E E ++ K
Sbjct: 124 KAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAA 170
Score = 36.1 bits (83), Expect = 0.18
Identities = 44/219 (20%), Positives = 80/219 (36%), Gaps = 40/219 (18%)
Query: 420 EKRHHRRDRSKERDGKDR-REGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDK 478
EK + R K+ R+ + E + + ++ + M I + +
Sbjct: 4 EKDLEDLKKEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRA 63
Query: 479 GAAAKKEPLSL--------------EELLAKKKAEEEARSKPKFLTKEER---------- 514
AAAK + +L E+ AK KA A++K L K++R
Sbjct: 64 AAAAKAKAAALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEE 123
Query: 515 -------AAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKK 567
AA A + + A + + +++ EE + +E T + ++K K+
Sbjct: 124 KAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQ 183
Query: 568 K--------EDPEEKELNKDKEREGEAIKERYLGLVKKK 598
K E+ E+E K K + A K + L K+K
Sbjct: 184 KAAEAGEGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQK 222
Score = 31.1 bits (70), Expect = 6.0
Identities = 44/186 (23%), Positives = 80/186 (43%), Gaps = 18/186 (9%)
Query: 425 RRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKK 484
+ + + + E + + A+ +K + K+ E TE++ +EE DK A K
Sbjct: 109 AKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAK 168
Query: 485 EPLSLEEL---LAKKKA------------EEEARSKPKFLTKEERAAEALRKRQAEVEEM 529
+ + LAK+KA EE+A++K K + A AL K++A
Sbjct: 169 AAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGNG 228
Query: 530 RKKMEEERKKRQEFTK-EASFESKRENFDARLRRDREKKKEDPEEKE--LNKDKEREGEA 586
E+ + K K +A+ ++ + A +++ E K+E+P + LNK E E
Sbjct: 229 DSGDEDAKAKAIAAAKAKAAAAARAKTKGAEGKKEEEPKQEEPSVNQPYLNKYVEVIKEK 288
Query: 587 IKERYL 592
+ E L
Sbjct: 289 LGEDVL 294
>gnl|CDD|236794 PRK10917, PRK10917, ATP-dependent DNA helicase RecG; Provisional.
Length = 681
Score = 42.4 bits (101), Expect = 0.002
Identities = 34/93 (36%), Positives = 41/93 (44%), Gaps = 18/93 (19%)
Query: 34 GSGKTL-AFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLG 92
GSGKT+ A L L + G A +MAPT LA+Q E K PLG
Sbjct: 292 GSGKTVVAALAALAA-------------IEAGYQAALMAPTEILAEQHYENLKKLLEPLG 338
Query: 93 IRTVLVVGGLS---REEQGFRLRLG-CEIVIAT 121
IR L+ G L R E + G +IVI T
Sbjct: 339 IRVALLTGSLKGKERREILEAIASGEADIVIGT 371
Score = 42.4 bits (101), Expect = 0.002
Identities = 34/93 (36%), Positives = 41/93 (44%), Gaps = 18/93 (19%)
Query: 775 GSGKTL-AFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLG 833
GSGKT+ A L L + G A +MAPT LA+Q E K PLG
Sbjct: 292 GSGKTVVAALAALAA-------------IEAGYQAALMAPTEILAEQHYENLKKLLEPLG 338
Query: 834 IRTVLVVGGLS---REEQGFRLRLG-CEIVIAT 862
IR L+ G L R E + G +IVI T
Sbjct: 339 IRVALLTGSLKGKERREILEAIASGEADIVIGT 371
Score = 42.4 bits (101), Expect = 0.002
Identities = 34/93 (36%), Positives = 41/93 (44%), Gaps = 18/93 (19%)
Query: 1450 GSGKTL-AFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLG 1508
GSGKT+ A L L + G A +MAPT LA+Q E K PLG
Sbjct: 292 GSGKTVVAALAALAA-------------IEAGYQAALMAPTEILAEQHYENLKKLLEPLG 338
Query: 1509 IRTVLVVGGLS---REEQGFRLRLG-CEIVIAT 1537
IR L+ G L R E + G +IVI T
Sbjct: 339 IRVALLTGSLKGKERREILEAIASGEADIVIGT 371
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 41.6 bits (98), Expect = 0.004
Identities = 40/207 (19%), Positives = 80/207 (38%), Gaps = 28/207 (13%)
Query: 418 EREKRHHRRDRSKERDGKDRREGYRR--ERREEEASGSKHKSRDKEGYEPTEQMVISDEE 475
E++K+ R + + E E + +E + + +K E +
Sbjct: 301 EQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKL----EELKEELESL 356
Query: 476 EDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQ-----AEVEEMR 530
E + + E LEEL ++ + EE + R+ A + Q E+E +
Sbjct: 357 EAELEELEAE---LEELESRLEELEEQL-------ETLRSKVAQLELQIASLNNEIERLE 406
Query: 531 KKME--EERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKE-----LNKDKERE 583
++E E+R++R + E + E L+ + E+ +E+ EE + L + E
Sbjct: 407 ARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEEL 466
Query: 584 GEAIKERYLGLVKKKRRVRRLNDRKFV 610
E ++E L +R + +L R
Sbjct: 467 REELEEAEQALDAAERELAQLQARLDS 493
Score = 41.6 bits (98), Expect = 0.004
Identities = 43/249 (17%), Positives = 89/249 (35%), Gaps = 24/249 (9%)
Query: 375 KRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSR----EREREKRHHRRDRSK 430
R E+ +R+ + + + + + +RLE E E E+ + ++ K
Sbjct: 736 ARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLK 795
Query: 431 ERDGKDRREG------YRRERREEEASGSKHKSRDKEGYEPTEQMVISDEE----EDKGA 480
E R E + +S ++ ++ +E+ +
Sbjct: 796 EELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIE 855
Query: 481 AAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKR 540
+ E LEEL+ + E E + E A LR EE+ +++ E KR
Sbjct: 856 SLAAEIEELEELIEE--LESELEALLNERASLEEALALLRSEL---EELSEELRELESKR 910
Query: 541 QEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKK--- 597
E +E E + + LR + + + D ++ L+++ E + +
Sbjct: 911 SELRRELE-ELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEE 969
Query: 598 -KRRVRRLN 605
+RR++RL
Sbjct: 970 ARRRLKRLE 978
Score = 39.3 bits (92), Expect = 0.022
Identities = 55/270 (20%), Positives = 106/270 (39%), Gaps = 36/270 (13%)
Query: 473 DEEEDKGAAAKKEPLSLEELLAKKKAEEEARSK---------PKFLTKEERAAEALRKRQ 523
E E A +KE LEE L + + E E S+ + + E+ E + +
Sbjct: 694 AELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLS 753
Query: 524 AEVEEMRKKMEEERKKRQE-FTKEASFESKRENFDARLRRDREKKKEDPEE-KELNKDKE 581
E+ E+ ++EE ++ +E + A E++ E +A++ + +E+ K E EL +
Sbjct: 754 KELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELT 813
Query: 582 REGEAIKERYLGLVKKKRRVRRLNDRKFVFDWDASEDTSVDYNSIYKERHQVQFFGRGNI 641
E L +RR+ R + E+ S D S+ E +++
Sbjct: 814 LLNEEAANLRERLESLERRIAATERR-LEDLEEQIEELSEDIESLAAEIEELEELIEELE 872
Query: 642 AGIDIKAQKRDQSKFYGEML--------------EKRRTEAEKEQEKVRLKKVKKREEKQ 687
+ ++ +R + +L E +R+E +E E++R K + +
Sbjct: 873 SELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLE 932
Query: 688 KWDDRHWTEKSLDEMTERDWRIFREDYSIT 717
E +D + ER E+YS+T
Sbjct: 933 G------LEVRIDNLQER----LSEEYSLT 952
Score = 38.9 bits (91), Expect = 0.028
Identities = 41/233 (17%), Positives = 79/233 (33%), Gaps = 22/233 (9%)
Query: 374 RKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDR-SKER 432
K + R+ + E + R + + + E E E+ R + SKE
Sbjct: 697 EKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKEL 756
Query: 433 -DGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEE 491
+ + E E E ++ ++ +E EQ+ + + + L+L
Sbjct: 757 TELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLN 816
Query: 492 LLA------KKKAEEEARSKPKFLTKEERAAEALRKRQA----EVEEMRKKMEEERKKRQ 541
A + E + + L E E L + E+EE+ + +EE + +
Sbjct: 817 EEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELE 876
Query: 542 EFTKE--------ASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEA 586
E A S+ E LR K+ E +EL + +E+ +
Sbjct: 877 ALLNERASLEEALALLRSELEELSEELRELESKRSE--LRRELEELREKLAQL 927
Score = 36.6 bits (85), Expect = 0.14
Identities = 34/229 (14%), Positives = 83/229 (36%), Gaps = 26/229 (11%)
Query: 487 LSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKE 546
L LEEL + + +E + + + E L++ + ++EE+R ++ E ++ +E KE
Sbjct: 232 LRLEELREELEELQEELKEAE--EELEELTAELQELEEKLEELRLEVSELEEEIEELQKE 289
Query: 547 ----------------------ASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREG 584
A+ E + E +A+L K E EE ++K E
Sbjct: 290 LYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEEL 349
Query: 585 EAIKERYLGLVKKKRRVRRLNDRKFVFDWDASEDTSVDYNSIYKERHQVQFFGRGNIAGI 644
+ E +++ + + + E + + +
Sbjct: 350 KEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLN--NEIERLEA 407
Query: 645 DIKAQKRDQSKFYGEMLEKRRTEAEKEQEKVRLKKVKKREEKQKWDDRH 693
++ + + + E+ E + E E ++++ + + EE ++ +
Sbjct: 408 RLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEEL 456
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 41.6 bits (98), Expect = 0.005
Identities = 45/208 (21%), Positives = 88/208 (42%), Gaps = 17/208 (8%)
Query: 402 RDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKE 461
R + + L +++ +E E R S+ + ++ E +E +E E KE
Sbjct: 187 RTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEEL--------KE 238
Query: 462 GYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRK 521
E E+ + S E + K LEE + + K E E L ++ + + L++
Sbjct: 239 EIEELEKELESLEGSKRKLEEKIR--ELEERIEELKKEIEE------LEEKVKELKELKE 290
Query: 522 RQAEVEEMRKKMEEERKKRQEFTKEAS-FESKRENFDARLRRDREKKKEDPEEKELNKDK 580
+ E ++ + EE + +E K S E + + R++ EK++ E K+ K+
Sbjct: 291 KAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKEL 350
Query: 581 EREGEAIKERYLGLVKKKRRVRRLNDRK 608
E+ E ++ER+ + K + L K
Sbjct: 351 EKRLEELEERHELYEEAKAKKEELERLK 378
Score = 36.6 bits (85), Expect = 0.17
Identities = 44/174 (25%), Positives = 81/174 (46%), Gaps = 11/174 (6%)
Query: 437 RREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKK 496
+ E +++ E E + + E + E++ EE + + EP E L K
Sbjct: 550 KLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPF-YNEYLELK 608
Query: 497 KAEEEARSKPKFLTKEER----AAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESK 552
AE+E + K L K E A E L + + +EE+RK++EE KK +++E +E
Sbjct: 609 DAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKK---YSEE-EYEEL 664
Query: 553 RENFDARLRRDREKKKEDPEE-KELNKDKEREGEAIKERYLGLVKKKRRVRRLN 605
RE + L R+ + + EE ++ ++ ++ E +KE K K+ + +L
Sbjct: 665 REEY-LELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLE 717
Score = 35.8 bits (83), Expect = 0.26
Identities = 55/307 (17%), Positives = 110/307 (35%), Gaps = 72/307 (23%)
Query: 403 DRDRDRLERDRSREREREKRHHRRDRSKERDGKDRRE--GYRRERREEEASGSKHKSRDK 460
R +RLE+ R E+ +++ E ++ E E REE K +
Sbjct: 175 KRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELE 234
Query: 461 EGYEPTEQMVISDEEEDKGAAAKKEPLS-LEELLAKKKAEEEARSKPKFLTKEERAAEAL 519
E E E++ E + +E + LEE + + K E E L ++ + + L
Sbjct: 235 ELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEE------LEEKVKELKEL 288
Query: 520 RKRQAEVEEMRKKMEEERKKRQEFTKEAS-FESKRENFDARLRRDREKKKEDPEEKELNK 578
+++ E ++ + EE + +E K S E + + R++ EK++ E K+ K
Sbjct: 289 KEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLK 348
Query: 579 DKEREGEAIKERYLGLVKKKRRVRRLNDRKFVFDWDASEDTSVDYNSIYKERHQVQFFGR 638
+ E+ E ++ER + + +
Sbjct: 349 ELEKRLEELEER----HELYEEAKAKKEEL------------------------------ 374
Query: 639 GNIAGIDIKAQKRDQSKFYGEMLEKRRTEAEKEQEKVRLKKVKKREEKQKWDDRHWTEKS 698
E L+KR T E+ + L++++K +E+ E+
Sbjct: 375 --------------------ERLKKRLTGLTPEKLEKELEELEKAKEE--------IEEE 406
Query: 699 LDEMTER 705
+ ++T R
Sbjct: 407 ISKITAR 413
Score = 35.4 bits (82), Expect = 0.39
Identities = 62/248 (25%), Positives = 110/248 (44%), Gaps = 43/248 (17%)
Query: 475 EEDKGAAAKKEPLSLEELLAKKKAEEEARSKP-------KFLTKEERAAEALRKRQAEVE 527
E+ K K + +LEEL K + E+ + K K L KE E L+K+ AE+E
Sbjct: 503 EQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELE 562
Query: 528 EMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAI 587
+ ++EEE + + +E FES E + RL+ E E K+ K+ ERE + +
Sbjct: 563 KKLDELEEELAELLKELEELGFESVEE-LEERLKELEPFYNEYLELKDAEKELEREEKEL 621
Query: 588 K---------ERYLGLVKKK-RRVR-RLNDRKFVFDWDASEDTSVDYNSIYKERHQVQFF 636
K L +K+ +R L + + + + E+ +Y + +E
Sbjct: 622 KKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRE------- 674
Query: 637 GRGNIAGIDIKAQKRDQSKFYGEMLEKRRTEAEKEQEKVRLKKVKKREEKQKWDDRHWTE 696
+AG+ + E LEKRR E +K EK++ + ++ + K++ + E
Sbjct: 675 ----LAGLRAEL----------EELEKRREEIKKTLEKLKEELEEREKAKKELEK---LE 717
Query: 697 KSLDEMTE 704
K+L+ + E
Sbjct: 718 KALERVEE 725
Score = 35.0 bits (81), Expect = 0.40
Identities = 22/114 (19%), Positives = 50/114 (43%), Gaps = 4/114 (3%)
Query: 495 KKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKR- 553
K+ + KF+ + E E +++++ E+EE+ +++ E + E +E K
Sbjct: 171 IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEV 230
Query: 554 ENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLNDR 607
+ + E +K E + L K + E I+E + + K+ + L ++
Sbjct: 231 KELEELKEEIEELEK---ELESLEGSKRKLEEKIRELEERIEELKKEIEELEEK 281
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 40.3 bits (94), Expect = 0.006
Identities = 38/141 (26%), Positives = 59/141 (41%), Gaps = 18/141 (12%)
Query: 471 ISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAA----EALRKRQAEV 526
+ + EE+ + ++E S EE + + E R KP F K++R E ++ +
Sbjct: 4 VLELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKAL 63
Query: 527 EEMRKKMEEERKKR-----QEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKE 581
EE K+ EERK+ +E K+ KR D + E+ EE
Sbjct: 64 EEEAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVDTDDENEEE-------- 115
Query: 582 REGEAIKERYLGLVKKKRRVR 602
E EA K R L +K+ R R
Sbjct: 116 -EYEAWKLRELKRIKRDREER 135
Score = 31.0 bits (70), Expect = 5.4
Identities = 39/176 (22%), Positives = 71/176 (40%), Gaps = 27/176 (15%)
Query: 431 ERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLE 490
E + + E EEE S + ++ ++ EP + V + ++D+ ++E + +
Sbjct: 1 ETEVLELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFT-RKKDRITIQEREREAAK 59
Query: 491 ----ELLAKKKAEEEARSKPKFLTKEERAAEALRKRQA-------------EVEEMR--- 530
E AK+KAEE R K + +E + L+KR E EE
Sbjct: 60 EKALEEEAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVDTDDENEEEEYEA 119
Query: 531 ------KKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDK 580
K+++ +R++R+E +E + K N RR +K + K K
Sbjct: 120 WKLRELKRIKRDREEREEMEREKAEIEKMRNMTEEERRAELRKNPKVSTNKAKKGK 175
Score = 31.0 bits (70), Expect = 5.7
Identities = 31/130 (23%), Positives = 51/130 (39%), Gaps = 17/130 (13%)
Query: 410 ERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQM 469
+R +ERERE + + + + R+ + EEE + E
Sbjct: 46 DRITIQEREREAAKEKALEEEAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANIDD 105
Query: 470 VISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEM 529
V +D+E + + E L EL K+ EE E + + +AE+E+M
Sbjct: 106 VDTDDENE---EEEYEAWKLRELKRIKRDREER--------------EEMEREKAEIEKM 148
Query: 530 RKKMEEERKK 539
R EEER+
Sbjct: 149 RNMTEEERRA 158
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 41.2 bits (97), Expect = 0.006
Identities = 48/203 (23%), Positives = 81/203 (39%), Gaps = 18/203 (8%)
Query: 390 ERDTDRSRDD-RGRDRDRDRL-ERDRSREREREKRHHRRDRSKERDGKDRREGYRRERRE 447
E + + R+ +R+R+ L E R E+ RD G D + E R
Sbjct: 257 EAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARR 316
Query: 448 EEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPK 507
EE + RD + + A ++ E+ ++ EE R +
Sbjct: 317 EELEDRDEELRD----------RLEECRVAAQAHNEEAESLREDADDLEERAEELREEAA 366
Query: 508 FLTKE-ERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREK 566
L E E A EA+ R+ E+EE+ +++EE R +R + E+F LR +R++
Sbjct: 367 ELESELEEAREAVEDRREEIEELEEEIEELR-ERFGDAPVDLGNA--EDFLEELREERDE 423
Query: 567 KKEDPEEKELNKDKEREGEAIKE 589
+E E EL E ++E
Sbjct: 424 LRER--EAELEATLRTARERVEE 444
Score = 40.0 bits (94), Expect = 0.014
Identities = 30/173 (17%), Positives = 65/173 (37%), Gaps = 10/173 (5%)
Query: 412 DRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVI 471
+ E + + + E R ER E+ R +E E E+++
Sbjct: 468 ETIEEDRERVEELEAELEDLEEEVEEVEE-RLERAEDLVEAEDRIERLEERREDLEELI- 525
Query: 472 SDEEEDKGAAAKKEPLSLEELLAKKKA-EEEARSKPKFLTKEERAAEALRKRQAEVEEMR 530
++ +++ EEL + E EA K + + E AE R+ AE+
Sbjct: 526 ----AERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKL 581
Query: 531 KKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKERE 583
+++E + + + + + + + R REK++ E + +++ E
Sbjct: 582 AELKERI---ESLERIRTLLAAIADAEDEIERLREKREALAELNDERRERLAE 631
Score = 36.6 bits (85), Expect = 0.16
Identities = 71/326 (21%), Positives = 123/326 (37%), Gaps = 42/326 (12%)
Query: 374 RKRRSRSRERD--YDRVRERDTDRSRDDRGRDRD-RDRLERDRSREREREKRHHRRDRSK 430
RR E + + +RER D + D G D + L +R RERE
Sbjct: 380 EDRREEIEELEEEIEELRERFGD-APVDLGNAEDFLEELREERDELREREA-------EL 431
Query: 431 ERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLE 490
E + RE EA + EG E I ++ E LE
Sbjct: 432 EATLRTARERVEEAEALLEAGKCPECGQPVEGSPHVE--TIEEDRERVE--------ELE 481
Query: 491 ELLAKKKAEEEARSK----PKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKE 546
L + E E + + L + E E L +R+ ++EE+ ER++ E +E
Sbjct: 482 AELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELI----AERRETIEEKRE 537
Query: 547 ASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKR-RVRRLN 605
+ E RE L + E+K+E E E ++ RE A L +K++ + R+
Sbjct: 538 RA-EELRERAAE-LEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLERIR 595
Query: 606 DRKFVFDWDASEDTSVDYNSIYKERHQVQFFGRGNIAGIDIKAQKRDQ--SKFYGEMLEK 663
A D + + ++R + + K +++ + ++F +E+
Sbjct: 596 TL-----LAAIADAEDEIERLREKREALAELNDERRERLAEKRERKRELEAEFDEARIEE 650
Query: 664 ---RRTEAEKEQEKVRLKKVKKREEK 686
+ AE+ E+V K + REE+
Sbjct: 651 AREDKERAEEYLEQVEEKLDELREER 676
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 41.2 bits (97), Expect = 0.006
Identities = 40/153 (26%), Positives = 66/153 (43%), Gaps = 18/153 (11%)
Query: 488 SLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERK------KRQ 541
L E L+K K EE R K +E R KR ++EE +++EEE +
Sbjct: 389 PLAEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEEL 448
Query: 542 EFTKEASFESKRENFDAR----LRRDREKKKEDPEEKELNKDKEREGEAIK--ERYLGLV 595
+ E ES+ E F +R+DRE + D + L K+ E + + ++ ER L +
Sbjct: 449 KREIEK-LESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAEL 507
Query: 596 KKKRRVRRLNDRKFV-----FDWDASEDTSVDY 623
+K R++ V +A E+ +Y
Sbjct: 508 RKMRKLELSGKGTPVKVVEKLTLEAIEEAEEEY 540
Score = 37.0 bits (86), Expect = 0.11
Identities = 40/245 (16%), Positives = 86/245 (35%), Gaps = 46/245 (18%)
Query: 482 AKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKR----QAEVEEMRKKMEEER 537
K+E E+ +++ E K + K E E L + + E+EE+++++E+
Sbjct: 397 VKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLE 456
Query: 538 KKRQEFTKEASFESK---------RENFDARLRRDREKKKEDPEEKELNKDK-----ERE 583
+ + F +E + + R + +KK+ + E++L + + E
Sbjct: 457 SELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLELS 516
Query: 584 G-------------EAIKERYLGLVKKKRRVRRLNDRKFVFDWDASEDTSVDYNSIY--- 627
G EAI+E K+ V + D A E +I
Sbjct: 517 GKGTPVKVVEKLTLEAIEEAEEEYGIKEGDVILVEDPSGGGARTAEELIDKKPRAIIRGE 576
Query: 628 -KERHQVQFFGRGNIAGI-----------DIKAQKRDQSKFYGEMLEKRRTEAEKEQEKV 675
+ F + I + + ++ + E +KR E E+ Q++
Sbjct: 577 EMSHAAAEEFFKNEIPVLPEGDVQIIRLDEFAVVDSEELRRAIEEWKKRFEERERRQKEE 636
Query: 676 RLKKV 680
+ ++
Sbjct: 637 DILRI 641
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 40.5 bits (96), Expect = 0.008
Identities = 35/112 (31%), Positives = 64/112 (57%), Gaps = 12/112 (10%)
Query: 496 KKAEEEARSKPKFLTKEERAAEALRK---RQAEVEEMRKKME---EERKKRQEFTK-EAS 548
K+AEEEA+ + L + ++ AEA++K +A+ E + + E E R++R E K E
Sbjct: 34 KEAEEEAK---RILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKR 90
Query: 549 FESKRENFDARLRRDREKKKE-DPEEKELNKDKEREGEAIKERYLGLVKKKR 599
K EN D +L ++++E + +EKEL + K++E E +E L++++
Sbjct: 91 LLQKEENLDRKLELLEKREEELEKKEKELEQ-KQQELEKKEEELEELIEEQL 141
Score = 39.4 bits (93), Expect = 0.021
Identities = 28/105 (26%), Positives = 44/105 (41%), Gaps = 13/105 (12%)
Query: 492 LLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKM-----EEERKKRQEFTKE 546
+ KK AE K EE A L + + E E ++K+ EE K R EF KE
Sbjct: 23 FVRKKIAEA------KIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKE 76
Query: 547 ASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERY 591
+R +R +K++ + EL + +E E E ++
Sbjct: 77 --LRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKEL 119
Score = 35.1 bits (82), Expect = 0.34
Identities = 21/83 (25%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 520 RKRQAEVEEMRKKM-EEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNK 578
+ E EE K++ EE +K+ + KEA E+K E R ++E ++ E ++L K
Sbjct: 30 EAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEK 89
Query: 579 DKEREGEAIKERYLGLVKKKRRV 601
++ E + + L K++ +
Sbjct: 90 RLLQKEENLDRKLELLEKREEEL 112
Score = 34.0 bits (79), Expect = 0.85
Identities = 32/127 (25%), Positives = 60/127 (47%), Gaps = 14/127 (11%)
Query: 476 EDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEE 535
E K A++E + +L + K E EA K L +E + + + E+ E R ++++
Sbjct: 30 EAKIKEAEEE---AKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQK 86
Query: 536 -ERKKRQEFTKEASFESKRENFDAR------LRRDREKKKEDPEEKELN-KDKEREGEAI 587
E++ Q KE + + K E + R ++ E+K+++ E+KE ++ E
Sbjct: 87 LEKRLLQ---KEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143
Query: 588 KERYLGL 594
ER GL
Sbjct: 144 LERISGL 150
Score = 30.5 bits (70), Expect = 9.8
Identities = 29/106 (27%), Positives = 48/106 (45%), Gaps = 16/106 (15%)
Query: 471 ISDEEEDKGAAAKKEPLSL---EELLAKKKAEEEARSKPKFLTKEE-----------RAA 516
I +E + + A KKE L E + + E+E R + L K E R
Sbjct: 43 ILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKL 102
Query: 517 EALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRR 562
E L KR+ E+E+ K++E+++++ ++ KE E E L R
Sbjct: 103 ELLEKREEELEKKEKELEQKQQELEK--KEEELEELIEEQLQELER 146
>gnl|CDD|219061 pfam06495, Transformer, Fruit fly transformer protein. This family
consists of transformer proteins from several Drosophila
species and also from Ceratitis capitata (Mediterranean
fruit fly). The transformer locus (tra) produces an RNA
processing protein that alternatively splices the
doublesex pre-mRNA in the sex determination hierarchy of
Drosophila melanogaster.
Length = 182
Score = 38.5 bits (89), Expect = 0.015
Identities = 39/124 (31%), Positives = 53/124 (42%), Gaps = 23/124 (18%)
Query: 361 MAGDRRSRSRSPPRKRRSRSRERD-----YDRVRERDTDRSRDDRG----RDRDRDRLER 411
M D SRS PR R SR+++ D VRERD R+ R R R R
Sbjct: 1 MDADSSSRS---PRDTRRDSRKKEKIPYFADEVRERDRVRNLRQRKTQSTRPTTSHRGRR 57
Query: 412 DRSRER----EREKRHHR-RDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPT 466
RSR R ER R R RS+ R+ D R R ++ + +SR + Y T
Sbjct: 58 TRSRSRSQSAERNSCQRRHRSRSRSRNRSDSRH------RSTSSTERRRRSRSRSRYSRT 111
Query: 467 EQMV 470
+++
Sbjct: 112 PRII 115
>gnl|CDD|223627 COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family
[Transcription / DNA replication, recombination, and
repair].
Length = 866
Score = 40.1 bits (93), Expect = 0.015
Identities = 27/119 (22%), Positives = 46/119 (38%), Gaps = 13/119 (10%)
Query: 986 DKRKKLMEVLNRGVKK-----PVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRE 1040
K + L E+L + + V+IF D+L L+ LG L G ++R+
Sbjct: 692 GKLQALDELLLDKLLEEGHYHKVLIFSQFTPVLDLLEDYLKALGIKYVRLDGSTPAKRRQ 751
Query: 1041 LALNSLK--GGSKDILVATDVAGRGIDIKDVSMVI------NYDMAKSIEDYTHRIGRT 1091
++ K L++ G G+++ VI N + D HRIG+
Sbjct: 752 ELIDRFNADEEEKVFLLSLKAGGLGLNLTGADTVILFDPWWNPAVELQAIDRAHRIGQK 810
Score = 40.1 bits (93), Expect = 0.015
Identities = 27/119 (22%), Positives = 46/119 (38%), Gaps = 13/119 (10%)
Query: 1661 DKRKKLMEVLNRGVKK-----PVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRE 1715
K + L E+L + + V+IF D+L L+ LG L G ++R+
Sbjct: 692 GKLQALDELLLDKLLEEGHYHKVLIFSQFTPVLDLLEDYLKALGIKYVRLDGSTPAKRRQ 751
Query: 1716 LALNSLK--GGSKDILVATDVAGRGIDIKDVSMVI------NYDMAKSIEDYTHRIGRT 1766
++ K L++ G G+++ VI N + D HRIG+
Sbjct: 752 ELIDRFNADEEEKVFLLSLKAGGLGLNLTGADTVILFDPWWNPAVELQAIDRAHRIGQK 810
Score = 32.4 bits (73), Expect = 3.1
Identities = 15/64 (23%), Positives = 27/64 (42%), Gaps = 5/64 (7%)
Query: 292 DKRKKLMEVLNRGVKK-----PVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRE 346
K + L E+L + + V+IF D+L L+ LG L G ++R+
Sbjct: 692 GKLQALDELLLDKLLEEGHYHKVLIFSQFTPVLDLLEDYLKALGIKYVRLDGSTPAKRRQ 751
Query: 347 LALN 350
++
Sbjct: 752 ELID 755
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 38.4 bits (90), Expect = 0.015
Identities = 22/95 (23%), Positives = 44/95 (46%), Gaps = 5/95 (5%)
Query: 517 EALRKRQAEVEEMRKKMEEERKKRQEFTKE-ASFESKRENFDARLRRDREKKKEDPE--- 572
+AL K + +E+++K++EE +++ E + + RE + R E K+ + E
Sbjct: 62 QALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKK 121
Query: 573 -EKELNKDKEREGEAIKERYLGLVKKKRRVRRLND 606
+ EL K ++ + E I++ K R D
Sbjct: 122 LKAELEKYEKNDPERIEKLKEETKVAKEAANRWTD 156
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 40.0 bits (94), Expect = 0.016
Identities = 35/240 (14%), Positives = 76/240 (31%), Gaps = 18/240 (7%)
Query: 402 RDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREG-----------YRRERREEEA 450
+++ + R + + + K R ++ K +++ +R + +
Sbjct: 1146 EVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDE 1205
Query: 451 SGSKHKSRDKEGYEPTEQMVISDEE-EDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFL 509
D + + DEE + K + + L ++ + K +E+ L
Sbjct: 1206 KRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDL 1265
Query: 510 TKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKE 569
+KE + A ++ A + + +K +S K+ KKK+
Sbjct: 1266 SKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKK 1325
Query: 570 DPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLNDRKFVFDWDASEDTSVDYNSIYKE 629
E+K K K K R + RK D + +D + + E
Sbjct: 1326 KSEKKTARKKK------SKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDE 1379
Score = 34.6 bits (80), Expect = 0.63
Identities = 35/171 (20%), Positives = 63/171 (36%), Gaps = 30/171 (17%)
Query: 456 KSRDKEGYEPTEQMVISDEEEDKGAAAKKE-------PLS---------LEELLAKKKAE 499
K +E E DE++++ A P+ L L KK+ E
Sbjct: 1058 KITAEEEEGAEEDDEADDEDDEEELGAAVSYDYLLSMPIWSLTKEKVEKLNAELEKKEKE 1117
Query: 500 EEA--RSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFD 557
E + PK + E+ L K + +EE + E+E K Q +
Sbjct: 1118 LEKLKNTTPKDMWLED-----LDKFEEALEEQEEVEEKEIAKEQRL-------KSKTKGK 1165
Query: 558 ARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLNDRK 608
A R + KK++ ++K+ + DK ++ + +KR++ D K
Sbjct: 1166 ASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNK 1216
>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
unknown].
Length = 833
Score = 39.6 bits (92), Expect = 0.016
Identities = 40/262 (15%), Positives = 82/262 (31%), Gaps = 34/262 (12%)
Query: 379 RSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRR 438
R + D + + D +++ R+ E + R ++R
Sbjct: 378 RLKLAQDDLTQAELNYAAADQA-ANQEGALNAREDEAEVLSTQEERRDILKNLLADAEKR 436
Query: 439 EGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKA 498
+ E + A ++ + ++E S + A +++
Sbjct: 437 TARQEELNKALAKAKILQADKAAKAYQED-------------ILQREAQSRGKTAAAERS 483
Query: 499 EEEARSKPKFLTKEERAAE--ALRKRQAEV--------EEMRKKMEEERKKRQEFTKEAS 548
+E+ + K L ++ + K +A EE ++EE K+RQ ++A
Sbjct: 484 QEQMTAALKALLAFQQQIADLSGAKEKASDQKSLLWKAEEQYALLKEEAKQRQLQEQKAL 543
Query: 549 FESKRENFDAR--------LRRDREKK--KEDPEEKELNKDKEREGEAIKERYLGLVKKK 598
E K+E + + DR + + +KE D RE A L +
Sbjct: 544 LEHKKETLEYTSQLAELLDQQADRFELSAQAAGSQKERGSDLYREALAQNAAALNKALNE 603
Query: 599 RRVRRLNDRKFVFDWDASEDTS 620
DW A ++
Sbjct: 604 LAAYWSALDLLQGDWKAGALSA 625
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 38.6 bits (90), Expect = 0.024
Identities = 41/167 (24%), Positives = 70/167 (41%), Gaps = 14/167 (8%)
Query: 402 RDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKE 461
+R R+E + R+ +E+ + R++ +D + R+ Y++E ++ + +E
Sbjct: 82 AKLERARVEAEERRKTLQEQTQQEQQRAQYQD-ELARKRYQKELEQQRRQNEELLKMQEE 140
Query: 462 GYEPTEQMVISDEEEDKGAAAK--KEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEAL 519
E M + EEE + +E LE + K E EAR + KEER E +
Sbjct: 141 SVLRQEAMRRATEEEILEMRRETIEEEAELERENIRAKIEAEARGR----AKEERENEDI 196
Query: 520 RKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREK 566
+ ++ K EER+ E K F F A L D+ K
Sbjct: 197 NREM-----LKLKANEERETVLESIKTT-FSHIGGGFRALL-TDKSK 236
Score = 30.5 bits (69), Expect = 6.8
Identities = 36/167 (21%), Positives = 62/167 (37%), Gaps = 17/167 (10%)
Query: 410 ERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQM 469
E+ R E E + + + +++ + + R E ERR+ ++ + + + Y+
Sbjct: 61 EKTRQAELEAKIKEYEAQQAQAKLERARVEA--EERRKTLQEQTQQEQQRAQ-YQ----- 112
Query: 470 VISDEEEDKGAAAKKEPLSL--EELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVE 527
DE K + E EELL K EE + E +R+ E E
Sbjct: 113 ---DELARKRYQKELEQQRRQNEELL--KMQEESVLRQEAMRRATEEEILEMRRETIE-E 166
Query: 528 EMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEK 574
E + E R K + + +REN D + K E+ E
Sbjct: 167 EAELERENIRAKIEAEA-RGRAKEERENEDINREMLKLKANEERETV 212
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 39.1 bits (92), Expect = 0.024
Identities = 23/79 (29%), Positives = 44/79 (55%), Gaps = 5/79 (6%)
Query: 507 KFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREK 566
+FLT ++A + K + VE+ KK EEE+K++++ + + E + + E+
Sbjct: 400 EFLTGSKKATK---KIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEE--EEEKEKKEEE 454
Query: 567 KKEDPEEKELNKDKEREGE 585
K+E+ EE E K++E E +
Sbjct: 455 KEEEEEEAEEEKEEEEEKK 473
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 39.0 bits (91), Expect = 0.026
Identities = 35/112 (31%), Positives = 53/112 (47%), Gaps = 13/112 (11%)
Query: 1003 VIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSL-KGGSKD--ILVATDV 1059
V+IF + D+L L GY C + G G E R+ ++++ K GS+ L++T
Sbjct: 490 VLIFSQMTRLLDILEDYLMYRGYQYCRIDGNTGGEDRDASIDAFNKPGSEKFVFLLSTRA 549
Query: 1060 AGRGIDIKDVSMVINYD------MAKSIEDYTHRIGRTGRAGKEGLAVSFCT 1105
G GI++ +VI YD + +D HRIG+ KE FCT
Sbjct: 550 GGLGINLATADIVILYDSDWNPQVDLQAQDRAHRIGQK----KEVQVFRFCT 597
Score = 39.0 bits (91), Expect = 0.026
Identities = 35/112 (31%), Positives = 53/112 (47%), Gaps = 13/112 (11%)
Query: 1678 VIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSL-KGGSKD--ILVATDV 1734
V+IF + D+L L GY C + G G E R+ ++++ K GS+ L++T
Sbjct: 490 VLIFSQMTRLLDILEDYLMYRGYQYCRIDGNTGGEDRDASIDAFNKPGSEKFVFLLSTRA 549
Query: 1735 AGRGIDIKDVSMVINYD------MAKSIEDYTHRIGRTGRAGKEGLAVSFCT 1780
G GI++ +VI YD + +D HRIG+ KE FCT
Sbjct: 550 GGLGINLATADIVILYDSDWNPQVDLQAQDRAHRIGQK----KEVQVFRFCT 597
>gnl|CDD|234478 TIGR04121, DEXH_lig_assoc, DEXH box helicase, DNA
ligase-associated. Members of this protein family are
DEAD/DEAH box helicases found associated with a
bacterial ATP-dependent DNA ligase, part of a four-gene
system that occurs in about 12 % of prokaryotic
reference genomes. The actual motif in this family is
DE[VILW]H.
Length = 803
Score = 39.1 bits (92), Expect = 0.027
Identities = 26/73 (35%), Positives = 35/73 (47%), Gaps = 6/73 (8%)
Query: 9 EPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAI 68
P P Q + L+ R + +A TGSGKTLA LP SL +A E +G + +
Sbjct: 13 TPRPFQLEMWAAALEGRSGLLIAPTGSGKTLAGFLP------SLIDLAGPEKPKKGLHTL 66
Query: 69 IMAPTRELAQQIE 81
+ P R LA I
Sbjct: 67 YITPLRALAVDIA 79
Score = 39.1 bits (92), Expect = 0.027
Identities = 26/73 (35%), Positives = 35/73 (47%), Gaps = 6/73 (8%)
Query: 750 EPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAI 809
P P Q + L+ R + +A TGSGKTLA LP SL +A E +G + +
Sbjct: 13 TPRPFQLEMWAAALEGRSGLLIAPTGSGKTLAGFLP------SLIDLAGPEKPKKGLHTL 66
Query: 810 IMAPTRELAQQIE 822
+ P R LA I
Sbjct: 67 YITPLRALAVDIA 79
Score = 39.1 bits (92), Expect = 0.027
Identities = 26/73 (35%), Positives = 35/73 (47%), Gaps = 6/73 (8%)
Query: 1425 EPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAI 1484
P P Q + L+ R + +A TGSGKTLA LP SL +A E +G + +
Sbjct: 13 TPRPFQLEMWAAALEGRSGLLIAPTGSGKTLAGFLP------SLIDLAGPEKPKKGLHTL 66
Query: 1485 IMAPTRELAQQIE 1497
+ P R LA I
Sbjct: 67 YITPLRALAVDIA 79
Score = 31.7 bits (73), Expect = 4.6
Identities = 17/62 (27%), Positives = 25/62 (40%)
Query: 1031 HGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGR 1090
HG +EQR ++ G +V T G+D V +VI K + R GR
Sbjct: 282 HGSLDREQRRWVEAAMAAGRLRAVVCTSSLDLGVDFGPVDLVIQIGSPKGVARLLQRAGR 341
Query: 1091 TG 1092
+
Sbjct: 342 SN 343
Score = 31.7 bits (73), Expect = 4.6
Identities = 17/62 (27%), Positives = 25/62 (40%)
Query: 1706 HGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGR 1765
HG +EQR ++ G +V T G+D V +VI K + R GR
Sbjct: 282 HGSLDREQRRWVEAAMAAGRLRAVVCTSSLDLGVDFGPVDLVIQIGSPKGVARLLQRAGR 341
Query: 1766 TG 1767
+
Sbjct: 342 SN 343
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 38.8 bits (91), Expect = 0.032
Identities = 26/112 (23%), Positives = 56/112 (50%), Gaps = 12/112 (10%)
Query: 497 KAEEEARSKPKFLTKEERAAEALRK---RQAEVEEMRKKMEEERKKRQEFTKEASFES-- 551
AEE A+ + + + ++ AE L+K +A+ E + + E ER+ ++ + E
Sbjct: 29 SAEELAK---RIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRL 85
Query: 552 --KRENFDARLRRDREKKKE-DPEEKELNKDKEREGEAIKERYLGLVKKKRR 600
+ E D ++ +K++ + +EKEL+ KE+ + +E L+ ++R
Sbjct: 86 LQREETLDRKMESLDKKEENLEKKEKELSN-KEKNLDEKEEELEELIAEQRE 136
Score = 31.4 bits (72), Expect = 5.0
Identities = 25/102 (24%), Positives = 44/102 (43%), Gaps = 15/102 (14%)
Query: 521 KRQAEVEEMRKKMEEERKKR-QEFTKEASFESKRENF--------DARLRRDREKKKEDP 571
K+ EE+ K++ EE KK + KEA E+K E + + RR+ ++ E
Sbjct: 25 KKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERR 84
Query: 572 -EEKELNKDKEREGEAIKERYLG-----LVKKKRRVRRLNDR 607
++E D++ E KE L L K++ + +
Sbjct: 85 LLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEE 126
Score = 31.0 bits (71), Expect = 6.4
Identities = 26/100 (26%), Positives = 46/100 (46%), Gaps = 13/100 (13%)
Query: 490 EELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQE------- 542
EE L +K E K + L K+E + L ++ ++E +++EE +++E
Sbjct: 89 EETLDRKM--ESLDKKEENLEKKE---KELSNKEKNLDEKEEELEELIAEQREELERISG 143
Query: 543 FTKEASFESKRENFDARLRRDREKK-KEDPEEKELNKDKE 581
T+E + E E + R + K KE EE + DK+
Sbjct: 144 LTQEEAKEILLEEVEEEARHEAAKLIKEIEEEAKEEADKK 183
>gnl|CDD|224121 COG1200, RecG, RecG-like helicase [DNA replication, recombination,
and repair / Transcription].
Length = 677
Score = 38.7 bits (91), Expect = 0.033
Identities = 38/102 (37%), Positives = 52/102 (50%), Gaps = 18/102 (17%)
Query: 24 NRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEE 83
NR + G + GSGKT+ LL +L I + G A +MAPT LA+Q E
Sbjct: 285 NRLLQG--DVGSGKTVVALLAMLAAI------------EAGYQAALMAPTEILAEQHYES 330
Query: 84 TNKFGTPLGIRTVLVVGGL---SREEQGFRLRLG-CEIVIAT 121
K+ PLGIR L+ G L +R+E +L G +IV+ T
Sbjct: 331 LRKWLEPLGIRVALLTGSLKGKARKEILEQLASGEIDIVVGT 372
Score = 38.7 bits (91), Expect = 0.033
Identities = 38/102 (37%), Positives = 52/102 (50%), Gaps = 18/102 (17%)
Query: 765 NRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEE 824
NR + G + GSGKT+ LL +L I + G A +MAPT LA+Q E
Sbjct: 285 NRLLQG--DVGSGKTVVALLAMLAAI------------EAGYQAALMAPTEILAEQHYES 330
Query: 825 TNKFGTPLGIRTVLVVGGL---SREEQGFRLRLG-CEIVIAT 862
K+ PLGIR L+ G L +R+E +L G +IV+ T
Sbjct: 331 LRKWLEPLGIRVALLTGSLKGKARKEILEQLASGEIDIVVGT 372
Score = 38.7 bits (91), Expect = 0.033
Identities = 38/102 (37%), Positives = 52/102 (50%), Gaps = 18/102 (17%)
Query: 1440 NRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEE 1499
NR + G + GSGKT+ LL +L I + G A +MAPT LA+Q E
Sbjct: 285 NRLLQG--DVGSGKTVVALLAMLAAI------------EAGYQAALMAPTEILAEQHYES 330
Query: 1500 TNKFGTPLGIRTVLVVGGL---SREEQGFRLRLG-CEIVIAT 1537
K+ PLGIR L+ G L +R+E +L G +IV+ T
Sbjct: 331 LRKWLEPLGIRVALLTGSLKGKARKEILEQLASGEIDIVVGT 372
>gnl|CDD|235032 PRK02362, PRK02362, ski2-like helicase; Provisional.
Length = 737
Score = 38.8 bits (91), Expect = 0.034
Identities = 47/159 (29%), Positives = 77/159 (48%), Gaps = 21/159 (13%)
Query: 732 EASLPTEILEIIEKIGYAEPTPIQRQAIPIGL-QNRDIIGVAETGSGKTLAFLLPLLVWI 790
E LP ++E E G E P Q +A+ GL ++++ T SGKTL
Sbjct: 5 ELPLPEGVIEFYEAEGIEELYPPQAEAVEAGLLDGKNLLAAIPTASGKTL---------- 54
Query: 791 QSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGL-SREEQG 849
+ ++A ++ +G A+ + P R LA + EE +F LG+R + G SR+E
Sbjct: 55 --IAELAMLKAIARGGKALYIVPLRALASEKFEEFERFE-ELGVRVGISTGDYDSRDE-- 109
Query: 850 FRLRLGC-EIVIATPGRLIDVLENRYLVLNQCTYIVLDE 887
LG +I++AT ++ +L N L+ T +V+DE
Sbjct: 110 ---WLGDNDIIVATSEKVDSLLRNGAPWLDDITCVVVDE 145
Score = 38.8 bits (91), Expect = 0.034
Identities = 47/159 (29%), Positives = 77/159 (48%), Gaps = 21/159 (13%)
Query: 1407 EASLPTEILEIIEKIGYAEPTPIQRQAIPIGL-QNRDIIGVAETGSGKTLAFLLPLLVWI 1465
E LP ++E E G E P Q +A+ GL ++++ T SGKTL
Sbjct: 5 ELPLPEGVIEFYEAEGIEELYPPQAEAVEAGLLDGKNLLAAIPTASGKTL---------- 54
Query: 1466 QSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGL-SREEQG 1524
+ ++A ++ +G A+ + P R LA + EE +F LG+R + G SR+E
Sbjct: 55 --IAELAMLKAIARGGKALYIVPLRALASEKFEEFERFE-ELGVRVGISTGDYDSRDE-- 109
Query: 1525 FRLRLGC-EIVIATPGRLIDVLENRYLVLNQCTYIVLDE 1562
LG +I++AT ++ +L N L+ T +V+DE
Sbjct: 110 ---WLGDNDIIVATSEKVDSLLRNGAPWLDDITCVVVDE 145
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 38.2 bits (89), Expect = 0.035
Identities = 16/66 (24%), Positives = 33/66 (50%), Gaps = 1/66 (1%)
Query: 521 KRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDK 580
KRQ ++E KK+++ KK E E + E + +E+ ++ +E+ ++DK
Sbjct: 366 KRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQA-DEDK 424
Query: 581 EREGEA 586
E+ +
Sbjct: 425 EKRQKD 430
Score = 30.9 bits (70), Expect = 7.3
Identities = 20/92 (21%), Positives = 38/92 (41%), Gaps = 9/92 (9%)
Query: 483 KKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQE 542
E + L+ K +E + K L + ++ EE K +EE+ K++E
Sbjct: 352 LYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEE 411
Query: 543 FTKEASFESKRENFDARLRRDREKKKEDPEEK 574
++ E E D+EK+++D +K
Sbjct: 412 -NEKKQKEQADE--------DKEKRQKDERKK 434
>gnl|CDD|224124 COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense
mechanisms].
Length = 733
Score = 38.6 bits (90), Expect = 0.035
Identities = 56/310 (18%), Positives = 103/310 (33%), Gaps = 35/310 (11%)
Query: 772 AETGSGKTLAFLLP-LLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGT 830
A TG GKT A L+ L + + + +R+ I + P R + + + +
Sbjct: 221 APTGYGKTEASLILALALLDEKIKLKSRV---------IYVLPFRTIIEDMYRRAKEIFG 271
Query: 831 PLGIRTVLVVGGLSREEQGFRLRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADR 890
+ + L + Y IV+
Sbjct: 272 LFSVIGKSLHSSSKEPLLLE---------PDQDILLTLTTNDSY-KKLLLALIVVTPIQI 321
Query: 891 MIDMGFEPDVQKILEYMPVTNLKPDTEDAEDENK----LLANYNSKKKYRQTVMF-TATM 945
+I + + L + + + D + LLA + + V+ +AT+
Sbjct: 322 LI-FSVKGFKFEFLALLLTSLVILDEVHLYADETMLAALLALLEALAEAGVPVLLMSATL 380
Query: 946 PPAVERLARSYLRRPATVYIGSVGKPTERIEQI--VYILSEQDKRKKLMEVLNR-GVK-- 1000
PP ++ + L + V + P E + + +D ++ + L VK
Sbjct: 381 PPFLKEKLKKALGKGREVVENAKFCPKEDEPGLKRKERVDVEDGPQEELIELISEEVKEG 440
Query: 1001 KPVIIFVNQKKGADVLAKGLEKLGYNACTLHG----GKGQEQRELALNSLKGGSKDILVA 1056
K V++ VN A L + L++ G LH +E+ K I+VA
Sbjct: 441 KKVLVIVNTVDRAIELYEKLKEKGPKVLLLHSRFTLKDREEKERELKKLFKQNEGFIVVA 500
Query: 1057 TDVAGRGIDI 1066
T V G+DI
Sbjct: 501 TQVIEAGVDI 510
Score = 38.6 bits (90), Expect = 0.035
Identities = 56/310 (18%), Positives = 103/310 (33%), Gaps = 35/310 (11%)
Query: 1447 AETGSGKTLAFLLP-LLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGT 1505
A TG GKT A L+ L + + + +R+ I + P R + + + +
Sbjct: 221 APTGYGKTEASLILALALLDEKIKLKSRV---------IYVLPFRTIIEDMYRRAKEIFG 271
Query: 1506 PLGIRTVLVVGGLSREEQGFRLRLGCEIVIATPGRLIDVLENRYLVLNQCTYIVLDEADR 1565
+ + L + Y IV+
Sbjct: 272 LFSVIGKSLHSSSKEPLLLE---------PDQDILLTLTTNDSY-KKLLLALIVVTPIQI 321
Query: 1566 MIDMGFEPDVQKILEYMPVTNLKPDTEDAEDENK----LLANYNSKKKYRQTVMF-TATM 1620
+I + + L + + + D + LLA + + V+ +AT+
Sbjct: 322 LI-FSVKGFKFEFLALLLTSLVILDEVHLYADETMLAALLALLEALAEAGVPVLLMSATL 380
Query: 1621 PPAVERLARSYLRRPATVYIGSVGKPTERIEQI--VYILSEQDKRKKLMEVLNR-GVK-- 1675
PP ++ + L + V + P E + + +D ++ + L VK
Sbjct: 381 PPFLKEKLKKALGKGREVVENAKFCPKEDEPGLKRKERVDVEDGPQEELIELISEEVKEG 440
Query: 1676 KPVIIFVNQKKGADVLAKGLEKLGYNACTLHG----GKGQEQRELALNSLKGGSKDILVA 1731
K V++ VN A L + L++ G LH +E+ K I+VA
Sbjct: 441 KKVLVIVNTVDRAIELYEKLKEKGPKVLLLHSRFTLKDREEKERELKKLFKQNEGFIVVA 500
Query: 1732 TDVAGRGIDI 1741
T V G+DI
Sbjct: 501 TQVIEAGVDI 510
Score = 33.6 bits (77), Expect = 1.2
Identities = 53/376 (14%), Positives = 112/376 (29%), Gaps = 42/376 (11%)
Query: 239 KKYRQTVMF-TATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQI--VYILSEQDKRK 295
+ V+ +AT+PP ++ + L + V + P E + + +D +
Sbjct: 367 AEAGVPVLLMSATLPPFLKEKLKKALGKGREVVENAKFCPKEDEPGLKRKERVDVEDGPQ 426
Query: 296 KLMEVLNR-GVK--KPVIIFVNQKKGADVLAKGLEKLGYNACTLHG----GKGQEQRELA 348
+ + L VK K V++ VN A L + L++ G LH +E+
Sbjct: 427 EELIELISEEVKEGKKVLVIVNTVDRAIELYEKLKEKGPKVLLLHSRFTLKDREEKEREL 486
Query: 349 LNSLKGGSKDILMAG-----------DRRSRSRSPPRK------RRSRSRERDYDRV-RE 390
K I++A D +P R +R +++ ++
Sbjct: 487 KKLFKQNEGFIVVATQVIEAGVDIDFDVLITELAPIDSLIQRAGRVNRHGKKENGKIYVY 546
Query: 391 RDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREG-----YRRER 445
D +R + ++LE+ E D + ++ R
Sbjct: 547 NDEERGPYLK---YSYEKLEKKLKSLEELPLEIINGDSLLSDEVDEKINQDILRIQDRIL 603
Query: 446 REEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSK 505
E K + + + + PL +E L ++
Sbjct: 604 SEAIELARLLDLLHKSSPDVLNSIFEYLKFLTDAEEIRDVPLEEKEELDERIESLYNLEI 663
Query: 506 PKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDRE 565
K+L +E + ++ ++E A E + + +
Sbjct: 664 LKYLEEELSEGPTRLGEKENELKLLVGLQELYLNY------AKGEYAWKVSLLVVIEEDG 717
Query: 566 KKKEDPEEKELNKDKE 581
D +E ++ E
Sbjct: 718 LVILDEKENLDSRKFE 733
>gnl|CDD|116627 pfam08017, Fibrinogen_BP, Fibrinogen binding protein. Proteins in
this family bind to fibrinogen. Members of this family
includes the fibrinogen receptor, FbsA, which mediates
platelet aggregation.
Length = 393
Score = 38.3 bits (88), Expect = 0.037
Identities = 54/261 (20%), Positives = 106/261 (40%), Gaps = 30/261 (11%)
Query: 364 DRRSRSRSPPRKRRSRSRERD-----YDRVRERDTDRSRDDRGRDRDRDRLERDRSRERE 418
D +RS+ +RR R E +R + +RS+ + R RD R + E
Sbjct: 37 DAENRSQGNVLERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDAENRSQGNVLE 96
Query: 419 REKRHHRRDRS------KERDGKDRREGYRRERREEEA----SGSKHKSRDKEGYEPTEQ 468
R +R ++RD +++ +G ERR+ +A G+ + R ++ ++
Sbjct: 97 RRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDAENRSQGNVLERRQRDAENKSQG 156
Query: 469 MVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAE------ALRKR 522
V+ + D E S +L +++ + E +S+ L + +R E L +R
Sbjct: 157 NVLERRQRD------AENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERR 210
Query: 523 QAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDR---EKKKEDPEEKELNKD 579
Q + E + ER++R K +R D + E+++ D E +
Sbjct: 211 QRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDAENRSQGNV 270
Query: 580 KEREGEAIKERYLGLVKKKRR 600
ER ++ + G V ++R+
Sbjct: 271 LERRQRDVENKSQGNVLERRQ 291
Score = 36.8 bits (84), Expect = 0.11
Identities = 59/302 (19%), Positives = 122/302 (40%), Gaps = 45/302 (14%)
Query: 381 RERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREG 440
R+RD + + + R +R + + R R+ E + + +R ++RD ++R +G
Sbjct: 34 RQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDAENRSQGNVLER-RQRDAENRSQG 92
Query: 441 YRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEE 500
ERR+ + + V+ + D E S +L +++ +
Sbjct: 93 NVLERRQRDVENKSQGN------------VLERRQRDV------ENKSQGNVLERRQRDA 134
Query: 501 EARSKPKFLTKEERAAE------ALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRE 554
E RS+ L + +R AE L +RQ + E + ER++R K +R
Sbjct: 135 ENRSQGNVLERRQRDAENKSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERR 194
Query: 555 NFDARLRRD---REKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLNDRKFVF 611
D + E+++ D E + ER ++ + G V ++R+ N
Sbjct: 195 QRDVENKSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVEN------ 248
Query: 612 DWDASEDTSVDYNSIYKERHQVQFFGRGNIAGIDIKAQKRDQSKFYGEMLEKRRTEAEKE 671
S+ N + + + + +GN+ + Q+ ++K G +LE+R+ +AE +
Sbjct: 249 ---KSQG-----NVLERRQRDAENRSQGNVLE---RRQRDVENKSQGNVLERRQRDAENK 297
Query: 672 QE 673
+
Sbjct: 298 SQ 299
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 38.6 bits (90), Expect = 0.041
Identities = 33/213 (15%), Positives = 69/213 (32%), Gaps = 37/213 (17%)
Query: 403 DRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEG 462
+ +LE +E + KE + R +++ +E + +
Sbjct: 558 KEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEE--- 614
Query: 463 YEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKR 522
+ E E ++AEEE S+ + L + E L+
Sbjct: 615 -LLQSLELSEAENE------------------LEEAEEELESELEKLNLQAELEELLQAA 655
Query: 523 QAEVEEMRKKMEEERKKRQEFTKE-----------ASFESKRENFDARLRRDREKKKEDP 571
E+EE +++E E ++ + + E + E L +K E
Sbjct: 656 LEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGE-- 713
Query: 572 EEKELNKDKEREGEAIKERYLGLVKKKRRVRRL 604
E+ + + + R +A E ++K + L
Sbjct: 714 IEQLIEELESR--KAELEELKKELEKLEKALEL 744
Score = 35.9 bits (83), Expect = 0.22
Identities = 67/432 (15%), Positives = 147/432 (34%), Gaps = 38/432 (8%)
Query: 279 ERIEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHG 338
E + ++ +L E +++ + +E L R +++ + + L + LEKL
Sbjct: 274 EELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKL--------- 324
Query: 339 GKGQEQRELALNSLKGGSKDILMAGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRD 398
K E+R L + L + + R + E + + + +
Sbjct: 325 -KSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKE---LEKA 380
Query: 399 DRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSR 458
+ + ++ + E E + E RE E E K
Sbjct: 381 LERLKQLEEAIQELKEELAELSAALEEIQEELEEL-EKELEELERELEELEEEI-KKLEE 438
Query: 459 DKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEA 518
E E M+ + A +K P+ +EL + + E E E
Sbjct: 439 QINQLESKELMI-----AELAGAGEKCPVCGQELPEEHEKELLE--------LYELELEE 485
Query: 519 LRKRQAEVEEMRKKMEEERKKRQEFTK-EASFESKRENFDARLRRDREKKKEDPEEKELN 577
L + + +E + EE + +E + E E +A EK ++ E
Sbjct: 486 LEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEEL 545
Query: 578 KDKEREGEA--IKERYLGLVKKKRRVRRLNDRKFVFDWDASEDTSVD--YNSIYKERHQV 633
++ + + + +KE L + + ++ L + + E + + K+ ++
Sbjct: 546 EELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKEL 605
Query: 634 QFFGRGNIAGIDIKAQKRDQSKFYGEMLEKRRTEAEKEQEKVRLKKVKKREEKQKWDDRH 693
+ ++ ++ Q + S+ E LE+ E E E EK+ L+ + + ++
Sbjct: 606 E----ERLSQLEELLQSLELSEAENE-LEEAEEELESELEKLNLQAELEELLQAALEELE 660
Query: 694 WTEKSLDEMTER 705
+ L+ R
Sbjct: 661 EKVEELEAEIRR 672
Score = 33.2 bits (76), Expect = 1.9
Identities = 26/128 (20%), Positives = 49/128 (38%), Gaps = 1/128 (0%)
Query: 479 GAAAKKEPLSLEELLAKKKAEEE-ARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEER 537
K+ +EEL + E + L +E + + L + Q E EE + E E
Sbjct: 177 KEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEA 236
Query: 538 KKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKK 597
+ + E E E L + + + +E ++ ER E ++E+ L +
Sbjct: 237 LEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEEL 296
Query: 598 KRRVRRLN 605
+R + L
Sbjct: 297 EREIEELE 304
>gnl|CDD|224557 COG1643, HrpA, HrpA-like helicases [DNA replication, recombination,
and repair].
Length = 845
Score = 38.5 bits (90), Expect = 0.044
Identities = 79/382 (20%), Positives = 141/382 (36%), Gaps = 116/382 (30%)
Query: 764 QNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTREL-----A 818
QN+ +I V ETGSGKT LP + +E+ I R L A
Sbjct: 64 QNQVVIIVGETGSGKTTQ--LPQFL----------LEEGLGIAGKIGCTQPRRLAARSVA 111
Query: 819 QQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRL------GCEIVIATPGRLIDVLEN 872
+++ EE G LG E G+ +R I + T G L+ ++N
Sbjct: 112 ERVAEE---LGEKLG------------ETVGYSIRFESKVSPRTRIKVMTDGILLREIQN 156
Query: 873 RYLVLNQCTYIVLDEA-DRMID----MGFEPDVQKILEYMPVTNLKPD--------TEDA 919
L L+ + +++DEA +R ++ +G D+ + D T DA
Sbjct: 157 DPL-LSGYSVVIIDEAHERSLNTDILLGLLKDLLA--------RRRDDLKLIIMSATLDA 207
Query: 920 EDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIV 979
E + N P +E R+Y P + + +
Sbjct: 208 ERFSAYFGN-----------------APVIEIEGRTY---PVEIRYLPEAEADYILLD-- 245
Query: 980 YILSEQDKRKKLMEVLNRGVKKP---VIIFVNQKKGADVLAKGLEK--LGYNA--CTLHG 1032
++ ++ +++ +++F+ ++ + A+ LEK LG + L+G
Sbjct: 246 ----------AIVAAVDIHLREGSGSILVFLPGQREIERTAEWLEKAELGDDLEILPLYG 295
Query: 1033 GKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIG--- 1089
E++ GG + +++AT++A + I + VI+ +AK + Y R G
Sbjct: 296 ALSAEEQVRVFEPAPGGKRKVVLATNIAETSLTIPGIRYVIDSGLAK-EKRYDPRTGLTR 354
Query: 1090 -------------RTGRAGKEG 1098
R GRAG+ G
Sbjct: 355 LETEPISKASADQRAGRAGRTG 376
Score = 38.5 bits (90), Expect = 0.044
Identities = 79/382 (20%), Positives = 141/382 (36%), Gaps = 116/382 (30%)
Query: 1439 QNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTREL-----A 1493
QN+ +I V ETGSGKT LP + +E+ I R L A
Sbjct: 64 QNQVVIIVGETGSGKTTQ--LPQFL----------LEEGLGIAGKIGCTQPRRLAARSVA 111
Query: 1494 QQIEEETNKFGTPLGIRTVLVVGGLSREEQGFRLRL------GCEIVIATPGRLIDVLEN 1547
+++ EE G LG E G+ +R I + T G L+ ++N
Sbjct: 112 ERVAEE---LGEKLG------------ETVGYSIRFESKVSPRTRIKVMTDGILLREIQN 156
Query: 1548 RYLVLNQCTYIVLDEA-DRMID----MGFEPDVQKILEYMPVTNLKPD--------TEDA 1594
L L+ + +++DEA +R ++ +G D+ + D T DA
Sbjct: 157 DPL-LSGYSVVIIDEAHERSLNTDILLGLLKDLLA--------RRRDDLKLIIMSATLDA 207
Query: 1595 EDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIV 1654
E + N P +E R+Y P + + +
Sbjct: 208 ERFSAYFGN-----------------APVIEIEGRTY---PVEIRYLPEAEADYILLD-- 245
Query: 1655 YILSEQDKRKKLMEVLNRGVKKP---VIIFVNQKKGADVLAKGLEK--LGYNA--CTLHG 1707
++ ++ +++ +++F+ ++ + A+ LEK LG + L+G
Sbjct: 246 ----------AIVAAVDIHLREGSGSILVFLPGQREIERTAEWLEKAELGDDLEILPLYG 295
Query: 1708 GKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIG--- 1764
E++ GG + +++AT++A + I + VI+ +AK + Y R G
Sbjct: 296 ALSAEEQVRVFEPAPGGKRKVVLATNIAETSLTIPGIRYVIDSGLAK-EKRYDPRTGLTR 354
Query: 1765 -------------RTGRAGKEG 1773
R GRAG+ G
Sbjct: 355 LETEPISKASADQRAGRAGRTG 376
>gnl|CDD|224120 COG1199, DinG, Rad3-related DNA helicases [Transcription / DNA
replication, recombination, and repair].
Length = 654
Score = 38.2 bits (89), Expect = 0.044
Identities = 27/115 (23%), Positives = 38/115 (33%), Gaps = 15/115 (13%)
Query: 738 EILEIIEKIGYAEPTPIQRQAIP--IGLQNRDIIGVAE--TGSGKTLAFLLPLLVWIQSL 793
+ + EP P QR+ + E TG+GKTLA+LLP L + +
Sbjct: 3 DDGYLAVAFPGFEPRPEQREMAEAVAEALKGGEGLLIEAPTGTGKTLAYLLPALAYAREE 62
Query: 794 PKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQ 848
K II T+ L +Q+ EE L R
Sbjct: 63 GKK-----------VIISTRTKALQEQLLEEDLPIHKLLKKLGGKFALLKGRSNY 106
Score = 38.2 bits (89), Expect = 0.044
Identities = 27/115 (23%), Positives = 38/115 (33%), Gaps = 15/115 (13%)
Query: 1413 EILEIIEKIGYAEPTPIQRQAIP--IGLQNRDIIGVAE--TGSGKTLAFLLPLLVWIQSL 1468
+ + EP P QR+ + E TG+GKTLA+LLP L + +
Sbjct: 3 DDGYLAVAFPGFEPRPEQREMAEAVAEALKGGEGLLIEAPTGTGKTLAYLLPALAYAREE 62
Query: 1469 PKIARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQ 1523
K II T+ L +Q+ EE L R
Sbjct: 63 GKK-----------VIISTRTKALQEQLLEEDLPIHKLLKKLGGKFALLKGRSNY 106
Score = 37.1 bits (86), Expect = 0.090
Identities = 27/109 (24%), Positives = 36/109 (33%), Gaps = 15/109 (13%)
Query: 3 EKIGYAEPTPIQRQAIP--IGLQNRDIIGVAE--TGSGKTLAFLLPLLVWIQSLPKIARM 58
EP P QR+ + E TG+GKTLA+LLP L + + K
Sbjct: 9 VAFPGFEPRPEQREMAEAVAEALKGGEGLLIEAPTGTGKTLAYLLPALAYAREEGKK--- 65
Query: 59 EDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVVGGLSREEQ 107
II T+ L +Q+ EE L R
Sbjct: 66 --------VIISTRTKALQEQLLEEDLPIHKLLKKLGGKFALLKGRSNY 106
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 38.5 bits (90), Expect = 0.047
Identities = 56/269 (20%), Positives = 100/269 (37%), Gaps = 46/269 (17%)
Query: 443 RERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEP-LSLEELLAKKKAEEE 501
++R E EA + ++R EQ + D+ E A K++ L +L A+ K E
Sbjct: 640 QKRAEAEARTALKQARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLE 699
Query: 502 ARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLR 561
+ K++ R+ + E + +E E Q A+ E+ R ARL+
Sbjct: 700 QQQAFLEALKDD-----FRELRTERLAKWQVVEGELDN-QLAQLSAAIEAARTQAKARLK 753
Query: 562 -----RDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLNDRKFVFDWDAS 616
DRE D + + K+ +R+ E ++ + ++ VR
Sbjct: 754 ELKKQYDRELASLDVDPNTV-KELKRQIEELETTIERIAVRRPEVRE------------- 799
Query: 617 EDTSVDYNSIYKERHQVQFFGRGNIAGIDIKAQKRDQSKFYGEMLEKRRTEAEK-EQEKV 675
Y + +E + R++ L + + AE+ +QE
Sbjct: 800 ------YRAFMQETWL-------------HRDSLREERPNLAIQLRELESSAEELQQELT 840
Query: 676 RLKKVKKREEKQKWDDRHWTEKSLDEMTE 704
RL K K K+ +R EK LD++ E
Sbjct: 841 RLIKDTKLRRKKLEQERKALEKQLDQLDE 869
Score = 33.9 bits (78), Expect = 1.2
Identities = 33/159 (20%), Positives = 57/159 (35%), Gaps = 15/159 (9%)
Query: 382 ERDYDRVRE-RDTDRSRDDRGRDRDRD--RLERDRSREREREKRHHRRDRSKERDGKDRR 438
+R Y+R+++ RD + R E+DR + E + +++ +
Sbjct: 379 QRKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKL 438
Query: 439 EGYRRERREEEASGSKHKSRDKEGYEPTEQMVISD--------EEEDKGAAAKKEPLSLE 490
E E E G + D P E + +EE + A A E L E
Sbjct: 439 EFNEEEYELELRLGRLKQRLDSATATPEELEQLEINDEALEKAQEEQEQAEANVEQLQSE 498
Query: 491 ELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEM 529
+K+ +E + L + ER LR+ E+E
Sbjct: 499 LRQLRKRRDEALEA----LQRAERRLLQLRQALDELELQ 533
Score = 32.7 bits (75), Expect = 2.3
Identities = 37/272 (13%), Positives = 93/272 (34%), Gaps = 9/272 (3%)
Query: 343 EQRELALNSLKGGSKDILMAGDRRSRSRSPPRKRRSR----SRERDYDRVRERDTDRSRD 398
EQ L+ + L+A + R R+ D+++E + +++
Sbjct: 256 EQLLSLELRLQHLHGE-LVADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQE 314
Query: 399 DRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEAS--GSKHK 456
+ + +++ D + + D+ R E E EA K
Sbjct: 315 LSAANAKLAADRSELELLEDQKGAFEDADIEQLQADLDQLPSIRSELEEVEARLDALTGK 374
Query: 457 SRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAA 516
+D + + I ++ E + ++ E ++KA E + ++
Sbjct: 375 HQDVQRKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLE 434
Query: 517 EALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKEL 576
+ E E+ ++ +++ T + E D L + +E++++ E
Sbjct: 435 AGKLEFNEEEYELELRLGRLKQRLDSATATPEELEQLEINDEALEKAQEEQEQAEANVEQ 494
Query: 577 NKDKEREGEAIKERYLGLVKKKRRVRRLNDRK 608
+ + R+ ++ L +++ RRL +
Sbjct: 495 LQSELRQLRKRRDEALEALQRAE--RRLLQLR 524
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 38.1 bits (88), Expect = 0.051
Identities = 32/157 (20%), Positives = 65/157 (41%), Gaps = 7/157 (4%)
Query: 483 KKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQE 542
K++ + + + K+A+ E S + + E + L+ R E+E K+ K E
Sbjct: 216 KEKACEIRDQITSKEAQLE--SSREIVKSYENELDPLKNRLKEIEHNLSKI---MKLDNE 270
Query: 543 FTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVR 602
S + + E ++ L EK + +E +LN ++E+ LV +R +
Sbjct: 271 IKALKSRKKQMEKDNSELELKMEKVFQGTDE-QLNDLYHNHQRTVREKERELVDCQRELE 329
Query: 603 RLNDRKFVFDWDASE-DTSVDYNSIYKERHQVQFFGR 638
+LN + + + + +E + +RHQ R
Sbjct: 330 KLNKERRLLNQEKTELLVEQGRLQLQADRHQEHIRAR 366
Score = 36.9 bits (85), Expect = 0.14
Identities = 66/364 (18%), Positives = 141/364 (38%), Gaps = 66/364 (18%)
Query: 374 RKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSK--- 430
+R R +ER+ + +++ R ++++ L ++ R + + RH R++
Sbjct: 310 HQRTVREKERELVDCQRELEKLNKERRLLNQEKTELLVEQGRLQLQADRHQEHIRARDSL 369
Query: 431 -------------ERDGKDRRE-----GYRRERREEEASGSKHKSRDKEGYEPTEQMVIS 472
ER R+ ER+E+EA + D + E +Q
Sbjct: 370 IQSLATRLELDGFERGPFSERQIKNFHTLVIERQEDEAKTAAQLCADLQSKERLKQEQAD 429
Query: 473 DEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKK 532
+ ++K + + L++ + +KK EE KF+ KE + E R E+++ +K
Sbjct: 430 EIRDEKKGLGRT--IELKKEILEKKQEEL-----KFVIKELQQLEGSSDRILELDQELRK 482
Query: 533 MEEERKKRQEFTKEASFESKRENF----DARLRRDREKKKEDPEEKELNKDKEREGEAIK 588
E E K + K + E+ ++ + + DR+ +K D E ++LN +
Sbjct: 483 AERELSKAE---KNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTT-----R 534
Query: 589 ERYLGLVKKK----RRVRRLNDR------KFVFDWDASEDTSVDYNSIYKERHQVQFFGR 638
+ L K K ++R++ R + + + +S KE +Q R
Sbjct: 535 TQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLGYFPNKKQLEDWLHSKSKEINQT----R 590
Query: 639 GNIAGIDIKAQKRDQSKFYGEMLEKRRTEAEK------------EQEKVRLKKVKKREEK 686
+A ++ + +Q+K + + + E + E+ L+++K+ EK
Sbjct: 591 DRLAKLNKELASLEQNKNHINNELESKEEQLSSYEDKLFDVCGSQDEESDLERLKEEIEK 650
Query: 687 QKWD 690
Sbjct: 651 SSKQ 654
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 37.7 bits (87), Expect = 0.059
Identities = 35/145 (24%), Positives = 67/145 (46%), Gaps = 16/145 (11%)
Query: 407 DRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPT 466
D D ER+R++ ++R K R ++ R +R + A K RD P
Sbjct: 183 DEDYPDDMEERDRKRYSEAKNREK-RAKLKNQDNARLKRLVQIA-----KKRD-----PR 231
Query: 467 EQMVISDEEEDKGAAAKKEPL--SLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQA 524
+ ++E++ K E + + LA K + EA++K + + +A A++K+
Sbjct: 232 IKS-FKEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAK 290
Query: 525 EVEEMRKKMEEERKKRQEFTKEASF 549
EV M+K ++ E+K + K+A +
Sbjct: 291 EV--MKKALKMEKKAIKNAAKDADY 313
>gnl|CDD|137505 PRK09751, PRK09751, putative ATP-dependent helicase Lhr; Provisional.
Length = 1490
Score = 38.0 bits (88), Expect = 0.065
Identities = 23/62 (37%), Positives = 30/62 (48%)
Query: 1031 HGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGR 1090
HG +EQR + +LK G +VAT GID+ V +VI S+ RIGR
Sbjct: 308 HGSVSKEQRAITEQALKSGELRCVVATSSLELGIDMGAVDLVIQVATPLSVASGLQRIGR 367
Query: 1091 TG 1092
G
Sbjct: 368 AG 369
Score = 38.0 bits (88), Expect = 0.065
Identities = 23/62 (37%), Positives = 30/62 (48%)
Query: 1706 HGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGR 1765
HG +EQR + +LK G +VAT GID+ V +VI S+ RIGR
Sbjct: 308 HGSVSKEQRAITEQALKSGELRCVVATSSLELGIDMGAVDLVIQVATPLSVASGLQRIGR 367
Query: 1766 TG 1767
G
Sbjct: 368 AG 369
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 37.7 bits (87), Expect = 0.074
Identities = 38/226 (16%), Positives = 84/226 (37%), Gaps = 22/226 (9%)
Query: 384 DYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRR 443
D D + D D +D D + ++ + + E + E + + ++ D +
Sbjct: 3986 DLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDE 4045
Query: 444 ERREEEAS--GSKHKSRDKEGYE--------PTEQMVISDEEEDKGAAAKKEPLSLEE-- 491
+ ++++ S + +++G+E TE V SDEE ++G + + +
Sbjct: 4046 DIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKM 4105
Query: 492 ----LLAKKKAEEEARSK----PKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEF 543
A +A+EE K EE E E++++ ++ E
Sbjct: 4106 DAKSTFASAEADEENTDKGIVGENEELGEEDGVRGNGTADGEFEQVQEDTSTPKEAMSEA 4165
Query: 544 TKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKE 589
++S ++ + +R + ED E + + E +KE
Sbjct: 4166 D--RQYQSLGDHLREWQQANRIHEWEDLTESQSQAFDDSEFMHVKE 4209
Score = 37.3 bits (86), Expect = 0.12
Identities = 38/201 (18%), Positives = 70/201 (34%), Gaps = 7/201 (3%)
Query: 390 ERDTDRSRDDRGRDRDRDRLERDRSRERERE--KRHHRRDRSKERDGKDRREGYRRERRE 447
E D DD D+DR E++ E + + +E + + E + E
Sbjct: 3932 ESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPE 3991
Query: 448 EEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKA--EEEARSK 505
+ K K+ M +DE +++ A K EP+ E+ L + E+ +
Sbjct: 3992 DLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDD 4051
Query: 506 PKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDRE 565
L +++ E + + E + E + + E + + D + D +
Sbjct: 4052 FSDLAEDD---EKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAK 4108
Query: 566 KKKEDPEEKELNKDKEREGEA 586
E E N DK GE
Sbjct: 4109 STFASAEADEENTDKGIVGEN 4129
Score = 35.0 bits (80), Expect = 0.51
Identities = 41/248 (16%), Positives = 94/248 (37%), Gaps = 22/248 (8%)
Query: 361 MAGDRRSRSRSPPRKRRSRSRERDYDR----VRERDTDRSRDDRGRDRDRDRLERDRSRE 416
M G + S S +D D + E ++ + + + + L +
Sbjct: 3864 MNGVTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKS 3923
Query: 417 REREKRHHRRDR-SKERDGKDRREGYRRERREEEAS----GSKHKSRDKEGYEPTEQMVI 471
E+ ++ D SKE D K + R+E+ +EE G + + E Q
Sbjct: 3924 NEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQ-ENNSQPPP 3982
Query: 472 SDE----------EEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRK 521
+E +E +G +K L ++ A + +EEA ++ ++E E
Sbjct: 3983 ENEDLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNT 4042
Query: 522 RQAEV--EEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKD 579
++ ++ E++ K ++ +E E++ D + ++ E PE++ ++
Sbjct: 4043 LDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNH 4102
Query: 580 KEREGEAI 587
+ + ++
Sbjct: 4103 PKMDAKST 4110
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 35.4 bits (82), Expect = 0.076
Identities = 25/99 (25%), Positives = 45/99 (45%), Gaps = 11/99 (11%)
Query: 444 ERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEAR 503
+R E SK K + + +++E+D+ A K P +KKA+E+ R
Sbjct: 20 SKRSELKKASKKKKKRSNSSPGSTSNEENEDEDDESTAESKRPE------GRKKAKEKLR 73
Query: 504 -----SKPKFLTKEERAAEALRKRQAEVEEMRKKMEEER 537
+K + KE+ E K AE E+ R ++E+++
Sbjct: 74 RDKLKAKKEEAEKEKEKEERFMKALAEAEKERAELEKKK 112
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 37.1 bits (86), Expect = 0.084
Identities = 29/176 (16%), Positives = 66/176 (37%), Gaps = 13/176 (7%)
Query: 415 REREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDE 474
E ++ + +R + + K+ + + +++ + + K+ + + E
Sbjct: 44 LEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKLE 103
Query: 475 EEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKME 534
+ K + L L E + ++E K L AE + K++ + +
Sbjct: 104 ALELLNLEKDKELELLEKELDELSKELQ----KQLQNT---AEIIEKKRENNKNEERLKF 156
Query: 535 EERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKER 590
E KK +E S E +RE F+ +L + E++ + K + +K R
Sbjct: 157 ENEKKLEE-----SLELEREKFEEQLHEANLDLEFKENEEQR-ESKWAILKKLKRR 206
>gnl|CDD|234365 TIGR03817, DECH_helic, helicase/secretion neighborhood putative
DEAH-box helicase. A conserved gene neighborhood widely
spread in the Actinobacteria contains this
uncharacterized DEAH-box family helicase encoded
convergently towards an operon of genes for protein
homologous to type II secretion and pilus formation
proteins. The context suggests that this helicase may
play a role in conjugal transfer of DNA.
Length = 742
Score = 37.4 bits (87), Expect = 0.087
Identities = 40/160 (25%), Positives = 66/160 (41%), Gaps = 22/160 (13%)
Query: 736 PTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPK 795
+++ +E G P Q +A + R ++ T SGK+LA+ LP+L
Sbjct: 22 HPDVVAALEAAGIHRPWQHQARAAELAHAGRHVVVATGTASGKSLAYQLPVL-------- 73
Query: 796 IARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVV---GGLSREEQGFRL 852
+ + D D A+ +APT+ LA L +R V G EE+ +
Sbjct: 74 -SALAD-DPRATALYLAPTKALAADQLRAV----RELTLRGVRPATYDGDTPTEERRW-A 126
Query: 853 RLGCEIVIATPGRL-IDVLEN--RYL-VLNQCTYIVLDEA 888
R V+ P L +L + R+ L + Y+V+DE
Sbjct: 127 REHARYVLTNPDMLHRGILPSHARWARFLRRLRYVVIDEC 166
Score = 37.4 bits (87), Expect = 0.087
Identities = 40/160 (25%), Positives = 66/160 (41%), Gaps = 22/160 (13%)
Query: 1411 PTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPK 1470
+++ +E G P Q +A + R ++ T SGK+LA+ LP+L
Sbjct: 22 HPDVVAALEAAGIHRPWQHQARAAELAHAGRHVVVATGTASGKSLAYQLPVL-------- 73
Query: 1471 IARMEDADQGPYAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVV---GGLSREEQGFRL 1527
+ + D D A+ +APT+ LA L +R V G EE+ +
Sbjct: 74 -SALAD-DPRATALYLAPTKALAADQLRAV----RELTLRGVRPATYDGDTPTEERRW-A 126
Query: 1528 RLGCEIVIATPGRL-IDVLEN--RYL-VLNQCTYIVLDEA 1563
R V+ P L +L + R+ L + Y+V+DE
Sbjct: 127 REHARYVLTNPDMLHRGILPSHARWARFLRRLRYVVIDEC 166
Score = 35.5 bits (82), Expect = 0.31
Identities = 39/149 (26%), Positives = 61/149 (40%), Gaps = 22/149 (14%)
Query: 6 GYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGP 65
G P Q +A + R ++ T SGK+LA+ LP+L + + D D
Sbjct: 33 GIHRPWQHQARAAELAHAGRHVVVATGTASGKSLAYQLPVL---------SALAD-DPRA 82
Query: 66 YAIIMAPTRELAQQIEEETNKFGTPLGIRTVLVV---GGLSREEQGFRLRLGCEIVIATP 122
A+ +APT+ LA L +R V G EE+ + R V+ P
Sbjct: 83 TALYLAPTKALAADQLRAV----RELTLRGVRPATYDGDTPTEERRW-AREHARYVLTNP 137
Query: 123 GRL-IDVLEN--RYL-VLNQCTYIVLDEA 147
L +L + R+ L + Y+V+DE
Sbjct: 138 DMLHRGILPSHARWARFLRRLRYVVIDEC 166
Score = 32.4 bits (74), Expect = 2.7
Identities = 34/147 (23%), Positives = 59/147 (40%), Gaps = 36/147 (24%)
Query: 1679 IIFVNQKKGADVLAKGLEKL-------------GYNACTLHGGKGQEQRELALNSLKGGS 1725
+ FV ++GA+++A +L Y A L +++REL +L+ G
Sbjct: 275 LTFVRSRRGAELVAAIARRLLGEVDPDLAERVAAYRAGYL----PEDRRELE-RALRDGE 329
Query: 1726 KDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRAGKEGLAVSFCTKDDSH 1785
+ T+ G+DI + V+ + + GR GR G+ L V +DD
Sbjct: 330 LLGVATTNALELGVDISGLDAVVIAGFPGTRASLWQQAGRAGRRGQGALVV-LVARDD-- 386
Query: 1786 LFYDLKQMMISSPVSTCPPELLNHPDA 1812
P+ T L++HP+A
Sbjct: 387 ------------PLDT---YLVHHPEA 398
Score = 32.0 bits (73), Expect = 4.2
Identities = 29/124 (23%), Positives = 51/124 (41%), Gaps = 20/124 (16%)
Query: 1004 IIFVNQKKGADVLAKGLEKL-------------GYNACTLHGGKGQEQRELALNSLKGGS 1050
+ FV ++GA+++A +L Y A L +++REL +L+ G
Sbjct: 275 LTFVRSRRGAELVAAIARRLLGEVDPDLAERVAAYRAGYL----PEDRRELE-RALRDGE 329
Query: 1051 KDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRAGKEGLAVSFCTKD--D 1108
+ T+ G+DI + V+ + + GR GR G+ L V D D
Sbjct: 330 LLGVATTNALELGVDISGLDAVVIAGFPGTRASLWQQAGRAGRRGQGALVVLVARDDPLD 389
Query: 1109 SHLF 1112
++L
Sbjct: 390 TYLV 393
>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region. The
timeless (tim) gene is essential for circadian function
in Drosophila. Putative homologues of Drosophila tim
have been identified in both mice and humans (mTim and
hTIM, respectively). Mammalian TIM is not the true
orthologue of Drosophila TIM, but is the likely
orthologue of a fly gene, timeout (also called tim-2).
mTim has been shown to be essential for embryonic
development, but does not have substantiated circadian
function. Some family members contain a SANT domain in
this region.
Length = 507
Score = 37.0 bits (85), Expect = 0.090
Identities = 50/248 (20%), Positives = 84/248 (33%), Gaps = 17/248 (6%)
Query: 375 KRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLE----RDRSREREREKRHHRRDRSK 430
+R RD D V R R + D+L RE K+ ++ + K
Sbjct: 177 LQRLYEEFRDSDDVLGHIMKNITGKRSRAQVVDKLLQLGLVSEKRELRGGKKRRKKLKPK 236
Query: 431 ERDGKDRREGYRRERREEEASGSKHKSRDKE----GYEPTEQMVISDEEEDKGAAAKKEP 486
+ +G++ E +E EEE + K + E ++ +IS E + +K
Sbjct: 237 QPNGEESGEDDFQEDPEEEEQLPESKPEETEKRVSAFQVEGSTLISAENLRQQLKQEKTS 296
Query: 487 LSLEEL------LAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKR 540
L L A + E+E + E EA+ Q + + K + R R
Sbjct: 297 WPLLWLQSCLIRAADDREEDECDQAVPLVPLTEENEEAMENEQ--FQRLLKAL-GLRPPR 353
Query: 541 QEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRR 600
+K + R R +E+ E EL D + E E ++K R
Sbjct: 354 SGQEGFWRIPAKLSSTQLRRRAASLSGEEEEPEDELKDDVDGEQADESEHETLALRKNAR 413
Query: 601 VRRLNDRK 608
R+
Sbjct: 414 QRKAGLAS 421
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 36.4 bits (84), Expect = 0.096
Identities = 23/136 (16%), Positives = 53/136 (38%), Gaps = 6/136 (4%)
Query: 452 GSKHKSRDKEGYEPTEQMVISDEEE----DKGAAAKKEPLSLEELLAKKKAEEEARSKPK 507
G K KS + P E ++EE K +E + E+L + + ++ +
Sbjct: 66 GLKEKSTSEPTVPPEEAEPHAEEEGQLAVRKTKQKVEEEVK-EQLQSLLEKIVVSKQEED 124
Query: 508 FLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKK 567
KE + E + ++ ++ +E + + E+ E + +++ +
Sbjct: 125 GPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQ-YKQDMKEKASE 183
Query: 568 KEDPEEKELNKDKERE 583
+E+ + KE + ER
Sbjct: 184 QENEDSKEPVEKAERT 199
Score = 31.4 bits (71), Expect = 3.9
Identities = 24/126 (19%), Positives = 49/126 (38%), Gaps = 6/126 (4%)
Query: 467 EQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEV 526
E++V+S +EED K+ L ++ + + R + K E +
Sbjct: 114 EKIVVSKQEEDG--PGKEPQLDEDK--FLLAEDSDDRQETLEAGKVHEETEDSYHVEETA 169
Query: 527 EEMRKKMEEERKKRQEF--TKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREG 584
E K+ +E+ QE +KE +++R + + + +ED ++ KE
Sbjct: 170 SEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIKEELA 229
Query: 585 EAIKER 590
+ E
Sbjct: 230 KEPVEE 235
Score = 30.7 bits (69), Expect = 6.1
Identities = 20/117 (17%), Positives = 36/117 (30%), Gaps = 4/117 (3%)
Query: 415 REREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEP--TEQMVIS 472
+E + D DR+E + EE S H ++
Sbjct: 124 DGPGKEPQLDEDKFLLAEDSDDRQETLEAGKVHEETEDSYHVEETASEQYKQDMKEKASE 183
Query: 473 DEEEDKGAAAKK--EPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVE 527
E ED +K + + + ++ +EE +E A+ + Q EV
Sbjct: 184 QENEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIKEELAKEPVEEQQEVP 240
>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1. TT virus (TTV), isolated
initially from a Japanese patient with hepatitis of
unknown aetiology, has since been found to infect both
healthy and diseased individuals and numerous prevalence
studies have raised questions about its role in
unexplained hepatitis. ORF1 is a large 750 residue
protein. The N-terminal half of this protein corresponds
to the capsid protein.
Length = 525
Score = 36.9 bits (86), Expect = 0.11
Identities = 26/70 (37%), Positives = 27/70 (38%), Gaps = 10/70 (14%)
Query: 361 MAGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSRERERE 420
MA R R R RR R R R R R R R R R R R R R R
Sbjct: 1 MAWRWWRRRRRRWRGRRRRRRRRRARRRR----------RRRRVRRRRRGRRRRRRRRRR 50
Query: 421 KRHHRRDRSK 430
+R RR R K
Sbjct: 51 RRRRRRKRKK 60
Score = 31.1 bits (71), Expect = 7.2
Identities = 21/52 (40%), Positives = 26/52 (50%), Gaps = 1/52 (1%)
Query: 395 RSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERR 446
R R RGR R R R R R R R +R R R + R + RR RR+R+
Sbjct: 9 RRRRWRGRRRRRRRRRARRRRRRRRVRRRRRGRRRRRRRRR-RRRRRRRKRK 59
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 36.9 bits (85), Expect = 0.11
Identities = 38/186 (20%), Positives = 72/186 (38%), Gaps = 12/186 (6%)
Query: 403 DRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEG 462
D D D + + E RE + ++ RE +R ++ + DK+
Sbjct: 171 DVDTDSISDKKVVEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLK----EELDKKQ 226
Query: 463 YEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKR 522
+ + +D +D + E ++ K + A + KE++ +KR
Sbjct: 227 IDADKAQQKADFAQDNADKQRDE--VRQKQQEAKNLPKPADTSSP---KEDKQVAENQKR 281
Query: 523 QAE--VEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDK 580
+ E E++K EE K + + ESK +A + +KK +P ++L K K
Sbjct: 282 EIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTK 341
Query: 581 EREGEA 586
+ EA
Sbjct: 342 P-QVEA 346
Score = 34.6 bits (79), Expect = 0.60
Identities = 28/156 (17%), Positives = 69/156 (44%), Gaps = 3/156 (1%)
Query: 388 VRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERRE 447
++ + R+D + + R D +ERE ++ R + KE K + + + +++
Sbjct: 178 SDKKVVEALREDNEKGVNFRRDMTD-LKERESQEDAKRAQQLKEELDKKQIDADKAQQKA 236
Query: 448 EEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPK 507
+ A + K RD E + ++ + D + + + ++ + +KA+ E + +
Sbjct: 237 DFAQDNADKQRD-EVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDE 295
Query: 508 FLTK-EERAAEALRKRQAEVEEMRKKMEEERKKRQE 542
K ++ A L++ E+ + E E +K++E
Sbjct: 296 EALKAKDHKAFDLKQESKASEKEAEDKELEAQKKRE 331
Score = 30.4 bits (68), Expect = 9.9
Identities = 30/150 (20%), Positives = 60/150 (40%), Gaps = 8/150 (5%)
Query: 370 RSPPRKRRSRSRERDYDRVR-ERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDR 428
R+ K ++ D D+ + + D + D+ RD R + + ++ + + + D+
Sbjct: 214 RAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDK 273
Query: 429 SKERDGKDRREGYRRE--RREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEP 486
+ K E + E + +EEA +K E E EDK A+K+
Sbjct: 274 QVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKAS---EKEAEDKELEAQKKR 330
Query: 487 LSLEELLAKKKAEEEARSKPKFLTKEERAA 516
+ E L K K + ++P L ++ +
Sbjct: 331 EPVAEDLQKTK--PQVEAQPTSLNEDAIDS 358
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 37.1 bits (87), Expect = 0.11
Identities = 26/118 (22%), Positives = 57/118 (48%), Gaps = 6/118 (5%)
Query: 480 AAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKK 539
A KE L+E L +KK + + E+ A +A+++ + E +E+ K++ + +K
Sbjct: 540 EALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKG 599
Query: 540 RQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKK 597
K R+ + EKK++ ++++ +++ + G+ +K YL L +K
Sbjct: 600 GYASVKAHELIEARKRLNKAN----EKKEKKKKKQKEKQEELKVGDEVK--YLSLGQK 651
Score = 36.7 bits (86), Expect = 0.13
Identities = 25/114 (21%), Positives = 52/114 (45%), Gaps = 13/114 (11%)
Query: 488 SLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFT-KE 546
SLEEL +++ E++A + AE L++ E +E ++ E++ + E ++
Sbjct: 524 SLEEL--ERELEQKAEE----AEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQ 577
Query: 547 ASFESKRENFD--ARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKK 598
A E+K+E + LR+ ++ + EL + + + + KKK
Sbjct: 578 AIKEAKKEADEIIKELRQLQKGGYASVKAHEL-IEARK---RLNKANEKKEKKK 627
Score = 32.5 bits (75), Expect = 2.5
Identities = 20/80 (25%), Positives = 46/80 (57%), Gaps = 6/80 (7%)
Query: 513 ERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKE--- 569
E A + + + + ++ E+ +EE ++ ++ +EA E+ + + +L+ + E+KKE
Sbjct: 505 EEAKKLIGEDKEKLNELIASLEELERELEQKAEEA--EALLKEAE-KLKEELEEKKEKLQ 561
Query: 570 DPEEKELNKDKEREGEAIKE 589
+ E+K L + ++ +AIKE
Sbjct: 562 EEEDKLLEEAEKEAQQAIKE 581
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 35.8 bits (83), Expect = 0.13
Identities = 35/182 (19%), Positives = 70/182 (38%), Gaps = 24/182 (13%)
Query: 382 ERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGY 441
E D + E + + D D D E + E E EK R +R K++ + + + Y
Sbjct: 38 EDDEEFEIEEEEEEEEVDSDFDDSEDD-EPESDDEEEGEKELQREERLKKK-KRVKTKAY 95
Query: 442 R-----RERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKG-------AAAKKEPLSL 489
+ +++++ A+ S + + + + KE
Sbjct: 96 KEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSPRRKSSRSSTVQNKEATHE 155
Query: 490 --------EELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRK--KMEEERKK 539
+ + K + + + K K LT+EER AEA + ++ + + + EEE+KK
Sbjct: 156 RLKEREIRRKKIQAKARKRKEKKKEKELTQEERLAEAKETERINLKSLERYEEQEEEKKK 215
Query: 540 RQ 541
+
Sbjct: 216 AK 217
Score = 34.7 bits (80), Expect = 0.30
Identities = 33/130 (25%), Positives = 44/130 (33%), Gaps = 18/130 (13%)
Query: 426 RDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKE 485
E + D EG + +REE K EPT++ D K A
Sbjct: 60 DSEDDEPESDDEEEGEKELQREERLKKKKRVKTKA-YKEPTKKKKKKDPTAAKSPKAAAP 118
Query: 486 PLSLEELLAKKKAEEEARSKPKFLTKEERA---------AEALRKRQAEVEEMRKKMEEE 536
KK E P L R EA +R E E RKK++ +
Sbjct: 119 --------RPKKKSERISWAPTLLDSPRRKSSRSSTVQNKEATHERLKEREIRRKKIQAK 170
Query: 537 RKKRQEFTKE 546
+KR+E KE
Sbjct: 171 ARKRKEKKKE 180
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels
of palindromic sequences in DNA. This family is based on
the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1042
Score = 36.9 bits (85), Expect = 0.14
Identities = 23/114 (20%), Positives = 43/114 (37%), Gaps = 7/114 (6%)
Query: 426 RDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKE 485
+ K GK E + + + V+ E + A ++
Sbjct: 186 FAKKKSLHGK-------AELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQT 238
Query: 486 PLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKK 539
S L K++A+EE K + L + E LR ++A +EE ++++ RK
Sbjct: 239 QQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRARKA 292
Score = 30.7 bits (69), Expect = 9.2
Identities = 24/189 (12%), Positives = 57/189 (30%), Gaps = 18/189 (9%)
Query: 413 RSREREREK-RHHRRDRSKERDGKD----------RREGYRRERREEEASGSKHKSRDKE 461
+S+ R R K R K++ + +E + R+ E S + +
Sbjct: 317 QSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHT 376
Query: 462 GYEPTEQMV-ISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALR 520
+ + K + KE L+ A A + + + L+
Sbjct: 377 LTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQQELQ 436
Query: 521 KRQAEV------EEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEK 574
+R AE+ + + E+ ++ E + + + ++ KK
Sbjct: 437 QRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLARL 496
Query: 575 ELNKDKERE 583
+++
Sbjct: 497 LELQEEPCP 505
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 36.6 bits (84), Expect = 0.15
Identities = 36/213 (16%), Positives = 79/213 (37%), Gaps = 24/213 (11%)
Query: 365 RRSRSRSPPRKRRSRSRERD--------YDRVRER---------DTDRSRDD--RGRDRD 405
+R + + R+ S E D D V E + D+ +D R D D
Sbjct: 103 KREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTEEMPENALPSDEDDKDPNDPYRALDID 162
Query: 406 RDRLERDRSREREREKRHHRRDRSKERDG-----KDRREGYRRERREEEASGSKHKSRDK 460
D+ D + ++ R+ +S E+ K ++ ++E++E+E K K ++
Sbjct: 163 LDKPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEV 222
Query: 461 EGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALR 520
EG++ + A E + E K + +++ + +
Sbjct: 223 EGFKSLLLALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKK 282
Query: 521 KRQAEVEEMRKKMEEERKKRQEFTKEASFESKR 553
K+Q + +E +KK ++ R + + + +
Sbjct: 283 KKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQ 315
Score = 35.0 bits (80), Expect = 0.36
Identities = 43/208 (20%), Positives = 79/208 (37%), Gaps = 15/208 (7%)
Query: 410 ERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQM 469
E R R+R + + ++ R KE+ G+ R E E+ A D E E
Sbjct: 86 EERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQM---VDIVTEEMPENA 142
Query: 470 VISDEEE-DKGAAAKKEPLSLEELLA--------KKKAEEEARSKPKFLTKEERAAEALR 520
+ SDE++ D + + L++ LA K + E ++S K + A E
Sbjct: 143 LPSDEDDKDPNDPYRALDIDLDKPLADSEKLPVQKHRNAETSKSPEK---GDVPAVEKKS 199
Query: 521 KRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDK 580
K+ + E+ K+ E ++ K++E S ++ A E +
Sbjct: 200 KKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAP 259
Query: 581 EREGEAIKERYLGLVKKKRRVRRLNDRK 608
+ E + K+ KK + ++ RK
Sbjct: 260 DSEPDEPKDAEAEETKKSPKHKKKKQRK 287
>gnl|CDD|217047 pfam02455, Hex_IIIa, Hexon-associated protein (IIIa). The major
capsid protein of the adenovirus strain is also known as
a hexon. This is a family of hexon-associated proteins
(protein IIIa).
Length = 491
Score = 36.2 bits (84), Expect = 0.15
Identities = 16/107 (14%), Positives = 24/107 (22%), Gaps = 3/107 (2%)
Query: 364 DRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRH 423
+ S PP S R R + EK
Sbjct: 358 EDDSLFAPPPSVAGPSSPAPSLSRYSSVPRSAFDGRLADSRLPGAGDLLAPL---AEKNA 414
Query: 424 HRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMV 470
+ + D R + Y +E RE S + + E
Sbjct: 415 YNNPLERLVDKMSRWKTYAQELRELNDSFPASRRPRIREAQDEEDDS 461
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 36.1 bits (84), Expect = 0.16
Identities = 28/91 (30%), Positives = 41/91 (45%), Gaps = 28/91 (30%)
Query: 517 EALRK----RQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPE 572
E LRK R+ E E++ K EEER QE +E E K+E R+ + K PE
Sbjct: 256 EVLRKVDKTREEEEEKILKAAEEER---QEEAQEKKEEKKKE------EREAKLAKLSPE 306
Query: 573 EKELNKDKEREGEAIKERYLGLVKKKRRVRR 603
E K +E+E +K++ R+
Sbjct: 307 E--QRKLEEKE-------------RKKQARK 322
Score = 33.4 bits (77), Expect = 0.98
Identities = 17/70 (24%), Positives = 34/70 (48%), Gaps = 8/70 (11%)
Query: 490 EELLAK-----KKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKM-EEERKKRQEF 543
E+L K ++ EE+ + +EE + K++ E E K+ EE++K +E
Sbjct: 255 PEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEE- 313
Query: 544 TKEASFESKR 553
KE ++++
Sbjct: 314 -KERKKQARK 322
Score = 31.1 bits (71), Expect = 5.1
Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 13/68 (19%)
Query: 474 EEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKM 533
EEE++ E EE KK+ ++ KEER A+ L K EE RK
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKK----------KEEREAK-LAKLSP--EEQRKLE 312
Query: 534 EEERKKRQ 541
E+ERKK+
Sbjct: 313 EKERKKQA 320
>gnl|CDD|216249 pfam01025, GrpE, GrpE.
Length = 165
Score = 34.9 bits (81), Expect = 0.16
Identities = 23/90 (25%), Positives = 39/90 (43%), Gaps = 4/90 (4%)
Query: 490 EELLAKKKAEEEARSKPKFLTKEERAAE---ALRKRQAEVEEMRKKMEEERKKRQEFTKE 546
E+ ++ +EE + + EE E L + AE E RK+ E ER++ +++ E
Sbjct: 2 EKEEEEELEDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRKRTEREREEAKKYAIE 61
Query: 547 ASFESKRENFDARLRRDREKKKEDPEEKEL 576
+ D L R KE+ + K L
Sbjct: 62 KFAKDLLPVLDN-LERALAAAKENEDVKSL 90
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 36.2 bits (84), Expect = 0.18
Identities = 36/138 (26%), Positives = 67/138 (48%), Gaps = 24/138 (17%)
Query: 471 ISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAE----- 525
+ D+ E A+AK+E L + LA+ KAEEE E AL++++ E
Sbjct: 157 LKDDLESLIASAKEELDQLSKKLAELKAEEE-----------EELERALKEKREELLSKL 205
Query: 526 ----VEEMRKKM-EEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDK 580
+ + K E++ R EF +E E R+ ++ +LR++ E++ E E+K N+
Sbjct: 206 EEELLARLESKEAALEKQLRLEFEREK--EELRKKYEEKLRQELERQAEAHEQKLKNELA 263
Query: 581 EREGEAIKERYLGLVKKK 598
+ E ++ + +K+K
Sbjct: 264 LQAIE-LQREFNKEIKEK 280
>gnl|CDD|237791 PRK14701, PRK14701, reverse gyrase; Provisional.
Length = 1638
Score = 36.4 bits (84), Expect = 0.18
Identities = 85/403 (21%), Positives = 150/403 (37%), Gaps = 64/403 (15%)
Query: 696 EKSLDEMTERDWRIFREDYSITIKGGKVPDPVRNWKEASLPTEILEIIEKIGYAEPTPIQ 755
EK L E + R + D W E E E EKI E IQ
Sbjct: 28 EKCLKEPNVFEKYAVRNALKLRGTLKFWEDFYILWNEVE---EFEEFFEKITGFEFWSIQ 84
Query: 756 RQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTR 815
+ L+ + VA TG GK+ F + +++ A +G I+ PT
Sbjct: 85 KTWAKRILRGKSFSIVAPTGMGKS-TFGAFIALFL-----------ALKGKKCYIILPTT 132
Query: 816 ELAQQIEEETNKF--GTPLGIRTVLVVGGLS---REEQGFRLRLG-CEIVIAT----PGR 865
L +Q E+ F L +R V L +EE R+ G +I++ T
Sbjct: 133 LLVKQTVEKIESFCEKANLDVRLVYYHSNLRKKEKEEFLERIENGDFDILVTTAQFLARN 192
Query: 866 LIDVLENRYLVLNQCTYIVLDEAD------RMID-----MGFEPDVQKILEYMPVTNLKP 914
++ ++ +I +D+ D + ID +GF ++ I + + LK
Sbjct: 193 FPEMKHLKF------DFIFVDDVDAFLKASKNIDRSLQLLGFYEEI--IEKAWKIIYLKK 244
Query: 915 --DTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPT 972
+ EDA ++ ++L K + + A+ + R L R + VG
Sbjct: 245 QGNIEDAMEKREILNKEIEKIGNKIGCLIVAS-ATGKAKGDRVKLYRELLGF--EVGSGR 301
Query: 973 ERIEQIV--YILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKG---ADVLAKGLEKLGYNA 1027
+ IV Y+ E+ ++ + E+L + + K +IFV +G A+ + K L + G+
Sbjct: 302 SALRNIVDVYLNPEKIIKEHVRELLKK-LGKGGLIFVPIDEGAEKAEEIEKYLLEDGFKI 360
Query: 1028 CTLHGGKGQEQRELALNSLKGGSKDILVAT----DVAGRGIDI 1066
+ + + + G D L+ RG+D+
Sbjct: 361 ELVSAK-----NKKGFDLFEEGEIDYLIGVATYYGTLVRGLDL 398
Score = 36.4 bits (84), Expect = 0.18
Identities = 85/403 (21%), Positives = 150/403 (37%), Gaps = 64/403 (15%)
Query: 1371 EKSLDEMTERDWRIFREDYSITIKGGKVPDPVRNWKEASLPTEILEIIEKIGYAEPTPIQ 1430
EK L E + R + D W E E E EKI E IQ
Sbjct: 28 EKCLKEPNVFEKYAVRNALKLRGTLKFWEDFYILWNEVE---EFEEFFEKITGFEFWSIQ 84
Query: 1431 RQAIPIGLQNRDIIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTR 1490
+ L+ + VA TG GK+ F + +++ A +G I+ PT
Sbjct: 85 KTWAKRILRGKSFSIVAPTGMGKS-TFGAFIALFL-----------ALKGKKCYIILPTT 132
Query: 1491 ELAQQIEEETNKF--GTPLGIRTVLVVGGLS---REEQGFRLRLG-CEIVIAT----PGR 1540
L +Q E+ F L +R V L +EE R+ G +I++ T
Sbjct: 133 LLVKQTVEKIESFCEKANLDVRLVYYHSNLRKKEKEEFLERIENGDFDILVTTAQFLARN 192
Query: 1541 LIDVLENRYLVLNQCTYIVLDEAD------RMID-----MGFEPDVQKILEYMPVTNLKP 1589
++ ++ +I +D+ D + ID +GF ++ I + + LK
Sbjct: 193 FPEMKHLKF------DFIFVDDVDAFLKASKNIDRSLQLLGFYEEI--IEKAWKIIYLKK 244
Query: 1590 --DTEDAEDENKLLANYNSKKKYRQTVMFTATMPPAVERLARSYLRRPATVYIGSVGKPT 1647
+ EDA ++ ++L K + + A+ + R L R + VG
Sbjct: 245 QGNIEDAMEKREILNKEIEKIGNKIGCLIVAS-ATGKAKGDRVKLYRELLGF--EVGSGR 301
Query: 1648 ERIEQIV--YILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKG---ADVLAKGLEKLGYNA 1702
+ IV Y+ E+ ++ + E+L + + K +IFV +G A+ + K L + G+
Sbjct: 302 SALRNIVDVYLNPEKIIKEHVRELLKK-LGKGGLIFVPIDEGAEKAEEIEKYLLEDGFKI 360
Query: 1703 CTLHGGKGQEQRELALNSLKGGSKDILVAT----DVAGRGIDI 1741
+ + + + G D L+ RG+D+
Sbjct: 361 ELVSAK-----NKKGFDLFEEGEIDYLIGVATYYGTLVRGLDL 398
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 35.5 bits (82), Expect = 0.18
Identities = 26/141 (18%), Positives = 58/141 (41%), Gaps = 22/141 (15%)
Query: 404 RDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGY--RRERREEEASGSKHKSRDKE 461
+ + + + +++EREK + ++R K++ E + ++ ++ + KHK E
Sbjct: 101 QKKLQKLLEEKQKQEREKEREEAEL-RQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAE 159
Query: 462 GYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRK 521
+ E AKK L+E KK +++ + R+
Sbjct: 160 SASSSLSGSAKPERNVSQEEAKKR---LQEWELKKLKQQQQK----------------RE 200
Query: 522 RQAEVEEMRKKMEEERKKRQE 542
+ + +++ EEERK++ E
Sbjct: 201 EERRKQRKKQQEEEERKQKAE 221
Score = 33.5 bits (77), Expect = 0.92
Identities = 35/158 (22%), Positives = 75/158 (47%), Gaps = 18/158 (11%)
Query: 531 KKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKER 590
K+++ ER+ ++ + E +K+ +L++ E+K++ EKE + + R+ A KE+
Sbjct: 77 KEVKLERQAQEAY--ENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLA-KEK 133
Query: 591 YLGLVKKKRRVRRLNDRKFVFDWDASEDTSVDYNSIYKERHQVQFFGRGNIAGIDIKAQK 650
Y ++K + R +A+E S + K V +A+K
Sbjct: 134 YEEWCRQKAQQAA-KQRTPKHKKEAAESASSSLSGSAKPERNV----------SQEEAKK 182
Query: 651 RDQSKFYGEMLEKRRTEAEKEQEKVRLKKVKKREEKQK 688
R Q L+K + + +K +E+ R ++ K++EE+++
Sbjct: 183 RLQE----WELKKLKQQQQKREEERRKQRKKQQEEEER 216
Score = 32.8 bits (75), Expect = 1.3
Identities = 34/135 (25%), Positives = 64/135 (47%), Gaps = 7/135 (5%)
Query: 476 EDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEV---EEMRKK 532
E + A + LS ++ +KK ++ K K ++ER LR+R A+ E R+K
Sbjct: 82 ERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQK 141
Query: 533 MEEERKKRQEFTKEASFESKRENFDARLRRDR----EKKKEDPEEKELNKDKEREGEAIK 588
++ K+R K+ + ES + + +R E+ K+ +E EL K K+++ + +
Sbjct: 142 AQQAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREE 201
Query: 589 ERYLGLVKKKRRVRR 603
ER K++ R
Sbjct: 202 ERRKQRKKQQEEEER 216
>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
unknown].
Length = 622
Score = 36.3 bits (83), Expect = 0.19
Identities = 43/255 (16%), Positives = 88/255 (34%), Gaps = 28/255 (10%)
Query: 386 DRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRRE-----G 440
D RE T RD D L+ + + + HR+D KE D
Sbjct: 303 DDSREVCTKAPEKYEPRDFVTDALQHSKVKLSWDAEDPHRKDLCKEAFTDDGIRDKDFSA 362
Query: 441 YRRERREEEASGS-------KHKSRDKEGYEPTEQMVISDEEEDKGAA------------ 481
Y + +E S K S + + ++V ++ + G
Sbjct: 363 YTASKLSDEDDDSVMESKMQKLFSEKEIDFGLNSELVDMSDDGENGEMEDTFTSHLPASN 422
Query: 482 ----AKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEER 537
K ++E+L K + +E R + + ++ + L+ ++ + + KK +
Sbjct: 423 ESESDDKLETTIEKLDRKLRERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHAI 482
Query: 538 KKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKK 597
++ +E + D +L K E+ + N+ +++ ++E ++ K
Sbjct: 483 ERTAASKEELELIKADDEDDEQLDHFDMKSILKAEKFKKNRKLKKKASNLEEGFVFDPKD 542
Query: 598 KRRVRRLNDRKFVFD 612
R V D F D
Sbjct: 543 PRFVAIFEDHNFAID 557
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 36.0 bits (83), Expect = 0.22
Identities = 21/63 (33%), Positives = 34/63 (53%), Gaps = 1/63 (1%)
Query: 481 AAKKEPLSLEELLAKKKAEEEARSKP-KFLTKEERAAEALRKRQAEVEEMRKKMEEERKK 539
A+K+ L+ EEL KKK EE+A+ K K L ++ A+A + Q + + E+K
Sbjct: 7 EAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKS 66
Query: 540 RQE 542
R+
Sbjct: 67 RKR 69
Score = 31.0 bits (70), Expect = 7.2
Identities = 17/69 (24%), Positives = 37/69 (53%), Gaps = 1/69 (1%)
Query: 522 RQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLR-RDREKKKEDPEEKELNKDK 580
+AE+E++R KM+E +K++++ K+ + +E A ++ D K + +E E + +
Sbjct: 995 AEAELEKLRNKMDEIQKQQEKLEKKMNASGYKEKVPANIQEEDARKLTKLLQELEFFEKE 1054
Query: 581 EREGEAIKE 589
+ EA
Sbjct: 1055 SKRLEAETS 1063
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 33.8 bits (78), Expect = 0.22
Identities = 26/83 (31%), Positives = 44/83 (53%), Gaps = 7/83 (8%)
Query: 464 EPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQ 523
EP E+ +D+E DK A +K E+ +KK+E E +K ++ EE+A L K +
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKA--KWEKKQEEKKSEAEKLAK---MSAEEKAEYELEKLE 55
Query: 524 AEVEEMRKKMEEERKKRQEFTKE 546
E+EE+ E R++ + K+
Sbjct: 56 KELEELEA--ELARRELKAEAKK 76
>gnl|CDD|165564 PHA03309, PHA03309, transcriptional regulator ICP4; Provisional.
Length = 2033
Score = 36.4 bits (83), Expect = 0.22
Identities = 44/170 (25%), Positives = 58/170 (34%), Gaps = 16/170 (9%)
Query: 367 SRSRSPPRKRRSRSRERDYDRV----RERDTDRSRDDRGRDRDRDRLERDRSREREREKR 422
S S S P R SRS R DRSR R R+RDR R R+R +
Sbjct: 1825 SSSSSSPSSRPSRSATPSLSPSPSPPRRAPVDRSRSGRRRERDRPSANPFRWAPRQRSRA 1884
Query: 423 HHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAA 482
H D + D E R S + G E V E ED A
Sbjct: 1885 DHSPDGTAPGDAPLNLEDGPGRGRPIWTPSSATTLPSRSG---PEDSVDETETEDSAPPA 1941
Query: 483 KKEPLSLEELLAKKKAEEE---------ARSKPKFLTKEERAAEALRKRQ 523
+ P LE A+ + E +S P ++E R + + R+
Sbjct: 1942 RLAPSPLETSRAEDSEDSEYPEYSNPRLGKSPPALKSREARRPSSKQPRR 1991
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 34.7 bits (80), Expect = 0.23
Identities = 25/84 (29%), Positives = 41/84 (48%), Gaps = 12/84 (14%)
Query: 509 LTKEERAAEAL---RKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDRE 565
+ ++RA R++Q E EE E E +K+ E +E + + E L +RE
Sbjct: 3 IGAKKRAKLEEKQARRQQREAEEE----EREERKKLEEKREGERKEEEE-----LEEERE 53
Query: 566 KKKEDPEEKELNKDKEREGEAIKE 589
KKKE+ E KE + +E E ++
Sbjct: 54 KKKEEEERKEREEQARKEQEEYEK 77
Score = 32.4 bits (74), Expect = 1.4
Identities = 24/91 (26%), Positives = 46/91 (50%), Gaps = 7/91 (7%)
Query: 410 ERDRSREREREKRHHRRDRSKERDGK-----DRREGYRRERREEEASGSKHKSRDK-EGY 463
R + RE E E+R R+ ++R+G+ + E +++ EEE + ++R + E Y
Sbjct: 16 ARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEY 75
Query: 464 EP-TEQMVISDEEEDKGAAAKKEPLSLEELL 493
E V+ +E DK +A ++ LE+ +
Sbjct: 76 EKLKSSFVVEEEGTDKLSADEESNELLEDFI 106
Score = 31.6 bits (72), Expect = 2.3
Identities = 20/75 (26%), Positives = 34/75 (45%)
Query: 507 KFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREK 566
K K E ++R+AE EE ++ + E K+ E +E E +RE R +
Sbjct: 6 KKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKERE 65
Query: 567 KKEDPEEKELNKDKE 581
++ E++E K K
Sbjct: 66 EQARKEQEEYEKLKS 80
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 35.5 bits (82), Expect = 0.27
Identities = 26/142 (18%), Positives = 63/142 (44%), Gaps = 3/142 (2%)
Query: 441 YRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEE 500
+ E++ +E + K +K E + +S+ ++ +E E ++E
Sbjct: 207 VKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEY 266
Query: 501 EARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARL 560
E +KP + +R +A R ++ +E+ ++ +EE++ +++ + A + + +
Sbjct: 267 EPINKPV---RPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKE 323
Query: 561 RRDREKKKEDPEEKELNKDKER 582
+ KK++ E E K K R
Sbjct: 324 KARARKKEQRKERGEKKKLKRR 345
Score = 34.7 bits (80), Expect = 0.47
Identities = 27/143 (18%), Positives = 57/143 (39%), Gaps = 9/143 (6%)
Query: 428 RSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPL 487
K+R +R E + E+ EAS S + SD+E +
Sbjct: 209 AEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEP 268
Query: 488 SLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEA 547
+ + K+K + + +E+ + L + E ++++KK+ + + + E KE
Sbjct: 269 INKPVRPKRKTKAQRN--------KEKRRKELEREAKEEKQLKKKLAQLARLK-EIAKEV 319
Query: 548 SFESKRENFDARLRRDREKKKED 570
+ + K R++R +KK+
Sbjct: 320 AQKEKARARKKEQRKERGEKKKL 342
Score = 32.4 bits (74), Expect = 2.2
Identities = 22/146 (15%), Positives = 54/146 (36%), Gaps = 12/146 (8%)
Query: 410 ERDRSREREREKRHHRRDRSK----ERDGKDRREGYRRER------REEEASGSKHKSRD 459
E ++R+ +R + K + EG E ++ S + +
Sbjct: 206 EVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESE 265
Query: 460 KEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEAL 519
E + + + +KE + + K + ++ K + KE A+
Sbjct: 266 YEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKE--VAQKE 323
Query: 520 RKRQAEVEEMRKKMEEERKKRQEFTK 545
+ R + E+ +++ E+++ KR++ K
Sbjct: 324 KARARKKEQRKERGEKKKLKRRKLGK 349
Score = 30.4 bits (69), Expect = 9.7
Identities = 23/107 (21%), Positives = 47/107 (43%), Gaps = 8/107 (7%)
Query: 490 EELLAKKKAEEEARSKPKFLTKEERAAEALR------KRQAEVEEMRKKMEEERKKRQEF 543
+E+ A+KK +E R + K L K A L +++ + + +E + E
Sbjct: 205 KEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFES 264
Query: 544 TKEASFESKRENFDARLRRDREK--KKEDPEEKELNKDKEREGEAIK 588
E + R + +R++EK K+ + E KE + K++ + +
Sbjct: 265 EYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLAR 311
>gnl|CDD|151060 pfam10500, SR-25, Nuclear RNA-splicing-associated protein. SR-25,
otherwise known as ADP-ribosylation factor-like factor
6-interacting protein 4, is expressed in virtually all
tissues. At the N-terminus there is a repeat of
serine-arginine (SR repeat), and towards the middle of
the protein there are clusters of both serines and of
basic amino acids. The presence of many nuclear
localisation signals strongly implies that this is a
nuclear protein that may contribute to RNA splicing.
SR-25 is also implicated, along with
heat-shock-protein-27, as a mediator in the Rac1 (GTPase
ras-related C3 botulinum toxin substrate 1) signalling
pathway.
Length = 74
Score = 32.1 bits (73), Expect = 0.28
Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 5/64 (7%)
Query: 366 RSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSRERERE--KRH 423
RSRSRSP K +R+ +V + +T R+R +G D + LE S+ER +E K+
Sbjct: 2 RSRSRSPETKEEYEARQSVIRKVVDPETGRTRLIKG---DGEILEEIVSKERHKEINKQA 58
Query: 424 HRRD 427
R D
Sbjct: 59 TRGD 62
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 35.9 bits (83), Expect = 0.28
Identities = 36/222 (16%), Positives = 67/222 (30%), Gaps = 35/222 (15%)
Query: 374 RKRRSRSRERDYDRVRERDTDRSRDDRGRD-------RDRDRLERDRSREREREKRHHRR 426
R+ R RE D R+R + + + +R D + +
Sbjct: 296 REEERELDARTEALEREADALRTRLEALQGSPAYQDAEELERARADAEALQAAAADARQA 355
Query: 427 DRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEP 486
R E ERR + + ++E EQ+ + E A EP
Sbjct: 356 IREAESR-------LEEERRRLDEEAGRLDDAERELRAAREQLARAAERAGLSPAHTAEP 408
Query: 487 LSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKE 546
+ A +E + + + A + +R +V +R++ + +
Sbjct: 409 DAA-------LAAQELQELGALDARRQDADRVIAQRSEQVALLRRRDDVADRAEATHAA- 460
Query: 547 ASFESKRENFDARLRRDR--EKKKEDPEEKELNKDKEREGEA 586
AR RRD E+ ++ EL + A
Sbjct: 461 -----------ARARRDELDEEAEQAAARAELADEAVHREGA 491
Score = 33.2 bits (76), Expect = 1.5
Identities = 36/222 (16%), Positives = 69/222 (31%), Gaps = 8/222 (3%)
Query: 376 RRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGK 435
+R R + R +R+ R R + + R ERD
Sbjct: 779 LAGAPSDRSLRAAHRRAAEAERQAESAERELARAARKAAAAAAAW---KQARRELERDAA 835
Query: 436 DRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAK 495
D + E K E + + A+ E A+
Sbjct: 836 DLDLPTDPDALEAVGLALKRFGDHLHTLEVAVRELRHAATRAAEQRARAARA---ESDAR 892
Query: 496 KKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEA--SFESKR 553
+ AE+ A ++ + R A V+E+R ++ E R +E E+
Sbjct: 893 EAAEDAAEARAEAEEASLRLRTLEESVGAMVDEIRARLAETRAALASGGRELPRLAEALA 952
Query: 554 ENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLV 595
+AR R + ++ + D E + ++ ++E L +
Sbjct: 953 TAEEARGRAEEKRAEADATLDERAEARDHAIGQLREFALTGL 994
Score = 33.2 bits (76), Expect = 1.7
Identities = 27/163 (16%), Positives = 54/163 (33%), Gaps = 13/163 (7%)
Query: 380 SRERDYDRVRERDTD-----RSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDG 434
+R +D DRV + ++ R RDD + R+ E+ R++ D
Sbjct: 425 ARRQDADRVIAQRSEQVALLRRRDDVADRAEATHAAARARRDELDEEAEQAAARAELADE 484
Query: 435 KDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEE-DKGAAAKKEPLSLEELL 493
REG R + + + ++D + D +
Sbjct: 485 AVHREGARLAWVD---AWQAQLRELTILAVDDQPGALADLDSWDALLQGEAPVRVAVYSA 541
Query: 494 AKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEE 536
+ A+E R + A E L + + + R+++E+
Sbjct: 542 VQPLADELTRERAALRL----AEEVLEEERDALRTERERLEQG 580
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 35.7 bits (83), Expect = 0.29
Identities = 15/60 (25%), Positives = 27/60 (45%), Gaps = 1/60 (1%)
Query: 374 RKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERD 433
R + + + ++ +S DD D D D + +RER E+R RR+ ++R
Sbjct: 121 RDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDD-DDIATRERSLERRRRRREWEEKRA 179
Score = 33.4 bits (77), Expect = 1.4
Identities = 22/97 (22%), Positives = 30/97 (30%), Gaps = 9/97 (9%)
Query: 359 ILMAGDRRSRSRSPPRKRRS--RSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSRE 416
I GD + PR + E D D + D D + D + D
Sbjct: 100 IFDDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKS-EDDEDDDDDDDDDDI 158
Query: 417 REREKRHHRRDRSKERDGKDRREGYRRERREEEASGS 453
RE+ RR R +E E R E G+
Sbjct: 159 ATRERSLERRRRRRE------WEEKRAELEFYYYQGT 189
>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH. This
bacillus clade of FliH proteins is not found by the Pfam
FliH model pfam02108, but is closely related to the
sequences identified by that model. Sequences identified
by this model are observed in flagellar operons in an
analogous position relative to other flagellar operon
genes.
Length = 255
Score = 34.8 bits (80), Expect = 0.30
Identities = 18/75 (24%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 484 KEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEF 543
P + +EL +++ E+ K E AA+ + + +A+ +R+++E+ER + +E
Sbjct: 27 DVPETEQELADEEQEFEQILEK-----AEAEAAQIIEQAEAQAAAIREQIEQERAQWEEE 81
Query: 544 TKEASFESKRENFDA 558
+ E+K+E ++A
Sbjct: 82 RERLIQEAKQEGYEA 96
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 35.3 bits (81), Expect = 0.32
Identities = 43/170 (25%), Positives = 82/170 (48%), Gaps = 19/170 (11%)
Query: 408 RLERDRSREREREKRHH-----RRDRSKERD-GKDRREGYRRERREEEASGSKHKSRDKE 461
+ E ++R+ E E R+ +RS + D G+ ++E + + +K K DK+
Sbjct: 419 QAELQQARQNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAKQK--DKQ 476
Query: 462 GYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEE--ARSKPKFLTKEERAAEAL 519
+ E+ + E D A+K+ L E +KK EEE AR+ + E AE+L
Sbjct: 477 SMQSMEKRL--KSEADSRVNAEKQ---LAEEKKRKKEEEETAARAAAQAAASREECAESL 531
Query: 520 RKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKE 569
++ + ++E KK+E + K ++E + ++E + R ++ EK+ E
Sbjct: 532 KQAKQDLEMEIKKLEHDLKLKEEECRML----EKEAQELRKYQESEKETE 577
Score = 34.1 bits (78), Expect = 0.79
Identities = 30/107 (28%), Positives = 46/107 (42%), Gaps = 16/107 (14%)
Query: 494 AKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKR 553
AK+K ++ +S K L E + K+ AE E+ RKK EEE R AS E
Sbjct: 470 AKQKDKQSMQSMEKRLKSEADSRVNAEKQLAE-EKKRKKEEEETAARAAAQAAASREECA 528
Query: 554 ENFDARLR-------------RDREKKKEDPEEK--ELNKDKEREGE 585
E+ + + +E++ E++ EL K +E E E
Sbjct: 529 ESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKEAQELRKYQESEKE 575
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 35.4 bits (82), Expect = 0.34
Identities = 33/164 (20%), Positives = 56/164 (34%), Gaps = 15/164 (9%)
Query: 434 GKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELL 493
GK E E + E D +G P M+ + K +E L +
Sbjct: 320 GKSVSEEDEDEDSDSEEEDEDDDEDDDDGENP--WMLRKKLGKLKEGEDDEENSGLLSMK 377
Query: 494 AKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRK-------KMEEERKKRQEFTKE 546
++AE + + E LR+ EE + K R+K E
Sbjct: 378 FMQRAEARKKEEND------AEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGE 431
Query: 547 ASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKER 590
ESK+ + + +K+ ++ EE E ++ + E A K
Sbjct: 432 KEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLL 475
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 33.4 bits (77), Expect = 0.36
Identities = 31/115 (26%), Positives = 55/115 (47%), Gaps = 9/115 (7%)
Query: 491 ELLAKKKAEEEARSKPKFLTKE-ERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASF 549
LL K++ EE + +E ER E L++R+ E+E+ ++++E K +F KE
Sbjct: 1 RLLEKRREMEEVQLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKEN-- 58
Query: 550 ESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRL 604
E+KR A + + EKK +E+E+ + K E E K + ++
Sbjct: 59 EAKRRR--AEKKAEEEKKLRKEKEEEIKELKAELEELKAEI----EKLEEKLEEY 107
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 35.4 bits (81), Expect = 0.36
Identities = 37/226 (16%), Positives = 80/226 (35%), Gaps = 22/226 (9%)
Query: 380 SRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRR- 438
E D E + DD D + ++E + + E + R++ ++ G D +
Sbjct: 135 EDEADLFNESESSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKF 194
Query: 439 -------EGYRRERREEEASGSKHK------SRDKEGYEPTEQMVISDEEED--KGAAAK 483
E EEEA+ D E + E ++EED +G
Sbjct: 195 FKLDEMNEFLEATEAEEEAALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEY 254
Query: 484 KEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEF 543
++ +E KK A ++A + KE K++A+ + + E++ ++ +
Sbjct: 255 EDFFDPKEKDKKKDAGDDAELEDDEPDKE------AVKKEADSKPEEEDEEDDEQEDDQD 308
Query: 544 TKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKE 589
+E + + + + E K + + + I++
Sbjct: 309 EEEPPEAAMDKVKLDEPVLEGVDLESPKELSSFEKRQAKLKQQIEQ 354
>gnl|CDD|218292 pfam04851, ResIII, Type III restriction enzyme, res subunit.
Length = 100
Score = 32.6 bits (75), Expect = 0.40
Identities = 19/72 (26%), Positives = 29/72 (40%), Gaps = 14/72 (19%)
Query: 9 EPTPIQRQAIPIGLQNRDIIGVAETGSGKTL-AFLLPLLVWIQSLPKIARMEDADQGPYA 67
E P Q +AI L+ + + V TGSGKTL A L +
Sbjct: 3 ELRPYQEEAIERLLEKKRGLIVMATGSGKTLTAAALIARLAKGKKK-------------V 49
Query: 68 IIMAPTRELAQQ 79
+ + P ++L +Q
Sbjct: 50 LFVVPRKDLLEQ 61
Score = 32.6 bits (75), Expect = 0.40
Identities = 19/72 (26%), Positives = 29/72 (40%), Gaps = 14/72 (19%)
Query: 750 EPTPIQRQAIPIGLQNRDIIGVAETGSGKTL-AFLLPLLVWIQSLPKIARMEDADQGPYA 808
E P Q +AI L+ + + V TGSGKTL A L +
Sbjct: 3 ELRPYQEEAIERLLEKKRGLIVMATGSGKTLTAAALIARLAKGKKK-------------V 49
Query: 809 IIMAPTRELAQQ 820
+ + P ++L +Q
Sbjct: 50 LFVVPRKDLLEQ 61
Score = 32.6 bits (75), Expect = 0.40
Identities = 19/72 (26%), Positives = 29/72 (40%), Gaps = 14/72 (19%)
Query: 1425 EPTPIQRQAIPIGLQNRDIIGVAETGSGKTL-AFLLPLLVWIQSLPKIARMEDADQGPYA 1483
E P Q +AI L+ + + V TGSGKTL A L +
Sbjct: 3 ELRPYQEEAIERLLEKKRGLIVMATGSGKTLTAAALIARLAKGKKK-------------V 49
Query: 1484 IIMAPTRELAQQ 1495
+ + P ++L +Q
Sbjct: 50 LFVVPRKDLLEQ 61
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 35.0 bits (80), Expect = 0.41
Identities = 44/234 (18%), Positives = 95/234 (40%), Gaps = 27/234 (11%)
Query: 375 KRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDG 434
+R+ R + + +R+R TD + D ++ E++R ER +E+R +RD
Sbjct: 409 RRKERRLKEEKERLRSLQTDTNTDTALEKLEKALAEKERIIERLKEQR--------DRDE 460
Query: 435 KDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAA------------- 481
+ +E + ++E E + ++ + E Q+ + EE K A+
Sbjct: 461 RYEQEEFETYKKEFEDLKEEVQNLQLKLSERELQLELLKEEVSKLASNQLKQRSDLERAH 520
Query: 482 -----AKKEPLSLEELLAKKKAEEEARSKPKFLTKE-ERAAEALRKRQAEVEEMRKKMEE 535
+++ LE+ L + +A E+ + + R+ + EV+ + ++E+
Sbjct: 521 IELEKIREKHEKLEKELKRLRANPESADRGSAVDAGTSRSRADSAGARNEVDRLLDRLEK 580
Query: 536 ERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKE 589
++R + EA +K R +++K E + K E EA +
Sbjct: 581 AEQERDDTEMEAGRLAKELEKAQRHLTKQQEKTEATRIEFERKSAELLEEAERL 634
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 34.2 bits (79), Expect = 0.53
Identities = 30/114 (26%), Positives = 54/114 (47%), Gaps = 13/114 (11%)
Query: 489 LEELLAKKKAEEEARSKP-KFLTKEERA-------AEALRKRQAEVEEMRKKMEE--ERK 538
L+E L K+A EEA + + LT +E+A AEA Q + E +K+ E+ E +
Sbjct: 178 LQEFLNSKEAVEEAILQTDQALTAKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQ 237
Query: 539 KRQEFTKEASFESKRENFDARLRRDREK---KKEDPEEKELNKDKEREGEAIKE 589
+R K E +L ++E+ K +E+ L + + E E++++
Sbjct: 238 ERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQEQEELLKEGFKTEAESLQK 291
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 34.7 bits (80), Expect = 0.53
Identities = 20/103 (19%), Positives = 45/103 (43%), Gaps = 10/103 (9%)
Query: 511 KEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDRE----- 565
++ A EA R Q +E +K+E + +E + ++E R RE
Sbjct: 28 RQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLV 87
Query: 566 KKKEDPEEK--ELNKDKER---EGEAIKERYLGLVKKKRRVRR 603
+K+E + + +L+ + + +A+ R L L + ++++
Sbjct: 88 QKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDN 130
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 34.5 bits (80), Expect = 0.54
Identities = 42/202 (20%), Positives = 74/202 (36%), Gaps = 13/202 (6%)
Query: 388 VRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERRE 447
+R + ++ + + + R R R + RE RH + ++ KD R +
Sbjct: 438 IRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAA-ALARVK 496
Query: 448 EEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAA-AKKEPLSLEELLAKKKAEEEA---R 503
+ + + K G P VI+ E K A A++ KKA A R
Sbjct: 497 AKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAEKQAAAAADPKKAAVAAAIAR 556
Query: 504 SKPKFLTKEERAAEALRK---RQAEVEEM-----RKKMEEERKKRQEFTKEASFESKREN 555
+K K ++ AEA + ++A V KK ++ + + A + K+
Sbjct: 557 AKAKKAAQQAANAEAEEEVDPKKAAVAAAIARAKAKKAAQQAASAEPEEQVAEVDPKKAA 616
Query: 556 FDARLRRDREKKKEDPEEKELN 577
A + R + KK E E
Sbjct: 617 VAAAIARAKAKKAEQQANAEPE 638
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 34.7 bits (79), Expect = 0.60
Identities = 30/89 (33%), Positives = 39/89 (43%), Gaps = 8/89 (8%)
Query: 366 RSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDR--SREREREKRH 423
+ R R+ R R +R+ +R R +R R +RDRLERDR ERER R
Sbjct: 457 KKRIERLEREERERLERERMERIERERLERERLERER-LERDRLERDRLDRLERERVDRL 515
Query: 424 HRRDRSKERDGKDRREGYRRERREEEASG 452
R +R K RR Y + E S
Sbjct: 516 ER-----DRLEKARRNSYFLKGMENGLSA 539
Score = 33.2 bits (75), Expect = 1.7
Identities = 22/65 (33%), Positives = 33/65 (50%), Gaps = 1/65 (1%)
Query: 394 DRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGS 453
++ R +R +R+RLER+R ERE+ R +ER +DR E R +R E E
Sbjct: 456 EKKRIERLEREERERLERERMERIERERLERER-LERERLERDRLERDRLDRLERERVDR 514
Query: 454 KHKSR 458
+ R
Sbjct: 515 LERDR 519
Score = 32.0 bits (72), Expect = 4.1
Identities = 20/57 (35%), Positives = 27/57 (47%), Gaps = 1/57 (1%)
Query: 364 DRRSRSRSPPRKRRSRSRER-DYDRVRERDTDRSRDDRGRDRDRDRLERDRSRERER 419
+R R R +R RER + +R+ +R R DR DRLERDR + R
Sbjct: 469 ERLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDRLERDRLEKARR 525
Score = 31.6 bits (71), Expect = 4.7
Identities = 36/105 (34%), Positives = 50/105 (47%), Gaps = 10/105 (9%)
Query: 348 ALNSLKGGSKDILMAGDRRSRSRSPPRKRRS-----RSRERDYDRVRERDTDRSRDDR-G 401
A+N GG D A R + RK + R +R +R R +R
Sbjct: 421 AVNGHYGGRVDKDHAERARIEKENAHRKALEMKILEKKRIERLEREERERLERERMERIE 480
Query: 402 RDR-DRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRER 445
R+R +R+RLER+R ER+R +R R DR ER+ DR E R E+
Sbjct: 481 RERLERERLERERL-ERDRLERD-RLDRL-ERERVDRLERDRLEK 522
Score = 31.2 bits (70), Expect = 6.9
Identities = 23/68 (33%), Positives = 33/68 (48%), Gaps = 3/68 (4%)
Query: 364 DRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGR-DRDR-DRLERDRSREREREK 421
+R R R R R + +R+ +R R +R R +RDR DRLER+R ER++
Sbjct: 461 ERLEREERE-RLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDRLERDR 519
Query: 422 RHHRRDRS 429
R S
Sbjct: 520 LEKARRNS 527
>gnl|CDD|187770 cd09639, Cas3_I, CRISPR/Cas system-associated protein Cas3. CRISPR
(Clustered Regularly Interspaced Short Palindromic
Repeats) and associated Cas proteins comprise a system
for heritable host defense by prokaryotic cells against
phage and other foreign DNA; DEAD/DEAH box helicase DNA
helicase cas3'; Often but not always is fused to HD
nuclease domain; signature gene for Type I.
Length = 353
Score = 34.3 bits (79), Expect = 0.62
Identities = 39/168 (23%), Positives = 69/168 (41%), Gaps = 18/168 (10%)
Query: 939 VMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKL--MEVLN 996
++ +AT+P ++ A + + E+ +I E DK ++ +E L
Sbjct: 158 LLMSATLPKFLKEYAEKIG-----YVEENEPLDLKPNERAPFIKIESDKVGEISSLERLL 212
Query: 997 RGVKK--PVIIFVNQKKGADVLAKGLEKLG--YNACTLHG----GKGQEQRELALNSLKG 1048
+KK V I VN A + L++ G +H ++ L K
Sbjct: 213 EFIKKGGSVAIIVNTVDRAQEFYQQLKEKGPEEEIMLIHSRFTEKDRAKKEAELLLEFKK 272
Query: 1049 GSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRAGK 1096
K ++VAT V +DI V ++I ++A I+ R+GR R G+
Sbjct: 273 SEKFVIVATQVIEASLDI-SVDVMIT-ELA-PIDSLIQRLGRLHRYGE 317
Score = 34.3 bits (79), Expect = 0.62
Identities = 39/168 (23%), Positives = 69/168 (41%), Gaps = 18/168 (10%)
Query: 1614 VMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKL--MEVLN 1671
++ +AT+P ++ A + + E+ +I E DK ++ +E L
Sbjct: 158 LLMSATLPKFLKEYAEKIG-----YVEENEPLDLKPNERAPFIKIESDKVGEISSLERLL 212
Query: 1672 RGVKK--PVIIFVNQKKGADVLAKGLEKLG--YNACTLHG----GKGQEQRELALNSLKG 1723
+KK V I VN A + L++ G +H ++ L K
Sbjct: 213 EFIKKGGSVAIIVNTVDRAQEFYQQLKEKGPEEEIMLIHSRFTEKDRAKKEAELLLEFKK 272
Query: 1724 GSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRAGK 1771
K ++VAT V +DI V ++I ++A I+ R+GR R G+
Sbjct: 273 SEKFVIVATQVIEASLDI-SVDVMIT-ELA-PIDSLIQRLGRLHRYGE 317
>gnl|CDD|223046 PHA03328, PHA03328, nuclear egress lamina protein UL31;
Provisional.
Length = 316
Score = 33.9 bits (78), Expect = 0.69
Identities = 20/61 (32%), Positives = 23/61 (37%), Gaps = 4/61 (6%)
Query: 362 AGDRRSRSRSPPRKRRSRSRERDY---DRVRERDTDRSRDDRGRDRDRDRLERDRSRERE 418
RRS S +R +R RD R R R R R R R L D R+R
Sbjct: 4 LALRRSSSSLRRSRRAARRSRRDGRVGSRGRSRYRSRRRSSRRSSTRRAEL-ADTERDRY 62
Query: 419 R 419
R
Sbjct: 63 R 63
Score = 32.8 bits (75), Expect = 1.6
Identities = 22/83 (26%), Positives = 26/83 (31%), Gaps = 19/83 (22%)
Query: 361 MAGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSRERERE 420
M RS S R+ R +R RD R R R R RSR R
Sbjct: 1 MDDLALRRSSSSLRRSRRAARRS------------RRDGRVGSRGR---SRYRSRRRSSR 45
Query: 421 KRHHRRDRSKERDGKDRREGYRR 443
+ RR R+ YR
Sbjct: 46 RSSTRR----AELADTERDRYRA 64
Score = 32.0 bits (73), Expect = 2.8
Identities = 19/62 (30%), Positives = 21/62 (33%), Gaps = 3/62 (4%)
Query: 385 YDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRE 444
D R + R R R R R SR R R + R RS R R E E
Sbjct: 2 DDLALRRSSSSLRRSRRAARRSRRDGRVGSRGRSRYR---SRRRSSRRSSTRRAELADTE 58
Query: 445 RR 446
R
Sbjct: 59 RD 60
Score = 32.0 bits (73), Expect = 2.8
Identities = 13/70 (18%), Positives = 22/70 (31%), Gaps = 3/70 (4%)
Query: 1180 MAGDRRSRSRSPARKRRSRSRDRDYDR-RFKRKKSPQCEAQSSRFSACSLPRSHKSSSLL 1238
+ RSR AR+ R R R R++ ++ + + R R
Sbjct: 8 RSSSSLRRSRRAARRSRRDGRVGSRGRSRYRSRRRSSRRSSTRRAELADTERD--RYRAY 65
Query: 1239 SRYSEQDPEE 1248
Y P +
Sbjct: 66 FAYLASSPSD 75
Score = 31.2 bits (71), Expect = 4.7
Identities = 14/55 (25%), Positives = 17/55 (30%)
Query: 393 TDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERRE 447
RS R R R R R R + +R R R RR R+
Sbjct: 6 LRRSSSSLRRSRRAARRSRRDGRVGSRGRSRYRSRRRSSRRSSTRRAELADTERD 60
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 33.5 bits (77), Expect = 0.72
Identities = 15/87 (17%), Positives = 33/87 (37%), Gaps = 9/87 (10%)
Query: 471 ISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMR 530
++EE + AK + + K K EE+ ++K E + E
Sbjct: 39 EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAK-------REKEEKGLRELEEDTPED 91
Query: 531 KKMEEERKKRQEFTKEASFESKRENFD 557
+ E+ R ++ + +E+ + + F
Sbjct: 92 ELAEKLRLRKLQ--EESDLNNAADLFG 116
Score = 30.4 bits (69), Expect = 8.7
Identities = 20/71 (28%), Positives = 32/71 (45%), Gaps = 1/71 (1%)
Query: 457 SRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAA 516
S D+E E E+ + K A K + +E ++K E+ R + T E+ A
Sbjct: 36 SWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE-DTPEDELA 94
Query: 517 EALRKRQAEVE 527
E LR R+ + E
Sbjct: 95 EKLRLRKLQEE 105
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 34.0 bits (78), Expect = 0.75
Identities = 31/139 (22%), Positives = 54/139 (38%), Gaps = 23/139 (16%)
Query: 472 SDEEEDKGAAAKKEPLSLEELLAK---KKAEEEARSKPKFLTKEERAAEALRKRQAEVEE 528
+D + GA E L+E + K+ EEE E R A+ KR+ EV E
Sbjct: 96 TDYVVESGATRNYEADKLDEEQEERVEKEREEELAGDAMKK-LENRTAD--SKREMEVLE 152
Query: 529 MRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIK 588
++++E + +R + + E+ E+KE +++E + IK
Sbjct: 153 RLEELKELQSRRADVDVNSMLEALFRR----------------EKKEEEEEEEEDEALIK 196
Query: 589 ERYLGLVKKK-RRVRRLND 606
G ++ RR D
Sbjct: 197 SLSFGPETEEDRRRADDED 215
Score = 30.9 bits (70), Expect = 7.0
Identities = 26/176 (14%), Positives = 58/176 (32%), Gaps = 32/176 (18%)
Query: 442 RRERREEEASGS----KHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKK 497
+ER EE A + ++++ D + E V+ EE K +++ + + +L
Sbjct: 122 EKEREEELAGDAMKKLENRTADSK----REMEVLERLEELKELQSRRADVDVNSMLEAL- 176
Query: 498 AEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKR-------------QEFT 544
F +++ E + +A ++ + E E +R + T
Sbjct: 177 ----------FRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDNDNT 226
Query: 545 KEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRR 600
S + + K+ + K R ++ LV +K+
Sbjct: 227 PSPKSGSSSPAKPTSILKKSAAKRSEAPSSSKAKKNSRGIPKPRDALSSLVVRKKA 282
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 33.4 bits (77), Expect = 0.78
Identities = 32/110 (29%), Positives = 55/110 (50%), Gaps = 16/110 (14%)
Query: 488 SLEELLAKKKAEEEARSKPKFLTKEE--RAAEALRKRQAEVEEMRKKMEEERKKRQEFTK 545
SL+E +A+ K EE K +E R E L+K + EVEE+RKK+++ K +Q
Sbjct: 31 SLKEEIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSL-- 88
Query: 546 EASFESKRENFDARLRRDREKKKEDPEEKELNKDK----EREGEAIKERY 591
+N ARL+ ++ K E E+ + + ERE + + +++
Sbjct: 89 --------KNLKARLKELEKELKNLKWESEVLEQRFEKVERERDELYDKF 130
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 34.3 bits (78), Expect = 0.80
Identities = 19/49 (38%), Positives = 32/49 (65%), Gaps = 3/49 (6%)
Query: 538 KKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEA 586
KKR+E ++A E++++ AR R+REK+KE E+E ++ ER +A
Sbjct: 579 KKREEAVEKAKREAEQK---AREEREREKEKEKEREREREREAERAAKA 624
Score = 33.9 bits (77), Expect = 1.1
Identities = 19/55 (34%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Query: 404 RDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSR 458
+ R+ RE E++ R R +R KE++ + RE R R +AS S H+SR
Sbjct: 579 KKREEAVEKAKREAEQKAREER-EREKEKEKEREREREREAERAAKASSSSHESR 632
Score = 32.7 bits (74), Expect = 2.5
Identities = 13/52 (25%), Positives = 30/52 (57%), Gaps = 5/52 (9%)
Query: 378 SRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRS 429
++ RE ++ + ++R++R R++ E+++ RERERE+ R ++
Sbjct: 578 AKKREEAVEKAKREAEQKAREEREREK-----EKEKEREREREREAERAAKA 624
Score = 31.6 bits (71), Expect = 5.2
Identities = 18/75 (24%), Positives = 34/75 (45%), Gaps = 19/75 (25%)
Query: 330 GYNACT--------LHGGKGQEQRELALNSLKGGSKDILMAGDRRSRSRSPPRKRRSRSR 381
GYN+C L K ++RE A+ K R + ++ + R + +
Sbjct: 558 GYNSCARTDLYFTPLASSKLAKKREEAVEKAK-----------REAEQKAREEREREKEK 606
Query: 382 ERDYDRVRERDTDRS 396
E++ +R RER+ +R+
Sbjct: 607 EKEREREREREAERA 621
Score = 30.8 bits (69), Expect = 9.1
Identities = 15/38 (39%), Positives = 23/38 (60%)
Query: 552 KRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKE 589
KRE + +R+ E+K + E+E K+KERE E +E
Sbjct: 580 KREEAVEKAKREAEQKAREEREREKEKEKERERERERE 617
>gnl|CDD|237253 PRK12898, secA, preprotein translocase subunit SecA; Reviewed.
Length = 656
Score = 34.2 bits (79), Expect = 0.81
Identities = 15/32 (46%), Positives = 20/32 (62%)
Query: 1036 QEQRELALNSLKGGSKDILVATDVAGRGIDIK 1067
Q+ E A+ + G I VAT++AGRG DIK
Sbjct: 507 QDAEEAAIVARAGQRGRITVATNMAGRGTDIK 538
Score = 34.2 bits (79), Expect = 0.81
Identities = 15/32 (46%), Positives = 20/32 (62%)
Query: 1711 QEQRELALNSLKGGSKDILVATDVAGRGIDIK 1742
Q+ E A+ + G I VAT++AGRG DIK
Sbjct: 507 QDAEEAAIVARAGQRGRITVATNMAGRGTDIK 538
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 33.7 bits (77), Expect = 0.84
Identities = 18/101 (17%), Positives = 35/101 (34%), Gaps = 9/101 (8%)
Query: 382 ERDYDRVRERDTDRSRDDRGRDR--DRDRLERDRSREREREKRHHRRDRSKERDGKDRRE 439
ER + + D+ D + + ++ R E+R + ++RR
Sbjct: 181 ERWMGGPDSDEDENPLDEEAPDMTPETSKQDQKEERRAAVERRLAELVEMINWNLEERRR 240
Query: 440 GYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGA 480
R+E+ EE D+ G + S++ E K
Sbjct: 241 DLRKEQELEENVERDSDDEDEHGED-------SEDGETKPE 274
Score = 31.4 bits (71), Expect = 4.5
Identities = 20/100 (20%), Positives = 40/100 (40%), Gaps = 9/100 (9%)
Query: 495 KKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKM---------EEERKKRQEFTK 545
++K +EE KP F + + + ++E M +EER+ E
Sbjct: 165 REKEKEEQPMKPAFKPERWMGGPDSDEDENPLDEEAPDMTPETSKQDQKEERRAAVERRL 224
Query: 546 EASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGE 585
E N + R R R++++ + + + D++ GE
Sbjct: 225 AELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGE 264
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 33.9 bits (77), Expect = 0.84
Identities = 47/195 (24%), Positives = 77/195 (39%), Gaps = 27/195 (13%)
Query: 406 RDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEP 465
R R R +S + E E+ + D E+D K + +E A + S + EP
Sbjct: 189 RARRFRKKSSKIEIEEVEKKVDDLLEKDMKAESVSVVLKDEKELARQERVSSWENFKEEP 248
Query: 466 TEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAE 525
E + A KKE EE + +EE+ L AE
Sbjct: 249 GEPL--------SRPALKKEKQGAEEEGEEGMSEED-----------------LDVGAAE 283
Query: 526 VEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGE 585
+E K++ E K++Q+ E + K E R E+K+ED E +E + E +
Sbjct: 284 IEN--KEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLAD 341
Query: 586 AIKERYLGLVKKKRR 600
+ E G+ +K+R+
Sbjct: 342 ELNELEKGIEEKRRQ 356
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 34.0 bits (78), Expect = 0.87
Identities = 20/97 (20%), Positives = 41/97 (42%), Gaps = 5/97 (5%)
Query: 488 SLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEER-KKRQEFTKE 546
L L +K+ E++ K L ++E+ + L + E++E + + E K+ QE K
Sbjct: 519 KLSAL--EKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKA 576
Query: 547 ASFESKRENFDARLRRDREKKKEDPEEKELNKDKERE 583
E E+ L+ + K ++ + E +
Sbjct: 577 LKKEV--ESIIRELKEKKIHKAKEIKSIEDLVKLKET 611
Score = 31.7 bits (72), Expect = 5.1
Identities = 23/110 (20%), Positives = 52/110 (47%), Gaps = 7/110 (6%)
Query: 474 EEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKM 533
++ + KE L++ L ++ E + R + K L E+ A EAL+ + EVE + +++
Sbjct: 529 QKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIREL 588
Query: 534 EEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKERE 583
+E++ + E K +L+ ++K + P + +K ++
Sbjct: 589 KEKK-------IHKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKIGDKV 631
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 33.2 bits (76), Expect = 1.1
Identities = 29/144 (20%), Positives = 54/144 (37%), Gaps = 14/144 (9%)
Query: 391 RDTDRSRDDRGRDRD--RDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREE 448
R + +D R RD + E +R+ + ++E + D K + EG + E+
Sbjct: 44 RQPGQGTEDELRKRDLRAELEEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEED 103
Query: 449 EASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLA------KKKAEEEA 502
E + + + + D + LL K++AEE+
Sbjct: 104 EIKRKRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKE 163
Query: 503 RSKPKFLTKEERAAEALRKRQAEV 526
R + EE+AAE + R+ E+
Sbjct: 164 REE------EEKAAEEEKAREEEI 181
Score = 32.0 bits (73), Expect = 2.6
Identities = 25/117 (21%), Positives = 49/117 (41%), Gaps = 3/117 (2%)
Query: 474 EEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKM 533
EE ++ +KKE E K + + + ++ E RKR E
Sbjct: 64 EEAERAHKSKKENKLAIEDADKSTNLDASNEGDE---DDDEEDEIKRKRIEEDARNSDAD 120
Query: 534 EEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKER 590
+ + + + + S + E+ A L R+ EK K++ E++ +++E+ E K R
Sbjct: 121 DSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAR 177
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 33.7 bits (77), Expect = 1.1
Identities = 21/58 (36%), Positives = 35/58 (60%), Gaps = 5/58 (8%)
Query: 491 ELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEAS 548
E LAK++AE E +++ + +EE+AA + QA+ E +E+ R+K Q K+AS
Sbjct: 259 EKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAE-----VEKRREKLQNLLKKAS 311
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 32.4 bits (73), Expect = 1.1
Identities = 29/105 (27%), Positives = 54/105 (51%)
Query: 474 EEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKM 533
EE+++ +++ L EEL + E R + +EERA E K + + EE K+
Sbjct: 42 EEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQE 101
Query: 534 EEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNK 578
+EE+++ Q+ +EA ++ E RL R++ ++ + E E K
Sbjct: 102 QEEQERIQKQKEEAEARAREEAERMRLEREKHFQQIEQERLERKK 146
Score = 30.1 bits (67), Expect = 6.7
Identities = 25/84 (29%), Positives = 47/84 (55%), Gaps = 3/84 (3%)
Query: 510 TKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKE 569
T E A L +++ + E R++ E+ER++++E + E KR + RLRR+ E +++
Sbjct: 19 TDAEEATRLLAEKRRQAREQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQ 78
Query: 570 DPE---EKELNKDKEREGEAIKER 590
+ E EKE ++ E E +E+
Sbjct: 79 EEERAREKEEKAKRKAEEEEKQEQ 102
>gnl|CDD|216403 pfam01271, Granin, Granin (chromogranin or secretogranin).
Length = 585
Score = 33.4 bits (76), Expect = 1.2
Identities = 35/243 (14%), Positives = 72/243 (29%), Gaps = 30/243 (12%)
Query: 364 DRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRH 423
D RS + + + + SR R R R R + +
Sbjct: 255 DESKRSGQLGLEDEAEEEKEEKG-------QESRGLSAVQTYLLRWVNARGRGRSQNRAE 307
Query: 424 HRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTE------------QMVI 471
R+RS+E + K+ + E A+ S + +
Sbjct: 308 --RERSEESEEKELDRASPYQEIEITANLQIPPSDLIRMLRKGDGRKPRGRVEEEGALEA 365
Query: 472 SDEEEDKGAAAKKEP---LSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEE 528
+ ++K + ++ + + P L+ + AE +Q
Sbjct: 366 PEALDEKRELDLDHSRVFENSKDGAVRAPQGAVLEALPNGLSVGDEGAEGKWNQQGPYFP 425
Query: 529 MRKKMEEERKKRQEFTKEASFESKRENFD-ARLRRDREKKKEDPEEKELNKDKEREGEAI 587
+ EE R + E D + ++ E++KE E+K L + +E +
Sbjct: 426 NEENQEEARFR----LPYYPGELSNPWEDKKQWKKSDEERKELYEDKFL-EGEEENDYTL 480
Query: 588 KER 590
+
Sbjct: 481 SMK 483
>gnl|CDD|236779 PRK10864, PRK10864, putative methyltransferase; Provisional.
Length = 346
Score = 33.2 bits (76), Expect = 1.2
Identities = 13/73 (17%), Positives = 25/73 (34%), Gaps = 7/73 (9%)
Query: 385 YDRVRERDTDRSRDDR-GRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRR 443
Y R + R+ + R G+ R + +R R DR+ + + + R
Sbjct: 16 YVRSDDDSDKRTHNPRTGKGGGRP----SGKSRADGGRRPARDDRNSQSRDRKWEDSPWR 71
Query: 444 E--RREEEASGSK 454
R + + K
Sbjct: 72 TVSRAPGDETPEK 84
Score = 31.3 bits (71), Expect = 5.4
Identities = 30/111 (27%), Positives = 39/111 (35%), Gaps = 19/111 (17%)
Query: 351 SLKGGSKDILMAGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRG-RDRDRDRL 409
LKG S + + R KR R R + +SR D G R DR
Sbjct: 4 ELKGKSGKVKVMYVRSDDDSD---KRTHNPRTGKGGG---RPSGKSRADGGRRPARDDRN 57
Query: 410 ERDRSRERE----REKRHHRRDRSKER------DGKDR--REGYRRERREE 448
+ R R+ E R D + E+ GK E RR+R EE
Sbjct: 58 SQSRDRKWEDSPWRTVSRAPGDETPEKADHGGISGKSFIDPEVLRRQRAEE 108
>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
Members of this family of small ribosomal proteins
possess one of three conserved blocks of sequence found
in proteins that stimulate the dissociation of guanine
nucleotides from G-proteins, leaving open the
possibility that MRP-S27 might be a functional partner
of GTP-binding ribosomal proteins.
Length = 417
Score = 33.2 bits (76), Expect = 1.3
Identities = 23/110 (20%), Positives = 53/110 (48%), Gaps = 9/110 (8%)
Query: 447 EEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKK-KAEEEARSK 505
+E + ++ DK ++ ++ +E +K E LLA+ ++ +E RSK
Sbjct: 297 DEASLKKLQEAIDKSESSELDEDLLLEE-------VEKSKQKFEPLLAQYGESFQEWRSK 349
Query: 506 PKFLTKEE-RAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRE 554
+ L K E R+ ++ +R +E+ ++ E+R++ F ++ K++
Sbjct: 350 LQALAKVESRSLLSVEERLPTIEKEDLELYEQRQQLWFFENRKLWQRKKK 399
>gnl|CDD|235449 PRK05415, PRK05415, hypothetical protein; Provisional.
Length = 341
Score = 32.9 bits (76), Expect = 1.3
Identities = 20/72 (27%), Positives = 32/72 (44%), Gaps = 3/72 (4%)
Query: 533 MEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYL 592
M E K R +F +E E + A+ D E++ P + +++E E EA E
Sbjct: 1 MSEPLKPRIDFDEEPLEEEEEPELRAQQAFD-EQEAFAPAAPDEEEEEEGELEAAVE--A 57
Query: 593 GLVKKKRRVRRL 604
L K+ R+L
Sbjct: 58 ALRPKRSLWRKL 69
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 33.5 bits (76), Expect = 1.4
Identities = 27/143 (18%), Positives = 62/143 (43%), Gaps = 24/143 (16%)
Query: 459 DKEGYEPTEQMVISDEEEDKGAA--------AKKEPLSLE-------ELLAKKKAEEEAR 503
+ Y+P ++ +E + + E ++ E EL A+K+ E EA+
Sbjct: 452 SVDSYDPRASLMTMEETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQ 511
Query: 504 S-KPKFLTKEERAAEALRKRQAEVEEMR-----KKMEEERKKRQEFTKEASFESKRENFD 557
K ++ ++ + ++ +V+E K + K+++ + K +K+E
Sbjct: 512 GIKYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQA 571
Query: 558 ARLRRDREKKKEDPEEKELNKDK 580
L++ KKK+ ++K+L+ K
Sbjct: 572 ENLKK---KKKQIAKQKKLDSKK 591
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 32.4 bits (74), Expect = 1.4
Identities = 14/80 (17%), Positives = 38/80 (47%), Gaps = 4/80 (5%)
Query: 515 AAEALRKRQ----AEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKED 570
+K++ E+E+++K+ EE++K + + K + K ++ + D+ +KK++
Sbjct: 56 EYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDE 115
Query: 571 PEEKELNKDKEREGEAIKER 590
E ++ +D +
Sbjct: 116 KEAEDKLEDLTKSYSETLST 135
>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 193
Score = 32.3 bits (74), Expect = 1.4
Identities = 23/85 (27%), Positives = 42/85 (49%), Gaps = 3/85 (3%)
Query: 464 EPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKE--ERAAEALRK 521
+ ++ D EE + A +E + EE ++ EE + + L + E + LR
Sbjct: 3 DKEQKTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELKDKYLRA 62
Query: 522 RQAEVEEMRKKMEEERKKRQEFTKE 546
QAE E +RK+ E ER++ +++ E
Sbjct: 63 -QAEFENLRKRTEREREEAKKYAIE 86
>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947). Family of
eukaryotic proteins with unknown function.
Length = 168
Score = 32.3 bits (74), Expect = 1.4
Identities = 28/119 (23%), Positives = 51/119 (42%), Gaps = 14/119 (11%)
Query: 483 KKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQE 542
+KE LE+ L K K EE L + ++ ++ K + R+ ++E +K+ +
Sbjct: 62 EKEIEELEKALKKTKDSEEKEE----LKRTLQSMKSRLKTLKNKDREREILKEHKKQEK- 116
Query: 543 FTKEASFESKRENFDARLRRDREKKKEDPEEK--ELNKDKEREGEAIKERYLGLVKKKR 599
E +E + E KK ++K EL K K+ + K+R K+K+
Sbjct: 117 -------ELIKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQLDKALEKKRKKNAGKEKK 168
Score = 30.7 bits (70), Expect = 4.6
Identities = 21/95 (22%), Positives = 44/95 (46%), Gaps = 12/95 (12%)
Query: 522 RQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDRE---KKKEDPEEKELN- 577
R+ E+EE+ K +++ + ++ + + +S + +DRE K+ +EKEL
Sbjct: 61 REKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIK 120
Query: 578 --------KDKEREGEAIKERYLGLVKKKRRVRRL 604
K E + +K+++ L K K+ + L
Sbjct: 121 EGKKPYYLKKSEIKKLVLKKKFDELKKSKQLDKAL 155
>gnl|CDD|109874 pfam00836, Stathmin, Stathmin family. The Stathmin family of
proteins play an important role in the regulation of the
microtubule cytoskeleton. They regulate microtubule
dynamics by promoting depolymerization of microtubules
and/or preventing polymerisation of tubulin
heterodimers.
Length = 140
Score = 31.5 bits (71), Expect = 1.5
Identities = 34/113 (30%), Positives = 57/113 (50%), Gaps = 13/113 (11%)
Query: 480 AAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEER-- 537
++ KK+ LSLEE+ K +A EE R + E + KR+ E E ++K +EE
Sbjct: 35 SSPKKKDLSLEEIQKKLEAAEERRK----SQEAEVLKQLAEKREHEREVLQKAIEENNNF 90
Query: 538 ----KKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEA 586
+++ E+ E++ A+L R +EK K +E+ K+KE + EA
Sbjct: 91 SKMAEEKLTQKMESIKENREAQLAAKLERLQEKDK---HAEEVRKNKELKEEA 140
>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ. Members of
this family are the FliJ protein found, in nearly every
case, in the midst of other flagellar biosynthesis genes
in bacgterial genomes. Typically the fliJ gene is found
adjacent to the gene for the flagellum-specific ATPase
FliI. Sequence scoring in the gray zone between trusted
and noise cutoffs include both probable FliJ proteins
and components of bacterial type III secretion systems.
Length = 141
Score = 31.5 bits (72), Expect = 1.6
Identities = 23/109 (21%), Positives = 54/109 (49%), Gaps = 16/109 (14%)
Query: 511 KEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESK----------RENFDARL 560
+EE+A L K QAE E + ++++ K R+E+ ++A + + F +L
Sbjct: 14 EEEQAKLELAKAQAEFERLETQLQQLIKYREEYEQQALEKVGAGTSALELSNYQRFIRQL 73
Query: 561 RRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLNDRKF 609
+ ++++++ ++E EA +ER L ++ + + +L ++K
Sbjct: 74 DQRIQQQQQELAL------LQQEVEAKRERLLEARRELKALEKLKEKKQ 116
>gnl|CDD|236153 PRK08116, PRK08116, hypothetical protein; Validated.
Length = 268
Score = 32.7 bits (75), Expect = 1.7
Identities = 13/43 (30%), Positives = 16/43 (37%)
Query: 517 EALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDAR 559
EA ++ E E K+ ER K E S ENF
Sbjct: 48 EAEEAKEREEENREKQRRIERLKSNSLLDEKFRNSTFENFLFD 90
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 32.5 bits (74), Expect = 1.7
Identities = 16/78 (20%), Positives = 31/78 (39%), Gaps = 10/78 (12%)
Query: 363 GDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDR---------DRLERDR 413
+ R RS KRR ++ + R++ ++ R+D + +D
Sbjct: 157 DEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSGGQSGLSTKDE 216
Query: 414 SREREREKRHHRRDRSKE 431
+ +R+K HH +R E
Sbjct: 217 PPKEKRQK-HHDPERRLE 233
Score = 31.8 bits (72), Expect = 2.6
Identities = 13/74 (17%), Positives = 27/74 (36%), Gaps = 4/74 (5%)
Query: 396 SRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRR-EGYRRERREEEASGSK 454
+ ++ R D + + DR +++E ++R+ D K G G ++E K
Sbjct: 165 KQKEKRRVEDSQKHKEDRRKKQEEKRRN---DEDKRPGGGGGSSGGQSGLSTKDEPPKEK 221
Query: 455 HKSRDKEGYEPTEQ 468
+ Q
Sbjct: 222 RQKHHDPERRLEPQ 235
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 32.3 bits (74), Expect = 1.8
Identities = 32/149 (21%), Positives = 62/149 (41%), Gaps = 5/149 (3%)
Query: 429 SKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLS 488
E K+ R+ ++ + E EA ++ + E E E V E E + + +
Sbjct: 21 RLEPRIKEIRKALKKAKAELEALNKALEALEIE-LEDLENQVSQLESEIQEIRERIK--R 77
Query: 489 LEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTK--E 546
EE L+ K E E R+ + + +L AE+ E +K+E+E + +E + E
Sbjct: 78 AEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLE 137
Query: 547 ASFESKRENFDARLRRDREKKKEDPEEKE 575
+ + + RE+ +E ++E
Sbjct: 138 KNLAEAEARLEEEVAEIREEGQELSSKRE 166
Score = 30.0 bits (68), Expect = 9.8
Identities = 28/144 (19%), Positives = 57/144 (39%), Gaps = 8/144 (5%)
Query: 405 DRDRLERDRSRER--EREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEG 462
+ + E + + E + + + +E R +R EE + R+
Sbjct: 34 KKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRA 93
Query: 463 YEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKR 522
Q +++ + + E L E + K + E E K + E+ AEA +
Sbjct: 94 LNIEIQ-----IAKERINSLEDELAELMEEIEKLEKEIEDL-KERLERLEKNLAEAEARL 147
Query: 523 QAEVEEMRKKMEEERKKRQEFTKE 546
+ EV E+R++ +E KR+E ++
Sbjct: 148 EEEVAEIREEGQELSSKREELKEK 171
>gnl|CDD|163426 TIGR03714, secA2, accessory Sec system translocase SecA2. Members of
this protein family are homologous to SecA and part of
the accessory Sec system. This system, including both
five core proteins for export and a variable number of
proteins for glycosylation, operates in certain
Gram-positive pathogens for the maturation and delivery
of serine-rich glycoproteins such as the cell surface
glycoprotein GspB in Streptococcus gordonii [Protein
fate, Protein and peptide secretion and trafficking].
Length = 762
Score = 33.1 bits (76), Expect = 1.8
Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 10/66 (15%)
Query: 1053 ILVATDVAGRGIDIK---------DVSMVINYDMAKSIEDYTHRIGRTGRAGKEGLAVSF 1103
+ VAT +AGRG DIK ++++ M S D R GR+GR G G + F
Sbjct: 475 VTVATSMAGRGTDIKLGKGVAELGGLAVIGTERMENSRVDLQLR-GRSGRQGDPGSSQFF 533
Query: 1104 CTKDDS 1109
+ +D
Sbjct: 534 VSLEDD 539
Score = 33.1 bits (76), Expect = 1.8
Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 10/66 (15%)
Query: 1728 ILVATDVAGRGIDIK---------DVSMVINYDMAKSIEDYTHRIGRTGRAGKEGLAVSF 1778
+ VAT +AGRG DIK ++++ M S D R GR+GR G G + F
Sbjct: 475 VTVATSMAGRGTDIKLGKGVAELGGLAVIGTERMENSRVDLQLR-GRSGRQGDPGSSQFF 533
Query: 1779 CTKDDS 1784
+ +D
Sbjct: 534 VSLEDD 539
>gnl|CDD|222914 PHA02666, PHA02666, hypothetical protein; Provisional.
Length = 287
Score = 32.6 bits (73), Expect = 1.8
Identities = 17/104 (16%), Positives = 30/104 (28%), Gaps = 1/104 (0%)
Query: 364 DRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRH 423
+ R +SR P R+ RS R + + + G+ R H
Sbjct: 39 ESRRKSR-PSRQHRSAERTPTTASSLTHENNTAPSRHGKQHSCKASSRSSHNRGSTSSSH 97
Query: 424 HRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTE 467
+ RR + R + ++ +G P E
Sbjct: 98 NHHAHRGPHQSAHRRSKHDAVRDTYQPCPQSPETDLYKGRLPGE 141
Score = 31.1 bits (69), Expect = 5.1
Identities = 14/81 (17%), Positives = 22/81 (27%)
Query: 366 RSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHR 425
R R+ S +R+SR + R T S ++ + R +
Sbjct: 32 RRRANSMESRRKSRPSRQHRSAERTPTTASSLTHENNTAPSRHGKQHSCKASSRSSHNRG 91
Query: 426 RDRSKERDGKDRREGYRRERR 446
S R RR
Sbjct: 92 STSSSHNHHAHRGPHQSAHRR 112
>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
subunit. CENP-Q is one of the components that assembles
onto the CENP-A-nucleosome distal (CAD) centromere. The
centromere, which is the basic element of chromosome
inheritance, is epigenetically determined in mammals.
CENP-A, the centromere-specific histone H3 variant,
assembles an array of nucleosomes and it is this that
seems to be the prime candidate for specifying
centromere identity. CENP-A nucleosomes directly recruit
a proximal CENP-A nucleosome associated complex (NAC)
comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
CENP-C and CENP-H. Assembly of the CENP-A NAC at
centromeres is dependent on CENP-M, CENP-N and CENP-T.
Additionally, there are seven other subunits which make
up the CENP-A-nucleosome distal (CAD) centromere,
CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
CENP-S, also assembling on the CENP-A NAC. Fta7 is the
equivalent component of the fission yeast Sim4 complex.
Length = 159
Score = 31.6 bits (72), Expect = 2.0
Identities = 21/94 (22%), Positives = 40/94 (42%), Gaps = 8/94 (8%)
Query: 481 AAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKR 540
K++ L ++LL +K+A E + L E + E L + E +K E ++
Sbjct: 14 PTKEDSLDYDQLLRRKRALE------RQLAAELKQLELLEEEIRREEAELEKDLEYLQEL 67
Query: 541 QEFTKEASFESKRENFDAR--LRRDREKKKEDPE 572
++ K E + E+ + LR + E+ E
Sbjct: 68 EKNAKALEREREEESKNLHPVLRLLESEVLEENE 101
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 32.7 bits (74), Expect = 2.2
Identities = 40/185 (21%), Positives = 65/185 (35%), Gaps = 9/185 (4%)
Query: 407 DRLERDRSRERERE-KRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEP 465
+ + +E E E K + + + K +EG +E + ++ + E
Sbjct: 661 EESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGE 720
Query: 466 TEQMVISDEEEDK--GAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQ 523
TE DE E + + E E K + E E K E A + +
Sbjct: 721 TEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDE 780
Query: 524 AEVE-----EMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNK 578
E++ EM+ E K E EA + + E D E K E E+ELN
Sbjct: 781 GEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDET-GEQELNA 839
Query: 579 DKERE 583
+ + E
Sbjct: 840 ENQGE 844
Score = 31.5 bits (71), Expect = 5.1
Identities = 52/249 (20%), Positives = 90/249 (36%), Gaps = 24/249 (9%)
Query: 382 ERDYDRVRERDTDRSRDDRGRDRDRDRLE-----RDRSREREREKRHHRRDRSKERDGKD 436
ER ++ E + + D + + + +E E E E E +G+
Sbjct: 688 ERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEG 747
Query: 437 RREGYRR-ERREEEASGSKHKSRDKEGYEPTEQMVISDEEE-----DKGAAAKKEPLSLE 490
EG E + + EG E ++ I E+ D+GA K E
Sbjct: 748 EAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGET 807
Query: 491 ELLAKKKAE--EEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEAS 548
E K + E E ++ + E E + Q E ++ K ++ ++E
Sbjct: 808 EAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEE 867
Query: 549 FESKRENFDARLRRDREKKKEDPEEK-ELNKDKEREGEAIKERYLGLVK-------KKRR 600
E + E + + E+++E+ EE L + R+ +AI YL L+
Sbjct: 868 EEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQAI---YLFLLPIVFPLWLTVPD 924
Query: 601 VRRLNDRKF 609
VRR RKF
Sbjct: 925 VRRQEARKF 933
>gnl|CDD|109943 pfam00906, Hepatitis_core, Hepatitis core antigen. The core
antigen of hepatitis viruses possesses a carboxyl
terminus rich in arginine. On this basis it was
predicted that the core antigen would bind DNA. There is
some experimental evidence to support this.
Length = 182
Score = 31.7 bits (72), Expect = 2.2
Identities = 11/27 (40%), Positives = 15/27 (55%), Gaps = 1/27 (3%)
Query: 365 RRSRSRSP-PRKRRSRSRERDYDRVRE 390
R R R+P PR+RRS+S R +
Sbjct: 153 RSPRRRTPSPRRRRSQSPRRRRSQSPS 179
Score = 31.3 bits (71), Expect = 2.9
Identities = 23/65 (35%), Positives = 33/65 (50%), Gaps = 9/65 (13%)
Query: 1154 ISSPVSTCPPELLNHPDAQHKPGTVMMAGDRRS-RSRSPA-RKRRSRSRDRDYDRRFKRK 1211
I +P + PP N P P T+++ RS R R+P+ R+RRS+S R R +
Sbjct: 125 IRTPPAYRPP---NAPILSTLPETIVVRRRGRSPRRRTPSPRRRRSQSPRR----RRSQS 177
Query: 1212 KSPQC 1216
S QC
Sbjct: 178 PSSQC 182
>gnl|CDD|225708 COG3167, PilO, Tfp pilus assembly protein PilO [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 211
Score = 32.1 bits (73), Expect = 2.2
Identities = 20/56 (35%), Positives = 28/56 (50%), Gaps = 4/56 (7%)
Query: 489 LEELLAKKKAEEEARSKPKFLTKEERAA--EALRKRQAEVEEMRKKMEEERKKRQE 542
LEEL + EEE +S + K +AA EALR + AE+EE + + E
Sbjct: 48 LEELEELEAEEEELKST--YQQKAIQAANLEALRAQLAEMEERFDILLRQLPSDTE 101
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 32.8 bits (75), Expect = 2.2
Identities = 41/234 (17%), Positives = 74/234 (31%), Gaps = 37/234 (15%)
Query: 381 RERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDG------ 434
ER R + R ++R + D DR R R+ R K +E
Sbjct: 612 AERATAREQVRAYFKARAEL--DALLDRRARLRAALRAALKAVAIVLPGEELAELLELAR 669
Query: 435 ----KDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEE-------DKGAAAK 483
+ ++ R+ +E ++ + E Q + +E + G +
Sbjct: 670 QLLEEAEKQAARKASLDERLRDAERALEEAEERHDEAQSALEAWQEQWYDALLEAGLGGR 729
Query: 484 KEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKK---- 539
P + L + +E +AA+ LR+R A +E + EEE +
Sbjct: 730 ASPAGALDALELLQNIKEKL----------QAADDLRQRIAAMERDLARFEEEVEALAEA 779
Query: 540 -RQEFTKEASFESKRENF---DARLRRDREKKKEDPEEKELNKDKEREGEAIKE 589
E + E+ R +K E +E K+ A+ E
Sbjct: 780 VAPEMLGTPADETARALKQRLKRARDTAAAAEKLAEEIEEAEKEVSEAAAALDE 833
Score = 31.3 bits (71), Expect = 5.8
Identities = 35/212 (16%), Positives = 72/212 (33%), Gaps = 13/212 (6%)
Query: 404 RDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGY 463
+ R + ++ + E+ + R + K+++ R R + K +
Sbjct: 172 KTRTWKDLVKALDEAEEELANLRKE-LRQLEKEKQRLERLRRLLPLLAERKALEQQLAAL 230
Query: 464 EPTEQMVISDEEEDKGAAAKKEPLS-----LEELLAKKKAEEEARSKPKFLTKEERAAEA 518
+ E + A A+ L E L +AE + S + L + A EA
Sbjct: 231 GEVIDLPPDAVERYEEARAELRAARRNLELLTERLEALQAELDEISLDEELLAQAAAIEA 290
Query: 519 LRKRQAEVEEMRKKMEE-----ERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEE 573
L +++ E + + + + A + R KE
Sbjct: 291 LHQQRGEYRNAEQDLPDREGEIANAREAAAALLAQIGPDADEEAVESLRPSLAAKETVT- 349
Query: 574 KELNKDKEREGEAIKERYLGLVKKKRRVRRLN 605
EL K KE +A+K L +++R ++++
Sbjct: 350 -ELEKRKEALDQALKSARDALEERERELKQVR 380
>gnl|CDD|235026 PRK02292, PRK02292, V-type ATP synthase subunit E; Provisional.
Length = 188
Score = 31.9 bits (73), Expect = 2.3
Identities = 26/96 (27%), Positives = 45/96 (46%), Gaps = 7/96 (7%)
Query: 494 AKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKR 553
++ +AE + ++ E A E L R+AE E E E+ + QE + A E+KR
Sbjct: 19 SEIRAEADEEAEEIIAEAEADAEEILEDREAEAER-----EIEQLREQELS-SAKLEAKR 72
Query: 554 ENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKE 589
E +AR + E + E++ + D ++ E K
Sbjct: 73 ERLNAR-KEVLEDVRNQVEDEIASLDGDKREELTKS 107
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 32.7 bits (75), Expect = 2.3
Identities = 28/133 (21%), Positives = 56/133 (42%), Gaps = 14/133 (10%)
Query: 590 RYLGLVKKKRRVRRLNDRKFVFDWDASEDTSVDYNSIYKERHQVQFFGRGNIAGIDI--K 647
R ++ R + ++ F + D V YN + E + F G I G+ +
Sbjct: 34 RLFRSLELGRIHEKYSNCSFEWKSDDPLKVRV-YNRDFVEEN----FDEGEIKGVFTLGE 88
Query: 648 AQKRDQSKFYGEMLEKRRTEAEKEQEKVRLKKVKKREEKQKWDDRHWTEKSLDEMTERDW 707
+++ E L+K + E++ E++ + KK EE +K K LD+ ++
Sbjct: 89 ENIEIEAQI--EELKKELKKLEEKIEQLEAEIEKKEEELEK-----AKNKFLDKAWKKLA 141
Query: 708 RIFREDYSITIKG 720
+ + + S +KG
Sbjct: 142 KKYDSNLSEALKG 154
Score = 32.7 bits (75), Expect = 2.3
Identities = 28/133 (21%), Positives = 56/133 (42%), Gaps = 14/133 (10%)
Query: 1265 RYLGLVKKKRRVRRLNDRKFVFDWDASEDTSVDYNSIYKERHQVQFFGRGNIAGIDI--K 1322
R ++ R + ++ F + D V YN + E + F G I G+ +
Sbjct: 34 RLFRSLELGRIHEKYSNCSFEWKSDDPLKVRV-YNRDFVEEN----FDEGEIKGVFTLGE 88
Query: 1323 AQKRDQSKFYGEMLEKRRTEAEKEQEKVRLKKVKKREEKQKWDDRHWTEKSLDEMTERDW 1382
+++ E L+K + E++ E++ + KK EE +K K LD+ ++
Sbjct: 89 ENIEIEAQI--EELKKELKKLEEKIEQLEAEIEKKEEELEK-----AKNKFLDKAWKKLA 141
Query: 1383 RIFREDYSITIKG 1395
+ + + S +KG
Sbjct: 142 KKYDSNLSEALKG 154
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 32.8 bits (74), Expect = 2.3
Identities = 42/225 (18%), Positives = 73/225 (32%), Gaps = 22/225 (9%)
Query: 392 DTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEAS 451
D+D D+ + + + KR + + E + E S
Sbjct: 467 DSDSQFDESEGNLRWKEGLASKLAYSQSGKRGRNIQKIFYDESLSPEECIEEYKGESAKS 526
Query: 452 GSKHKSRDKEGYEPTEQMVISDEEEDKGAA--AKKEPLSLEELLAKKKAEEEARSKPKFL 509
+S EP + +S + ++ K EEL KKK A+ K +F
Sbjct: 527 S---ESDLVVQDEPEDFFDVSKVANESISSNHEKLMESEFEEL--KKKWSSLAQLKSRFQ 581
Query: 510 T--------KEERAAEALRKRQAEV---EEMRKKMEEERKKRQEFTKEASFESKRENFDA 558
EE + K E EE E E + T E + +++
Sbjct: 582 KDATLDSIEGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYE- 640
Query: 559 RLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRR 603
+RE+ EE N + E G+ K+ ++KR++
Sbjct: 641 ---TEREENARKKEELRGNFELEERGDPEKKDVDWYTEEKRKIEE 682
Score = 31.6 bits (71), Expect = 5.6
Identities = 21/104 (20%), Positives = 44/104 (42%), Gaps = 6/104 (5%)
Query: 500 EEARSKPKFLTKE----ERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKREN 555
EE S P+ + + +R+ + +R + + E K R KE + + +
Sbjct: 976 EEDYSLPREIESKLPLDKRSIAVVSRRIELPVPPECREKHEIKDR--IVKERIKDQEEKE 1033
Query: 556 FDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKR 599
L+R +E++ E++ + ++ + KE +KKKR
Sbjct: 1034 RMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKRLKKKR 1077
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 32.5 bits (74), Expect = 2.4
Identities = 20/140 (14%), Positives = 50/140 (35%), Gaps = 5/140 (3%)
Query: 448 EEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEE--EARSK 505
+ A ++ S++ + + + + K +S KKK +E E +
Sbjct: 179 KSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSS---FFKKKTKEKKEKKEA 235
Query: 506 PKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDRE 565
+ KEE E+ ++ +E + + + ++ K + S E +++
Sbjct: 236 SESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKR 295
Query: 566 KKKEDPEEKELNKDKEREGE 585
K+ + E E ++
Sbjct: 296 KRLKKMMEDEDEDEEMEIVP 315
Score = 32.5 bits (74), Expect = 2.5
Identities = 26/152 (17%), Positives = 56/152 (36%), Gaps = 13/152 (8%)
Query: 450 ASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFL 509
+S K K+++K+ + + + +E E++ +E+
Sbjct: 219 SSFFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSG 278
Query: 510 TKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKE 569
+ + E K + + + ++K ME+E + + S + E+ E
Sbjct: 279 ERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEES-------------E 325
Query: 570 DPEEKELNKDKEREGEAIKERYLGLVKKKRRV 601
+PE L K +E + E G + +RRV
Sbjct: 326 EPEPPPLPKKEEEKEEVTVSPDGGRRRGRRRV 357
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 32.6 bits (74), Expect = 2.4
Identities = 34/151 (22%), Positives = 59/151 (39%), Gaps = 10/151 (6%)
Query: 405 DRDRLERDRSRERERE-KRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGY 463
D + ER + +ERER+ D SK + EEE S + ++E
Sbjct: 397 DTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESV---EEEEEEEE 453
Query: 464 EPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQ 523
E E+ S+EEE + + EE+ A +EEE + E E +R
Sbjct: 454 EEEEEEQESEEEEGEDEEEE------EEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRN 507
Query: 524 AEVEEMRKKMEEERKKRQEFTKEASFESKRE 554
+E+ + + E ++ + E+ E +
Sbjct: 508 SEMAGISRMSEGQQPRGSSVQPESPQEEPLQ 538
>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 548
Score = 32.5 bits (74), Expect = 2.4
Identities = 30/178 (16%), Positives = 61/178 (34%), Gaps = 5/178 (2%)
Query: 389 RERDTDRSRDDRGRDRDRDRLE---RDRSREREREKRHHRRDRSKERDGKDRREGYRRER 445
RE + ++ ++ + E + RE R +R KE + + + Y
Sbjct: 287 REAEQAEILAEQAIQEEKAQAEQEVQHAKALEAREMRVGLIERQKETELEPQERSYFINA 346
Query: 446 REEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSK 505
+ +A + + + + + A + E +L + E+ +
Sbjct: 347 AQRQAQEEAKAAANIAEAIGAQAEAAVETARETEEAERAEQAALVAAAEAAEQEQVEIAV 406
Query: 506 PKFLTKEERAAEALR-KRQAEVEEMRKKMEEERKKRQ-EFTKEASFESKRENFDARLR 561
K E A+A K +AE + K E E K+ E + + E F A ++
Sbjct: 407 RAEAAKAEAEAQAAEIKAEAEAIREKGKAEAEAKRALAEAIQVLGDAAAAELFKALVQ 464
Score = 30.6 bits (69), Expect = 9.4
Identities = 38/192 (19%), Positives = 63/192 (32%), Gaps = 20/192 (10%)
Query: 398 DDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKS 457
D GR R L+ E E EK RD K ++ + A
Sbjct: 209 DALGRRRIAQVLQDAEIAENEAEKETEIAIAEANRDAKLVELEVEQQPAGKTA------- 261
Query: 458 RDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAE 517
E T ++ I E + AA K E A+ AE+ + + +E + A+
Sbjct: 262 ------EQTREVKIILAETEAEVAAWKAETRREAEQAEILAEQAIQEEKAQAEQEVQHAK 315
Query: 518 ALRKRQAEVEEMRKKMEEERKKRQEF-------TKEASFESKRENFDARLRRDREKKKED 570
AL R+ V + ++ E E + ++ + N + E E
Sbjct: 316 ALEAREMRVGLIERQKETELEPQERSYFINAAQRQAQEEAKAAANIAEAIGAQAEAAVET 375
Query: 571 PEEKELNKDKER 582
E E + E+
Sbjct: 376 ARETEEAERAEQ 387
>gnl|CDD|165222 PHA02896, PHA02896, A-type inclusion like protein; Provisional.
Length = 616
Score = 32.7 bits (74), Expect = 2.5
Identities = 39/242 (16%), Positives = 78/242 (32%), Gaps = 34/242 (14%)
Query: 368 RSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRG---RDRDRDRLERDRSREREREKRHH 424
+ ++ P + +R+ D D++ E + D + DD +D R + + +KR H
Sbjct: 374 KMKNCPSTNGTLNRDEDEDKIEEDNDDYNNDDEDDLIKDEAPILNRRHGQMDDKYDKRDH 433
Query: 425 RRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKK 484
R +K Y E K D + ++
Sbjct: 434 RYKNNK----------YDIYDDESPRYKYKDDDYDDNDDNDDDHII-------------- 469
Query: 485 EPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFT 544
P L K E+E + ++ +++R R+ + + +
Sbjct: 470 -PKKANNLEDAKDFEDEMMDA---IADDDYTPKSIRNRRNGYLLKDEDRYYYDRGHRNLG 525
Query: 545 KEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRL 604
E E + D D++K +D ++ E + +RY K K ++ L
Sbjct: 526 AEKDPEKPHPDKDEEYPEDKDKSDDDRLNDADKAMRKLEADGFGDRYR---KIKEKMDNL 582
Query: 605 ND 606
D
Sbjct: 583 ED 584
Score = 31.5 bits (71), Expect = 4.7
Identities = 32/213 (15%), Positives = 75/213 (35%), Gaps = 28/213 (13%)
Query: 364 DRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRH 423
D + +P RR + YD+ RD R ++ D + + R + ++ +
Sbjct: 407 DDLIKDEAPILNRRHGQMDDKYDK---------RDHRYKNNKYDIYDDESPRYKYKDDDY 457
Query: 424 HRRDRSKERDGKDRREGYRRERREEEAS----------GSKHKSRDKEGYEPTEQMVISD 473
D + + ++ + ++ E K + GY ++
Sbjct: 458 DDNDDNDDDHIIPKKANNLEDAKDFEDEMMDAIADDDYTPKSIRNRRNGYLLKDEDRYYY 517
Query: 474 EEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEAL---------RKRQA 524
+ + A+K+P ++ E++ +S L ++A L RK +
Sbjct: 518 DRGHRNLGAEKDPEKPHPDKDEEYPEDKDKSDDDRLNDADKAMRKLEADGFGDRYRKIKE 577
Query: 525 EVEEMRKKMEEERKKRQEFTKEASFESKRENFD 557
+++ + ++ RK E K+ +S + D
Sbjct: 578 KMDNLEDDYDDLRKHAIELPKKLDNQSGNGDRD 610
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 32.3 bits (73), Expect = 2.5
Identities = 42/192 (21%), Positives = 69/192 (35%), Gaps = 18/192 (9%)
Query: 416 EREREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTE-----QMV 470
E+ K R + + K RR +E E K R+K +
Sbjct: 101 EKVIHKCKQRLTKLTQYLLKARRLALFKEHGEYMVRKPKMNRREKLRERKALNAAKFEKN 160
Query: 471 ISDEEED--KGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEE 528
I E K PL++ E L K A E + K ++ER E + E+E
Sbjct: 161 IEKELMLRLKSGIYGDTPLNVREHLWNKAATE----REKRQDEKERYVEEEEESDTELEA 216
Query: 529 MRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIK 588
+ E+E+ K+++ K + + + + D +E E NK K I+
Sbjct: 217 VTDDSEKEKTKKKDLEKWLG-SDQSMETSESEEEESSESESDEDEDEDNKGK------IR 269
Query: 589 ERYLGLVKKKRR 600
+R KK R+
Sbjct: 270 KRKTDDAKKSRK 281
>gnl|CDD|185235 PRK15335, PRK15335, type III secretion system protein SpaM;
Provisional.
Length = 147
Score = 31.2 bits (70), Expect = 2.6
Identities = 29/125 (23%), Positives = 63/125 (50%), Gaps = 15/125 (12%)
Query: 467 EQMVISDEEEDKGAAAKKEPL-----SLEELLAKKKAEEEARSKPK---FLTKEERAAEA 518
E +++ ++ED+G A++E + L+ LL +AE S+ + L K+
Sbjct: 22 ESILLRYQDEDRGLQAEEEAILEQIAGLKLLLDTLRAENRQLSREEIYTLLRKQSIVRRQ 81
Query: 519 LRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKREN-------FDARLRRDREKKKEDP 571
++ + ++ ++++K E KKR+EF K++ + ++E R RE ++E+
Sbjct: 82 IKDLELQIIQIQEKRSELEKKREEFQKKSKYWLRKEGNYQRWIIRQKRFYIQREIQQEEA 141
Query: 572 EEKEL 576
E +E+
Sbjct: 142 ESEEI 146
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 32.5 bits (74), Expect = 2.6
Identities = 25/181 (13%), Positives = 65/181 (35%), Gaps = 19/181 (10%)
Query: 404 RDRDRLERDRSRERE--REKRHHRRDRSKERDGKDRREGYR-----RERREEEASGSKHK 456
++ D L + R + + +++R++ + + Y R + + +
Sbjct: 168 KEADSLYKPSGRNPQINQLLEKLKQERNEIDEAEKEYATYHKLLESRRAEHARLAELRSE 227
Query: 457 SRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAA 516
R + + + +E LE+ L +++ EE A + + E+
Sbjct: 228 LRADRDHIRALRDAVELW------PRLQEWKQLEQELTRRR-EELATFPRDGVLRLEKRE 280
Query: 517 EALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKEL 576
L+K +AE++ + ++ E + + +K A +R ++ + EL
Sbjct: 281 AHLQKTEAEIDALLVRLAELKDLASQ-----LIPAKEAVLQALVRLHQQLSEIKASAFEL 335
Query: 577 N 577
Sbjct: 336 T 336
Score = 32.5 bits (74), Expect = 2.6
Identities = 48/259 (18%), Positives = 85/259 (32%), Gaps = 29/259 (11%)
Query: 470 VISDEEEDKGAAAKKEPLSLEELLAKKKAEE-EARSKPKFLTKEERAAEALRKRQAEVEE 528
V++ E + + + P + +LL K K E E K + E+ R A + E
Sbjct: 165 VLNKEADSLYKPSGRNPQ-INQLLEKLKQERNEIDEAEKEYATYHKLLESRRAEHARLAE 223
Query: 529 MRKKMEEERKKRQEFTKE-----ASFESKRENFDARLRRDREKKKEDPEEKELNKDKERE 583
+R ++ +R + E K+ + L R RE+ P + L +K
Sbjct: 224 LRSELRADRDHIRALRDAVELWPRLQEWKQ--LEQELTRRREELATFPRDGVLRLEKREA 281
Query: 584 GEAIKERYLGLVK------KKRRVRRLNDRKFVFDWDASEDTSVDYNSIYK-ERHQVQFF 636
E + + K + + ++ V A + I +
Sbjct: 282 HLQKTEAEIDALLVRLAELKDLASQLIPAKEAVL--QALVRLHQQLSEIKASAFELTETL 339
Query: 637 GRGNIAGIDIKAQKRDQSKFYGEMLEKRRTEAEKEQEKVRLKKVKKREEKQ---KWDDRH 693
I+A RD+ + G E R + E + + +RE KQ + R
Sbjct: 340 A-------GIEADLRDKEEAAGNGFEAERVHDLRSLECMLRYQSSQRELKQTEAAYCKR- 391
Query: 694 WTEKSLDEMTERDWRIFRE 712
EK L E + R
Sbjct: 392 LDEKRLFEDEAEEEARQRL 410
>gnl|CDD|187841 cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3;
Distinct diverged subfamily of Cas3 helicase domain.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Diverged DNA helicase Cas3'; signature gene for Type I
and subtype I-D.
Length = 353
Score = 32.2 bits (73), Expect = 2.6
Identities = 23/69 (33%), Positives = 30/69 (43%), Gaps = 18/69 (26%)
Query: 22 LQNRD---IIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQ 78
LQ++D I A TG+GKTLA+L PLL AI + PT L +
Sbjct: 9 LQSKDADIIFNTAPTGAGKTLAWLTPLL---------------HGENKAIALYPTNALIE 53
Query: 79 QIEEETNKF 87
E +F
Sbjct: 54 DQTEAIKEF 62
Score = 32.2 bits (73), Expect = 2.6
Identities = 23/69 (33%), Positives = 30/69 (43%), Gaps = 18/69 (26%)
Query: 763 LQNRD---IIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQ 819
LQ++D I A TG+GKTLA+L PLL AI + PT L +
Sbjct: 9 LQSKDADIIFNTAPTGAGKTLAWLTPLL---------------HGENKAIALYPTNALIE 53
Query: 820 QIEEETNKF 828
E +F
Sbjct: 54 DQTEAIKEF 62
Score = 32.2 bits (73), Expect = 2.6
Identities = 23/69 (33%), Positives = 30/69 (43%), Gaps = 18/69 (26%)
Query: 1438 LQNRD---IIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQ 1494
LQ++D I A TG+GKTLA+L PLL AI + PT L +
Sbjct: 9 LQSKDADIIFNTAPTGAGKTLAWLTPLL---------------HGENKAIALYPTNALIE 53
Query: 1495 QIEEETNKF 1503
E +F
Sbjct: 54 DQTEAIKEF 62
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 32.4 bits (74), Expect = 2.7
Identities = 24/107 (22%), Positives = 48/107 (44%), Gaps = 10/107 (9%)
Query: 531 KKMEEERKKRQEF---------TKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKE 581
++++E KK EF KE E E + + ++++++E +KE D+E
Sbjct: 6 RRIKELVKKHSEFINYPIYLWVEKEVEKEVPDEEEEEE-KEEKKEEEEKTTDKEEEVDEE 64
Query: 582 REGEAIKERYLGLVKKKRRVRRLNDRKFVFDWDASEDTSVDYNSIYK 628
E E K++ + + LN K ++ + + T +Y + YK
Sbjct: 65 EEKEEKKKKTKKVKETTTEWELLNKTKPIWTRNPKDVTKEEYAAFYK 111
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 31.5 bits (72), Expect = 2.8
Identities = 17/75 (22%), Positives = 33/75 (44%), Gaps = 1/75 (1%)
Query: 528 EMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAI 587
+ +K+ E + R + KE + +LR + ++E+ EEK +++ER A
Sbjct: 84 NLLEKLIEVSEHRVDSEKEDERYEITSDVRKQLRFLEQLEREE-EEKRDEEERERLLRAA 142
Query: 588 KERYLGLVKKKRRVR 602
K R K++
Sbjct: 143 KSRSEQSRLKQKAKE 157
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 32.2 bits (73), Expect = 2.9
Identities = 23/145 (15%), Positives = 58/145 (40%), Gaps = 11/145 (7%)
Query: 395 RSRDDRGRDR-DRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERR----EEE 449
R +D + +D D + D S + + + + K++ K++++ ++ +++
Sbjct: 209 RGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDD 268
Query: 450 ASGSKHKSRDKEGYE----PTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSK 505
A D EG E +D EE + + + P E + E E
Sbjct: 269 ADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKN 328
Query: 506 PKF--LTKEERAAEALRKRQAEVEE 528
+ L+K+ + + L+ ++ +++
Sbjct: 329 EEEGGLSKKGKKLKKLKGKKNGLDK 353
>gnl|CDD|233483 TIGR01587, cas3_core, CRISPR-associated helicase Cas3. This model
represents the highly conserved core region of an
alignment of Cas3, a protein found in association with
CRISPR repeat elements in a broad range of bacteria and
archaea. Cas3 appears to be a helicase, with regions
found by pfam00270 (DEAD/DEAH box helicase) and pfam00271
(Helicase conserved C-terminal domain). Some but not all
members have an N-terminal HD domain region (pfam01966)
that is not included within this model [Mobile and
extrachromosomal element functions, Other].
Length = 358
Score = 32.0 bits (73), Expect = 3.0
Identities = 39/168 (23%), Positives = 68/168 (40%), Gaps = 15/168 (8%)
Query: 939 VMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKL--MEVLN 996
++ +AT+P ++ A K R E+ +I E DK ++ +E L
Sbjct: 159 LLMSATLPKFLKEYAEKIG--YVEFNEPLDLKEERRFERHRFIKIESDKVGEISSLERLL 216
Query: 997 RGVKK--PVIIFVNQKKGADVLAKGLEKLGYNACT--LHG----GKGQEQRELALNSLKG 1048
+KK + I VN A + L++ LH ++ L +K
Sbjct: 217 EFIKKGGKIAIIVNTVDRAQEFYQQLKENAPEEEIMLLHSRFTEKDRAKKEAELLEEMKK 276
Query: 1049 GSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRAGK 1096
K ++VAT V +DI ++I ++A I+ R+GR R G+
Sbjct: 277 NEKFVIVATQVIEASLDI-SADVMIT-ELA-PIDSLIQRLGRLHRYGR 321
Score = 32.0 bits (73), Expect = 3.0
Identities = 39/168 (23%), Positives = 68/168 (40%), Gaps = 15/168 (8%)
Query: 1614 VMFTATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKL--MEVLN 1671
++ +AT+P ++ A K R E+ +I E DK ++ +E L
Sbjct: 159 LLMSATLPKFLKEYAEKIG--YVEFNEPLDLKEERRFERHRFIKIESDKVGEISSLERLL 216
Query: 1672 RGVKK--PVIIFVNQKKGADVLAKGLEKLGYNACT--LHG----GKGQEQRELALNSLKG 1723
+KK + I VN A + L++ LH ++ L +K
Sbjct: 217 EFIKKGGKIAIIVNTVDRAQEFYQQLKENAPEEEIMLLHSRFTEKDRAKKEAELLEEMKK 276
Query: 1724 GSKDILVATDVAGRGIDIKDVSMVINYDMAKSIEDYTHRIGRTGRAGK 1771
K ++VAT V +DI ++I ++A I+ R+GR R G+
Sbjct: 277 NEKFVIVATQVIEASLDI-SADVMIT-ELA-PIDSLIQRLGRLHRYGR 321
>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
Length = 201
Score = 31.6 bits (72), Expect = 3.0
Identities = 13/40 (32%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 530 RKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKE 569
++KM+E +K +EF KE E++ +L++ +EK+ E
Sbjct: 71 QEKMKELQKMMKEFQKEFR-EAQESGDMKKLKKLQEKQME 109
>gnl|CDD|213755 TIGR02977, phageshock_pspA, phage shock protein A. Members of this
family are the phage shock protein PspA, from the phage
shock operon. This is a narrower family than the set of
PspA and its homologs, sometimes several in a genome, as
described by pfam04012. PspA appears to maintain the
protonmotive force under stress conditions that include
overexpression of certain phage secretins, heat shock,
ethanol, and protein export defects [Cellular processes,
Adaptations to atypical conditions].
Length = 219
Score = 31.5 bits (72), Expect = 3.0
Identities = 17/59 (28%), Positives = 33/59 (55%), Gaps = 5/59 (8%)
Query: 484 KEPLSLEELLAKKKAEEEARSKPKFLTKE-ERAAEALRKRQAEVEEMRKKMEEERKKRQ 541
+E L+ L+ K+KA+E A + L +E E L K Q ++ +++ K+ E R +++
Sbjct: 83 REDLARAALIEKQKAQELAEA----LERELAAVEETLAKLQEDIAKLQAKLAEARARQK 137
>gnl|CDD|179302 PRK01558, PRK01558, V-type ATP synthase subunit E; Provisional.
Length = 198
Score = 31.3 bits (71), Expect = 3.1
Identities = 19/70 (27%), Positives = 35/70 (50%), Gaps = 4/70 (5%)
Query: 489 LEELLAKKK----AEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFT 544
+++L+ K K E E + L +E A E + K + E +E++ K E+E +
Sbjct: 5 VKDLINKIKKDGLEEAERLANEIILEAKEEAEEIIAKAEEEAKELKAKAEKEANDYKRHA 64
Query: 545 KEASFESKRE 554
EAS ++ R+
Sbjct: 65 LEASRQAGRD 74
>gnl|CDD|130217 TIGR01147, V_ATP_synt_G, vacuolar ATP synthase, subunit G. This
model describes the vacuolar ATP synthase G subunit in
eukaryotes and includes members from diverse groups
e.g., fungi, plants, parasites etc. V-ATPases are
multi-subunit enzymes composed of two functional
domains: A transmembrane Vo domain and a peripheral
catalytic domain V1. The G subunit is one of the
subunits of the catalytic domain. V-ATPases are
responsible for the acidification of endosomes and
lysosomes, which are part of the central vacuolar system
[Energy metabolism, ATP-proton motive force
interconversion].
Length = 113
Score = 30.2 bits (68), Expect = 3.2
Identities = 20/75 (26%), Positives = 41/75 (54%), Gaps = 1/75 (1%)
Query: 523 QAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKER 582
QAE + +K+ E RK++ + K+A E+++E + +R++E K+ + + N E
Sbjct: 13 QAE-KRAAEKVSEARKRKTKRLKQAKEEAQKEVEKYKQQREKEFKEFEAKHLGGNGAAEE 71
Query: 583 EGEAIKERYLGLVKK 597
+ EA + + +KK
Sbjct: 72 KAEAETQAKIREIKK 86
>gnl|CDD|234127 TIGR03158, cas3_cyano, CRISPR-associated helicase Cas3, subtype
CYANO. CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) is a widespread family of
prokaryotic direct repeats with spacers of unique
sequence between consecutive repeats. This protein
family is a CRISPR-associated (Cas) family strictly
associated with the Cyano subtype of CRISPR/Cas locus,
found in several species of Cyanobacteria and several
archaeal species. It contains helicase motifs and
appears to represent the Cas3 protein of the Cyano
subtype of CRISPR/Cas system [Mobile and
extrachromosomal element functions, Other].
Length = 357
Score = 31.8 bits (72), Expect = 3.2
Identities = 22/69 (31%), Positives = 29/69 (42%), Gaps = 18/69 (26%)
Query: 22 LQNRD---IIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQ 78
LQ++D I A TG+GKTLA+L PLL I + PT L +
Sbjct: 9 LQSKDADIIFNTAPTGAGKTLAWLTPLL---------------HGENDTIALYPTNALIE 53
Query: 79 QIEEETNKF 87
E +F
Sbjct: 54 DQTEAIKEF 62
Score = 31.8 bits (72), Expect = 3.2
Identities = 22/69 (31%), Positives = 29/69 (42%), Gaps = 18/69 (26%)
Query: 763 LQNRD---IIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQ 819
LQ++D I A TG+GKTLA+L PLL I + PT L +
Sbjct: 9 LQSKDADIIFNTAPTGAGKTLAWLTPLL---------------HGENDTIALYPTNALIE 53
Query: 820 QIEEETNKF 828
E +F
Sbjct: 54 DQTEAIKEF 62
Score = 31.8 bits (72), Expect = 3.2
Identities = 22/69 (31%), Positives = 29/69 (42%), Gaps = 18/69 (26%)
Query: 1438 LQNRD---IIGVAETGSGKTLAFLLPLLVWIQSLPKIARMEDADQGPYAIIMAPTRELAQ 1494
LQ++D I A TG+GKTLA+L PLL I + PT L +
Sbjct: 9 LQSKDADIIFNTAPTGAGKTLAWLTPLL---------------HGENDTIALYPTNALIE 53
Query: 1495 QIEEETNKF 1503
E +F
Sbjct: 54 DQTEAIKEF 62
>gnl|CDD|131084 TIGR02029, AcsF, magnesium-protoporphyrin IX monomethyl ester
aerobic oxidative cyclase. This model respresents the
oxidative cyclase responsible for forming the
distinctive E-ring of the chlorin ring system under
aerobic conditions. This enzyme is believed to utilize a
binuclear iron center and molecular oxygen. There are
two isoforms of this enzyme in some plants and
cyanobacterai which are differentially regulated based
on the levels of copper and oxygen. This step is
essential in the biosynthesis of both
bacteriochlorophyll and chlorophyll under aerobic
conditions (a separate enzyme, BchE, acts under
anaerobic conditions). This enzyme is found in plants,
cyanobacteria and other photosynthetic bacteria
[Biosynthesis of cofactors, prosthetic groups, and
carriers, Chlorophyll and bacteriochlorphyll].
Length = 337
Score = 31.7 bits (72), Expect = 3.2
Identities = 17/72 (23%), Positives = 29/72 (40%), Gaps = 3/72 (4%)
Query: 494 AKKKAEEEARSKPKFLTKEERAAEALRK--RQAEVEEMRKKMEEERKKRQEFTKEASFES 551
K A +E P+F T + L + E + M +M+ + R F + F+
Sbjct: 3 GTKTASQETLLTPRFYTTDFEEMANLDVSPVENEWDAMLAEMKAD-YNRHHFVRNEEFDQ 61
Query: 552 KRENFDARLRRD 563
E+ D LR+
Sbjct: 62 SWEHIDGELRQA 73
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 32.0 bits (73), Expect = 3.3
Identities = 31/174 (17%), Positives = 66/174 (37%), Gaps = 13/174 (7%)
Query: 519 LRKRQAEVEEMRKKMEEERKKRQEFTKE-ASFESKRENFDARLRR--DREKKKEDP---E 572
L++ Q E+ + KK+ E++ +R + K+ S E++ + +A+L D KK
Sbjct: 40 LKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADL 99
Query: 573 EKELNK--DKEREGEAIKERYLGLVKKKRRVRRLNDRKFVFDWDASEDTSVDYNSIYKER 630
LN +ERE L +++ R D S ++ Y ++ R
Sbjct: 100 NARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGALNPAR 159
Query: 631 -----HQVQFFGRGNIAGIDIKAQKRDQSKFYGEMLEKRRTEAEKEQEKVRLKK 679
+ +I A++ + + E ++ A+ +E+ +
Sbjct: 160 AERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLA 213
>gnl|CDD|177566 PHA03242, PHA03242, envelope glycoprotein M; Provisional.
Length = 428
Score = 32.1 bits (73), Expect = 3.3
Identities = 20/82 (24%), Positives = 29/82 (35%), Gaps = 10/82 (12%)
Query: 1184 RRSRSRSPARKRRSRSRDRDYDRRFKRKKSPQCEAQSSRFSACSLPRSHKSSSLLSRYSE 1243
RR RSR R +R R R Y RR +SS ++ P + +S
Sbjct: 352 RRHRSRFYGHVRDARHRARSYIRR----------VRSSMRNSREAPYAGESDVGRDDDYA 401
Query: 1244 QDPEEKELNKDKEREGEAIKER 1265
E ++ E E + R
Sbjct: 402 STDGEPIYDEVAPDEDEVLYAR 423
>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509). This
family consists of several uncharacterized viral proteins
from the Marek's disease-like viruses. Members of this
family are typically around 400 residues in length. The
function of this family is unknown.
Length = 377
Score = 31.9 bits (72), Expect = 3.4
Identities = 14/31 (45%), Positives = 21/31 (67%), Gaps = 1/31 (3%)
Query: 1182 GDRRSRSRSPARKRRSRSRDRDYDRRFKRKK 1212
G+RR R R+ + RSRSR R RR++R++
Sbjct: 323 GERRGRRRN-RSESRSRSRSRSGSRRYRRRR 352
Score = 31.9 bits (72), Expect = 3.6
Identities = 17/56 (30%), Positives = 24/56 (42%), Gaps = 1/56 (1%)
Query: 363 GDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSRERE 418
G+RR R R+ + RSRSR R R R R R + +D + S +
Sbjct: 323 GERRGRRRNR-SESRSRSRSRSGSRRYRRRRGRGVPGRRSESRQDTVLVSSSEASD 377
Score = 31.5 bits (71), Expect = 3.8
Identities = 20/76 (26%), Positives = 28/76 (36%), Gaps = 10/76 (13%)
Query: 367 SRSRSPPRKRRSRSRERDYDRVRERDTDRSR--DDRGRDRDRDRLERDRSREREREKRHH 424
+R+ +P R S + RGR R+R R RSR R +R+
Sbjct: 291 NRAGTPDDGSAISEDSAGGSHHTMRRPPHSTSGERRGRRRNRSE-SRSRSRSRSGSRRYR 349
Query: 425 RR-------DRSKERD 433
RR RS+ R
Sbjct: 350 RRRGRGVPGRRSESRQ 365
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 32.1 bits (73), Expect = 3.8
Identities = 70/403 (17%), Positives = 136/403 (33%), Gaps = 66/403 (16%)
Query: 248 TATMPPAVERLARSYLRRPATVYIGSVGKPTERIEQIVYILSEQDKRKKLMEVLNRGVKK 307
A ++ + S +R Y+ S+ T +E++ + +++DK +E L + K
Sbjct: 1092 AAEEEEEIDVSSSSGVRGSDYDYLLSMAIGTLTLEKVQELCADRDKLNIEVEDLKKTTPK 1151
Query: 308 PVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKGGSKDILMAGDRRS 367
+ + K D L K L+KL + + QR A K R++
Sbjct: 1152 SLWL-----KDLDALEKELDKLDKEDAKAEEAREKLQRAAARGESGAAKK-----VSRQA 1201
Query: 368 RSRSPPRKRRSRSRERDYDRVRERDTDRSRDD----------RGRDRDRDRLERDRSRER 417
+ P+K ++ E E S + +GR + + +
Sbjct: 1202 PKKPAPKKTTKKASE---SETTEETYGSSAMETENVAEVVKPKGRAGAKKKAPAAAKEKE 1258
Query: 418 EREKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEED 477
E ++ +DR + + +A ++ + K+ + SD+++D
Sbjct: 1259 EEDEILDLKDRLAAYNLDSAPAQSAKMEETVKAVPARRAAARKKPLASVSVISDSDDDDD 1318
Query: 478 -----------------KGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEAL- 519
+ AA + + AKK+ +S K LT+ + AEA+
Sbjct: 1319 DFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEMLKPAEAIG 1378
Query: 520 RKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKD 579
+ +V +MR F K++ + L R K+ + E
Sbjct: 1379 ISPEKKVRKMRAS---------PFNKKSG---------SVLGRAATNKETESSENVSGSS 1420
Query: 580 KEREGEAIKERYLGLVKKKRRVRRLNDRKFVFDWDASEDTSVD 622
+ E V K R +R N ++ + SE S D
Sbjct: 1421 SSEKDEID-------VSAKPRPQRANRKQTTYVLSDSESESAD 1456
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A peptide
from the central, syntaxin binding domain of synaphin
competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal exocytosis
by promoting interaction between the complementary
syntaxin and synaptobrevin transmembrane regions that
reside in opposing membranes prior to fusion.
Length = 139
Score = 30.6 bits (69), Expect = 3.9
Identities = 25/115 (21%), Positives = 49/115 (42%), Gaps = 15/115 (13%)
Query: 469 MVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEE 528
M+ E+E + A++E +E + + E E K K RK + E E
Sbjct: 19 MLGGKEDEGDESDAEEE----DEEIQEALREAEEERKAKH-----------RKMEEEREV 63
Query: 529 MRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKERE 583
MR+ + ++ +++ E ++ E + RL R ++ +E E ++E E
Sbjct: 64 MRQGIRDKYGIKKKEEDEEEPQAAAEEEEGRLGRKKKTPEELAAEAGEEDEEEEE 118
>gnl|CDD|236408 PRK09200, PRK09200, preprotein translocase subunit SecA; Reviewed.
Length = 790
Score = 31.8 bits (73), Expect = 4.0
Identities = 38/147 (25%), Positives = 63/147 (42%), Gaps = 27/147 (18%)
Query: 970 KPTERIEQ--IVYILSEQDKRKKLMEVLNRGVKK-PVIIFVNQKKGADVLAKGL--EKLG 1024
+P RI+ V++ ++ + + EV R PV+I + ++ +K L +
Sbjct: 395 RPIIRIDYPDKVFVTLDEKYKAVIEEVKERHETGRPVLIGTGSIEQSETFSKLLDEAGIP 454
Query: 1025 YNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIK---DVS-----MVINYD 1076
+N L K + E + + G + VAT++AGRG DIK V VI +
Sbjct: 455 HN---LLNAKNAAK-EAQIIAEAGQKGAVTVATNMAGRGTDIKLGEGVHELGGLAVIGTE 510
Query: 1077 MAKSIEDYTHRI-----GRTGRAGKEG 1098
+S R+ GR+GR G G
Sbjct: 511 RMES-----RRVDLQLRGRSGRQGDPG 532
Score = 31.8 bits (73), Expect = 4.0
Identities = 38/147 (25%), Positives = 63/147 (42%), Gaps = 27/147 (18%)
Query: 1645 KPTERIEQ--IVYILSEQDKRKKLMEVLNRGVKK-PVIIFVNQKKGADVLAKGL--EKLG 1699
+P RI+ V++ ++ + + EV R PV+I + ++ +K L +
Sbjct: 395 RPIIRIDYPDKVFVTLDEKYKAVIEEVKERHETGRPVLIGTGSIEQSETFSKLLDEAGIP 454
Query: 1700 YNACTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIK---DVS-----MVINYD 1751
+N L K + E + + G + VAT++AGRG DIK V VI +
Sbjct: 455 HN---LLNAKNAAK-EAQIIAEAGQKGAVTVATNMAGRGTDIKLGEGVHELGGLAVIGTE 510
Query: 1752 MAKSIEDYTHRI-----GRTGRAGKEG 1773
+S R+ GR+GR G G
Sbjct: 511 RMES-----RRVDLQLRGRSGRQGDPG 532
>gnl|CDD|234506 TIGR04221, SecA2_Mycobac, accessory Sec system translocase SecA2,
Actinobacterial type. Members of this family are the
SecA2 subunit of the Mycobacterial type of accessory
secretory system. This family is quite different SecA2 of
the Staph/Strep type (TIGR03714).
Length = 762
Score = 31.7 bits (72), Expect = 4.0
Identities = 32/135 (23%), Positives = 59/135 (43%), Gaps = 19/135 (14%)
Query: 989 KKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKG 1048
+++ EV G +PV++ ++ LA+ L + G L+ E+ + +
Sbjct: 416 EEIAEVHKTG--QPVLVGTQDVAESEELAEALLEAGVPCNVLNAKNDAEEAAIIAEAGDI 473
Query: 1049 GSKDILVATDVAGRGIDIK----------DVSMVINYDMAKSIEDYTHRI-----GRTGR 1093
G+ + V+T +AGRG DI+ V+ + + + T R+ GR GR
Sbjct: 474 GA--VTVSTQMAGRGTDIRLGGSDEADHDRVAELGGLHVIGTGRHRTARLDNQLRGRAGR 531
Query: 1094 AGKEGLAVSFCTKDD 1108
G G +V F + +D
Sbjct: 532 QGDPGSSVFFVSLED 546
Score = 31.7 bits (72), Expect = 4.0
Identities = 32/135 (23%), Positives = 59/135 (43%), Gaps = 19/135 (14%)
Query: 1664 KKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACTLHGGKGQEQRELALNSLKG 1723
+++ EV G +PV++ ++ LA+ L + G L+ E+ + +
Sbjct: 416 EEIAEVHKTG--QPVLVGTQDVAESEELAEALLEAGVPCNVLNAKNDAEEAAIIAEAGDI 473
Query: 1724 GSKDILVATDVAGRGIDIK----------DVSMVINYDMAKSIEDYTHRI-----GRTGR 1768
G+ + V+T +AGRG DI+ V+ + + + T R+ GR GR
Sbjct: 474 GA--VTVSTQMAGRGTDIRLGGSDEADHDRVAELGGLHVIGTGRHRTARLDNQLRGRAGR 531
Query: 1769 AGKEGLAVSFCTKDD 1783
G G +V F + +D
Sbjct: 532 QGDPGSSVFFVSLED 546
>gnl|CDD|215824 pfam00260, Protamine_P1, Protamine P1.
Length = 51
Score = 28.3 bits (63), Expect = 4.4
Identities = 18/51 (35%), Positives = 21/51 (41%)
Query: 396 SRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGYRRERR 446
+R R R R R R R R R +R R R + RR RR RR
Sbjct: 1 ARYRCCRSRSRSRCRRRRRRRCRRRRRRCCRRRRRRVGCCRRRYTRRRRRR 51
Score = 27.6 bits (61), Expect = 9.0
Identities = 18/49 (36%), Positives = 18/49 (36%)
Query: 365 RRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDR 413
R RS R R R R R R R R R R G R R R R
Sbjct: 2 RYRCCRSRSRSRCRRRRRRRCRRRRRRCCRRRRRRVGCCRRRYTRRRRR 50
>gnl|CDD|177283 PHA00451, PHA00451, protein kinase.
Length = 362
Score = 31.3 bits (71), Expect = 4.6
Identities = 18/76 (23%), Positives = 27/76 (35%), Gaps = 6/76 (7%)
Query: 381 RERDYDRVRERDTDRSRDDRGR-DRDRDRLERDRSREREREKRHHRRDRSKERDGKDRRE 439
+R +R + R RD G R + R R R+ R+ + + + R
Sbjct: 242 NQRMIERCKNRKA--KRDPNGTFQVARKAAMKRRKRNRKLRARNAKELAAMRMEANQIR- 298
Query: 440 GYRRERREEEASGSKH 455
R E R G KH
Sbjct: 299 --RNEPRARMLMGDKH 312
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 30.7 bits (70), Expect = 4.7
Identities = 19/100 (19%), Positives = 44/100 (44%), Gaps = 13/100 (13%)
Query: 507 KFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREK 566
++L+ A E L + Q E+ E+++++ E +++ E +E + + +
Sbjct: 56 RYLSDTPSARERLPELQQELAELQEELAELQEQLAELQQENQ--------ELKQELSTLE 107
Query: 567 KKEDPEEKELNKDKEREGEAIKERYLGLVKKKRRVRRLND 606
+ + +KEL + K+ AI L ++ R +R
Sbjct: 108 AELERLQKELARIKQLSANAI-----ELDEENRELREELA 142
>gnl|CDD|146016 pfam03179, V-ATPase_G, Vacuolar (H+)-ATPase G subunit. This family
represents the eukaryotic vacuolar (H+)-ATPase
(V-ATPase) G subunit. V-ATPases generate an acidic
environment in several intracellular compartments.
Correspondingly, they are found as membrane-attached
proteins in several organelles. They are also found in
the plasma membranes of some specialised cells.
V-ATPases consist of peripheral (V1) and membrane
integral (V0) heteromultimeric complexes. The G subunit
is part of the V1 subunit, but is also thought to be
strongly attached to the V0 complex. It may be involved
in the coupling of ATP degradation to H+ translocation.
Length = 105
Score = 29.5 bits (67), Expect = 4.9
Identities = 28/105 (26%), Positives = 45/105 (42%), Gaps = 20/105 (19%)
Query: 497 KAEEEARSKPKFLTKEERAAEALRKRQA------EVEEMRKKMEEERKKRQEFTKEASFE 550
+AE+EA E R A R +QA E+EE R + E E K+ EA
Sbjct: 11 QAEKEAAEI----VNEARKRRAKRLKQAKEEAEKEIEEYRAQREAEFKE-----FEAEHS 61
Query: 551 SKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGLV 595
R + ++ ++ E+K + EL + + EA+ + L V
Sbjct: 62 GSRGELEKKIEKETEEKID-----ELKRSFNKNKEAVVQMLLSKV 101
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 31.6 bits (72), Expect = 4.9
Identities = 36/210 (17%), Positives = 65/210 (30%), Gaps = 26/210 (12%)
Query: 420 EKRHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKG 479
E+ + D K R KD + Y + +DKE I E E G
Sbjct: 754 EEIQYLEDLLKARLEKDYQSKYLDLIDNLNDAREI---KDKESKAIDLD-DIDFELELIG 809
Query: 480 AA-------------AKKEPLSLEELLAK-----KKAEEEARSKPKF---LTKEERAAEA 518
+ + E L K+ E+ R K K L + A
Sbjct: 810 KQEINIDYILELLQTFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRA 869
Query: 519 LRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNK 578
+ AE E + ++ ++ ++F +E + F L + K E+ E
Sbjct: 870 KKLDTAEKLEELYILAKKEEEFKQFAEEEGLNEEELAFYDDLALNGGKLPENGTELVEKL 929
Query: 579 DKEREGEAIKERYLGLVKKKR-RVRRLNDR 607
KE+ + ++ +++ R
Sbjct: 930 AKEKSLREKNKDDWKAKEEVEAKLKAFFRR 959
>gnl|CDD|237546 PRK13889, PRK13889, conjugal transfer relaxase TraA; Provisional.
Length = 988
Score = 31.6 bits (72), Expect = 4.9
Identities = 61/253 (24%), Positives = 86/253 (33%), Gaps = 44/253 (17%)
Query: 380 SRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSRERERE-----KRHHRRDRSKERDG 434
SR RD D RDD DRDRL R SR+R ++ +R ER G
Sbjct: 691 SRHRD-----GVDLHYGRDDFA---DRDRLVRTLSRDRAKDMASDYERADPAQSYAERRG 742
Query: 435 ---KDRREGYRRERREEEASG---SKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLS 488
++R R+ E+ G G E + A A+
Sbjct: 743 ITFRERVAEIVRKIVPEKLRGMFDGLRLPDPVPGPEAGRRPERESAAATTDAPARTVAAD 802
Query: 489 LEELL--AKKKA-EEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEER-------- 537
E L A+ +A AR+ +E+ L + E++E RK EE R
Sbjct: 803 PEAALRQARTRALVRHARAVDAIFRMQEQGGPVLPHQVKELQEARKAFEEVRPYGSHDAE 862
Query: 538 ---KKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAIKERYLGL 594
KK E EA+ R R + E EL D R + ER+ L
Sbjct: 863 AAYKKNPELAAEAA--------SGRPARAIRALQ---LETELRTDPARRADRFVERWQKL 911
Query: 595 VKKKRRVRRLNDR 607
+ +R + D
Sbjct: 912 DRASQRQYQAGDM 924
>gnl|CDD|111032 pfam02090, SPAM, Salmonella surface presentation of antigen gene
type M protein.
Length = 147
Score = 30.4 bits (68), Expect = 5.0
Identities = 28/126 (22%), Positives = 68/126 (53%), Gaps = 17/126 (13%)
Query: 467 EQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEE-----RAAEALRK 521
E +++ ++ED+ A++E + +E++ K + R++ + L++EE R +R+
Sbjct: 22 ESILLRYQDEDRRLQAEEEAI-VEQIAGLKLLLDTLRAENRQLSREEIYSLLRKQSIVRR 80
Query: 522 R----QAEVEEMRKKMEEERKKRQEFTKEASFESKREN-------FDARLRRDREKKKED 570
+ + ++ ++++K E KKR+EF +++ + ++E RL RE ++E+
Sbjct: 81 QIKDLELQITQIQEKRCELEKKREEFQEKSKYWLRKEGNYQRWIIRQKRLYIQREIQQEE 140
Query: 571 PEEKEL 576
E +E+
Sbjct: 141 AESEEI 146
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 30.3 bits (69), Expect = 5.2
Identities = 16/75 (21%), Positives = 43/75 (57%), Gaps = 7/75 (9%)
Query: 476 EDKGAAAKKEPLSLEELLAK-KKAEEEARSKPKFLTKEERAA--EALRKRQAEVEEMRKK 532
E + + E L++ + +K E++ + + L++E R A + L+++Q E+++ ++
Sbjct: 41 EKEFKKLQAE---LQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQA 97
Query: 533 MEEE-RKKRQEFTKE 546
++E ++K+QE +
Sbjct: 98 AQQELQQKQQELLQP 112
>gnl|CDD|184064 PRK13461, PRK13461, F0F1 ATP synthase subunit B; Provisional.
Length = 159
Score = 30.4 bits (69), Expect = 5.2
Identities = 18/74 (24%), Positives = 36/74 (48%)
Query: 517 EALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKEL 576
+ RQ+E++ +K +E++KK +E + E K + + + K K + +E+
Sbjct: 32 AVIDSRQSEIDNKIEKADEDQKKARELKLKNERELKNAKEEGKKIVEEYKSKAENVYEEI 91
Query: 577 NKDKEREGEAIKER 590
K+ E + I ER
Sbjct: 92 VKEAHEEADLIIER 105
>gnl|CDD|233032 TIGR00580, mfd, transcription-repair coupling factor (mfd). All
proteins in this family for which functions are known are
DNA-dependent ATPases that function in the process of
transcription-coupled DNA repair in which the repair of
the transcribed strand of actively transcribed genes is
repaired at a higher rate than the repair of
non-transcribed regions of the genome and than the
non-transcribed strand of the same gene. This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University). This family is
closely related to the RecG and UvrB families [DNA
metabolism, DNA replication, recombination, and repair].
Length = 926
Score = 31.6 bits (72), Expect = 5.3
Identities = 21/75 (28%), Positives = 32/75 (42%), Gaps = 3/75 (4%)
Query: 1028 CTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVI--NYDMAKSIEDYT 1085
HG + + E + G +LV T + GIDI + + +I D + Y
Sbjct: 690 AIAHGQMTENELEEVMLEFYKGEFQVLVCTTIIETGIDIPNANTIIIERADKFGLAQLYQ 749
Query: 1086 HRIGRTGRAGKEGLA 1100
R GR GR+ K+ A
Sbjct: 750 LR-GRVGRSKKKAYA 763
Score = 31.6 bits (72), Expect = 5.3
Identities = 21/75 (28%), Positives = 32/75 (42%), Gaps = 3/75 (4%)
Query: 1703 CTLHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVI--NYDMAKSIEDYT 1760
HG + + E + G +LV T + GIDI + + +I D + Y
Sbjct: 690 AIAHGQMTENELEEVMLEFYKGEFQVLVCTTIIETGIDIPNANTIIIERADKFGLAQLYQ 749
Query: 1761 HRIGRTGRAGKEGLA 1775
R GR GR+ K+ A
Sbjct: 750 LR-GRVGRSKKKAYA 763
>gnl|CDD|236544 PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase;
Reviewed.
Length = 830
Score = 31.7 bits (72), Expect = 5.5
Identities = 16/72 (22%), Positives = 29/72 (40%), Gaps = 16/72 (22%)
Query: 361 MAGD------RRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRS 414
MAG+ R+ + P +++ SR R RD D + D D + + +
Sbjct: 1 MAGNDREPIGRKGKPSRPVKQKVSRRRYRDDDDYDDYD----------DYEDEEPMPRKG 50
Query: 415 REREREKRHHRR 426
+ + R+ R R
Sbjct: 51 KGKGRKPRGKRG 62
>gnl|CDD|165391 PHA03118, PHA03118, multifunctional expression regulator;
Provisional.
Length = 474
Score = 31.2 bits (70), Expect = 5.5
Identities = 20/109 (18%), Positives = 44/109 (40%), Gaps = 6/109 (5%)
Query: 1163 PELLNHPDAQHKPGTVMMAGDRRSRSRSPARKRRSRSRDRDYDRRFKRKKSPQCEAQSSR 1222
++ +H +PG + R +S ++ R + RD + R P +
Sbjct: 110 DDINHHGGRDTEPGHAHIENGERKSPKSYNQQSRKKHRDESLRNKHGRPSGPPAMSPGEH 169
Query: 1223 FSACSLPRSHKSSSLLSRYSEQDPEEKELNKDKEREGEAIKERYLGLVK 1271
F ++H + L R++E+D + K+ + + I E+ + +K
Sbjct: 170 FD-----QTHDAEYRL-RFNERDARRDRIRKEYDIPTDKITEQAIETLK 212
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 30.9 bits (70), Expect = 5.6
Identities = 30/133 (22%), Positives = 58/133 (43%), Gaps = 11/133 (8%)
Query: 439 EGYRRERREEEASGSKHKS---RDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLAK 495
E RE++E E + + R ++ E E+ + EE+ K + + L +
Sbjct: 1 EEAEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKA---- 56
Query: 496 KKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKREN 555
+ EEE R + E E L E K+EEER+K++ T++ ++E
Sbjct: 57 DELEEENRRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQL----QQEL 112
Query: 556 FDARLRRDREKKK 568
+A+ +R +++
Sbjct: 113 REAQEAHERARQE 125
>gnl|CDD|216860 pfam02063, MARCKS, MARCKS family.
Length = 296
Score = 31.0 bits (69), Expect = 6.0
Identities = 30/120 (25%), Positives = 48/120 (40%), Gaps = 6/120 (5%)
Query: 435 KDRREGYRRERREEEASGSKHKSRDKEGYEPTEQMVISDEEEDKGAAAKKEPLSLEELLA 494
K +R +++ + S K+K EG E + +++ AAA E S EE A
Sbjct: 136 KKKRFSFKKSFKLSGFSFKKNKKEAGEGAEAEGAAAEKEGAKEEAAAAAPEAGSGEE--A 193
Query: 495 KKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRE 554
EE + + EE AA+A + E E + + K + E K A + E
Sbjct: 194 AAPGEEAGAAGAEGEAGEEPAADA----EPEQPEAKPEEAAPEKPQAEEAKAAEEQKAEE 249
>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
Toc86/159, G and M domains. The long precursor of the
86K protein originally described is proposed to have
three domains. The N-terminal A-domain is acidic,
repetitive, weakly conserved, readily removed by
proteolysis during chloroplast isolation, and not
required for protein translocation. The other domains
are designated G (GTPase) and M (membrane anchor); this
family includes most of the G domain and all of M
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 763
Score = 31.5 bits (71), Expect = 6.1
Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 12/84 (14%)
Query: 476 EDKGAAAKKEPLSLEELL--AKKKAEEEARSKPKF--LTKEERAAEALRKRQAEVEE--- 528
E +G + + LE+ ++ E+E P F LTK + A + +R+A +EE
Sbjct: 379 EQQGGDEEDSDIELEDSSDSDEESGEDEYDQLPPFKPLTKAQMAKLSKEQRKAYLEEYDY 438
Query: 529 -----MRKKMEEERKKRQEFTKEA 547
+K+ EE K+ + K
Sbjct: 439 RVKLLQKKQWREELKRMKMMKKFG 462
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This
family includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 31.2 bits (69), Expect = 6.1
Identities = 31/117 (26%), Positives = 55/117 (47%), Gaps = 6/117 (5%)
Query: 464 EPTEQMVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQ 523
E + ++++ + K E ++ +AK E+ + K KE R AEA K++
Sbjct: 53 EAFASLELTEQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKR 112
Query: 524 AEVEEMRK--KMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNK 578
A EE K K E+ER ++++ KEA + A ++++ K E E+ E K
Sbjct: 113 AAQEEEHKQWKAEQERIQKEQEKKEADL----KKLQAEKKKEKAVKAEKAEKAEKTK 165
>gnl|CDD|237478 PRK13709, PRK13709, conjugal transfer nickase/helicase TraI;
Provisional.
Length = 1747
Score = 31.3 bits (71), Expect = 6.5
Identities = 29/117 (24%), Positives = 49/117 (41%), Gaps = 23/117 (19%)
Query: 481 AAKKEPLSLEELLAKKKAEEEARSKPKFLTKEERAAEALR------------KRQAEVEE 528
A EP++ E+LA+++AEE R + E RA E +R K + V +
Sbjct: 1633 AGNGEPVT-AEVLAQRQAEEAIRRET-----ERRADEIVRKMAENKPDLPDGKTEQAVRD 1686
Query: 529 MRKKMEEERKKRQEFTKEASFESKRENFDARLRRD-REKKKEDPEEKELNKDKEREG 584
+ + ER + +EA+ R R RE +E+ + L + ER+
Sbjct: 1687 IAGQ---ERDRAAISEREAALPESVLREPQREREAVREVARENLLRERLQQM-ERDM 1739
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 31.1 bits (71), Expect = 6.6
Identities = 40/225 (17%), Positives = 81/225 (36%), Gaps = 41/225 (18%)
Query: 393 TDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDGKDRREGY--RRERREEEA 450
++ +DD RD D + + H E D +++ E ++E +E
Sbjct: 124 SEIDKDDDVRDDDDFDDDDLGDLASDDRAAHFGGGEDDEEDEEEQPERKKSKKEVMKEVI 183
Query: 451 SGSKHKSRDK-----EGYEPTEQMVISDEE------EDKGAAAKKEPLSLEELLAK---K 496
+ SK ++ E + E++ D++ + +P E +
Sbjct: 184 AKSKFYKAERQKAKEEDEDLREEL---DDDFKDLMSLLRTVKPPPKPPMTPEEKDDEYDQ 240
Query: 497 KAEE---EARSKP--KFLTKEERAAEALRKRQAEVEEMRKKMEEERKKR---------QE 542
+ E + R++P + T+EE A E E KK+E ER +R +E
Sbjct: 241 RVRELTFDRRAQPTDRTKTEEELAKEE--------AERLKKLEAERLRRMRGEEEDDEEE 292
Query: 543 FTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKEREGEAI 587
+ S + + F+ + + + +E+E + E E
Sbjct: 293 EDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEED 337
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 29.6 bits (67), Expect = 6.6
Identities = 24/92 (26%), Positives = 38/92 (41%), Gaps = 5/92 (5%)
Query: 512 EERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKR--ENFDARLRRDREKKKE 569
+E A E R+ + E RK++ EERK+ E + E+ + E + + E
Sbjct: 39 QEEAKEKEREERIE---ERKRIREERKQELEKQLKERKEALKLLEEENDDEEDAETEDTE 95
Query: 570 DPEEKELNKDKEREGEAIKERYLGLVKKKRRV 601
D E+ E E +E Y+ K K V
Sbjct: 96 DVEDDEWEGFPEPTVTDYEEEYIDEDKYKTTV 127
>gnl|CDD|227508 COG5181, HSH155, U2 snRNP spliceosome subunit [RNA processing and
modification].
Length = 975
Score = 31.1 bits (70), Expect = 6.6
Identities = 22/103 (21%), Positives = 40/103 (38%), Gaps = 5/103 (4%)
Query: 375 KRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDG 434
+ R+ SR V + + + +D +++ + RE R D + D
Sbjct: 25 EDRNDSR----GNVSQWEATPRTANESKDLKQEQTPLGYIQVREDLYSKTRMDPGFQPDS 80
Query: 435 KDRREGYRRERREEEASGSKHKSR-DKEGYEPTEQMVISDEEE 476
+RE + E SK SR D+ GYEP +++ +
Sbjct: 81 DKKRELELNNTWDMEYPDSKRSSRWDEMGYEPPQEINMCLPAR 123
>gnl|CDD|119440 cd06578, HemD, Uroporphyrinogen-III synthase (HemD) catalyzes the
asymmetrical cyclization of tetrapyrrole (linear) to
uroporphyrinogen-III, the fourth step in the biosynthesis
of heme. This ubiquitous enzyme is present in eukaryotes,
bacteria and archaea. Mutations in the human
uroporphyrinogen-III synthase gene cause congenital
erythropoietic porphyria, a recessive inborn error of
metabolism also known as Gunther disease.
Length = 239
Score = 30.7 bits (70), Expect = 6.7
Identities = 33/154 (21%), Positives = 56/154 (36%), Gaps = 27/154 (17%)
Query: 886 DEADRMIDM----GFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMF 941
+AD + + G E ++E P D + + L Y ++F
Sbjct: 8 PQADELAALLEALGAEVLELPLIEIEP-----LDDAELDAALADLDEY-------DWLIF 55
Query: 942 TATMPPAVE----RLARSYLRRPATVYIGSVG-KPTERIEQI----VYILSEQDKRKKLM 992
T+ P AVE L LR A + I +VG K E + + ++ E D L
Sbjct: 56 TS--PNAVEAFFEALEELGLRALAGLKIAAVGPKTAEALREAGLTADFVPEEGDSEGLLE 113
Query: 993 EVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYN 1026
+ + K I+ + + LA+ L + G
Sbjct: 114 LLELQDGKGKRILRPRGGRAREDLAEALRERGAE 147
Score = 30.7 bits (70), Expect = 6.7
Identities = 33/154 (21%), Positives = 56/154 (36%), Gaps = 27/154 (17%)
Query: 1561 DEADRMIDM----GFEPDVQKILEYMPVTNLKPDTEDAEDENKLLANYNSKKKYRQTVMF 1616
+AD + + G E ++E P D + + L Y ++F
Sbjct: 8 PQADELAALLEALGAEVLELPLIEIEP-----LDDAELDAALADLDEY-------DWLIF 55
Query: 1617 TATMPPAVE----RLARSYLRRPATVYIGSVG-KPTERIEQI----VYILSEQDKRKKLM 1667
T+ P AVE L LR A + I +VG K E + + ++ E D L
Sbjct: 56 TS--PNAVEAFFEALEELGLRALAGLKIAAVGPKTAEALREAGLTADFVPEEGDSEGLLE 113
Query: 1668 EVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYN 1701
+ + K I+ + + LA+ L + G
Sbjct: 114 LLELQDGKGKRILRPRGGRAREDLAEALRERGAE 147
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 31.2 bits (70), Expect = 6.8
Identities = 40/255 (15%), Positives = 89/255 (34%), Gaps = 25/255 (9%)
Query: 375 KRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREKRHHRRDRSKERDG 434
KR ++ + + Y ++R+ ++ + D + + + +R ER + H+ S R
Sbjct: 393 KRVAKIKSKTYRKIRKNRKEKEMALIPKSEDLENEKSEEARALERMTQRHKNTSSWTRKM 452
Query: 435 KDRR---EGYRRERREEEASGSKHKSR--------DKEGYEPTEQMVISDEEEDKGAAAK 483
+R EG R E+ G + R ++ E ++ ++E+
Sbjct: 453 LERASHGEGTREAVNEQIRKGDELMQRIHGKEIMDGEDVSEFSDSDYDTNEQVSTAFEKI 512
Query: 484 KEPLSLEELLA---KKKAEEEARSKPKFLTKEERAAE-ALRKRQAEVEEMRKKMEEERKK 539
+ L+ +L + A + K + E E++E+
Sbjct: 513 RNEEELKGVLGMKFMRDASNRQMAASKISVADLVKVENGDDIDVGELDEVGGDAIYANAG 572
Query: 540 RQEFT------KEASFESKRENFDARLR----RDREKKKEDPEEKELNKDKEREGEAIKE 589
R+E ++ + ++E+F + E E+ EA +
Sbjct: 573 RREVFPVVEQRRKLAPRKRKEDFVTPSTSLEKSMDRILHGQKKRAEGAVVFEKPLEATEN 632
Query: 590 RYLGLVKKKRRVRRL 604
L +K RR++R+
Sbjct: 633 FNPWLDRKMRRIKRI 647
>gnl|CDD|220774 pfam10477, EIF4E-T, Nucleocytoplasmic shuttling protein for mRNA
cap-binding EIF4E. EIF4E-T is the transporter protein
for shuttling the mRNA cap-binding protein EIF4E
protein, targeting it for nuclear import. EIF4E-T
contains several key binding domains including two
functional leucine-rich NESs (nuclear export signals)
between residues 438-447 and 613-638 in the human
protein. The other two binding domains are an
EIF4E-binding site, between residues 27-42 in Q9EST3,
and a bipartite NLS (nuclear localisation signals)
between 194-211, and these lie in family EIF4E-T_N.
EIF4E is the eukaryotic translation initiation factor 4E
that is the rate-limiting factor for cap-dependent
translation initiation.
Length = 520
Score = 31.1 bits (69), Expect = 6.8
Identities = 20/57 (35%), Positives = 27/57 (47%), Gaps = 3/57 (5%)
Query: 395 RSRDDRGRDRDRDRLERDRSREREREKRHHRRDR--SKERDGKDRREGYRRERREEE 449
RD RG +DR+ ER R RER+ + + RR+ SK + RR E E
Sbjct: 149 FERDFRGPRKDRNP-ERSRDRERDYKDKRFRREFGDSKRVFSESRRNDSYTIEEEPE 204
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region
of the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association with
mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is
dispensable for binding to Spt4.
Length = 92
Score = 28.9 bits (65), Expect = 7.0
Identities = 15/57 (26%), Positives = 22/57 (38%), Gaps = 1/57 (1%)
Query: 382 ERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSRE-REREKRHHRRDRSKERDGKDR 437
E + D D D D+ + DR DR RE E E + ++R G +
Sbjct: 17 EEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAEELAEYLRKRYGDEA 73
Score = 28.6 bits (64), Expect = 8.5
Identities = 20/68 (29%), Positives = 36/68 (52%), Gaps = 4/68 (5%)
Query: 524 AEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKERE 583
AEV++ ++ EEE ++ + E F + E D R R ++++ EKE +D E
Sbjct: 6 AEVDDEEEEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRR----EKEEEEDAEEL 61
Query: 584 GEAIKERY 591
E +++RY
Sbjct: 62 AEYLRKRY 69
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 30.8 bits (70), Expect = 7.5
Identities = 18/57 (31%), Positives = 26/57 (45%), Gaps = 2/57 (3%)
Query: 490 EELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERKKRQEFTKE 546
EEL K+ E+E RA E L+K E+ KK+EE ++ QE +
Sbjct: 186 EELRQLKQLEDELEDCDPTELD--RAKEKLKKLLQEIMIKVKKLEELEEELQELESK 240
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 30.0 bits (67), Expect = 7.6
Identities = 21/70 (30%), Positives = 39/70 (55%), Gaps = 5/70 (7%)
Query: 517 EALRKRQAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKEL 576
E + K + E EE+ ++ME ER++ + +E E +RE R + E++ +D E + +
Sbjct: 126 EQIEKARQEREELEERMEWERREEKIDEREDQEEQERE----REEQTIEEQSDDSEHEII 181
Query: 577 NKDKEREGEA 586
+D E E E+
Sbjct: 182 EQD-ESETES 190
>gnl|CDD|226947 COG4581, COG4581, Superfamily II RNA helicase [DNA replication,
recombination, and repair].
Length = 1041
Score = 30.8 bits (70), Expect = 8.2
Identities = 14/48 (29%), Positives = 24/48 (50%), Gaps = 1/48 (2%)
Query: 733 ASLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTL 780
A P ++ + + E P Q++AI I + ++ A T SGKT+
Sbjct: 103 AESPFDLAPPAREYPF-ELDPFQQEAIAILERGESVLVCAPTSSGKTV 149
Score = 30.8 bits (70), Expect = 8.2
Identities = 14/48 (29%), Positives = 24/48 (50%), Gaps = 1/48 (2%)
Query: 1408 ASLPTEILEIIEKIGYAEPTPIQRQAIPIGLQNRDIIGVAETGSGKTL 1455
A P ++ + + E P Q++AI I + ++ A T SGKT+
Sbjct: 103 AESPFDLAPPAREYPF-ELDPFQQEAIAILERGESVLVCAPTSSGKTV 149
>gnl|CDD|236394 PRK09169, PRK09169, hypothetical protein; Validated.
Length = 2316
Score = 30.8 bits (70), Expect = 8.3
Identities = 22/99 (22%), Positives = 34/99 (34%), Gaps = 6/99 (6%)
Query: 362 AGDRRSRSRSPPRKRRSRSRERDYDRVRERDTDRSRDDRGRDRDRDRLERDRSREREREK 421
A +R R + P R R R R D R D R R R DR E
Sbjct: 6 APHKRRRDAAAPADPRPRRRPRLGDAPAPR--TARADSGATPRGRPRAGADREPTSE--- 60
Query: 422 RHHRRDRSKERDGKDRREGYRRERREEEASGSKHKSRDK 460
+ +R +R + +RE++ ++R +
Sbjct: 61 QLRDYERWLDRAAA-GQLDAQREQQCARLWFLVQQARAR 98
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 29.6 bits (67), Expect = 8.3
Identities = 29/121 (23%), Positives = 57/121 (47%), Gaps = 8/121 (6%)
Query: 480 AAAKKEPLSLEELLAKKKAE-EEARSKPKFLTKEERAAEALRKRQAEVEEMRKKMEEERK 538
A A++ + LLA+ + E EEAR + E +A ++ + EE++ + EEE +
Sbjct: 47 AEAERLKEEAQALLAEYEQELEEAREQ-----ASEIIEQAKKEAEQIAEEIKAEAEEELE 101
Query: 539 KRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDK-EREGEAIKERYLGLVKK 597
+ +E EA E+++E LR + + EK L K E + + + ++ + +
Sbjct: 102 RIKE-AAEAEIEAEKERALEELRAEVAELAVAIAEKLLGKKVDEAAQKDLIDAFIAELGE 160
Query: 598 K 598
Sbjct: 161 N 161
>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2. This
family consists of several Eukaryotic rRNA processing
protein EBP2 sequences. Ebp2p is required for the
maturation of 25S rRNA and 60S subunit assembly. Ebp2p
may be one of the target proteins of Rrs1p for executing
the signal to regulate ribosome biogenesis. This family
also plays a role in chromosome segregation.
Length = 270
Score = 30.4 bits (69), Expect = 8.6
Identities = 38/150 (25%), Positives = 60/150 (40%), Gaps = 31/150 (20%)
Query: 469 MVISDEEEDKGAAAKKEPLSLEELLAKKKAEEEARSKPKFLTKEER------AAEALRKR 522
MV SDE K +K+ L A KKA EEAR + +E + E L++R
Sbjct: 110 MVKSDEHMQK---VRKK---LLAEAAAKKASEEARKQ-----RELKKFGKQVQVEKLQER 158
Query: 523 QAEVEEMRKKMEEERKKRQEFTKEASFESKRENFDARLRRDREKKKEDPEEKELNKDKER 582
E ++M +K+++ +KKR+ E FD + + K ++K R
Sbjct: 159 AKEKKDMLEKIKKLKKKRKGGGDEGD------LFDFLV---DDAAKGGSKKKGRKGGAAR 209
Query: 583 EGEAIKERY----LGLVKKKRRVRRLNDRK 608
K + G KKR + N +
Sbjct: 210 GKPNAKRKAKDAKFGFGGKKRGSKS-NTAE 238
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 30.4 bits (69), Expect = 8.6
Identities = 22/75 (29%), Positives = 33/75 (44%), Gaps = 7/75 (9%)
Query: 466 TEQMVISDEEEDKGAAAKKEP-LSLEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQA 524
TE++ + E L+ EE KKK E+ R K K L + E+
Sbjct: 289 TEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQ------NLIE 342
Query: 525 EVEEMRKKMEEERKK 539
E +E+ K+EE +KK
Sbjct: 343 ERKELNSKLEEIQKK 357
>gnl|CDD|163636 cd07393, MPP_DR1119, Deinococcus radiodurans DR1119 and related
proteins, metallophosphatase domain. DR1119 is an
uncharacterized Deinococcus radiodurans protein with a
metallophosphatase domain. The domain present in members
of this family belongs to the metallophosphatase (MPP)
superfamily. MPPs are functionally diverse, but all
share a conserved domain with an active site consisting
of two metal ions (usually manganese, iron, or zinc)
coordinated with octahedral geometry by a cage of
histidine, aspartate, and asparagine residues. The MPP
superfamily includes: Mre11/SbcD-like exonucleases,
Dbr1-like RNA lariat debranching enzymes, YfcE-like
phosphodiesterases, purple acid phosphatases (PAPs),
YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid
sphingomyelinases (ASMases). The conserved domain is a
double beta-sheet sandwich with a di-metal active site
made up of residues located at the C-terminal side of the
sheets. This domain is thought to allow for productive
metal coordination.
Length = 232
Score = 30.0 bits (68), Expect = 8.8
Identities = 23/98 (23%), Positives = 41/98 (41%), Gaps = 22/98 (22%)
Query: 982 LSEQDKRKKLMEV--LNRGVK--------KPVIIFV-----NQKKGADVLAKGLEKLGYN 1026
E+D++ E+ L +K K I+ + N+ ++K +E+ G +
Sbjct: 136 KVEEDEKIFERELERLELSLKAAKKREKEKIKIVMLHYPPANENGDDSPISKLIEEYGVD 195
Query: 1027 ACT---LHGGKGQEQRELALNSLKGGSKDILVATDVAG 1061
C LHG R+ A+N +GG + LV+ D
Sbjct: 196 ICVYGHLHGV----GRDRAINGERGGIRYQLVSADYLN 229
Score = 30.0 bits (68), Expect = 8.8
Identities = 23/98 (23%), Positives = 41/98 (41%), Gaps = 22/98 (22%)
Query: 1657 LSEQDKRKKLMEV--LNRGVK--------KPVIIFV-----NQKKGADVLAKGLEKLGYN 1701
E+D++ E+ L +K K I+ + N+ ++K +E+ G +
Sbjct: 136 KVEEDEKIFERELERLELSLKAAKKREKEKIKIVMLHYPPANENGDDSPISKLIEEYGVD 195
Query: 1702 ACT---LHGGKGQEQRELALNSLKGGSKDILVATDVAG 1736
C LHG R+ A+N +GG + LV+ D
Sbjct: 196 ICVYGHLHGV----GRDRAINGERGGIRYQLVSADYLN 229
>gnl|CDD|233063 TIGR00631, uvrb, excinuclease ABC, B subunit. All proteins in this
family for wich functions are known are DNA helicases
that function in the nucleotide excision repair and are
endonucleases that make the 3' incision next to DNA
damage. They are part of a pathway requiring UvrA, UvrB,
UvrC, and UvrD homologs. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 655
Score = 30.7 bits (70), Expect = 8.9
Identities = 35/127 (27%), Positives = 56/127 (44%), Gaps = 12/127 (9%)
Query: 970 KPTERIEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACT 1029
+PT+ Q+ +LSE +R E V++ KK A+ L L++LG
Sbjct: 422 RPTDG--QVDDLLSEIRQRVARNE--------RVLVTTLTKKMAEDLTDYLKELGIKVRY 471
Query: 1030 LHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKS--IEDYTHR 1087
LH +R + L+ G D+LV ++ G+D+ +VS+V D K +
Sbjct: 472 LHSEIDTLERVEIIRDLRLGEFDVLVGINLLREGLDLPEVSLVAILDADKEGFLRSERSL 531
Query: 1088 IGRTGRA 1094
I GRA
Sbjct: 532 IQTIGRA 538
Score = 30.7 bits (70), Expect = 8.9
Identities = 35/127 (27%), Positives = 56/127 (44%), Gaps = 12/127 (9%)
Query: 1645 KPTERIEQIVYILSEQDKRKKLMEVLNRGVKKPVIIFVNQKKGADVLAKGLEKLGYNACT 1704
+PT+ Q+ +LSE +R E V++ KK A+ L L++LG
Sbjct: 422 RPTDG--QVDDLLSEIRQRVARNE--------RVLVTTLTKKMAEDLTDYLKELGIKVRY 471
Query: 1705 LHGGKGQEQRELALNSLKGGSKDILVATDVAGRGIDIKDVSMVINYDMAKS--IEDYTHR 1762
LH +R + L+ G D+LV ++ G+D+ +VS+V D K +
Sbjct: 472 LHSEIDTLERVEIIRDLRLGEFDVLVGINLLREGLDLPEVSLVAILDADKEGFLRSERSL 531
Query: 1763 IGRTGRA 1769
I GRA
Sbjct: 532 IQTIGRA 538
>gnl|CDD|153308 cd07624, BAR_SNX7_30, The Bin/Amphiphysin/Rvs (BAR) domain of
Sorting Nexins 7 and 30. BAR domains are dimerization,
lipid binding and curvature sensing modules found in
many different proteins with diverse functions. Sorting
nexins (SNXs) are Phox homology (PX) domain containing
proteins that are involved in regulating membrane
traffic and protein sorting in the endosomal system.
SNXs differ from each other in their lipid-binding
specificity, subcellular localization and specific
function in the endocytic pathway. A subset of SNXs also
contain BAR domains. The PX-BAR structural unit
determines the specific membrane targeting of SNXs. This
subfamily consists of SNX7, SNX30, and similar proteins.
The specific functions of SNX7 and SNX30 have not been
elucidated. BAR domains form dimers that bind to
membranes, induce membrane bending and curvature, and
may also be involved in protein-protein interactions.
Length = 200
Score = 30.0 bits (68), Expect = 9.6
Identities = 31/117 (26%), Positives = 50/117 (42%), Gaps = 12/117 (10%)
Query: 463 YEPTEQMVISDEEE------DKGAAAKKEPLSLEELLAKKKAEEEARSKP--KFLTKEER 514
Y P Q+ + E E +A ++ +LE LL E P ++L +
Sbjct: 54 YSPIFQLWSASETELAPLLEGVSSAVERCTAALEVLL---SDHEFVFLPPLREYLLYSDA 110
Query: 515 AAEALRKRQAEVEEMRKKMEEERKKRQEFTKEAS-FESKRENFDARLRRDREKKKED 570
+ L++R E +EE KKR E KE + K E +A L+ D E+ K++
Sbjct: 111 VKDVLKRRDQFQIEYELSVEELNKKRLELLKEVEKLQDKLECANADLKADLERWKQN 167
>gnl|CDD|213783 TIGR03181, PDH_E1_alph_x, pyruvate dehydrogenase E1 component,
alpha subunit. Members of this protein family are the
alpha subunit of the E1 component of pyruvate
dehydrogenase (PDH). This model represents one branch of
a larger family that E1-alpha proteins from
2-oxoisovalerate dehydrogenase, acetoin dehydrogenase,
another PDH clade, etc [Energy metabolism, Pyruvate
dehydrogenase].
Length = 341
Score = 30.2 bits (69), Expect = 9.6
Identities = 15/64 (23%), Positives = 26/64 (40%), Gaps = 13/64 (20%)
Query: 473 DEEEDKGAAAKKEPLS-LEELLAKKKAEEEARSKPKFLTKEERAAEALRKRQAEVEEMRK 531
+EE++ KK+P+ L + L +K L EE+ + +AEV E
Sbjct: 262 TKEEEE-EWRKKDPILRLRKYLERKG-----------LWDEEQEEALEEEAEAEVAEAVA 309
Query: 532 KMEE 535
+
Sbjct: 310 EALA 313
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.134 0.380
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 97,199,281
Number of extensions: 10255920
Number of successful extensions: 21564
Number of sequences better than 10.0: 1
Number of HSP's gapped: 17596
Number of HSP's successfully gapped: 1260
Length of query: 1832
Length of database: 10,937,602
Length adjustment: 111
Effective length of query: 1721
Effective length of database: 6,014,308
Effective search space: 10350624068
Effective search space used: 10350624068
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 66 (29.3 bits)