RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy1907
(1320 letters)
>gnl|CDD|224557 COG1643, HrpA, HrpA-like helicases [DNA replication, recombination,
and repair].
Length = 845
Score = 332 bits (853), Expect = 1e-97
Identities = 125/231 (54%), Positives = 156/231 (67%), Gaps = 7/231 (3%)
Query: 1 MTDGTLHREFLSEPDLASYSVMIIDEAHERTLHTDILFGLVKDIVRF-RSDLKLLISSAT 59
MTDG L RE ++P L+ YSV+IIDEAHER+L+TDIL GL+KD++ R DLKL+I SAT
Sbjct: 145 MTDGILLREIQNDPLLSGYSVVIIDEAHERSLNTDILLGLLKDLLARRRDDLKLIIMSAT 204
Query: 60 LDAQKFSEFFDDAPVFRIPGRRFPVDIYYTKAPEADYI-DACVVSILQIHATQPLGDILV 118
LDA++FS +F +APV I GR +PV+I Y EADYI +V+ + IH + G ILV
Sbjct: 205 LDAERFSAYFGNAPVIEIEGRTYPVEIRYLPEAEADYILLDAIVAAVDIHLREGSGSILV 264
Query: 119 FLTGQEEIETCNEILQDRVRRLGSKIKELIILPVYSNLPTDMQTKIFLPTPPNARKVVLA 178
FL GQ EIE E L+ LG +L ILP+Y L + Q ++F P P RKVVLA
Sbjct: 265 FLPGQREIERTAEWLEKA--ELGD---DLEILPLYGALSAEEQVRVFEPAPGGKRKVVLA 319
Query: 179 TNIAETSLTIDNIIYVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAG 229
TNIAETSLTI I YVID G K+ ++ RTG+ L PISKASA+QRAG
Sbjct: 320 TNIAETSLTIPGIRYVIDSGLAKEKRYDPRTGLTRLETEPISKASADQRAG 370
Score = 303 bits (778), Expect = 2e-87
Identities = 126/406 (31%), Positives = 178/406 (43%), Gaps = 54/406 (13%)
Query: 796 VYVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAGRAGRVAAGKCFRLYTAWAYKHE 855
YVID G K+ ++ RTG+ L PISKASA+QRAGRAGR G C+RLY+ +
Sbjct: 333 RYVIDSGLAKEKRYDPRTGLTRLETEPISKASADQRAGRAGRTGPGICYRLYSEEDFL-A 391
Query: 856 LEDNTVPEIQRINLGNAVLMLKALGI-NDLIHFDFLDPPPHETLVLALEQLYALGALNHH 914
+ T+PEI R +L VL LK+LGI D+ F FLDPPP + AL L LGAL+
Sbjct: 392 FPEFTLPEILRTDLSGLVLQLKSLGIGQDIAPFPFLDPPPEAAIQAALTLLQELGALDDS 451
Query: 915 GDLTKLGRRMAEFPLDPMMGKMLLASEKYKCVEEIVTIASMLSVNGAI--FYRP-KDKII 971
G LT LG++M+ PLDP + +MLL + + C+ E TIASMLS F R K +
Sbjct: 452 GKLTPLGKQMSLLPLDPRLARMLLTAPEGGCLGEAATIASMLSEQDRESDFSRDVKLRKQ 511
Query: 972 HADTARKNFF------MLGGDHLVLLNIYNQW-----ADADFST-QWCYENFIQHRSMKR 1019
GDHL+LL + A ++ C +++ R
Sbjct: 512 RTAQDLLKRLKRRNAADPRGDHLLLLEAFPDRIARKRAKGEYLRANGCRAMLFPTKALSR 571
Query: 1020 ARDVREQLIGLVQRVE-------------------MELVSNPTETVNIRKAITAGYFYHI 1060
A + L+ + E + +IR A+ AG +I
Sbjct: 572 APWIIAALLVQTSALAGRILAAAEIDEDEWAAQHLPEHCYSEPIWDDIRGALAAGRKLNI 631
Query: 1061 AKLSKSGS-YKTAKHNQTVMIHPNS-SLFEELPRWVLYHELVFTTKEFMRQ--------- 1109
A+L G Y T N V HP+S L L W+ Y E + T K ++R+
Sbjct: 632 AQLQLDGRPYVTLSDNTPVFAHPSSVRLGLVLLEWIKYAEFLRTRKGYLREGRGERWPDV 691
Query: 1110 --VTEIESKWLLEVAP---HYYKPRELEDSTNKKMPKTLGKSRAEL 1150
+ E+ WL E + + + +P L +L
Sbjct: 692 QTLIELLKLWLKEQVKGLRGLDGLT--KAAMKQALPDLLPWDLTQL 735
Score = 207 bits (529), Expect = 4e-55
Identities = 76/137 (55%), Positives = 96/137 (70%), Gaps = 1/137 (0%)
Query: 657 KKKLNIEETKKSLPIYPFKQDLIDAVRDHQVLIIEGETGSGKTTQIPQYLHEAGFTKDKM 716
+I E + LP+ + +++ A+ +QV+II GETGSGKTTQ+PQ+L E G
Sbjct: 37 ANVPDILEYRSGLPVTAVRDEILKAIEQNQVVIIVGETGSGKTTQLPQFLLEEGL-GIAG 95
Query: 717 KIGCTQPRRVAAMSVSARVAEEMGVKLGCEVGYAIRFEDCTSERTIIKYMTDGTLHREFL 776
KIGCTQPRR+AA SV+ RVAEE+G KLG VGY+IRFE S RT IK MTDG L RE
Sbjct: 96 KIGCTQPRRLAARSVAERVAEELGEKLGETVGYSIRFESKVSPRTRIKVMTDGILLREIQ 155
Query: 777 SEPDLASYSVMIIDEAH 793
++P L+ YSV+IIDEAH
Sbjct: 156 NDPLLSGYSVVIIDEAH 172
Score = 89.7 bits (223), Expect = 5e-18
Identities = 44/214 (20%), Positives = 75/214 (35%), Gaps = 43/214 (20%)
Query: 1145 KSRAELGGDHLVLLNIYNQW-----ADADFST-QWCYENFIQHRSMKRARDVREQLIGLV 1198
++ A+ GDHL+LL + A ++ C +++ RA + L+
Sbjct: 524 RNAADPRGDHLLLLEAFPDRIARKRAKGEYLRANGCRAMLFPTKALSRAPWIIAALLVQT 583
Query: 1199 QRVE-------------------MELVSNPTETVNIRKAITAGYFYHIAKLSKSGS-YKT 1238
+ E + +IR A+ AG +IA+L G Y T
Sbjct: 584 SALAGRILAAAEIDEDEWAAQHLPEHCYSEPIWDDIRGALAAGRKLNIAQLQLDGRPYVT 643
Query: 1239 AKHNQTVMIHPNS-SLFEELPRWVLYHELVFTTKEFMRQ-----------VTEIESKWLL 1286
N V HP+S L L W+ Y E + T K ++R+ + E+ WL
Sbjct: 644 LSDNTPVFAHPSSVRLGLVLLEWIKYAEFLRTRKGYLREGRGERWPDVQTLIELLKLWLK 703
Query: 1287 EVAP---HYYKPRELEDSTNKKMPKTLGKSRAEL 1317
E + + + +P L +L
Sbjct: 704 EQVKGLRGLDGLT--KAAMKQALPDLLPWDLTQL 735
>gnl|CDD|233659 TIGR01967, DEAH_box_HrpA, ATP-dependent helicase HrpA. This model
represents HrpA, one of two related but uncharacterized
DEAH-box ATP-dependent helicases in many Proteobacteria
and a few high-GC Gram-positive bacteria. HrpA is about
1300 amino acids long, while its paralog HrpB, also
uncharacterized, is about 800 amino acids long. Related
characterized eukarotic proteins are RNA helicases
associated with pre-mRNA processing [Unknown function,
Enzymes of unknown specificity].
Length = 1283
Score = 209 bits (534), Expect = 9e-55
Identities = 121/369 (32%), Positives = 191/369 (51%), Gaps = 12/369 (3%)
Query: 797 YVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAGRAGRVAAGKCFRLYTAWAYKHEL 856
YVID G + + ++ RT ++ L + PIS+ASANQR GR GRVA G C RLY+ +
Sbjct: 351 YVIDTGTARISRYSYRTKVQRLPIEPISQASANQRKGRCGRVAPGICIRLYSEEDFNSRP 410
Query: 857 EDNTVPEIQRINLGNAVLMLKALGINDLIHFDFLDPPPHETL---VLALEQLYALGALNH 913
E T PEI R NL + +L + AL + D+ F F++ P + LE+L AL
Sbjct: 411 E-FTDPEILRTNLASVILQMLALRLGDIAAFPFIEAPDPRAIRDGFRLLEELGALDDDEA 469
Query: 914 HGDLTKLGRRMAEFPLDPMMGKMLLASEKYKCVEEIVTIASMLSVNGAIFYRPKDKIIHA 973
LT +GR++A+ P+DP + +MLL + + C++E++ IAS LS+ RP +K A
Sbjct: 470 EPQLTPIGRQLAQLPVDPRLARMLLEAHRLGCLQEVLIIASALSIQD-PRERPMEKQQAA 528
Query: 974 DTARKNFFMLGGDHLVLLNIYNQWAD---ADFSTQ---WCYENFIQHRSMKRARDVREQL 1027
D A F D L +N++ + A + Q C + ++ + ++ +D+ QL
Sbjct: 529 DQAHARFKDPRSDFLSRVNLWRHIEEQRQALSANQFRNACRKQYLNYLRVREWQDIYRQL 588
Query: 1028 IGLVQRVEMELVSNPTETVNIRKAITAGYFYHIAKLSKSGSYKTAKHNQTVMIHPNSSLF 1087
+V+ + ++L P + I KA+ +G I + Y A+ + I P S LF
Sbjct: 589 TQVVKELGLKLNEEPADYDAIHKALLSGLLSQIGMKDEKHEYDGAR-GRKFHIFPGSPLF 647
Query: 1088 EELPRWVLYHELVFTTKEFMRQVTEIESKWLLEVAPHYYKPRELEDSTNKKMPKTLGKSR 1147
++ P+WV+ ELV T+K + R V +IE +W+ VA H K E KK + + +
Sbjct: 648 KKPPKWVMAAELVETSKLYARLVAKIEPEWVEPVAGHLIKKNYFEPHWEKKRGQVMAYEK 707
Query: 1148 AELGGDHLV 1156
L G LV
Sbjct: 708 VTLYGLTLV 716
Score = 195 bits (498), Expect = 2e-50
Identities = 96/235 (40%), Positives = 148/235 (62%), Gaps = 15/235 (6%)
Query: 1 MTDGTLHREFLSEPDLASYSVMIIDEAHERTLHTDILFGLVKDIVRFRSDLKLLISSATL 60
MTDG L E + L+ Y +IIDEAHER+L+ D L G +K ++ R DLK++I+SAT+
Sbjct: 162 MTDGILLAETQQDRFLSRYDTIIIDEAHERSLNIDFLLGYLKQLLPRRPDLKIIITSATI 221
Query: 61 DAQKFSEFFDDAPVFRIPGRRFPVDIYY------TKAPEADYIDACVVSILQIHATQPLG 114
D ++FS F++AP+ + GR +PV++ Y + + D ++A + ++ ++ A P G
Sbjct: 222 DPERFSRHFNNAPIIEVSGRTYPVEVRYRPLVEEQEDDDLDQLEAILDAVDELFAEGP-G 280
Query: 115 DILVFLTGQEEIETCNEILQDRVRRLGSKIKELIILPVYSNLPTDMQTKIFLPTPPNARK 174
DIL+FL G+ EI EIL+ R ++ ILP+Y+ L Q ++F P + R+
Sbjct: 281 DILIFLPGEREIRDAAEILRKR------NLRHTEILPLYARLSNKEQQRVF--QPHSGRR 332
Query: 175 VVLATNIAETSLTIDNIIYVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAG 229
+VLATN+AETSLT+ I YVID G + + ++ RT ++ L + PIS+ASANQR G
Sbjct: 333 IVLATNVAETSLTVPGIHYVIDTGTARISRYSYRTKVQRLPIEPISQASANQRKG 387
Score = 140 bits (355), Expect = 2e-33
Identities = 63/133 (47%), Positives = 89/133 (66%), Gaps = 1/133 (0%)
Query: 668 SLPIYPFKQDLIDAVRDHQVLIIEGETGSGKTTQIPQYLHEAGFTKDKMKIGCTQPRRVA 727
+LP+ ++D+ +A+ ++QV+II GETGSGKTTQ+P+ E G + IG TQPRR+A
Sbjct: 65 NLPVSAKREDIAEAIAENQVVIIAGETGSGKTTQLPKICLELGRGSHGL-IGHTQPRRLA 123
Query: 728 AMSVSARVAEEMGVKLGCEVGYAIRFEDCTSERTIIKYMTDGTLHREFLSEPDLASYSVM 787
A +V+ R+AEE+G LG +VGY +RF D S T++K MTDG L E + L+ Y +
Sbjct: 124 ARTVAQRIAEELGTPLGEKVGYKVRFHDQVSSNTLVKLMTDGILLAETQQDRFLSRYDTI 183
Query: 788 IIDEAHPKVYVID 800
IIDEAH + ID
Sbjct: 184 IIDEAHERSLNID 196
Score = 76.7 bits (189), Expect = 6e-14
Identities = 39/133 (29%), Positives = 67/133 (50%), Gaps = 1/133 (0%)
Query: 1173 WCYENFIQHRSMKRARDVREQLIGLVQRVEMELVSNPTETVNIRKAITAGYFYHIAKLSK 1232
C + ++ + ++ +D+ QL +V+ + ++L P + I KA+ +G I +
Sbjct: 567 ACRKQYLNYLRVREWQDIYRQLTQVVKELGLKLNEEPADYDAIHKALLSGLLSQIGMKDE 626
Query: 1233 SGSYKTAKHNQTVMIHPNSSLFEELPRWVLYHELVFTTKEFMRQVTEIESKWLLEVAPHY 1292
Y A+ + I P S LF++ P+WV+ ELV T+K + R V +IE +W+ VA H
Sbjct: 627 KHEYDGAR-GRKFHIFPGSPLFKKPPKWVMAAELVETSKLYARLVAKIEPEWVEPVAGHL 685
Query: 1293 YKPRELEDSTNKK 1305
K E KK
Sbjct: 686 IKKNYFEPHWEKK 698
>gnl|CDD|233660 TIGR01970, DEAH_box_HrpB, ATP-dependent helicase HrpB. This model
represents HrpB, one of two related but uncharacterized
DEAH-box ATP-dependent helicases in many Proteobacteria,
but also in a few species of other lineages. The member
from Rhizobium meliloti has been designated HelO. HrpB
is typically about 800 residues in length, while its
paralog HrpA (TIGR01967), also uncharacterized, is about
1300 amino acids long. Related characterized eukarotic
proteins are RNA helicases associated with pre-mRNA
processing [Unknown function, Enzymes of unknown
specificity].
Length = 819
Score = 196 bits (500), Expect = 1e-51
Identities = 98/230 (42%), Positives = 126/230 (54%), Gaps = 9/230 (3%)
Query: 2 TDGTLHREFLSEPDLASYSVMIIDEAHERTLHTDILFGLVKDIVR-FRSDLKLLISSATL 60
T+G L R +P+L +I DE HER+L D+ L D+ R DLK+L SATL
Sbjct: 97 TEGILTRMIQDDPELDGVGALIFDEFHERSLDADLGLALALDVQSSLREDLKILAMSATL 156
Query: 61 DAQKFSEFFDDAPVFRIPGRRFPVDIYYTKAPEADYI-DACVVSILQIHATQPLGDILVF 119
D ++ S DAPV GR FPV+I Y + DA ++ A++ G ILVF
Sbjct: 157 DGERLSSLLPDAPVVESEGRSFPVEIRYLPLRGDQRLEDAVSRAVEHALASET-GSILVF 215
Query: 120 LTGQEEIETCNEILQDRVRRLGSKIKELIILPVYSNLPTDMQTKIFLPTPPNARKVVLAT 179
L GQ EI E L +R L S +++I P+Y L Q + P P RKVVLAT
Sbjct: 216 LPGQAEIRRVQEQLAER---LDS---DVLICPLYGELSLAAQDRAIKPDPQGRRKVVLAT 269
Query: 180 NIAETSLTIDNIIYVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAG 229
NIAETSLTI+ I VID G + F+ +TG+ L V IS+ASA QRAG
Sbjct: 270 NIAETSLTIEGIRVVIDSGLARVARFDPKTGITRLETVRISQASATQRAG 319
Score = 113 bits (285), Expect = 2e-25
Identities = 56/142 (39%), Positives = 73/142 (51%), Gaps = 1/142 (0%)
Query: 798 VIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAGRAGRVAAGKCFRLYTAWAYKHELE 857
VID G + F+ +TG+ L V IS+ASA QRAGRAGR+ G C+RL+ + L
Sbjct: 284 VIDSGLARVARFDPKTGITRLETVRISQASATQRAGRAGRLEPGVCYRLW-SEEQHQRLP 342
Query: 858 DNTVPEIQRINLGNAVLMLKALGINDLIHFDFLDPPPHETLVLALEQLYALGALNHHGDL 917
PEI + +L L L G D +LD PP L A + L LGAL+ G L
Sbjct: 343 AQDEPEILQADLSGLALELAQWGAKDPSDLRWLDAPPSVALAAARQLLQRLGALDAQGRL 402
Query: 918 TKLGRRMAEFPLDPMMGKMLLA 939
T G+ MA P + MLL+
Sbjct: 403 TAHGKAMAALGCHPRLAAMLLS 424
Score = 100 bits (251), Expect = 3e-21
Identities = 45/126 (35%), Positives = 69/126 (54%), Gaps = 4/126 (3%)
Query: 669 LPIYPFKQDLIDAVRDHQVLIIEGETGSGKTTQIP-QYLHEAGFTKDKMKIGCTQPRRVA 727
LPI+ L DA+ H +++E G+GK+T +P L G KI +PRR+A
Sbjct: 1 LPIHAVLPALRDALAAHPQVVLEAPPGAGKSTAVPLALLDAPGIGG---KIIMLEPRRLA 57
Query: 728 AMSVSARVAEEMGVKLGCEVGYAIRFEDCTSERTIIKYMTDGTLHREFLSEPDLASYSVM 787
A S + R+A ++G +G VGY +R E+ S RT ++ +T+G L R +P+L +
Sbjct: 58 ARSAAQRLASQLGEAVGQTVGYRVRGENKVSRRTRLEVVTEGILTRMIQDDPELDGVGAL 117
Query: 788 IIDEAH 793
I DE H
Sbjct: 118 IFDEFH 123
>gnl|CDD|182986 PRK11131, PRK11131, ATP-dependent RNA helicase HrpA; Provisional.
Length = 1294
Score = 189 bits (482), Expect = 1e-48
Identities = 117/350 (33%), Positives = 185/350 (52%), Gaps = 29/350 (8%)
Query: 797 YVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAGRAGRVAAGKCFRLYTAWAYKHEL 856
YVIDPG + + ++ RT ++ L + PIS+ASANQR GR GRV+ G C RLY+
Sbjct: 358 YVIDPGTARISRYSYRTKVQRLPIEPISQASANQRKGRCGRVSEGICIRLYS-------- 409
Query: 857 EDN-------TVPEIQRINLGNAVLMLKALGINDLIHFDFLDPPPHETLVLALEQLYALG 909
ED+ T PEI R NL + +L + ALG+ D+ F F++ P + + L LG
Sbjct: 410 EDDFLSRPEFTDPEILRTNLASVILQMTALGLGDIAAFPFVEAPDKRNIQDGVRLLEELG 469
Query: 910 ALN-----HHGDLTKLGRRMAEFPLDPMMGKMLLASEKYKCVEEIVTIASMLSVNGAIFY 964
A+ LT LGR++A+ P+DP + +M+L ++K+ CV E++ I S LS+
Sbjct: 470 AITTDEQASAYKLTPLGRQLAQLPVDPRLARMVLEAQKHGCVREVMIITSALSIQDP-RE 528
Query: 965 RPKDKIIHADTARKNFFMLGGDHLVLLNIYN---QWADADFSTQW---CYENFIQHRSMK 1018
RP DK +D + F D L +N++N + A S Q+ C +++ + ++
Sbjct: 529 RPMDKQQASDEKHRRFADKESDFLAFVNLWNYLQEQQKALSSNQFRRLCRTDYLNYLRVR 588
Query: 1019 RARDVREQLIGLVQRVEMELVSNPTETVNIRKAITAGYFYHIA-KLSKSGSYKTAKHNQT 1077
+D+ QL +V+ + + + S P E I A+ G HI K ++ Y A+ N
Sbjct: 589 EWQDIYTQLRQVVKELGIPVNSEPAEYREIHTALLTGLLSHIGMKDAEKQEYTGAR-NAR 647
Query: 1078 VMIHPNSSLFEELPRWVLYHELVFTTKEFMRQVTEIESKWLLEVAPHYYK 1127
I P S LF++ P+WV+ ELV T++ + R IE +W+ +A H K
Sbjct: 648 FSIFPGSGLFKKPPKWVMVAELVETSRLWGRIAARIEPEWIEPLAQHLIK 697
Score = 186 bits (474), Expect = 1e-47
Identities = 93/236 (39%), Positives = 140/236 (59%), Gaps = 17/236 (7%)
Query: 1 MTDGTLHREFLSEPDLASYSVMIIDEAHERTLHTDILFGLVKDIVRFRSDLKLLISSATL 60
MTDG L E + L Y +IIDEAHER+L+ D + G +K+++ R DLK++I+SAT+
Sbjct: 169 MTDGILLAEIQQDRLLMQYDTIIIDEAHERSLNIDFILGYLKELLPRRPDLKVIITSATI 228
Query: 61 DAQKFSEFFDDAPVFRIPGRRFPVDIYYTKAPEADYIDACVVSILQ--IHATQPL----- 113
D ++FS F++AP+ + GR +PV++ Y P + D LQ A L
Sbjct: 229 DPERFSRHFNNAPIIEVSGRTYPVEVRYR--PIVEEADDTERDQLQAIFDAVDELGREGP 286
Query: 114 GDILVFLTGQEEIETCNEILQDRVRRLGSKIKELIILPVYSNLPTDMQTKIFLPTPPNAR 173
GDIL+F++G+ EI + L ++ ILP+Y+ L Q ++F + R
Sbjct: 287 GDILIFMSGEREIRDTADALNKL------NLRHTEILPLYARLSNSEQNRVF--QSHSGR 338
Query: 174 KVVLATNIAETSLTIDNIIYVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAG 229
++VLATN+AETSLT+ I YVIDPG + + ++ RT ++ L + PIS+ASANQR G
Sbjct: 339 RIVLATNVAETSLTVPGIKYVIDPGTARISRYSYRTKVQRLPIEPISQASANQRKG 394
Score = 148 bits (375), Expect = 7e-36
Identities = 68/141 (48%), Positives = 94/141 (66%), Gaps = 3/141 (2%)
Query: 668 SLPIYPFKQDLIDAVRDHQVLIIEGETGSGKTTQIPQYLHEAGFTKDKMKIGCTQPRRVA 727
+LP+ KQD+++A+RDHQV+I+ GETGSGKTTQ+P+ E G + IG TQPRR+A
Sbjct: 72 NLPVSQKKQDILEAIRDHQVVIVAGETGSGKTTQLPKICLELGRGVKGL-IGHTQPRRLA 130
Query: 728 AMSVSARVAEEMGVKLGCEVGYAIRFEDCTSERTIIKYMTDGTLHREFLSEPDLASYSVM 787
A +V+ R+AEE+ +LG VGY +RF D S+ T++K MTDG L E + L Y +
Sbjct: 131 ARTVANRIAEELETELGGCVGYKVRFNDQVSDNTMVKLMTDGILLAEIQQDRLLMQYDTI 190
Query: 788 IIDEAHPKVYVID--PGFCKQ 806
IIDEAH + ID G+ K+
Sbjct: 191 IIDEAHERSLNIDFILGYLKE 211
Score = 59.7 bits (145), Expect = 1e-08
Identities = 43/149 (28%), Positives = 75/149 (50%), Gaps = 8/149 (5%)
Query: 1153 DHLVLLNIYN---QWADADFSTQW---CYENFIQHRSMKRARDVREQLIGLVQRVEMELV 1206
D L +N++N + A S Q+ C +++ + ++ +D+ QL +V+ + + +
Sbjct: 550 DFLAFVNLWNYLQEQQKALSSNQFRRLCRTDYLNYLRVREWQDIYTQLRQVVKELGIPVN 609
Query: 1207 SNPTETVNIRKAITAGYFYHIA-KLSKSGSYKTAKHNQTVMIHPNSSLFEELPRWVLYHE 1265
S P E I A+ G HI K ++ Y A+ N I P S LF++ P+WV+ E
Sbjct: 610 SEPAEYREIHTALLTGLLSHIGMKDAEKQEYTGAR-NARFSIFPGSGLFKKPPKWVMVAE 668
Query: 1266 LVFTTKEFMRQVTEIESKWLLEVAPHYYK 1294
LV T++ + R IE +W+ +A H K
Sbjct: 669 LVETSRLWGRIAARIEPEWIEPLAQHLIK 697
>gnl|CDD|236950 PRK11664, PRK11664, ATP-dependent RNA helicase HrpB; Provisional.
Length = 812
Score = 175 bits (445), Expect = 1e-44
Identities = 92/230 (40%), Positives = 126/230 (54%), Gaps = 9/230 (3%)
Query: 2 TDGTLHREFLSEPDLASYSVMIIDEAHERTLHTDILFGLVKDIVR-FRSDLKLLISSATL 60
T+G L R +P+L+ ++I+DE HER+L D+ L+ D+ + R DLKLLI SATL
Sbjct: 100 TEGILTRMIQRDPELSGVGLVILDEFHERSLQADLALALLLDVQQGLRDDLKLLIMSATL 159
Query: 61 DAQKFSEFFDDAPVFRIPGRRFPVDIYYTKAPEADYID-ACVVSILQIHATQPLGDILVF 119
D + + DAPV GR FPV+ Y P D A + ++ + G +L+F
Sbjct: 160 DNDRLQQLLPDAPVIVSEGRSFPVERRYQPLPAHQRFDEAVARATAELLRQES-GSLLLF 218
Query: 120 LTGQEEIETCNEILQDRVRRLGSKIKELIILPVYSNLPTDMQTKIFLPTPPNARKVVLAT 179
L G EI+ E L RV ++++ P+Y L Q K LP P RKVVLAT
Sbjct: 219 LPGVGEIQRVQEQLASRVA------SDVLLCPLYGALSLAEQQKAILPAPAGRRKVVLAT 272
Query: 180 NIAETSLTIDNIIYVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAG 229
NIAETSLTI+ I V+D G + F+ +TG+ L IS+AS QRAG
Sbjct: 273 NIAETSLTIEGIRLVVDSGLERVARFDPKTGLTRLVTQRISQASMTQRAG 322
Score = 97.7 bits (244), Expect = 2e-20
Identities = 54/143 (37%), Positives = 73/143 (51%), Gaps = 1/143 (0%)
Query: 798 VIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAGRAGRVAAGKCFRLYTAWAYKHELE 857
V+D G + F+ +TG+ L IS+AS QRAGRAGR+ G C LY+ +
Sbjct: 287 VVDSGLERVARFDPKTGLTRLVTQRISQASMTQRAGRAGRLEPGICLHLYSKEQAERAAA 346
Query: 858 DNTVPEIQRINLGNAVLMLKALGINDLIHFDFLDPPPHETLVLALEQLYALGALNHHGDL 917
+ PEI +L +L L G +D +LD PP L A L LGAL+ G L
Sbjct: 347 QSE-PEILHSDLSGLLLELLQWGCHDPAQLSWLDQPPAAALAAAKRLLQQLGALDGQGRL 405
Query: 918 TKLGRRMAEFPLDPMMGKMLLAS 940
T GR+MA DP + ML+A+
Sbjct: 406 TARGRKMAALGNDPRLAAMLVAA 428
Score = 88.4 bits (220), Expect = 1e-17
Identities = 44/127 (34%), Positives = 74/127 (58%), Gaps = 4/127 (3%)
Query: 668 SLPIYPFKQDLIDAVRDHQVLIIEGETGSGKTTQIP-QYLHEAGFTKDKMKIGCTQPRRV 726
SLP+ +L+ A++ ++++ TG+GK+T +P Q L G KI +PRR+
Sbjct: 3 SLPVAAVLPELLTALKTAPQVLLKAPTGAGKSTWLPLQLLQHGGING---KIIMLEPRRL 59
Query: 727 AAMSVSARVAEEMGVKLGCEVGYAIRFEDCTSERTIIKYMTDGTLHREFLSEPDLASYSV 786
AA +V+ R+AE++G K G VGY +R E T ++ +T+G L R +P+L+ +
Sbjct: 60 AARNVAQRLAEQLGEKPGETVGYRMRAESKVGPNTRLEVVTEGILTRMIQRDPELSGVGL 119
Query: 787 MIIDEAH 793
+I+DE H
Sbjct: 120 VILDEFH 126
>gnl|CDD|218070 pfam04408, HA2, Helicase associated domain (HA2). This presumed
domain is about 90 amino acid residues in length. It is
found is a diverse set of RNA helicases. Its function is
unknown, however it seems likely to be involved in
nucleic acid binding.
Length = 91
Score = 136 bits (344), Expect = 9e-38
Identities = 54/91 (59%), Positives = 67/91 (73%), Gaps = 1/91 (1%)
Query: 901 ALEQLYALGALNHHGDLTKLGRRMAEFPLDPMMGKMLLASEKYKCVEEIVTIASMLSVNG 960
ALE LY LGAL+ G+LT LGR+MAE PLDP +GKMLL S ++ C++EI+TIA+MLSV
Sbjct: 2 ALELLYELGALDDDGELTPLGRQMAELPLDPRLGKMLLLSAEFGCLDEILTIAAMLSV-P 60
Query: 961 AIFYRPKDKIIHADTARKNFFMLGGDHLVLL 991
+ FYRPK+K AD AR+ F DHL LL
Sbjct: 61 SPFYRPKEKEEEADAARRKFASAESDHLTLL 91
>gnl|CDD|219532 pfam07717, OB_NTP_bind, Oligonucleotide/oligosaccharide-binding
(OB)-fold. This family is found towards the C-terminus
of the DEAD-box helicases (pfam00270). In these helicases
it is apparently always found in association with
pfam04408. There do seem to be a couple of instances
where it occurs by itself - . The structure PDB:3i4u
adopts an OB-fold. helicases (pfam00270). In these
helicases it is apparently always found in association
with pfam04408. This C-terminal domain of the yeast
helicase contains an
oligonucleotide/oligosaccharide-binding (OB)-fold which
seems to be placed at the entrance of the putative
nucleic acid cavity. It also constitutes the binding site
for the G-patch-containing domain of Pfa1p. When found on
DEAH/RHA helicases, this domain is central to the
regulation of the helicase activity through its binding
of both RNA and G-patch domain proteins.
Length = 109
Score = 128 bits (325), Expect = 5e-35
Identities = 49/108 (45%), Positives = 68/108 (62%), Gaps = 8/108 (7%)
Query: 1026 QLIGLVQRVEME------LVSNPTETVNIRKAITAGYFYHIAKLSKSGSYK-TAKHNQTV 1078
QL+ +++R+ + L S+ + IRKA+ AG F ++A+L T K Q V
Sbjct: 2 QLLEILERLGLPEEPSSLLNSSSNDYELIRKALCAGLFPNVARLVDEKGEYKTLKEGQPV 61
Query: 1079 MIHPNSSLFEELP-RWVLYHELVFTTKEFMRQVTEIESKWLLEVAPHY 1125
IHP+S LF + P WV+YHELV TTKE+MR VT I+ +WLLE+APHY
Sbjct: 62 FIHPSSVLFGKKPPEWVVYHELVETTKEYMRDVTAIDPEWLLELAPHY 109
Score = 128 bits (325), Expect = 5e-35
Identities = 49/108 (45%), Positives = 68/108 (62%), Gaps = 8/108 (7%)
Query: 1193 QLIGLVQRVEME------LVSNPTETVNIRKAITAGYFYHIAKLSKSGSYK-TAKHNQTV 1245
QL+ +++R+ + L S+ + IRKA+ AG F ++A+L T K Q V
Sbjct: 2 QLLEILERLGLPEEPSSLLNSSSNDYELIRKALCAGLFPNVARLVDEKGEYKTLKEGQPV 61
Query: 1246 MIHPNSSLFEELP-RWVLYHELVFTTKEFMRQVTEIESKWLLEVAPHY 1292
IHP+S LF + P WV+YHELV TTKE+MR VT I+ +WLLE+APHY
Sbjct: 62 FIHPSSVLFGKKPPEWVVYHELVETTKEYMRDVTAIDPEWLLELAPHY 109
>gnl|CDD|214852 smart00847, HA2, Helicase associated domain (HA2) Add an
annotation. This presumed domain is about 90 amino acid
residues in length. It is found is a diverse set of RNA
helicases. Its function is unknown, however it seems
likely to be involved in nucleic acid binding.
Length = 82
Score = 108 bits (272), Expect = 3e-28
Identities = 45/85 (52%), Positives = 57/85 (67%), Gaps = 3/85 (3%)
Query: 907 ALGALNHHGDLTKLGRRMAEFPLDPMMGKMLLASEKYKCVEEIVTIASMLSVNGAIFYRP 966
LGAL+ G LT LGR+MAE PLDP + KMLLA+ ++ C++EI+TI +MLSV RP
Sbjct: 1 ELGALDDDGRLTPLGRKMAELPLDPRLAKMLLAAAEFGCLDEILTIVAMLSVGD---PRP 57
Query: 967 KDKIIHADTARKNFFMLGGDHLVLL 991
K+K AD AR+ F DHL LL
Sbjct: 58 KEKREDADAARRRFADPESDHLTLL 82
>gnl|CDD|238005 cd00046, DEXDc, DEAD-like helicases superfamily. A diverse family
of proteins involved in ATP-dependent RNA or DNA
unwinding. This domain contains the ATP-binding region.
Length = 144
Score = 101 bits (254), Expect = 6e-25
Identities = 31/115 (26%), Positives = 47/115 (40%), Gaps = 9/115 (7%)
Query: 686 QVLIIEGETGSGKTTQIPQYLHEAGFTKDKMKIGCTQPRRVAAMSVSARVAEEMGVKLGC 745
+ +++ TGSGKT + E + ++ P R A V+ R+ E G +
Sbjct: 1 RDVLLAAPTGSGKTLAALLPILELLDSLKGGQVLVLAPTRELANQVAERLKELFGEGI-- 58
Query: 746 EVGYAIRFEDCT------SERTIIKYMTDGTLHREFLSEP-DLASYSVMIIDEAH 793
+VGY I S +T I T G L E L ++I+DEAH
Sbjct: 59 KVGYLIGGTSIKQQEKLLSGKTDIVVGTPGRLLDELERLKLSLKKLDLLILDEAH 113
Score = 52.7 bits (127), Expect = 6e-08
Identities = 19/61 (31%), Positives = 28/61 (45%), Gaps = 2/61 (3%)
Query: 1 MTDGTLHREFLSEP-DLASYSVMIIDEAHERTLHTDILFGLVKDIVRFRSDLKLLISSAT 59
T G L E L ++I+DEAH L GL K +++ D ++L+ SAT
Sbjct: 85 GTPGRLLDELERLKLSLKKLDLLILDEAHRLLNQGFGLLGL-KILLKLPKDRQVLLLSAT 143
Query: 60 L 60
Sbjct: 144 P 144
>gnl|CDD|214692 smart00487, DEXDc, DEAD-like helicases superfamily.
Length = 201
Score = 87.2 bits (216), Expect = 3e-19
Identities = 32/138 (23%), Positives = 57/138 (41%), Gaps = 8/138 (5%)
Query: 664 ETKKSLPIYPFKQDLIDAV-RDHQVLIIEGETGSGKTTQIPQYLHEAGFTKDKMKIGCTQ 722
E P+ P++++ I+A+ + +I+ TGSGKT EA ++
Sbjct: 2 EKFGFEPLRPYQKEAIEALLSGLRDVILAAPTGSGKTLAALLPALEALKRGKGGRVLVLV 61
Query: 723 PRRVAAMSVSARVAEEM----GVKLGCEVGYAIR--FEDCTSERTIIKYMTDGTLHREFL 776
P R A + + + +G G + R S +T I T G L
Sbjct: 62 PTRELAEQWAEELKKLGPSLGLKVVGLYGGDSKREQLRKLESGKTDILVTTPGRLLDLLE 121
Query: 777 SEP-DLASYSVMIIDEAH 793
++ L++ ++I+DEAH
Sbjct: 122 NDKLSLSNVDLVILDEAH 139
Score = 53.6 bits (129), Expect = 8e-08
Identities = 25/91 (27%), Positives = 40/91 (43%), Gaps = 8/91 (8%)
Query: 1 MTDGTLHREFLSEP-DLASYSVMIIDEAHERT--LHTDILFGLVKDIVRFRSDLKLLISS 57
T G L ++ L++ ++I+DEAH D L L+K +++LL+ S
Sbjct: 111 TTPGRLLDLLENDKLSLSNVDLVILDEAHRLLDGGFGDQLEKLLK---LLPKNVQLLLLS 167
Query: 58 ATL--DAQKFSEFFDDAPVFRIPGRRFPVDI 86
AT + + E F + PVF G I
Sbjct: 168 ATPPEEIENLLELFLNDPVFIDVGFTPLEPI 198
>gnl|CDD|197757 smart00490, HELICc, helicase superfamily c-terminal domain.
Length = 82
Score = 49.5 bits (119), Expect = 2e-07
Identities = 23/99 (23%), Positives = 38/99 (38%), Gaps = 23/99 (23%)
Query: 131 EILQDRVRRLGSKIKELIILPVYSNLPTDMQTKIFLPTPPNARKVVLATNIAETSLTIDN 190
E L + ++ LG K+ ++ L + + +I KV++AT++AE L +
Sbjct: 1 EELAELLKELGIKV-----ARLHGGLSQEEREEILDKFNNGKIKVLVATDVAERGLDLPG 55
Query: 191 IIYVIDPGFCKQNNFNSRTGMESLTVVPISKASANQRAG 229
+ VI P S AS QR G
Sbjct: 56 VDLVIIYDL------------------PWSPASYIQRIG 76
Score = 29.5 bits (67), Expect = 2.1
Identities = 11/17 (64%), Positives = 12/17 (70%)
Query: 821 VPISKASANQRAGRAGR 837
+P S AS QR GRAGR
Sbjct: 64 LPWSPASYIQRIGRAGR 80
>gnl|CDD|177443 PHA02653, PHA02653, RNA helicase NPH-II; Provisional.
Length = 675
Score = 54.2 bits (131), Expect = 4e-07
Identities = 53/201 (26%), Positives = 91/201 (45%), Gaps = 35/201 (17%)
Query: 16 LASYSVMIIDEAHERTLHTDILFGLV-KDIVRFRSDLKLLISSATL--DAQKFSEFFDDA 72
L Y +IIDE HE DI+ + K I + RS L + +ATL D + EFF +
Sbjct: 289 LFDYGTVIIDEVHEHDQIGDIIIAVARKHIDKIRS---LFLMTATLEDDRDRIKEFFPNP 345
Query: 73 PVFRIPG-RRFPV-DIY----YTKAPEADYIDA---CVVSILQIHATQPLG-DILVFLTG 122
IPG FP+ ++Y Y + YI+ +V+ L+ + T P G +VF+
Sbjct: 346 AFVHIPGGTLFPISEVYVKNKYNPKNKRAYIEEEKKNIVTALKKY-TPPKGSSGIVFV-- 402
Query: 123 QEEIETCNEILQDRVRRLGSKIKELIILPVYSNLP--TDMQTKIFLPTPPNARK---VVL 177
+ C E + L ++ ++ +P ++ K++ + K +++
Sbjct: 403 -ASVSQCEEY----KKYLEKRLPIYDFYIIHGKVPNIDEILEKVY------SSKNPSIII 451
Query: 178 ATNIAETSLTIDNIIYVIDPG 198
+T E+S+TI N +V D G
Sbjct: 452 STPYLESSVTIRNATHVYDTG 472
Score = 32.3 bits (74), Expect = 2.2
Identities = 13/61 (21%), Positives = 25/61 (40%), Gaps = 7/61 (11%)
Query: 653 TEKQKKKLNIEETKKSLPIYPFK-------QDLIDAVRDHQVLIIEGETGSGKTTQIPQY 705
+K + I + P + +A + +++ G TG GKT+Q+P+
Sbjct: 140 VGSKKDTIGILGNPEPFSKIPLASLQPDVQLKIFEAWISRKPVVLTGGTGVGKTSQVPKL 199
Query: 706 L 706
L
Sbjct: 200 L 200
Score = 30.3 bits (69), Expect = 8.4
Identities = 12/26 (46%), Positives = 14/26 (53%)
Query: 823 ISKASANQRAGRAGRVAAGKCFRLYT 848
ISK+ QR GR GRV+ G Y
Sbjct: 487 ISKSMRTQRKGRVGRVSPGTYVYFYD 512
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 50.9 bits (121), Expect = 5e-06
Identities = 45/219 (20%), Positives = 92/219 (42%), Gaps = 8/219 (3%)
Query: 450 DDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESR 509
++ K+ DE ++ + K + + K+A E +A+ K + +
Sbjct: 1480 EEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEA 1539
Query: 510 RKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEK 569
+K EK+K D++ + E +E +E+ E ++ ++ E AK+ ++AR E
Sbjct: 1540 KKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKA--EEAKKAEEARIEEV 1597
Query: 570 VQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLL 629
++ Y K K E + + K K E+ K +E+ + E ++ EL
Sbjct: 1598 MKLYEEEKKMKAEEAKKAEEAKIKA---EELKKAEEEKKKVEQLKKKEAEEKKKAEELKK 1654
Query: 630 DNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKS 668
+ + I+A K EE+ + ++ K E+ KK+
Sbjct: 1655 AEEENKIKAAEE---AKKAEEDKKKAEEAKKAEEDEKKA 1690
Score = 45.5 bits (107), Expect = 2e-04
Identities = 45/230 (19%), Positives = 99/230 (43%), Gaps = 23/230 (10%)
Query: 445 LTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKI--IP 502
L ++++ K + + DK + + +D++ EA K+ EAE K +
Sbjct: 1652 LKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKK---EAEEAKKAEELK 1708
Query: 503 KLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHD 562
K E ++K E +K ++ +++A+ E ++ E K+K E K+ +
Sbjct: 1709 KKEAEEKKKAEELKKAEEENKIKAE--------------EAKKEAEEDKKKAEEAKKDEE 1754
Query: 563 KARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQ 622
+ +++ +++ K + ++ E V E+E E+R+ E ++ IF A E
Sbjct: 1755 EKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGG 1814
Query: 623 EQYELLLDN----QIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKS 668
++ L++++ + I+ ++ N E + EK K N E +
Sbjct: 1815 KEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDG 1864
Score = 45.5 bits (107), Expect = 2e-04
Identities = 46/235 (19%), Positives = 98/235 (41%), Gaps = 14/235 (5%)
Query: 439 AFMGHFLTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKL-----E 493
A M HF + E ++ LK ++ + A ++K+ +EA K+ + E
Sbjct: 1260 ARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADE 1319
Query: 494 AESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRK 553
A+ + + K +++K E +K + A+ EA+ DE +E++ E++ E K+K
Sbjct: 1320 AKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379
Query: 554 LLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFR 613
K+ ++ ++ ++ ++ K + +K +E+ + +K DE A +
Sbjct: 1380 ADAAKKKAEEKKKADEAKK----KAEEDKKKADELKKAA-----AAKKKADEAKKKAEEK 1430
Query: 614 VGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKS 668
A + + + + D + K EE + + K EE KK+
Sbjct: 1431 KKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKA 1485
Score = 44.4 bits (104), Expect = 5e-04
Identities = 48/233 (20%), Positives = 95/233 (40%), Gaps = 18/233 (7%)
Query: 451 DRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAE-SRDKIIPKLRIESR 509
D + + + + ++N + AT + EEA K +AE +R K + E
Sbjct: 1069 DEGLKPSYKDFDFDAKEDNRADEATE-EAFGKAEEAKKTETGKAEEARKAEEAKKKAEDA 1127
Query: 510 RKYLEKRKEDKVAEL-EADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELE 568
RK E RK + + EA +D E R+ E ++ E AK+ + AR+ E
Sbjct: 1128 RKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKA--EDAKKAEAARKAE 1185
Query: 569 KVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQ-RKWEDEQMSSAIFRV------------G 615
+V++ + + K E + E+ E+ RK ED + + A+ +
Sbjct: 1186 EVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKA 1245
Query: 616 AKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKS 668
++R ++E + F + + + + + +K ++K +E KK+
Sbjct: 1246 EEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKA 1298
Score = 43.2 bits (101), Expect = 0.001
Identities = 43/226 (19%), Positives = 100/226 (44%), Gaps = 6/226 (2%)
Query: 448 SDDDRK-ERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKL-EAESRDKIIPKLR 505
+D+ +K E + ++ K ++ + A ++K+ +E K +L +AE + K +
Sbjct: 1512 ADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKK 1571
Query: 506 IESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERE--RRDREHKRKLLELAKEHDK 563
E + ++ E+ EA I + L+ E++ + E ++ E K K EL K ++
Sbjct: 1572 AEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEE 1631
Query: 564 ARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQE 623
+++E++++ + K E + +E + E +K E+++ + + +D +
Sbjct: 1632 KKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAA 1691
Query: 624 QYELLLDNQIDFIQALSLPGNRD--KGEEEMTEKQKKKLNIEETKK 667
+ + + L + K EE +++ K+ EE KK
Sbjct: 1692 EALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKK 1737
Score = 42.1 bits (98), Expect = 0.002
Identities = 43/219 (19%), Positives = 89/219 (40%), Gaps = 13/219 (5%)
Query: 463 KNRDKNNTKNVATSRQDKRAYEEAAKRL----KLEAESRDKIIPKLRIESRRKYLEKRKE 518
K + ++ + +EA K+ K E++ K + + +K E +K
Sbjct: 1465 KAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKA 1524
Query: 519 DKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAK-------EHDKARELEKVQ 571
D+ + E DE +E++ E + E +K E K E DK L K +
Sbjct: 1525 DEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAE 1584
Query: 572 RYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDN 631
+ + E + +E++K+ E +K E+ ++ + + ++++ EQ +
Sbjct: 1585 EAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAE 1644
Query: 632 QIDFIQAL--SLPGNRDKGEEEMTEKQKKKLNIEETKKS 668
+ + L + N+ K EE + ++ K EE KK+
Sbjct: 1645 EKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKA 1683
Score = 38.6 bits (89), Expect = 0.029
Identities = 34/196 (17%), Positives = 91/196 (46%), Gaps = 7/196 (3%)
Query: 476 SRQDKRAYEE--AAKRLKLEAESRDKI--IPKLRIESRRKYLEKRKEDKVAELEADILDD 531
+ + K+A E A+ LK E + K+ + K E ++K E +K ++ +++A
Sbjct: 1609 AEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAK 1668
Query: 532 EYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEK 591
+ +++ E ++ + + K+ L KE ++A++ E++++ + K E + +E
Sbjct: 1669 KAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEEN 1728
Query: 592 EKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEE 651
+ +++ E+++ + + KD E +++ L + + + EE
Sbjct: 1729 KIKAEEAKKEAEEDKKKAEEAK---KDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEEL 1785
Query: 652 MTEKQKKKLNIEETKK 667
E +K+++ +++ K
Sbjct: 1786 DEEDEKRRMEVDKKIK 1801
Score = 36.3 bits (83), Expect = 0.16
Identities = 37/164 (22%), Positives = 69/164 (42%), Gaps = 11/164 (6%)
Query: 453 KERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKI-----IPKLRIE 507
+E+ E +E+ K K ++K+ +EA K+ + + + D++ K E
Sbjct: 1363 EEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADE 1422
Query: 508 SRRKYLEKRKED---KVAELEADILDDEYLFSEQELTERERRDREHKRKLLEL---AKEH 561
+++K EK+K D K AE + + E + E ++ E +K E A+E
Sbjct: 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEA 1482
Query: 562 DKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDE 605
KA E +K K + + E + ++ E+ K DE
Sbjct: 1483 KKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE 1526
Score = 35.9 bits (82), Expect = 0.21
Identities = 46/196 (23%), Positives = 81/196 (41%), Gaps = 16/196 (8%)
Query: 478 QDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSE 537
+D R EEA K AE ++ + E RK E RK + + EA +E +E
Sbjct: 1137 EDARKAEEARK-----AEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAE 1191
Query: 538 QELTERERRDREHKRKL-----LELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKE 592
+ + R E RK E A++ + A++ E V++ K + EE + E
Sbjct: 1192 ELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAK-----KDAEEAKKAE 1246
Query: 593 KVPHYEQ-RKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEE 651
+ + E+ RK+E+ +M+ R A E + + L + + K +E
Sbjct: 1247 EERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADE 1306
Query: 652 MTEKQKKKLNIEETKK 667
+K ++ +E KK
Sbjct: 1307 AKKKAEEAKKADEAKK 1322
Score = 35.5 bits (81), Expect = 0.23
Identities = 43/218 (19%), Positives = 83/218 (38%), Gaps = 9/218 (4%)
Query: 452 RKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRK 511
+ E +E + + +A +D R EEA K AE K + E RK
Sbjct: 1135 KAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARK-----AEDAKKAEAARKAEEVRK 1189
Query: 512 YLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKVQ 571
E RK + + EA +E +E+ + + E +K E K+ ++A++ E+ +
Sbjct: 1190 AEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEER 1249
Query: 572 RYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVG-AKDREHQEQYELLLD 630
+ + + + E RK ++ + + + AK E +++ +
Sbjct: 1250 NNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKK 1309
Query: 631 NQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKS 668
+ +A + K EE + K EE KK+
Sbjct: 1310 KAEEAKKADEA---KKKAEEAKKKADAAKKKAEEAKKA 1344
Score = 32.4 bits (73), Expect = 2.3
Identities = 39/221 (17%), Positives = 84/221 (38%), Gaps = 20/221 (9%)
Query: 447 DSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRI 506
D+++ +K +E + + + RQ EEA K A+ K K +
Sbjct: 1238 DAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARK-----ADELKKAEEKKKA 1292
Query: 507 ESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARE 566
+ +K EK+K D+ + + ++ E +++ E K+K K+ ++A++
Sbjct: 1293 DEAKKAEEKKKADEAKKKAEE---------AKKADEAKKKAEEAKKKADAAKKKAEEAKK 1343
Query: 567 LEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYE 626
+ + E EE E + E +K D + A++++ ++ +
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADA------AKKKAEEKKKADEAK 1397
Query: 627 LLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKK 667
+ L K +E +K ++K +E KK
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKK 1438
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 48.2 bits (115), Expect = 3e-05
Identities = 48/223 (21%), Positives = 88/223 (39%), Gaps = 23/223 (10%)
Query: 451 DRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRR 510
KER E + L +K YE+ ++ LK + I +L +
Sbjct: 152 KPKERKEILDELFGLEK---------------YEKLSELLKEVIKEAKAKIEELEGQL-- 194
Query: 511 KYLEKRKEDKVAELEADI---LDDEYLFSEQELTERERRDREHKRKLLELAKEHDKAREL 567
L + ED + LE ++ E + EQE E E+ + +L EL +E ++ EL
Sbjct: 195 SELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEEL 254
Query: 568 EKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYEL 627
+ + + EE E E+ + + E++ + R + E E
Sbjct: 255 KARLLEIESLELEALKIREEELR-ELERLLEELEEKIERL-EELEREIEELEEELEGLRA 312
Query: 628 LLDNQIDFIQAL-SLPGNRDKGEEEMTEKQKKKLNIEETKKSL 669
LL+ + ++ L SL +K EE++ + + + + E K L
Sbjct: 313 LLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNEL 355
Score = 41.3 bits (97), Expect = 0.004
Identities = 40/227 (17%), Positives = 81/227 (35%), Gaps = 16/227 (7%)
Query: 449 DDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIES 508
+D + E LK K ++ EA + E E + + +L+
Sbjct: 199 EDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARL 258
Query: 509 RRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELE 568
+ + K+ E E L+ E+++ E +RE + EL ELE
Sbjct: 259 LEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELE 318
Query: 569 ------KVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQ 622
K + K EK E+++ + + + K +++ + + E
Sbjct: 319 ELLEKLKSLEERLEKLE---EKLEKLESELE--ELAEEK---NELAKLLEERLKELEERL 370
Query: 623 EQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKSL 669
E+ E L+ ++ ++ L + +EE+ E I+E + L
Sbjct: 371 EELEKELEKALERLKQLEE--AIQELKEELAELSAALEEIQEELEEL 415
Score = 39.7 bits (93), Expect = 0.013
Identities = 41/264 (15%), Positives = 93/264 (35%), Gaps = 19/264 (7%)
Query: 419 RRTQSPCPSCLKVFKTPKVRAFMGHFLTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQ 478
CP C + + + + + ++ +E E + R++
Sbjct: 454 AGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELREL 513
Query: 479 DKRAYEEAAKRLKLEAESRDKI--IPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFS 536
++ E L+ E +K+ + L E + + + EL +
Sbjct: 514 EEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEEL------RQLEDR 567
Query: 537 EQELTERERRDREHKRKLLELAKEHDKARELEKVQRYYM---------PKGTKGPEKYEE 587
QEL E R + + EL + ++ +EL+K + + + E E
Sbjct: 568 LQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENE 627
Query: 588 VDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQ-EQYELLLDNQIDFIQAL-SLPGNR 645
++E E+ E K + + + ++ E + E+ E + ++ I+ L
Sbjct: 628 LEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKL 687
Query: 646 DKGEEEMTEKQKKKLNIEETKKSL 669
++ E+ E ++ + +EE K L
Sbjct: 688 EELEQLEEELEQLREELEELLKKL 711
Score = 34.4 bits (79), Expect = 0.57
Identities = 43/228 (18%), Positives = 95/228 (41%), Gaps = 18/228 (7%)
Query: 444 FLTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPK 503
L + + K+ +E + LK + + ++ EE K L+ +++ +
Sbjct: 376 ELEKALERLKQLEEAIQELKEELAELSAALEEIQE---ELEELEKELEELERELEELEEE 432
Query: 504 LRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDK 563
++ + + KE +AEL ++ QEL E EH+++LLEL + +
Sbjct: 433 IKKLEEQINQLESKELMIAELAGA--GEKCPVCGQELPE------EHEKELLELYEL--E 482
Query: 564 ARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQE 623
ELE+ + + + E+ EE++++ + E+ E ++ A+ + E E
Sbjct: 483 LEELEE-ELSREKEEAELREEIEELEKELRE--LEEELIELLELEEALKEELEEKLEKLE 539
Query: 624 QYELLLDNQIDFIQALSLPGNRDKGEEEMTE--KQKKKLNIEETKKSL 669
L+ + +Q L + E+ + E + ++L + T+K
Sbjct: 540 NLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEE 587
>gnl|CDD|238034 cd00079, HELICc, Helicase superfamily c-terminal domain; associated
with DEXDc-, DEAD-, and DEAH-box proteins, yeast
initiation factor 4A, Ski2p, and Hepatitis C virus NS3
helicases; this domain is found in a wide variety of
helicases and helicase related proteins; may not be an
autonomously folding unit, but an integral part of the
helicase; 4 helicase superfamilies at present according
to the organization of their signature motifs; all
helicases share the ability to unwind nucleic acid
duplexes with a distinct directional polarity; they
utilize the free energy from nucleoside triphosphate
hydrolysis to fuel their translocation along DNA,
unwinding the duplex in the process.
Length = 131
Score = 43.8 bits (104), Expect = 6e-05
Identities = 26/148 (17%), Positives = 58/148 (39%), Gaps = 34/148 (22%)
Query: 85 DIYYTKAPEADYIDACVVSILQIHATQPLGDILVFLTGQEEIETCNEILQDRVRRLGSKI 144
I P D ++ +L+ H + G +L+F ++ ++ L + +R+ G K+
Sbjct: 1 PIKQYVLPVEDEKLEALLELLKEHLKKG-GKVLIFCPSKKMLD----ELAELLRKPGIKV 55
Query: 145 KELIILPVYSNLPTDMQTKI---FLPTPPNARKVVLATNIAETSLTIDNIIYVIDPGFCK 201
++ + + + ++ F V++AT++ + + N+ VI
Sbjct: 56 A-----ALHGDGSQEEREEVLKDFREGEIV---VLVATDVIARGIDLPNVSVVI------ 101
Query: 202 QNNFNSRTGMESLTVVPISKASANQRAG 229
N++ P S +S QR G
Sbjct: 102 --NYDL----------PWSPSSYLQRIG 117
Score = 33.0 bits (76), Expect = 0.34
Identities = 12/26 (46%), Positives = 13/26 (50%), Gaps = 1/26 (3%)
Query: 822 PISKASANQRAGRAGRVA-AGKCFRL 846
P S +S QR GRAGR G L
Sbjct: 106 PWSPSSYLQRIGRAGRAGQKGTAILL 131
>gnl|CDD|201125 pfam00271, Helicase_C, Helicase conserved C-terminal domain. The
Prosite family is restricted to DEAD/H helicases,
whereas this domain family is found in a wide variety of
helicases and helicase related proteins. It may be that
this is not an autonomously folding unit, but an
integral part of the helicase.
Length = 78
Score = 40.6 bits (96), Expect = 2e-04
Identities = 16/87 (18%), Positives = 36/87 (41%), Gaps = 18/87 (20%)
Query: 143 KIKELIILPVYSNLPTDMQTKIFLPTPPNARKVVLATNIAETSLTIDNIIYVIDPGFCKQ 202
+ + + ++ L + + +I KV++AT++A + + ++ VI
Sbjct: 4 RKPGIKVARLHGGLSQEEREEILEDFRNGKSKVLVATDVAGRGIDLPDVNLVI------- 56
Query: 203 NNFNSRTGMESLTVVPISKASANQRAG 229
N++ +P + AS QR G
Sbjct: 57 -NYD----------LPWNPASYIQRIG 72
Score = 28.6 bits (65), Expect = 4.0
Identities = 10/18 (55%), Positives = 12/18 (66%)
Query: 821 VPISKASANQRAGRAGRV 838
+P + AS QR GRAGR
Sbjct: 60 LPWNPASYIQRIGRAGRA 77
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 43.4 bits (103), Expect = 8e-04
Identities = 39/152 (25%), Positives = 81/152 (53%), Gaps = 26/152 (17%)
Query: 484 EEAAKRLKLEAESR----DKI---IPKLRIESRRKYLEKRKED----KVAELEADI--LD 530
+EAA R+++E +S+ D++ I +L IE R+ L+K K++ ++ +LE ++ L+
Sbjct: 390 DEAAARIRMEIDSKPEELDELDRRIIQLEIE--REALKKEKDEASKERLEDLEKELAELE 447
Query: 531 DEYLFSEQELTERERRDREHKRKLLELAKEHDKAR-ELEKVQRYYMPKGTKGPE-KYEEV 588
+EY +L E+ + ++ + + ++ +E ++ R ELE+ +R K E +Y ++
Sbjct: 448 EEY----ADLEEQWKAEKAAIQGIQQIKEEIEQVRLELEQAEREG--DLAKAAELQYGKL 501
Query: 589 DEKEKVPHYEQRKWEDEQMSSAIFR--VGAKD 618
E EK + K E+ + R V A++
Sbjct: 502 PELEKRLQAAEAK-LGEETKPRLLREEVTAEE 532
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 42.4 bits (100), Expect = 0.002
Identities = 48/237 (20%), Positives = 98/237 (41%), Gaps = 17/237 (7%)
Query: 445 LTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKL 504
L + ++D ++ E +K+ K + +D EEA LEA IP++
Sbjct: 739 LEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALN--DLEARLSHSRIPEI 796
Query: 505 R-----IESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAK 559
+ +E +E R + +L L+ EYL E + + +R D + + K +E
Sbjct: 797 QAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEI 856
Query: 560 EHDKAR------ELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFR 613
E+ + ELE+++ ++ + +E DE E + + + E++ + I +
Sbjct: 857 ENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEA--QLRELERKIEELEAQIEK 914
Query: 614 VGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEK-QKKKLNIEETKKSL 669
+ E + + E L + ++ I+ EE E Q + +EE ++L
Sbjct: 915 KRKRLSELKAKLEALEE-ELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRAL 970
Score = 42.4 bits (100), Expect = 0.002
Identities = 60/269 (22%), Positives = 100/269 (37%), Gaps = 49/269 (18%)
Query: 451 DRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRR 510
K E E LK ++ + A RQ EE K + +E ++ IE
Sbjct: 218 KEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLE---EIEQLL 274
Query: 511 KYLEKRKED-----------KVAELEADI--LDDEYLFSEQELTERERRDREHKRKLLEL 557
+ L K+ +D K+ ELEA+I L+ E+EL + E R + + ++ +L
Sbjct: 275 EELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKL 334
Query: 558 AKEHDK-ARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGA 616
E ++ RE+E+ ++ EE E ++ + + E+ A R
Sbjct: 335 LAEIEELEREIEEERKRRDKLT-------EEYAELKEELEDLRAELEEVDKEFAETRDEL 387
Query: 617 KDREHQ------EQYELL--LDNQIDFIQALS---------LPGNRDKG---EEEMTEKQ 656
KD + E EL LD + +Q LS + G K EEE +K
Sbjct: 388 KDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKA 447
Query: 657 KKKLNIEETKKSLPIYPFKQDLIDAVRDH 685
+ E + L DL ++
Sbjct: 448 LEIKKQEWKLEQL-----AADLSKYEQEL 471
Score = 37.4 bits (87), Expect = 0.064
Identities = 37/193 (19%), Positives = 83/193 (43%), Gaps = 11/193 (5%)
Query: 480 KRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLE--KRKEDKVAELEADILDDEYLFSE 537
+R E+A +RL DK+ IE + +E +++ DK+ E E L +E
Sbjct: 314 ERELEDAEERLAKLEAEIDKL--LAEIEELEREIEEERKRRDKLTE-EYAELKEELEDLR 370
Query: 538 QELTERERRDREHKRKLLELAKEHDKA-RELEKVQRYYMPKGTKGPEKYEEVDE-KEKVP 595
EL E ++ E + +L + ++ +K RE+ +++R + EE+ + +
Sbjct: 371 AELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIA 430
Query: 596 HYEQRKWE-DEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTE 654
E + E +E+ + ++ + ++ L + + L D+ E+E+++
Sbjct: 431 GIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYE---QELYDLKEEYDRVEKELSK 487
Query: 655 KQKKKLNIEETKK 667
Q++ E +
Sbjct: 488 LQRELAEAEAQAR 500
Score = 37.4 bits (87), Expect = 0.066
Identities = 42/224 (18%), Positives = 81/224 (36%), Gaps = 35/224 (15%)
Query: 476 SRQDKRAYEEAAKRL-KLEAESRDKIIPKLRIESRRKYLEKRKED---KVAELEADI--L 529
SR + + +RL L+ E RIE+R L + D K+ E+E +I L
Sbjct: 669 SRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQL 728
Query: 530 DDEYLFSEQELTERERRDREHKRKLLELAKEHD--------KARELEKVQR-----YYMP 576
+ E ++ L E E ++++ + E +L K++
Sbjct: 729 EQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARL 788
Query: 577 KGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKD---------REHQEQYEL 627
++ PE E+ + E+ E + E ++ ++ + ++ +
Sbjct: 789 SHSRIPEIQAELSKLEE----EVSRIE-ARLREIEQKLNRLTLEKEYLEKEIQELQEQRI 843
Query: 628 LLDNQIDFIQALSLPGNRDKGE--EEMTEKQKKKLNIEETKKSL 669
L QI I+ N K E EE+ E + ++E L
Sbjct: 844 DLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDL 887
Score = 31.2 bits (71), Expect = 5.1
Identities = 31/182 (17%), Positives = 67/182 (36%), Gaps = 29/182 (15%)
Query: 452 RKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRK 511
E +E + ++ ++ T + E R +LE ++ + ++ R+
Sbjct: 335 LAEIEELEREI--EEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYRE 392
Query: 512 YLEK--------------------RKEDKVAELEADI--LDDEYLFSEQELTERERRDRE 549
LEK R +++A+L A I ++ + E+E ++ ++
Sbjct: 393 KLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKK 452
Query: 550 HKRKLLELAKEHDKAR-ELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMS 608
+ KL +LA + K EL ++ Y + + E+ E E + R E+
Sbjct: 453 QEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEA----QARASEERVRG 508
Query: 609 SA 610
Sbjct: 509 GR 510
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 41.8 bits (99), Expect = 0.002
Identities = 44/188 (23%), Positives = 76/188 (40%), Gaps = 44/188 (23%)
Query: 480 KRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQE 539
KR EE +RL+ E E D+II +++ E + EKR+ K +L +I ++
Sbjct: 85 KRRQEEYEERLQ-EREQMDEIIERIQEEDEAEAQEKRE--KQKKLREEI--------DEF 133
Query: 540 LTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQ 599
ER R E K + E E K+ Y K E+ EE + + + +
Sbjct: 134 NEERIERKEEEKERERE---------EELKILEYQREKA----EREEEREAERR-----E 175
Query: 600 RKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKK 659
RK E E+ + + + + +E+ + L D +EE K+++K
Sbjct: 176 RKEEKEREVARLRAQQEEAEDEREELDELRA---------------DLYQEEYERKERQK 220
Query: 660 LNIEETKK 667
E K+
Sbjct: 221 EKEEAEKR 228
Score = 33.7 bits (78), Expect = 0.54
Identities = 23/107 (21%), Positives = 48/107 (44%), Gaps = 18/107 (16%)
Query: 480 KRAYEEAAK--RLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFS- 536
EE + R K + E+ + + + E ++ E++ E+K L+ + ++E
Sbjct: 207 DLYQEEYERKERQKEKEEAEKRR--RQKQE-LQRAREEQIEEKEERLQEERAEEEAERER 263
Query: 537 -----------EQELTERER-RDREHKRKLLELAKEHDKARELEKVQ 571
EQE E+ R + EH+R+L + +E ++ R E+ +
Sbjct: 264 MLEKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEKEERRAAEREE 310
Score = 33.7 bits (78), Expect = 0.61
Identities = 40/203 (19%), Positives = 83/203 (40%), Gaps = 22/203 (10%)
Query: 477 RQDKRAYEE----AAKRLKLEAESRDKIIPKLRIESRRKYLEKRKE----DKVAELEADI 528
+R EE A ++ + + + R++I RK EK +E K+ E + +
Sbjct: 103 EIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREK 162
Query: 529 LDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKV--QRYYMPKGTKGPEKYE 586
+ E + +E ++RE R + + D+ EL+++ Y K +K +
Sbjct: 163 AEREEEREAERRERKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKEK 222
Query: 587 EVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRD 646
E EK + E ++ +EQ+ R+ + E + + E +L+ Q +
Sbjct: 223 EEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAERERMLEKQ------------AE 270
Query: 647 KGEEEMTEKQKKKLNIEETKKSL 669
E E +K+++ E ++ L
Sbjct: 271 DEELEQENAEKRRMKRLEHRREL 293
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 42.0 bits (99), Expect = 0.002
Identities = 61/258 (23%), Positives = 117/258 (45%), Gaps = 39/258 (15%)
Query: 457 EFSERLKN-RDKNNTKNVATSRQDKRAYEEAAKRL-KLEAESR--DKIIPKLR-IESRRK 511
E +E+LK +K N+ + YE+ ++L KL+ E + K + KL ++ +
Sbjct: 500 ELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLA 559
Query: 512 YLEKRK---EDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAK-EHDK---- 563
LEK+ E+++AEL L++ S +EL ER + + LEL E +
Sbjct: 560 ELEKKLDELEEELAELL-KELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREE 618
Query: 564 ------ARELEKVQRYYMPKGTKGPEKYEEVDEKEK---VPHYEQRKWEDEQMSSAIFRV 614
EL+K + E +E++E EK YE+ + E ++S + +
Sbjct: 619 KELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGL 678
Query: 615 GAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKSLPIYPF 674
A+ E L+ + + I+ +K +EE+ E++K K +E+ +K+L
Sbjct: 679 RAELEE--------LEKRREEIKK-----TLEKLKEELEEREKAKKELEKLEKAL---ER 722
Query: 675 KQDLIDAVRDHQVLIIEG 692
++L + V+ ++ L+ E
Sbjct: 723 VEELREKVKKYKALLKER 740
Score = 35.8 bits (83), Expect = 0.21
Identities = 46/226 (20%), Positives = 100/226 (44%), Gaps = 21/226 (9%)
Query: 462 LKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLR--------IESRRKYL 513
+K R + K + + + +E K L+ ++I +L +E K L
Sbjct: 174 IKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKEL 233
Query: 514 EKRKEDKVAELEADILDDEYLFS--EQELTERERRDREHKRKLLELAKEHDKARELEKVQ 571
E+ KE ++ ELE ++ E E+++ E E R E K+++ EL ++ + +EL++
Sbjct: 234 EELKE-EIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKA 292
Query: 572 RYY------MPKGTKGPEKYEEVDEK--EKVPHYEQRKWEDEQMSSAIFRVGAKDREHQE 623
Y + + E+ + E++ E+R E E+ + + K +E ++
Sbjct: 293 EEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEK 352
Query: 624 QYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLN--IEETKK 667
+ E L + + +A + ++ ++ +T +KL +EE +K
Sbjct: 353 RLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEK 398
Score = 35.4 bits (82), Expect = 0.22
Identities = 46/236 (19%), Positives = 96/236 (40%), Gaps = 30/236 (12%)
Query: 453 KERDEFSERLKNRDK------NNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRI 506
K + E +K ++K +++ + R E ++ E E + I +L
Sbjct: 186 KRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEK 245
Query: 507 ESRRKYLEKRK-EDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKAR 565
E KRK E+K+ ELE I +EL + E ++L EL ++ ++
Sbjct: 246 ELESLEGSKRKLEEKIRELEERI---------EELKKEIEELEEKVKELKELKEKAEEYI 296
Query: 566 ELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQY 625
+L + Y+ + + ++ ++E+ E+R E E+ + + K +E +++
Sbjct: 297 KLSEFYEEYLDELREIEKRLSRLEEEING--IEERIKELEEKEERLEELKKKLKELEKRL 354
Query: 626 ELL------------LDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKSL 669
E L +++ ++ +K E+E+ E +K K IEE +
Sbjct: 355 EELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKI 410
Score = 32.0 bits (73), Expect = 2.8
Identities = 48/218 (22%), Positives = 89/218 (40%), Gaps = 36/218 (16%)
Query: 460 ERLKNRDKNNTKNVATSRQDKRAYEEAAKRL----KLEAESRDKIIPKLRIESRRKYLEK 515
+ +N KN + + ++ E+ KR +L E ++ LR +
Sbjct: 158 DDYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELP 217
Query: 516 RKEDKVAELEADILDDEYLFSEQELTERERRDRE-HKRKLLELAKEHDKARELEKVQRYY 574
+++ +LE ++ + E L E E E+E E KRKL E +E ++ E
Sbjct: 218 ELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIE-------- 269
Query: 575 MPKGTKGPEKYEEVDE-KEKVPHYEQRKWEDEQMSSAI-FRVGAKDREHQEQYEL-LLDN 631
E +E++E +EKV ++ K + E+ F D + + L L+
Sbjct: 270 --------ELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEE 321
Query: 632 QIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKSL 669
+I+ I+ E + E ++K+ +EE KK L
Sbjct: 322 EINGIE------------ERIKELEEKEERLEELKKKL 347
Score = 32.0 bits (73), Expect = 2.9
Identities = 42/164 (25%), Positives = 69/164 (42%), Gaps = 37/164 (22%)
Query: 460 ERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKL-----RIESRRKYLE 514
ERLK R T ++ EE K + E KI ++ I+ +K +E
Sbjct: 375 ERLKKRLTGLTPEKL-----EKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIE 429
Query: 515 KRKEDK----VAELEADILDDEYLFSE------------QELTERERRDREHKRKL---- 554
+ K+ K V E + L E +E+ E+ER+ R+ R+L
Sbjct: 430 ELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVL 489
Query: 555 ---LELAKEHDKAREL----EKVQRYYMPKGTKGPEKYEEVDEK 591
EL K + A +L EK+++Y + + K E+YE++ EK
Sbjct: 490 KKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEK 533
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 41.4 bits (97), Expect = 0.004
Identities = 49/241 (20%), Positives = 79/241 (32%), Gaps = 34/241 (14%)
Query: 452 RKERDEFSERLKNRDK-----NNTKNVATSRQDKRAY-EEAAKRLKLEAESRDKIIPKLR 505
R+E+ F ER++ + Q R E + +L E + +L
Sbjct: 640 REEQAAFEERVEGLLAVLEAQFIDLSTLFCVQRLRVAAELQKEEARLALEGNIERTKELN 699
Query: 506 IESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKAR 565
E R + RKE + + D E F E E++ RE + + LE E A
Sbjct: 700 DELRAELELHRKE-ILDLFDCGTADTEDAFREAAREEQQLTQRESRLESLEAQLEGVAAE 758
Query: 566 ELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQY 625
E E+ E+E E DE++ +V A R+ Q
Sbjct: 759 AYELSAS----------LDQRELKEEELALLEEAIDALDEEVEELHAQVAALSRQ-IAQL 807
Query: 626 ELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKSLPIYPFKQDLIDAVRDH 685
E G + E K+ L E+ +K + Q L +A+R
Sbjct: 808 E---------------GGGTVAELRQRRESLKEDLE-EKARKWASLRLAVQVLEEALRLF 851
Query: 686 Q 686
+
Sbjct: 852 K 852
Score = 31.7 bits (72), Expect = 3.5
Identities = 54/265 (20%), Positives = 96/265 (36%), Gaps = 49/265 (18%)
Query: 443 HFLTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLE-AESRDKII 501
H L +S R D+++ + + + + +E K+L+ E R+++
Sbjct: 208 HKLLESRRAEHARLAELRSELRADRDHIRALRDAVELWPRLQEW-KQLEQELTRRREELA 266
Query: 502 --PK---LRIESRRKYLEKRKED------KVAEL-----------EADILDDEYLF---- 535
P+ LR+E R +L+K + + ++AEL EA + L
Sbjct: 267 TFPRDGVLRLEKREAHLQKTEAEIDALLVRLAELKDLASQLIPAKEAVLQALVRLHQQLS 326
Query: 536 ----SEQELTE------RERRDREHKRKLLELAKEHDKARELEKVQRYYMPKG--TKGPE 583
S ELTE + RD+E A+ R LE + RY + +
Sbjct: 327 EIKASAFELTETLAGIEADLRDKEEAAGNGFEAERVHDLRSLECMLRYQSSQRELKQTEA 386
Query: 584 KY-EEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLP 642
Y + +DEK R +EDE A R+ + E + E + Q +
Sbjct: 387 AYCKRLDEK--------RLFEDEAEEEARQRLADDEEEVRAGDEAREEKIAANSQVIDKE 438
Query: 643 GNRDKGEEEMTEKQKKKLNIEETKK 667
+ + T QK++ E+
Sbjct: 439 EVCNLYDRRDTAWQKQRFLREKQTA 463
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 41.2 bits (97), Expect = 0.004
Identities = 39/248 (15%), Positives = 82/248 (33%), Gaps = 57/248 (22%)
Query: 478 QDKRA-YEEAA-----KRLKLEAESR------------------DKIIPKLRIESRR--K 511
+++RA +EEAA K + E E + ++ + L ++ + +
Sbjct: 155 EERRAIFEEAAGISKYKERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKAER 214
Query: 512 YLEKRKEDKVAEL-----EADILDDEYLFSEQELTERERRDREHKRKL---------LEL 557
Y E + E + EL + L +E ++EL E E E +L L L
Sbjct: 215 YKELKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRL 274
Query: 558 AKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKEK-VPHYEQRKWEDEQMSSAIFRVGA 616
+ E + Y E+ E+ +R E+ +
Sbjct: 275 EVSELEEEIEELQKELY--------ALANEISRLEQQKQILRERLANLERQLEELEAQLE 326
Query: 617 KDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKSLPIYPFKQ 676
+ ++ L + + L + E E+ E + + +E + L +
Sbjct: 327 ELESKLDELAEELAELEE--KLEELKEELESLEAELEELEAELEELESRLEEL------E 378
Query: 677 DLIDAVRD 684
+ ++ +R
Sbjct: 379 EQLETLRS 386
Score = 35.0 bits (81), Expect = 0.35
Identities = 28/146 (19%), Positives = 52/146 (35%), Gaps = 20/146 (13%)
Query: 446 TDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAES--------- 496
++ + DE +E L + + + + A E + E ES
Sbjct: 323 AQLEELESKLDELAEEL-AELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQL 381
Query: 497 ---RDKIIPKLR----IESRRKYLEKRKED---KVAELEADILDDEYLFSEQELTERERR 546
R K+ + + + LE R E + L+ +I + E EL E +
Sbjct: 382 ETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAE 441
Query: 547 DREHKRKLLELAKEHDKARELEKVQR 572
E + +L EL +E ++ E + R
Sbjct: 442 LEELEEELEELQEELERLEEALEELR 467
Score = 31.6 bits (72), Expect = 3.4
Identities = 43/224 (19%), Positives = 84/224 (37%), Gaps = 23/224 (10%)
Query: 455 RDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLE 514
R E E + ++ + SRQ ++ A RL+ E E ++ I +L E
Sbjct: 704 RKELEELEEELEQLRKELEELSRQISALRKDLA-RLEAEVEQLEERIAQLSKEL------ 756
Query: 515 KRKEDKVAELEADILDDEYLFSEQELTERERRDRE---HKRKLLELAKEHDKAR-ELEKV 570
E ++ ELE + + E +E E E E + + K +L L + D+ R EL +
Sbjct: 757 TELEAEIEELEERLEEAEEELAEAE-AEIEELEAQIEQLKEELKALREALDELRAELTLL 815
Query: 571 QRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWED-----EQMSSAIFRVGAKDREHQEQY 625
+ + E+ + + E+ E +++ I + E + +
Sbjct: 816 NEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESEL 875
Query: 626 ELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKSL 669
E LL N+ ++ E+ E ++ +E + L
Sbjct: 876 EALL-NERASLEE-----ALALLRSELEELSEELRELESKRSEL 913
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 40.0 bits (93), Expect = 0.011
Identities = 35/196 (17%), Positives = 73/196 (37%), Gaps = 19/196 (9%)
Query: 473 VATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDE 532
A SR+ ++ +E K+L E E+ ++I L ++ K + K E + +
Sbjct: 163 AAGSREKRKK-KERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYY-QLKEKL 220
Query: 533 YLFSEQELTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKE 592
L E L + E + LL+ D+ E+E ++ E E+E
Sbjct: 221 ELEEENLLYLDYLKLNEERIDLLQ-ELLRDEQEEIESSKQ-------------ELEKEEE 266
Query: 593 KVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEM 652
+ Q E+++ + + ++ E L + + + K E+
Sbjct: 267 IL---AQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKE 323
Query: 653 TEKQKKKLNIEETKKS 668
+K +K+L E+ +
Sbjct: 324 LKKLEKELKKEKEEIE 339
Score = 32.2 bits (73), Expect = 2.6
Identities = 28/206 (13%), Positives = 65/206 (31%), Gaps = 11/206 (5%)
Query: 448 SDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRI- 506
K +E ERL+ +++ E+ K E ++K K
Sbjct: 846 EQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEE 905
Query: 507 ESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKA-- 564
ES++ L + KE+++ E A+ + + + E +++ +
Sbjct: 906 ESQKDNLLEEKENEIEERIAEEAIILLKYESEP-----EELLLEEADEKEKEEDNKEEEE 960
Query: 565 RELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQ 624
+++ G E +EKE+ Y + + + E++ + + E +
Sbjct: 961 ERNKRLLLAKEELGNVNLMAIAEFEEKEE--RYNKDELKKERLEEEKKELLREIIE-ETC 1017
Query: 625 YELLLDNQIDFIQALSLPGNRDKGEE 650
++ L E
Sbjct: 1018 QRFKEFLELFVSINRGLNKVFFYLEL 1043
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 38.6 bits (90), Expect = 0.016
Identities = 49/197 (24%), Positives = 79/197 (40%), Gaps = 30/197 (15%)
Query: 464 NRDKNNTKNVATSRQDKRAYEEAAKRLK-LEAESRDKI---IPKLRIESRRKYLEKRKED 519
R +NN S D A E AK L+ LE+ K + K++ ++R+ LE + +
Sbjct: 15 KRTRNNGPATIASGFDPTALERGAKALRELESSPHAKKAFELSKMQEKTRQAELEAKIK- 73
Query: 520 KVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKAR-----ELEKVQRYY 574
E EA Q ER R + E +RK L+ + ++ R EL + +RY
Sbjct: 74 ---EYEAQQ--------AQAKLERARVEAEERRKTLQEQTQQEQQRAQYQDELAR-KRYQ 121
Query: 575 MPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQID 634
K E+ +E E + E+ E M A + R + E L+ +
Sbjct: 122 -----KELEQQRRQNE-ELLKMQEESVLRQEAMRRATEEEILEMRRETIEEEAELEREN- 174
Query: 635 FIQALSLPGNRDKGEEE 651
I+A R + +EE
Sbjct: 175 -IRAKIEAEARGRAKEE 190
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 39.0 bits (90), Expect = 0.019
Identities = 49/222 (22%), Positives = 88/222 (39%), Gaps = 21/222 (9%)
Query: 367 HPVIQLFTILPSSSTSSRLFTTLPSSLAAVPHHIQRQSPAQSYHRMTPGCTDRRTQSPCP 426
HP +QL + + + + P++L Q + + R + C R++ +
Sbjct: 331 HPGLQLAAMEKAKHAEAANYGISPNTLI-------NQRNEEQHGRRSSSCASRQSANNVT 383
Query: 427 SCLKVFKTPKVRAFMGHFLTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDK-RAYEE 485
+ + V + S DDRK + + + + V ++ R +E
Sbjct: 384 NITSITSVTSVASVASVASVPSKDDRKYPQDGATHCHAVNGHYGGRVDKDHAERARIEKE 443
Query: 486 AAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERER 545
A R LE KI+ K RIE LE+ + +++ + ++ E L E+ ER
Sbjct: 444 NAHRKALEM----KILEKKRIER----LEREERERLERERMERIERERLERERLERERLE 495
Query: 546 RDREHKRKLLELAKEHDKARE---LEKVQR--YYMPKGTKGP 582
RDR + +L L +E E LEK +R Y++ G
Sbjct: 496 RDRLERDRLDRLERERVDRLERDRLEKARRNSYFLKGMENGL 537
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 38.5 bits (90), Expect = 0.026
Identities = 45/208 (21%), Positives = 93/208 (44%), Gaps = 24/208 (11%)
Query: 470 TKNVATSRQDKRAYEEAAKRLK-LEAESRDKIIPKLRIESRRKYL-----EKRKEDKVAE 523
++N +S KR +E + L LEA+ +++ + L E R++ + E
Sbjct: 656 SRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELE 715
Query: 524 LEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPE 583
+ + L E E+EL + + R E + +L EL +E ++ +E + E
Sbjct: 716 RQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQE-------------RLEE 762
Query: 584 KYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFI--QALSL 641
EE++ E+ + K E E++ + + E +E+ E + ++D + + SL
Sbjct: 763 LEEELESLEE--ALAKLKEEIEELEEKRQALQEELEELEEELE-EAERRLDALERELESL 819
Query: 642 PGNRDKGEEEMTEKQKKKLNIEETKKSL 669
R++ E+E+ E +++ +EE L
Sbjct: 820 EQRRERLEQEIEELEEEIEELEEKLDEL 847
Score = 37.8 bits (88), Expect = 0.047
Identities = 50/184 (27%), Positives = 89/184 (48%), Gaps = 14/184 (7%)
Query: 452 RKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLR--IESR 509
+E +E E + R + + + Q + E+ + L+ E E ++ + +L +E
Sbjct: 795 LEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEEL 854
Query: 510 RKYLEKRKEDKVAELEADI--LDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARE- 566
K LE+ KE ++ ELEA+ L+DE E+E E E RE + +L EL +E +K RE
Sbjct: 855 EKELEELKE-ELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRER 913
Query: 567 ----LEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQ 622
K++R + PE EE++E+ + + + E E++ I +G +
Sbjct: 914 LEELEAKLERL----EVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAI 969
Query: 623 EQYE 626
E+YE
Sbjct: 970 EEYE 973
Score = 34.3 bits (79), Expect = 0.52
Identities = 46/241 (19%), Positives = 94/241 (39%), Gaps = 24/241 (9%)
Query: 445 LTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKL 504
L +E +E E L+ + + + ++ + EEA +LK E E ++ L
Sbjct: 732 LEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQAL 791
Query: 505 RIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKA 564
+ E + E+ +E AE D L+ E E RER ++E + E+
Sbjct: 792 Q-EELEELEEELEE---AERRLDALEREL---ESLEQRRERLEQEIEELEEEI------- 837
Query: 565 RELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQ 624
ELE+ + + ++ EE+ E+ E+ + E E++ + + E +E+
Sbjct: 838 EELEEKLDELEEELEELEKELEELKEEL-----EELEAEKEELEDEL----KELEEEKEE 888
Query: 625 YELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKKSLPIY-PFKQDLIDAVR 683
E L + L + + E E + ++L +E + + ++ L +
Sbjct: 889 LEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELE 948
Query: 684 D 684
Sbjct: 949 R 949
Score = 33.1 bits (76), Expect = 1.1
Identities = 41/190 (21%), Positives = 83/190 (43%), Gaps = 22/190 (11%)
Query: 484 EEAAKRLKLEAESRDKIIPKLRIESR--RKYLEKRKEDKVAELEADILDDEYLFSEQELT 541
EE +RL+ E +K + KL ++ +Y E + E + ELE +L + +EL
Sbjct: 185 EENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELR--ELELALLLAKLKELRKELE 242
Query: 542 ERERRDREHKRKLLELAKE-HDKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQR 600
E E + +L EL +E + +E+E+++ + EE+ E+ E+
Sbjct: 243 ELEEELSRLEEELEELQEELEEAEKEIEELK-----------SELEELREE-----LEEL 286
Query: 601 KWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQAL-SLPGNRDKGEEEMTEKQKKK 659
+ E ++ I + + +E+ E L + + + L L + +EE+ E++
Sbjct: 287 QEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLL 346
Query: 660 LNIEETKKSL 669
+E+ L
Sbjct: 347 EELEQLLAEL 356
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 37.2 bits (86), Expect = 0.040
Identities = 36/139 (25%), Positives = 61/139 (43%), Gaps = 15/139 (10%)
Query: 446 TDSDDDRKERDEFSERLK-----NRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKI 500
++ +++ D+ RLK +D+ + ++A EE AKR AE R +
Sbjct: 21 SEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKALEEEAKRK---AEERKRE 77
Query: 501 IPKLRIESRRKYLE--KRKEDKVAELEADILDDEYLFSEQEL-----TERERRDREHKRK 553
K+ E +K LE KR A ++ DDE E E +R +RDRE + +
Sbjct: 78 TLKIVEEEVKKELELKKRNTLLEANIDDVDTDDENEEEEYEAWKLRELKRIKRDREEREE 137
Query: 554 LLELAKEHDKARELEKVQR 572
+ E +K R + + +R
Sbjct: 138 MEREKAEIEKMRNMTEEER 156
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 38.0 bits (88), Expect = 0.041
Identities = 36/124 (29%), Positives = 57/124 (45%), Gaps = 11/124 (8%)
Query: 452 RKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRK 511
+KE D +L + V+ ++DK++ + KRLK EA+SR +L E +RK
Sbjct: 454 KKENDMLQTKLNSM-------VSAKQKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRK 506
Query: 512 YLEKRKEDKVAELEADILDD--EYLFSEQELTERERRDREHKRKLLELAKE--HDKAREL 567
E+ + A A ++ E L ++ E E + EH KL E +A+EL
Sbjct: 507 KEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKEAQEL 566
Query: 568 EKVQ 571
K Q
Sbjct: 567 RKYQ 570
>gnl|CDD|227545 COG5220, TFB3, Cdk activating kinase (CAK)/RNA polymerase II
transcription initiation/nucleotide excision repair
factor TFIIH, subunit TFB3 [Cell division and chromosome
partitioning / Transcription / DNA replication,
recombination, and repair].
Length = 314
Score = 36.9 bits (85), Expect = 0.057
Identities = 33/157 (21%), Positives = 59/157 (37%), Gaps = 29/157 (18%)
Query: 409 YHRMTPGCTDR---RTQSPCPS--CLKV-----FKTP-----------KVRAFM----GH 443
YHRM C DR R + CP C K+ F VR +
Sbjct: 35 YHRMCESCVDRIFSRGPAQCPYKGCGKILRKIKFIKQTFEDITVEKEVDVRKRLLRAFNK 94
Query: 444 FLTDSDDDRKERDEFSERLKNRDKN--NTKNVATSRQDKRAYEEAAKRLKLEAESRDKII 501
+ D + +++ E ++ +V+ + + + YEE + L R
Sbjct: 95 EEEEFGGDLAKYNDYLEEVEELVFELLELIDVSLTEEKVKKYEEMNQDSILNNLERPT-- 152
Query: 502 PKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQ 538
P++ ++ L+KR + K +LE I ++E + E
Sbjct: 153 PEVMPGKQKNVLQKRMKLKKRQLERQIEEEERMNKEW 189
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 36.7 bits (85), Expect = 0.12
Identities = 29/172 (16%), Positives = 53/172 (30%), Gaps = 45/172 (26%)
Query: 450 DDRKERDEFSERL----KNRDKNNT--------KNVATSRQDKRAYEE------------ 485
D K DE E L K + + + +++A E+
Sbjct: 178 DLVKALDEAEEELANLRKELRQLEKEKQRLERLRRLLPLLAERKALEQQLAALGEVIDLP 237
Query: 486 ---AAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEAD--ILDD--------- 531
+ + AE R + +E + LE + ++ E+ D +L
Sbjct: 238 PDAVERYEEARAELRAA---RRNLELLTERLEALQA-ELDEISLDEELLAQAAAIEALHQ 293
Query: 532 ---EYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTK 580
EY +EQ+L +RE + L + + E V+ K
Sbjct: 294 QRGEYRNAEQDLPDREGEIANAREAAAALLAQIGPDADEEAVESLRPSLAAK 345
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 35.5 bits (82), Expect = 0.13
Identities = 32/149 (21%), Positives = 65/149 (43%), Gaps = 9/149 (6%)
Query: 453 KERDEFSERLKNRDKNNTK-NVATSRQDKRA--YEEAAKRLKLEAESRDKIIPKLRIESR 509
+E+ E ER++ +++ + ++ A EE K+ + EA+ +K +L E+R
Sbjct: 5 REQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENR 64
Query: 510 RKYLEKRKEDKVAE---LEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARE 566
R LE+ E LEA++ + ++ E ERE+++ E ++ EL + +
Sbjct: 65 R--LEEEAAASEEERERLEAEVDEATAEVAKLE-EEREKKEAETRQLQQELREAQEAHER 121
Query: 567 LEKVQRYYMPKGTKGPEKYEEVDEKEKVP 595
+ T P V+ ++ P
Sbjct: 122 ARQELLEAAAAPTAPPHVAAPVNGEQLEP 150
>gnl|CDD|238550 cd01130, VirB11-like_ATPase, Type IV secretory pathway component
VirB11, and related ATPases. The homohexamer, VirB11 is
one of eleven Vir proteins, which are required for
T-pilus biogenesis and virulence in the transfer of
T-DNA from the Ti (tumor-inducing) plasmid of bacterial
to plant cells. The pilus is a fibrous cell surface
organelle, which mediates adhesion between bacteria
during conjugative transfer or between bacteria and host
eukaryotic cells during infection. VirB11- related
ATPases include the archaeal flagella biosynthesis
protein and the pilus assembly proteins CpaF/TadA and
TrbB. This alignment contains the C-terminal domain,
which is the ATPase.
Length = 186
Score = 35.0 bits (81), Expect = 0.14
Identities = 12/23 (52%), Positives = 15/23 (65%)
Query: 678 LIDAVRDHQVLIIEGETGSGKTT 700
L AV + ++I G TGSGKTT
Sbjct: 18 LWLAVEARKNILISGGTGSGKTT 40
>gnl|CDD|222258 pfam13604, AAA_30, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. There is a Walker A and
Walker B.
Length = 195
Score = 34.9 bits (81), Expect = 0.17
Identities = 25/121 (20%), Positives = 49/121 (40%), Gaps = 35/121 (28%)
Query: 680 DAVRD-----HQVLIIEGETGSGKTT---QIPQYLHEAGFTKDKMKIGCTQPRRVAAMSV 731
+AVR +V +++G G+GKTT + AG+ RV ++
Sbjct: 8 EAVRHVLTSGDRVAVVQGPAGTGKTTSLKAAREAWEAAGY-------------RVIGLAP 54
Query: 732 SARVAEEMGVKLGCEVGYAIRFEDCTSERTIIKYMTDGTLHREFLSEPDLASYSVMIIDE 791
+ + A+ +G +LG E RT+ + + L + +++++DE
Sbjct: 55 TGKAAKVLGEELGIEA------------RTLASLL--HRWDKGEDPGRVLDAGTLLVVDE 100
Query: 792 A 792
A
Sbjct: 101 A 101
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 34.1 bits (79), Expect = 0.28
Identities = 19/92 (20%), Positives = 44/92 (47%), Gaps = 2/92 (2%)
Query: 515 KRKEDKVAELEADILD-DEYLFSEQELTERERRDREHKRKLLELAKEHDKAR-ELEKVQR 572
+ + ++ +L+ ++ + + + Q E+ ++ RE + EL +E + EL+K++
Sbjct: 65 NKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKA 124
Query: 573 YYMPKGTKGPEKYEEVDEKEKVPHYEQRKWED 604
PE+ E++ E+ KV +W D
Sbjct: 125 ELEKYEKNDPERIEKLKEETKVAKEAANRWTD 156
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 34.5 bits (80), Expect = 0.29
Identities = 16/69 (23%), Positives = 30/69 (43%), Gaps = 11/69 (15%)
Query: 452 RKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRK 511
K R+E E++ + +R E K+ + + E R+ + KL E +RK
Sbjct: 262 DKTREEEEEKILKAAEE-----------ERQEEAQEKKEEKKKEEREAKLAKLSPEEQRK 310
Query: 512 YLEKRKEDK 520
EK ++ +
Sbjct: 311 LEEKERKKQ 319
>gnl|CDD|183704 PRK12726, PRK12726, flagellar biosynthesis regulator FlhF;
Provisional.
Length = 407
Score = 34.7 bits (79), Expect = 0.29
Identities = 28/120 (23%), Positives = 56/120 (46%), Gaps = 9/120 (7%)
Query: 592 EKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEE 651
+ P E+RK ++E++S+ + A +RE + + DF++ L G D +
Sbjct: 104 QSTPLLEERKTQEEELSAMRLELAALNRELAVKMREEREQNSDFVKFLKGRGISDTYVAD 163
Query: 652 MTEKQKKKLNIEETKK-------SLPIYPFKQDLIDA--VRDHQVLIIEGETGSGKTTQI 702
+ +K+ ET +P K + D+ + +H+++ + G+TG GKTT +
Sbjct: 164 FMQAGRKQFKQVETAHLDDITDWFVPYLSGKLAVEDSFDLSNHRIISLIGQTGVGKTTTL 223
>gnl|CDD|220245 pfam09444, MRC1, MRC1-like domain. This putative domain is found
to be the most conserved region in mediator of
replication checkpoint protein 1.
Length = 145
Score = 33.5 bits (77), Expect = 0.30
Identities = 28/119 (23%), Positives = 51/119 (42%), Gaps = 4/119 (3%)
Query: 456 DEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEK 515
DE S+ + + + S++ E AAKR + E E +K + KL + L +
Sbjct: 29 DEDSDDDDDSELEEMIDDEYSKEKAENDELAAKRAEEEREDDEKEVEKLLDDITTGGLRR 88
Query: 516 RKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKVQRYY 574
R+ +LE +DE Q + R +++LLE + DK + K + ++
Sbjct: 89 RRGKNGLDLELSDSEDE---LLQRRRLKRRELALMRQRLLE-DGDLDKLAKNPKSKAFF 143
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 34.7 bits (80), Expect = 0.32
Identities = 42/220 (19%), Positives = 89/220 (40%), Gaps = 26/220 (11%)
Query: 452 RKERDEFSERLKN-RDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRR 510
R+ERDE +E ++ ++K + N +E + + E +E
Sbjct: 61 REERDEINEEVQELKEKRDEINAKLQEL----RKEYRELKEKRNEFNLGGRSIKSLEREI 116
Query: 511 KYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKE----HDKARE 566
+ LEK+++ V E + + + ++ E ++ E KL EL E KARE
Sbjct: 117 ERLEKKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKARE 176
Query: 567 L-EKVQRY------YMPKGTKGPEKYEEVDEKEKVPH--YEQRKWEDEQMSSAIFRVGAK 617
+ EK+Q Y + K E+ +E+ ++ H + + + +++ + +
Sbjct: 177 IHEKIQELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNE 236
Query: 618 DREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQK 657
RE +++ + L + R + EE+ E+ +
Sbjct: 237 LRELEKKIKALRAKEKA--------AKRREKREELKERAE 268
>gnl|CDD|227095 COG4753, COG4753, Response regulator containing CheY-like receiver
domain and AraC-type DNA-binding domain [Signal
transduction mechanisms].
Length = 475
Score = 34.6 bits (80), Expect = 0.34
Identities = 31/160 (19%), Positives = 52/160 (32%), Gaps = 16/160 (10%)
Query: 481 RAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQE- 539
+ Y E A++L E K + + K+ K E + + E
Sbjct: 239 KVYREIAEKLGFEEADYFDYRFKKYLGMTPD--QYNKQLKQNITENNARGITAILEELFV 296
Query: 540 -LTERERRDREHK--------RKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDE 590
E +HK L +D+ E VQR + + E+ EE+
Sbjct: 297 MGVEHLLESSKHKDALEIWYLLSLYVQMILNDEGLLEEIVQRIH---DIQSFEELEEITS 353
Query: 591 KEKVPHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLD 630
+ Y + E+ + SS I +V +H L L
Sbjct: 354 QILHVEYVIQSVENIEGSSLIQKVLDYIHKH-FSENLTLK 392
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 35.0 bits (81), Expect = 0.36
Identities = 27/113 (23%), Positives = 50/113 (44%), Gaps = 14/113 (12%)
Query: 447 DSDDDRKERDEFSERLKNR-DKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLR 505
D+ D+R R+ R +NR ++ + ++ + E K A ++D+ R
Sbjct: 622 DTRDNRTRREGRENREENRRNRRQAQQQTAETRESQQAEVTEK-----ARTQDEQQQAPR 676
Query: 506 IE-SRRKYLEKRK---EDKVAELEADILDDEYLFSEQELTERERRDREHKRKL 554
E RR+ EKR+ E K +E + + EQE ++ + R +R+L
Sbjct: 677 RERQRRRNDEKRQAQQEAKALNVEEQSVQET----EQEERVQQVQPRRKQRQL 725
>gnl|CDD|203709 pfam07652, Flavi_DEAD, Flavivirus DEAD domain.
Length = 146
Score = 33.1 bits (76), Expect = 0.38
Identities = 30/104 (28%), Positives = 46/104 (44%), Gaps = 17/104 (16%)
Query: 695 GSGKTTQ-IPQYLHEAGFTKDKMKIGCTQPRRVAAMSVSARVAEEMGVKL-GCEVGY--- 749
G+GKT + +P+ + E +++ P RV V EM L G + Y
Sbjct: 12 GAGKTRKVLPELVREC--IDRRLRTLVLAPTRV--------VLAEMEEALRGLPIRYHTP 61
Query: 750 AIRFEDCTSERTIIKYMTDGTLHREFLSEPDLASYSVMIIDEAH 793
A+ E E I+ M T + LS + +Y V+I+DEAH
Sbjct: 62 AVSSEHTGRE--IVDVMCHATFTQRLLSPVRVPNYEVIIMDEAH 103
>gnl|CDD|215832 pfam00270, DEAD, DEAD/DEAH box helicase. Members of this family
include the DEAD and DEAH box helicases. Helicases are
involved in unwinding nucleic acids. The DEAD box
helicases are involved in various aspects of RNA
metabolism, including nuclear transcription, pre mRNA
splicing, ribosome biogenesis, nucleocytoplasmic
transport, translation, RNA decay and organellar gene
expression.
Length = 169
Score = 33.4 bits (77), Expect = 0.42
Identities = 28/142 (19%), Positives = 50/142 (35%), Gaps = 14/142 (9%)
Query: 672 YPFKQDLIDAVRDHQVLIIEGETGSGKTTQIPQYLHEAGFTKDKMKIG-CTQPRRVAAMS 730
P + I A+ + ++++ TGSGKT + +A K P R A
Sbjct: 1 TPIQAQAIPAILSGKDVLVQAPTGSGKTLAFLLPILQALLPKKGGPQALVLAPTRELAEQ 60
Query: 731 VSARVAEEMGVKLGCEVGYAI-------RFEDCTSERTIIKYMTDGTLHREFLSEPDLAS 783
+ +++ LG V + + I T G L
Sbjct: 61 I-YEELKKLFKILGLRVALLTGGTSLKEQARKLKKGKADILVGTPGRLLDLLRRGKLKLL 119
Query: 784 YSV--MIIDEAHPKVYVIDPGF 803
++ +++DEAH ++D GF
Sbjct: 120 KNLKLLVLDEAH---RLLDMGF 138
>gnl|CDD|150406 pfam09727, CortBP2, Cortactin-binding protein-2. This entry is the
first approximately 250 residues of cortactin-binding
protein 2. In addition to being a positional candidate
for autism this protein is expressed at highest levels
in the brain in humans. The human protein has six
associated ankyrin repeat domains pfam00023 towards the
C-terminus which act as protein-protein interaction
domains.
Length = 193
Score = 33.3 bits (76), Expect = 0.45
Identities = 18/84 (21%), Positives = 36/84 (42%)
Query: 480 KRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQE 539
K +L + + K + +L E R+ +K D L + EQE
Sbjct: 95 KETQRRMLAQLLAAEKRQRKTVLELEEEKRKHIRYMKKSDDFTNLLEQERERLKKLLEQE 154
Query: 540 LTERERRDREHKRKLLELAKEHDK 563
+++ ++++EH++ L L +E K
Sbjct: 155 KSQQAKKEQEHRKLLATLEEELGK 178
>gnl|CDD|238071 cd00124, MYSc, Myosin motor domain. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Myosins are actin-dependent molecular
motors that play important roles in muscle contraction,
cell motility, and organelle transport. The head domain
is a molecular motor, which utilizes ATP hydrolysis to
generate directed movement toward the plus end along
actin filaments. A cyclical interaction between myosin
and actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 679
Score = 34.2 bits (79), Expect = 0.49
Identities = 14/44 (31%), Positives = 24/44 (54%), Gaps = 4/44 (9%)
Query: 676 QDLIDAVRDHQVLIIEGETGSGKTT---QIPQYLHEAGFTKDKM 716
++++ R+ Q +II GE+G+GKT I +YL + D
Sbjct: 78 RNMLRDRRN-QSIIISGESGAGKTENTKLIMKYLASLAGSNDTG 120
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 34.2 bits (79), Expect = 0.59
Identities = 42/159 (26%), Positives = 72/159 (45%), Gaps = 33/159 (20%)
Query: 450 DDRKER-DEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIES 508
+R+E +E ER + + + + ++KR EAA + EAE + + +L S
Sbjct: 526 AERRETIEEKRERAEELRERAAE-LEAEAEEKR---EAAAEAEEEAEEAREEVAEL--NS 579
Query: 509 RRKYLEKRKE--DKVAELEADILDDEYLFSEQELTER---------ERRD-----REHKR 552
+ L++R E +++ L A I + + L E+ ERR+ RE KR
Sbjct: 580 KLAELKERIESLERIRTLLAAI--ADAEDEIERLREKREALAELNDERRERLAEKRERKR 637
Query: 553 KLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEK 591
EL E D+AR +E+ + + E E+V+EK
Sbjct: 638 ---ELEAEFDEAR-IEEARE----DKERAEEYLEQVEEK 668
Score = 33.5 bits (77), Expect = 0.85
Identities = 31/121 (25%), Positives = 52/121 (42%), Gaps = 22/121 (18%)
Query: 451 DRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRR 510
+ KE + ERL N ++ + + E A+ + EA+ +E
Sbjct: 197 EEKEEKDLHERL-NGLESELAELDEEIERYEEQREQARETRDEADEV--------LEE-- 245
Query: 511 KYLEKRKEDKVAELEADILDDEYLFSEQELTERERRD-----REHKRKLLELAKEHDKAR 565
+ E+R+E + LEA+I D +E TERER + R+ + +L EL +E D
Sbjct: 246 -HEERREE--LETLEAEIEDLRETIAE---TEREREELAEEVRDLRERLEELEEERDDLL 299
Query: 566 E 566
Sbjct: 300 A 300
Score = 30.8 bits (70), Expect = 6.0
Identities = 27/118 (22%), Positives = 48/118 (40%), Gaps = 30/118 (25%)
Query: 449 DDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIES 508
++DR+ +E L++ + ++ E A+ L EAE R + R+E
Sbjct: 471 EEDRERVEELEAELED--------LEEEVEEVEERLERAEDLV-EAEDRIE-----RLEE 516
Query: 509 RRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARE 566
RR+ LE+ ++ +E RER + +R EL E ++ RE
Sbjct: 517 RREDLEELIAERRETIEEK---------------RERAEELRERA-AELEAEAEEKRE 558
Score = 30.4 bits (69), Expect = 7.5
Identities = 29/130 (22%), Positives = 49/130 (37%), Gaps = 34/130 (26%)
Query: 452 RKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKI------IPKLR 505
E E E ++ ++ + T +EA + L+ E R+++ I LR
Sbjct: 212 ESELAELDEEIERYEEQREQARETR-------DEADEVLEEHEERREELETLEAEIEDLR 264
Query: 506 -------------------IESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERR 546
+ R + LE+ ++D +AE D D E + E E E R
Sbjct: 265 ETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAV--EARREELEDR 322
Query: 547 DREHKRKLLE 556
D E + +L E
Sbjct: 323 DEELRDRLEE 332
>gnl|CDD|240439 PTZ00491, PTZ00491, major vault protein; Provisional.
Length = 850
Score = 34.2 bits (79), Expect = 0.62
Identities = 32/115 (27%), Positives = 49/115 (42%), Gaps = 15/115 (13%)
Query: 460 ERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKED 519
ER K DK Q + E A+ +E+ + + E+R +E E
Sbjct: 684 ERQKMHDK-----AKAEEQRTKLLELQAESAAVESSGQSRAEALAEAEARL--IEAEAEV 736
Query: 520 KVAELEADILDDEYLFSEQELT-ERERRDREHK--RKLLELAKEHDKARELEKVQ 571
+ AEL A L E E EL R+R++ E + + EL E KA+EL ++
Sbjct: 737 EQAELRAKALRIEA---EAELEKLRKRQELELEYEQAQNEL--EIAKAKELADIE 786
>gnl|CDD|173607 PTZ00417, PTZ00417, lysine-tRNA ligase; Provisional.
Length = 585
Score = 33.8 bits (77), Expect = 0.65
Identities = 28/107 (26%), Positives = 49/107 (45%), Gaps = 25/107 (23%)
Query: 376 LPSSSTSSRLFTTLPSSLAA---------VPHHIQRQSPAQSYHRMTPGCTDRRTQSPCP 426
+P+ T+++L L S + H Q SP YHR PG T+R
Sbjct: 441 MPNPPTAAKLLDQLASHFIENKYPNKPFFIIEHPQIMSPLAKYHRSKPGLTER------- 493
Query: 427 SCLKVFKTPK--VRAFMGHFLTDSDDDRKERDEFSERLKNRDKNNTK 471
L++F K + A+ T+ +D K+++ FS + K+R+K + +
Sbjct: 494 --LEMFICGKEVLNAY-----TELNDPFKQKECFSAQQKDREKGDAE 533
>gnl|CDD|184377 PRK13894, PRK13894, conjugal transfer ATPase TrbB; Provisional.
Length = 319
Score = 33.6 bits (77), Expect = 0.67
Identities = 14/34 (41%), Positives = 24/34 (70%)
Query: 675 KQDLIDAVRDHQVLIIEGETGSGKTTQIPQYLHE 708
++ +I AVR H+ +++ G TGSGKTT + ++E
Sbjct: 138 REAIIAAVRAHRNILVIGGTGSGKTTLVNAIINE 171
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 33.0 bits (76), Expect = 0.71
Identities = 31/144 (21%), Positives = 59/144 (40%), Gaps = 24/144 (16%)
Query: 447 DSDDDRKERDEFSERLKNRDKNNTKNVATSRQ--DKRAYEEAAKRLKLEAESRDKIIPKL 504
++ K + S++ +N N+ S+ + +LK + + + +L
Sbjct: 38 SEAEEVKNEENKSKKKAAPIENAEGNIVFSKVEFADGEQAKKDLKLKKKKKKKKTDYKQL 97
Query: 505 --RIESRRKYLEKRKEDKVAELEADIL--------------DDEYLFS------EQELTE 542
++E+R+K LE+ EDK AE+E DDE L E++ +
Sbjct: 98 LKKLEARKKKLEELDEDKAAEIEEKEKWTKALAKAEGVKVKDDEKLLKKALKRKEKQKKK 157
Query: 543 RERRDREHKRKLLELAKEHDKARE 566
++ +E K K+ + E K RE
Sbjct: 158 SKKEWKERKEKVEKKKAERQKKRE 181
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 33.7 bits (77), Expect = 0.78
Identities = 26/114 (22%), Positives = 50/114 (43%), Gaps = 2/114 (1%)
Query: 453 KERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKY 512
KERD L + DK T+ S Q + EEA ++ K+ +L
Sbjct: 413 KERDGLRAILNSYDKELTETSV-SGQLMKRLEEAEDLVQKVQSHLAKMENQLSELEEDVG 471
Query: 513 LEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARE 566
+K + + + E E +L ++ + +E+ L+ + + K+ L +E D+ R+
Sbjct: 472 QQKDRNNTL-ETEIKLLKEQLVSNERLLSFVKEATNALRLKIETLERERDRLRQ 524
>gnl|CDD|234011 TIGR02782, TrbB_P, P-type conjugative transfer ATPase TrbB. The
TrbB protein is found in the trb locus of Agrobacterium
Ti plasmids where it is involved in the type IV
secretion system for plasmid conjugative transfer. TrbB
is a homolog of the vir system VirB11 ATPase , and the
Flp pilus sytem ATPase TadA [Cellular processes,
Conjugation].
Length = 299
Score = 33.2 bits (76), Expect = 0.78
Identities = 11/23 (47%), Positives = 16/23 (69%)
Query: 678 LIDAVRDHQVLIIEGETGSGKTT 700
L +AV + +++ G TGSGKTT
Sbjct: 125 LREAVLARKNILVVGGTGSGKTT 147
>gnl|CDD|223363 COG0286, HsdM, Type I restriction-modification system
methyltransferase subunit [Defense mechanisms].
Length = 489
Score = 33.2 bits (76), Expect = 0.93
Identities = 20/136 (14%), Positives = 49/136 (36%), Gaps = 15/136 (11%)
Query: 426 PSCLKVFKTPKVRAFMGHFLTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEE 485
P+ + K D+ + E+ +RL + + ++ + +
Sbjct: 367 PTNILFLTKNKPAERNDVLFIDASKEHFEKPLNKKRLTEENIEKIVDTYREFKEIEGFSK 426
Query: 486 AAKRLKLEAESRDKIIPKLRIESRRKYLEKRKED--KVAELEADILDDEYLFSEQELTER 543
+ S ++I + +Y+ + +E+ + EL ++ + + EL E
Sbjct: 427 SV--------SLEEIRENDYNLNPGRYVAEEEEEDIDLDELLQELSEIDA-----ELAEL 473
Query: 544 ERRDREHKRKLLELAK 559
E+ E ++LL K
Sbjct: 474 EKELEEILKELLLNLK 489
>gnl|CDD|218380 pfam05010, TACC, Transforming acidic coiled-coil-containing protein
(TACC). This family contains the proteins TACC 1, 2 and
3 the genes for which are found concentrated in the
centrosomes of eukaryotic and may play a conserved role
in organising centrosomal microtubules. The human TACC
proteins have been linked to cancer and TACC2 has been
identified as a possible tumour suppressor (AZU-1). The
functional homologue (Alp7) in Schizosaccharomyces pombe
has been shown to be required for organisation of
bipolar spindles.
Length = 207
Score = 32.4 bits (74), Expect = 0.93
Identities = 31/123 (25%), Positives = 50/123 (40%), Gaps = 26/123 (21%)
Query: 476 SRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLF 535
S++D A + KR + +E + + K + R + LE K V E E I
Sbjct: 3 SQKDMDAALKKIKREVISSELESEELKKKYEKLRSENLEMGK--IVDEFEKTI------- 53
Query: 536 SEQELTERERRDREHKRKLLELAKEHDKA--------RELEKVQRYYMPKGTKGPEKYEE 587
Q + E +++ KR+L E+ E D+A + + Y EKY+E
Sbjct: 54 -AQMIEESQKQKELSKRELQEVLAEKDQAYADLNSLETSFSDLFKRY--------EKYKE 104
Query: 588 VDE 590
V E
Sbjct: 105 VIE 107
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 33.1 bits (75), Expect = 1.1
Identities = 32/177 (18%), Positives = 65/177 (36%), Gaps = 15/177 (8%)
Query: 460 ERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKED 519
E L+ + + V + + + ++ ++ E RD+I K + + K E+
Sbjct: 189 ETLRQVRQTQGQKVQEHQMELKYLKQYKEK---ACEIRDQITSKEAQLESSREIVKSYEN 245
Query: 520 KVAELEADILDDEYLFS-----EQELTERERR--DREHKRKLLELAKEHDKARELEKVQR 572
++ L+ + + E+ S + E+ + R E LEL E E++
Sbjct: 246 ELDPLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLND 305
Query: 573 YYMPKGTKGPEKYEE-VDEKEKVPHYEQRKWEDEQMSSA----IFRVGAKDREHQEQ 624
Y EK E VD + ++ + + Q + R+ + HQE
Sbjct: 306 LYHNHQRTVREKERELVDCQRELEKLNKERRLLNQEKTELLVEQGRLQLQADRHQEH 362
>gnl|CDD|219890 pfam08537, NBP1, Fungal Nap binding protein NBP1. NBP1 is a
nuclear protein which has been shown in Saccharomyces
cerevisiae to be essential for the G2/M transition of
the cell cycle.
Length = 313
Score = 32.9 bits (75), Expect = 1.1
Identities = 29/162 (17%), Positives = 55/162 (33%), Gaps = 11/162 (6%)
Query: 437 VRAFMGHFLTDSDDDRKER---DEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLE 493
++ F +D ++E DE ER K R K K + + + + A LK
Sbjct: 5 LKGLWKDFWIKTDGRKREYGSLDEPRERSKVRRKRARKTSKSYLKVSKPNTDRASELKRR 64
Query: 494 AESRDKIIPKLR----IESRRKYLEKRKED----KVAELEADILDDEYLFSEQELTERER 545
E R + R + + D + A + + + +++ +R
Sbjct: 65 TERRGPPTKRRRQRTFWNNIKSVFSNDNRDLSKMQNASINYRLPNSPADEEQEDAKDRIL 124
Query: 546 RDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEE 587
R K+K+LEL EL + + + P +
Sbjct: 125 RSEAFKKKVLELKYNKRMLEELRRGSQDGRSNRDRNPSLNLD 166
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 33.3 bits (77), Expect = 1.2
Identities = 28/116 (24%), Positives = 54/116 (46%), Gaps = 12/116 (10%)
Query: 484 EEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTER 543
EEA K + + E +++I L E R+ +K +E + EA+ L +E +++L E
Sbjct: 505 EEAKKLIGEDKEKLNELIASL-EELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEE 563
Query: 544 ERRDREHK----RKLLELAKEHDKARELEKVQR--YYMPKGTKGPEKYEEVDEKEK 593
E + E ++ ++ AK+ E +++ + + KG K E+ E K
Sbjct: 564 EDKLLEEAEKEAQQAIKEAKK-----EADEIIKELRQLQKGGYASVKAHELIEARK 614
Score = 31.0 bits (71), Expect = 5.4
Identities = 19/99 (19%), Positives = 42/99 (42%), Gaps = 4/99 (4%)
Query: 480 KRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEAD-ILDDEYLFSEQ 538
+ E+ + L+ + E + KL E+ ++ + KE A+ EAD I+ + +
Sbjct: 543 LKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKE---AKKEADEIIKELRQLQKG 599
Query: 539 ELTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPK 577
+ + RK L A E + ++ ++ ++ K
Sbjct: 600 GYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 32.8 bits (76), Expect = 1.4
Identities = 27/120 (22%), Positives = 56/120 (46%), Gaps = 20/120 (16%)
Query: 471 KNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRI-ESRRKYLEKR---------KEDK 520
K +A ++ + EE AKR+ EA+ + I K + E++ + + R + ++
Sbjct: 26 KKIAEAKI--KEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNE 83
Query: 521 VAELEADILD--------DEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKVQR 572
+ +LE +L E L +E E++ ++ E K++ LE +E + E++Q
Sbjct: 84 LQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 31.6 bits (71), Expect = 1.8
Identities = 34/134 (25%), Positives = 58/134 (43%), Gaps = 7/134 (5%)
Query: 537 EQELTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPH 596
EQE E+E +DR + +L A E RE E ++ K + + +E+EK
Sbjct: 43 EQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQ 102
Query: 597 YEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQ 656
EQ + + ++ + A+ RE E+ L+ + F Q R K EE+ ++
Sbjct: 103 EEQERIQKQKEEAE-----ARAREEAERMR--LEREKHFQQIEQERLERKKRLEEIMKRT 155
Query: 657 KKKLNIEETKKSLP 670
+K + KK P
Sbjct: 156 RKSEVSPQVKKEDP 169
Score = 30.4 bits (68), Expect = 3.8
Identities = 26/125 (20%), Positives = 56/125 (44%), Gaps = 2/125 (1%)
Query: 449 DDDRKERDEFSERLKNRDKNNTKNVATSRQDK-RAYEEAAKRLKLEAESRDKIIPKLRIE 507
+ +R+E++E + K R+++ R EE R K E R + + +
Sbjct: 43 EQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQ 102
Query: 508 SRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKAREL 567
++ ++K+KE+ A + + L E+ + E+ E K++L E+ K K+
Sbjct: 103 EEQERIQKQKEEAEARAR-EEAERMRLEREKHFQQIEQERLERKKRLEEIMKRTRKSEVS 161
Query: 568 EKVQR 572
+V++
Sbjct: 162 PQVKK 166
>gnl|CDD|224124 COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense
mechanisms].
Length = 733
Score = 32.4 bits (74), Expect = 1.9
Identities = 16/60 (26%), Positives = 26/60 (43%), Gaps = 11/60 (18%)
Query: 18 SYSVMIIDEAHERTLHTD-----ILFGLVKDIVRFRSDLKLLISSATLDAQKFSEFFDDA 72
S++I+DE H L+ D L L++ + + +L+ SATL E A
Sbjct: 338 LTSLVILDEVH---LYADETMLAALLALLEALAEA--GVPVLLMSATLPPF-LKEKLKKA 391
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 32.2 bits (74), Expect = 1.9
Identities = 36/128 (28%), Positives = 63/128 (49%), Gaps = 27/128 (21%)
Query: 465 RDKNNTKNVATSRQDKRAYEEAAKR----LKLEA--ESRDKIIPKLRIE------SRRKY 512
R + K + ++ + + E AK+ LK EA E+++++ KLR E RR
Sbjct: 19 RKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLEAKEEVH-KLRAELERELKERRNE 77
Query: 513 LEKRKEDKVAELEADILD---------DEYLFS-EQELTERERRDREHKRKLLELAKEHD 562
L +R E ++ + E + LD +E L E+EL+ +E+ E + +L EL E
Sbjct: 78 L-QRLERRLLQRE-ETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQR 135
Query: 563 KARELEKV 570
+ ELE++
Sbjct: 136 E--ELERI 141
>gnl|CDD|224308 COG1390, NtpE, Archaeal/vacuolar-type H+-ATPase subunit E [Energy
production and conversion].
Length = 194
Score = 31.6 bits (72), Expect = 2.0
Identities = 28/99 (28%), Positives = 45/99 (45%), Gaps = 9/99 (9%)
Query: 449 DDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAE-SRDKIIPKLRIE 507
+ +E +E E + + K A ++ EE ++ + EAE R +II +E
Sbjct: 13 REAEEEAEEILEEAR-EEAEKIKEEAKREAEEAI-EEILRKAEKEAERERQRIISSALLE 70
Query: 508 SRRKYLEKRKE--DKVAELEADILD----DEYLFSEQEL 540
+RRK LE ++E + V E + L D S QEL
Sbjct: 71 ARRKLLEAKEEILESVFEAVEEKLRNIASDPEYESLQEL 109
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 31.9 bits (73), Expect = 2.1
Identities = 25/160 (15%), Positives = 53/160 (33%), Gaps = 28/160 (17%)
Query: 445 LTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKI---- 500
+DS+D+ E + + + D + + + E A++ K + +I
Sbjct: 129 SSDSEDE--EEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATTRILTPA 186
Query: 501 ----IPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLE 556
I +LR+E K + + + DE + ++ D E K
Sbjct: 187 DFAKIQELRLEKGVDKALGGKLKRRDKDAPERHSDELVDAD---------DIEGPAKK-- 235
Query: 557 LAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPH 596
K+ K + + +G + EK+ K+
Sbjct: 236 --KKQTKEERIATAK-----EGREDREKFGSRKGKKDKEG 268
>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit. This
family includes the vacuolar ATP synthase E subunit, as
well as the archaebacterial ATP synthase E subunit.
Length = 195
Score = 31.5 bits (72), Expect = 2.2
Identities = 24/91 (26%), Positives = 40/91 (43%), Gaps = 11/91 (12%)
Query: 471 KNVATSRQDKRAYEEAAKRLKLEAESRDKI-IPKLRIESRRKYLEKRKEDKVAELEADIL 529
K A +K+ E K+ K +AE +I I + E+R K L R+E +L
Sbjct: 25 KAEAVEEAEKKIEEIYEKKEK-QAEMEKQIIISNAKNEARLKVLNAREE---------LL 74
Query: 530 DDEYLFSEQELTERERRDREHKRKLLELAKE 560
D + +++ L E+K L +L K+
Sbjct: 75 DSVFEEAKERLANLSEDKDEYKDLLKDLIKQ 105
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 31.6 bits (72), Expect = 2.2
Identities = 16/77 (20%), Positives = 35/77 (45%), Gaps = 5/77 (6%)
Query: 484 EEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTER 543
+E + K + ++ K K ++++ + EK K +K + ++ +D E EL E+
Sbjct: 42 DEEKEEEKAKVAAKAK--AKKALKAKIEEKEKAKREKEEKGLRELEEDT---PEDELAEK 96
Query: 544 ERRDREHKRKLLELAKE 560
R + + L A +
Sbjct: 97 LRLRKLQEESDLNNAAD 113
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 32.0 bits (73), Expect = 2.3
Identities = 16/97 (16%), Positives = 38/97 (39%), Gaps = 3/97 (3%)
Query: 476 SRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRR-KYLEKRKEDKVAELEA--DILDDE 532
+ + + ++ + +L E+ K + + ++ +LE+ + E
Sbjct: 406 KEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRRE 465
Query: 533 YLFSEQELTERERRDREHKRKLLELAKEHDKARELEK 569
++ E RDR +R EL ++ + ELE+
Sbjct: 466 VRDKVRKDREIRARDRRIERLEKELEEKKKRVEELER 502
>gnl|CDD|234232 TIGR03499, FlhF, flagellar biosynthetic protein FlhF. [Cellular
processes, Chemotaxis and motility].
Length = 283
Score = 31.9 bits (73), Expect = 2.3
Identities = 28/164 (17%), Positives = 53/164 (32%), Gaps = 16/164 (9%)
Query: 543 RERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKW 602
E +K LE A E ++ + + E +++
Sbjct: 57 AAASAEEEAKKALEPADPAKLPATAEPLELPAPLEEP--AAPAAQAAEPLLPEAELRKEL 114
Query: 603 ED-EQMSSAIFRVGAKDREHQEQ---YELLLDNQI--DFIQALSLPGNRDKGEEEMTEKQ 656
++ + A R E E LL+ + + + L D E+
Sbjct: 115 SALRELLERLLAGLALQRRDPEGAKLLERLLEAGVSEELARELLEKLPEDLDAEDAWRWL 174
Query: 657 KKKLNIEETKKSLPIYPFKQDLIDAVRDHQVLIIEGETGSGKTT 700
++ L + LP+ P + ++ V+ + G TG GKTT
Sbjct: 175 REAL-----EGMLPVKPEEDPIL---ERGGVIALVGPTGVGKTT 210
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 32.1 bits (73), Expect = 2.3
Identities = 26/96 (27%), Positives = 42/96 (43%), Gaps = 6/96 (6%)
Query: 484 EEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTER 543
E+A E + +I KL ++ +K + + E + L E +EL ER
Sbjct: 500 EQAKTFYGEFKEEINVLIEKL-SALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKER 558
Query: 544 ERRDREHKRKLLELAKEHDKA--RELEKVQRYYMPK 577
E R K +L + A+E KA +E+E + R K
Sbjct: 559 E---RNKKLELEKEAQEALKALKKEVESIIRELKEK 591
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 31.7 bits (72), Expect = 2.7
Identities = 36/228 (15%), Positives = 65/228 (28%), Gaps = 37/228 (16%)
Query: 453 KERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRL----------KLEAESRDKIIP 502
+EF + + K+ + E +D+I
Sbjct: 42 NTLEEFEKEANEKRAQYRSAKKKELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAK 101
Query: 503 KLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHD 562
LEK KE ++ E E D L E Q E + RE+ + L E++
Sbjct: 102 L--EALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFENE 159
Query: 563 KARE--LEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVGAKDRE 620
K E LE + + + + E E+E+ + + + K +
Sbjct: 160 KKLEESLELEREKFEEQLHEANLDLE--------------FKENEEQRESKWAILKKLKR 205
Query: 621 HQEQYELLLDNQIDFIQALSLPGNR---------DKGEEEMTEKQKKK 659
E + + + S ++ DKG K
Sbjct: 206 RAELGSQQVQGEALELPNESFIRSKFPSDINEKADKGFRGFDCLNKVP 253
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 30.7 bits (70), Expect = 2.9
Identities = 20/67 (29%), Positives = 36/67 (53%), Gaps = 1/67 (1%)
Query: 497 RDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLE 556
R K+ L ++ + K E+ +E +V EL+ + DE L ++ + + RE++RK E
Sbjct: 87 RKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELL-EKELAKLKREKRRENERKQKE 145
Query: 557 LAKEHDK 563
+ KE K
Sbjct: 146 ILKEQMK 152
>gnl|CDD|223813 COG0742, COG0742, N6-adenine-specific methylase [DNA replication,
recombination, and repair].
Length = 187
Score = 31.0 bits (71), Expect = 2.9
Identities = 18/73 (24%), Positives = 29/73 (39%), Gaps = 3/73 (4%)
Query: 865 QRINLGNAVLMLKALGINDLIHFDFLDPPPHETLVLALEQLYALGALNHHGDLTKLGRRM 924
R+ +A+ LK LG + FLDPP + L+ L L +G L +
Sbjct: 95 ARVLRNDALRALKQLGTREPFDLVFLDPPYAKGLLDKELALLLLE---ENGWLKPGALIV 151
Query: 925 AEFPLDPMMGKML 937
E D + ++
Sbjct: 152 VEHDKDVELPELP 164
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This
family includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 31.6 bits (70), Expect = 2.9
Identities = 26/108 (24%), Positives = 50/108 (46%), Gaps = 4/108 (3%)
Query: 480 KRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQE 539
K ++A K++ + E+ K+ K E + E + + A+ E + + +EQE
Sbjct: 72 KSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEE----EHKQWKAEQE 127
Query: 540 LTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEE 587
++E+ +E K L+ K+ +KA + EK ++ K P EE
Sbjct: 128 RIQKEQEKKEADLKKLQAEKKKEKAVKAEKAEKAEKTKKASTPAPVEE 175
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 31.7 bits (72), Expect = 3.1
Identities = 20/82 (24%), Positives = 38/82 (46%), Gaps = 6/82 (7%)
Query: 513 LEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKVQR 572
L +R E V E E I+ E E+EL + + + + L + + +ELE+ +
Sbjct: 362 LLERLEKDVEEGEKTIVK-EARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEE 420
Query: 573 YYMPKGTKGPEKYEEVDEKEKV 594
++ + +K EE + EK+
Sbjct: 421 NFLKELSK-----EEKELLEKL 437
>gnl|CDD|220178 pfam09321, DUF1978, Domain of unknown function (DUF1978). Members
of this family are found in various hypothetical
proteins produced by the bacterium Chlamydia pneumoniae.
Their exact function has not, as yet, been identified.
Length = 241
Score = 31.0 bits (70), Expect = 3.1
Identities = 16/95 (16%), Positives = 39/95 (41%), Gaps = 7/95 (7%)
Query: 445 LTDSDDDRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRL-KLEAESRDKIIPK 503
+++ +D+ ++++ S ++ + Q R E+ K E E R++ +
Sbjct: 132 VSEREDETEKQETVSSYARDLETALDHIEVEYVQCVREQEDYWKEEDVKEQELREEGGER 191
Query: 504 LRIESRRKYLE------KRKEDKVAELEADILDDE 532
R+E ++ LE K + ++ E + E
Sbjct: 192 KRLEEIQEGLESLEDLLKATKKRLDFAEEHLEKAE 226
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 31.0 bits (71), Expect = 3.3
Identities = 25/98 (25%), Positives = 48/98 (48%), Gaps = 12/98 (12%)
Query: 460 ERLKNRDKNNTKNVATSRQD--------KRAYEEAAK-RLKLEAESRDKIIPKLRIESRR 510
+K +++N K +A Q+ K+A EE + R KL+ +DK K +++R
Sbjct: 37 AEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLK-NLKARL 95
Query: 511 KYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDR 548
K LEK ++ + E+++L+ + E+E E +
Sbjct: 96 KELEKELKN--LKWESEVLEQRFEKVERERDELYDKFE 131
>gnl|CDD|223435 COG0358, DnaG, DNA primase (bacterial type) [DNA replication,
recombination, and repair].
Length = 568
Score = 31.7 bits (72), Expect = 3.3
Identities = 12/49 (24%), Positives = 21/49 (42%), Gaps = 3/49 (6%)
Query: 648 GEEEMTEKQKKKLNIEET---KKSLPIYPFKQDLIDAVRDHQVLIIEGE 693
G + + + K LN ET KK +Y + Q++++EG
Sbjct: 208 GGRVLGDDKPKYLNSPETELFKKGEELYGLDPARKKIAKADQIIVVEGY 256
>gnl|CDD|237260 PRK12906, secA, preprotein translocase subunit SecA; Reviewed.
Length = 796
Score = 31.6 bits (72), Expect = 3.5
Identities = 17/47 (36%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 663 EETKKSLPIYPFKQDLIDAVRDHQVLIIEGETGSGKT--TQIPQYLH 707
E K+ L + PF +I + H+ I E +TG GKT +P YL+
Sbjct: 71 EGAKRVLGLRPFDVQIIGGIVLHEGNIAEMKTGEGKTLTATLPVYLN 117
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
splicing factor. These splicing factors consist of an
N-terminal arginine-rich low complexity domain followed
by three tandem RNA recognition motifs (pfam00076). The
well-characterized members of this family are auxilliary
components of the U2 small nuclear ribonuclearprotein
splicing factor (U2AF). These proteins are closely
related to the CC1-like subfamily of splicing factors
(TIGR01622). Members of this subfamily are found in
plants, metazoa and fungi.
Length = 509
Score = 31.4 bits (71), Expect = 3.7
Identities = 24/142 (16%), Positives = 51/142 (35%), Gaps = 22/142 (15%)
Query: 455 RDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLE 514
RDE +R + + + ++ ++ R +R+ + + R R + R R +
Sbjct: 1 RDEEPDREREKSRGRDRDRSSERPRRRSRDRSRFR------DRHR-----RSRERSYRED 49
Query: 515 KRKEDKVAELEADILDDEYLFSEQELTERERRDR------EHKRKLLELAKEHDKARELE 568
R D+ Y + RR R +H+R+L + + + ++ +
Sbjct: 50 SRPRDRRRYDSRSPRSLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDDK 109
Query: 569 KVQRYYMPKGTKGPEKYEEVDE 590
K + + P YE V
Sbjct: 110 KRSLWDIK-----PPGYELVTA 126
>gnl|CDD|240364 PTZ00332, PTZ00332, paraflagellar rod protein; Provisional.
Length = 589
Score = 31.5 bits (71), Expect = 4.0
Identities = 34/118 (28%), Positives = 59/118 (50%), Gaps = 22/118 (18%)
Query: 461 RLKNRDKNNTKNVATSRQDKRAYE-EAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKE- 518
RLK R + + K++ + Q + EA KR E ++ I R+ ++++E
Sbjct: 271 RLKQRCETDLKHIHDAIQKADLEDAEAMKRYATNKEKSERFI--------RENEDRQEEA 322
Query: 519 -DKVAELEADILDDEYLFSE--QELTER-ERRDREHKRKL-----LELAKEHDKAREL 567
+K+ +LE + + L +E +E+ R E DRE KR++ LE+A +H K EL
Sbjct: 323 WNKIQDLERQL---QRLGTERFEEVKRRIEENDREEKRRVEYQQFLEVAGQHKKLLEL 377
>gnl|CDD|215920 pfam00437, T2SE, Type II/IV secretion system protein. This family
contains both type II and type IV pathway secretion
proteins from bacteria. VirB11 ATPase is a subunit of
the Agrobacterium tumefaciens transfer DNA (T-DNA)
transfer system, a type IV secretion pathway required
for delivery of T-DNA and effector proteins to plant
cells during infection.
Length = 273
Score = 30.7 bits (70), Expect = 4.2
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 678 LIDAVRDHQVLIIEGETGSGKTT 700
L AV+ +++ G TGSGKTT
Sbjct: 122 LRQAVQARGNILVSGGTGSGKTT 144
>gnl|CDD|107037 PHA01514, PHA01514, O-antigen conversion protein C.
Length = 485
Score = 31.1 bits (70), Expect = 4.4
Identities = 27/74 (36%), Positives = 40/74 (54%), Gaps = 7/74 (9%)
Query: 1034 VEMELVSNPTE-TVNIRKAITAGYF---YHIAKLSKSGSYKTAKHNQTVMIHPNSSLFEE 1089
V+ +SN T+ T + + GYF Y IAK +GSY +H++ + I NSSLFE
Sbjct: 183 VKKNSISNITKNTASSVAGLIVGYFAYSYFIAKRLVTGSYNI-EHSKIIEI--NSSLFEG 239
Query: 1090 LPRWVLYHELVFTT 1103
+ VL +F+T
Sbjct: 240 IISNVLSFYRMFST 253
Score = 31.1 bits (70), Expect = 4.4
Identities = 27/74 (36%), Positives = 40/74 (54%), Gaps = 7/74 (9%)
Query: 1201 VEMELVSNPTE-TVNIRKAITAGYF---YHIAKLSKSGSYKTAKHNQTVMIHPNSSLFEE 1256
V+ +SN T+ T + + GYF Y IAK +GSY +H++ + I NSSLFE
Sbjct: 183 VKKNSISNITKNTASSVAGLIVGYFAYSYFIAKRLVTGSYNI-EHSKIIEI--NSSLFEG 239
Query: 1257 LPRWVLYHELVFTT 1270
+ VL +F+T
Sbjct: 240 IISNVLSFYRMFST 253
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 31.3 bits (71), Expect = 4.5
Identities = 43/245 (17%), Positives = 84/245 (34%), Gaps = 45/245 (18%)
Query: 447 DSDDDRKERDEFSERLKN-RDK---------------------------NNTKNVATSRQ 478
+ + +++R+E E++ R+K +K ++
Sbjct: 47 EEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDDDTKAWLKKSKKRQKKKE 106
Query: 479 DKRAYEEAAKRLKLE--AESRDKIIPKLRIESRRKYLEKRKEDKVAEL-EADILDDEYLF 535
+R + E AE + + L++ + + E+ ED + L + +L+DE
Sbjct: 107 AERKKALLLDEKEKERAAEYTSEDLAGLKVGHKVEEFEE-GEDVILTLKDTGVLEDED-- 163
Query: 536 SEQELTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKY-EEVDEKEKV 594
EL E ++E +K LEL K+ + KY EE++ K+K
Sbjct: 164 EGDELENVELVEKEKDKKNLELKKK----KPDYDPDDDDKFNKRSILSKYDEEIEGKKKK 219
Query: 595 PHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTE 654
D S+ +++ QE + L N + + D +
Sbjct: 220 SDNLFTL--DSGGST----DDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMV 273
Query: 655 KQKKK 659
K KK
Sbjct: 274 KFKKP 278
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 31.2 bits (70), Expect = 4.5
Identities = 38/161 (23%), Positives = 74/161 (45%), Gaps = 13/161 (8%)
Query: 447 DSDDDRKERDEFSER---LKNRDKNNTKNVATSRQDKRAYE----EAAKRLKLEAESRDK 499
+ ++ R+ER+E E K+ KN+ ++ +++++ E E KR LE + +
Sbjct: 115 EKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEF 174
Query: 500 IIPKLRIESRRKYLEKRKEDKV-AELEADILDDEYLFSEQELTERERRDREHKRKLLELA 558
+ KL+ + +V A E + L + + EL E ++ RE +RK+LE
Sbjct: 175 MTHKLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEE-LKKKREERRKVLEEE 233
Query: 559 KEHDKARELEKVQRYYMPKGTKGPE----KYEEVDEKEKVP 595
++ K E ++ R K E + E ++++KVP
Sbjct: 234 EQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEKRQKVP 274
Score = 30.8 bits (69), Expect = 5.5
Identities = 33/158 (20%), Positives = 66/158 (41%), Gaps = 13/158 (8%)
Query: 445 LTDSDDDRKER-DEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPK 503
L ++ R ER E ER K +T ++ E ++R++ ++ + ++ + +
Sbjct: 63 LARREERRDERFSEALERQKEFKPTSTD---------QSLSEPSRRMQEDSGAENETVEE 113
Query: 504 LRIESRRKYLEKRKEDK-VAELEADILDDEYLFSEQELTERERRDREH-KRKLLELAKEH 561
E R+ E+ +E + V + E + ++E E E + E KR LE
Sbjct: 114 EEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGE 173
Query: 562 DKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQ 599
+L+ + + G +G + E E EK+ +Q
Sbjct: 174 FMTHKLKHTENTFSRGGAEGAQ-VEAGKEFEKLKQKQQ 210
>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein. YkyA is a
family of proteins containing a lipoprotein signal and a
hydrolase domain. It is similar to cell wall binding
proteins and might also be recognisable by a host immune
defence system. It is thus likely to belong to pathways
important for pathogenicity.
Length = 205
Score = 30.4 bits (69), Expect = 4.8
Identities = 35/143 (24%), Positives = 64/143 (44%), Gaps = 20/143 (13%)
Query: 451 DRKERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRR 510
++KE++ + + ++ +N S Q ++ K LK E ES +K + +S +
Sbjct: 46 EKKEQELYDQIIQLGKDDNEAIKKLSDQALANVDKREKLLKKEKESIEK--SEEEFKSAK 103
Query: 511 KYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKV 570
KY+EK ++ K+ + +A L + R + + +L K + KA LEK
Sbjct: 104 KYIEKIEDKKLKK-KAKQLVKVM----------KERYKSYD----KLYKAYKKALNLEK- 147
Query: 571 QRYYMPKGTKGPEKYEEVDEKEK 593
+ Y K +E+DEK K
Sbjct: 148 ELYEYLK--DKDLTLKELDEKIK 168
>gnl|CDD|236148 PRK08074, PRK08074, bifunctional ATP-dependent DNA helicase/DNA
polymerase III subunit epsilon; Validated.
Length = 928
Score = 31.1 bits (71), Expect = 5.0
Identities = 17/62 (27%), Positives = 21/62 (33%), Gaps = 6/62 (9%)
Query: 6 LHREFLSEPDLASYSVMIIDEAH---ERTLHTDILFGLVKDIVRFRSDLKLLISSATLDA 62
L EP L SY +IIDEAH E G + F+ L L +
Sbjct: 443 LTDLTSEEPLLPSYEHIIIDEAHHFEEAASRH---LGEQFSYMSFQLLLSRLGTLEEDGL 499
Query: 63 QK 64
Sbjct: 500 LS 501
Score = 31.1 bits (71), Expect = 5.7
Identities = 12/23 (52%), Positives = 13/23 (56%)
Query: 771 LHREFLSEPDLASYSVMIIDEAH 793
L EP L SY +IIDEAH
Sbjct: 443 LTDLTSEEPLLPSYEHIIIDEAH 465
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 31.0 bits (71), Expect = 5.1
Identities = 30/135 (22%), Positives = 59/135 (43%), Gaps = 20/135 (14%)
Query: 489 RLKLEAESRDKIIPKLRIE---SRRK-------YLE--KRKEDKVAE-----LEADILDD 531
+ K + E+RD I K+ + S+RK +L + ++A LE +
Sbjct: 340 KTKEKRETRDSIAKKIAEKLHTSKRKVRREVLPFLSIIFKHNPELAARLAAFLELTEEEI 399
Query: 532 EYLFSEQELTERERRDREHKRKLLEL-AKEHDKARELEKVQRYYMPKGTKGPEKYEEVDE 590
E+L ++ T++ ++ E K E KE K K + + + +K EE +E
Sbjct: 400 EFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEE--EEEEEKEKKEEEKEE 457
Query: 591 KEKVPHYEQRKWEDE 605
+E+ E+ + E++
Sbjct: 458 EEEEAEEEKEEEEEK 472
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 31.1 bits (70), Expect = 5.1
Identities = 40/205 (19%), Positives = 75/205 (36%), Gaps = 22/205 (10%)
Query: 480 KRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQE 539
K + ++ S K++ LR E K + R+ D+ D E+E
Sbjct: 162 KNILSGNVSDVDTDSISDKKVVEALR-EDNEKGVNFRR---------DMTD----LKERE 207
Query: 540 LTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKEKVPHYEQ 599
E +R ++ K +L + + DKA++ + K + ++ + P
Sbjct: 208 SQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTS 267
Query: 600 RKWEDEQMSSAIFRVGAKDR-EHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKK 658
ED+Q++ R K + E ++ E L + L K E + +EK+ +
Sbjct: 268 SPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDL-------KQESKASEKEAE 320
Query: 659 KLNIEETKKSLPIYPFKQDLIDAVR 683
+E KK P+ Q V
Sbjct: 321 DKELEAQKKREPVAEDLQKTKPQVE 345
>gnl|CDD|176500 cd08557, PI-PLCc_bacteria_like, Catalytic domain of bacterial
phosphatidylinositol-specific phospholipase C and
similar proteins. This subfamily corresponds to the
catalytic domain present in bacterial
phosphatidylinositol-specific phospholipase C (PI-PLC,
EC 4.6.1.13) and their sequence homologs found in
eukaryota. Bacterial PI-PLCs participate in
Ca2+-independent PI metabolism, hydrolyzing the membrane
lipid phosphatidylinositol (PI) to produce
phosphorylated myo-inositol and diacylglycerol (DAG).
Although their precise physiological function remains
unclear, bacterial PI-PLCs may function as virulence
factors in some pathogenic bacteria. Bacterial PI-PLCs
contain a single TIM-barrel type catalytic domain. Its
catalytic mechanism is based on general base and acid
catalysis utilizing two well conserved histidines, and
consists of two steps, a phosphotransfer and a
phosphodiesterase reaction. Eukaryotic homologs in this
family are named as phosphatidylinositol-specific
phospholipase C X domain containing proteins (PI-PLCXD).
They are distinct from the typical eukaryotic
phosphoinositide-specific phospholipases C (PI-PLC, EC
3.1.4.11), which have a multidomain organization that
consists of a PLC catalytic core domain, and various
regulatory domains. The catalytic core domain is
assembled from two highly conserved X- and Y-regions
split by a divergent linker sequence. In contrast,
eukaryotic PI-PLCXDs contain a single TIM-barrel type
catalytic domain, X domain, which is closely related to
that of bacterial PI-PLCs. Although the biological
function of eukaryotic PI-PLCXDs still remains unclear,
it may be distinct from that of typical eukaryotic
PI-PLCs. This family also includes a distinctly
different type of eukaryotic PLC,
glycosylphosphatidylinositol-specific phospholipase C
(GPI-PLC), an integral membrane protein characterized in
the protozoan parasite Trypanosoma brucei. T. brucei
GPI-PLC hydrolyzes the GPI-anchor on the variant
specific glycoprotein (VSG), releasing dimyristyl
glycerol (DMG), which may facilitate the evasion of the
protozoan to the host's immune system. It does not
require Ca2+ for its activity and is more closely
related to bacterial PI-PLCs, but not mammalian PI-PLCs.
Length = 271
Score = 30.5 bits (69), Expect = 5.3
Identities = 16/58 (27%), Positives = 22/58 (37%), Gaps = 9/58 (15%)
Query: 888 DFLDPPPHETLVLALEQLYALGALNHHGDL---------TKLGRRMAEFPLDPMMGKM 936
DFLD P E ++L LE Y H +L L R P +G++
Sbjct: 90 DFLDAHPSEVVILDLEHEYGGDNGEDHDELDALLRDVLGDPLYRPPVRAGGWPTLGEL 147
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 30.8 bits (70), Expect = 5.4
Identities = 26/156 (16%), Positives = 52/156 (33%), Gaps = 19/156 (12%)
Query: 475 TSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYL 534
+ +R E K KLE++ + IP + ++ + E D+
Sbjct: 26 FVNEVRRCDEVERKLRKLESKIKKLGIPLKDTGGKPDVPPSKEFLDLEEEILDL------ 79
Query: 535 FSEQELTERERRDREHKRKLLELAKEHDKARELEKVQRYYMPKGTKGPEKYEEVDEKEKV 594
E E+ E E ++++ EL + L++ + + + + +D K
Sbjct: 80 --EAEIKEVEENLESLEKEINELE---EWLNVLDEEKSFLDE-NLEELSELSNLDIDFKY 133
Query: 595 PHYEQRKWEDEQMSSAIFRVGAKDREHQEQYELLLD 630
+ F G +RE E +E L
Sbjct: 134 LR-------GAEGLKLGFVAGVINREKLEAFERELW 162
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region
of the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association with
mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is
dispensable for binding to Spt4.
Length = 92
Score = 28.6 bits (64), Expect = 5.5
Identities = 17/58 (29%), Positives = 29/58 (50%), Gaps = 4/58 (6%)
Query: 518 EDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELEKV--QRY 573
E++ E + + L DE F ++ E +RR R R+ + +E A EL + +RY
Sbjct: 14 EEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEE--DAEELAEYLRKRY 69
>gnl|CDD|219665 pfam07959, Fucokinase, L-fucokinase. In the salvage pathway of
GDP-L-fucose, free cytosolic fucose is phosphorylated by
L-fucokinase to form L-fucose-L-phosphate, which is then
further converted to GDP-L-fucose in the reaction
catalyzed by GDP-L-fucose pyrophosphorylase.
Length = 414
Score = 30.6 bits (69), Expect = 5.5
Identities = 21/108 (19%), Positives = 35/108 (32%), Gaps = 26/108 (24%)
Query: 376 LPSSSTSSRLFTTLPSSLAAVP------HHIQRQSPAQSYHR----MTPG----CTDR-R 420
LP++S ++FT LP P + ++ + Y R M PG D
Sbjct: 8 LPNASPLGKIFTPLPVLAPDGPVPLLGDNLLELKLAM--YQRFPPQMPPGVLVTSGDELL 65
Query: 421 TQSPCP---------SCLKVFKTPKVRAFMGHFLTDSDDDRKERDEFS 459
+ P + L + + G F+TDS +
Sbjct: 66 SVGDAPGISFDEPGATALAHPSSLAIATNHGVFVTDSQGSLAHDLTYR 113
>gnl|CDD|238544 cd01124, KaiC, KaiC is a circadian clock protein primarily found in
cyanobacteria KaiC is a RecA-like ATPase, having both
Walker A and Walker B motifs. A related protein is found
in archaea.
Length = 187
Score = 30.0 bits (68), Expect = 5.5
Identities = 8/23 (34%), Positives = 13/23 (56%)
Query: 687 VLIIEGETGSGKTTQIPQYLHEA 709
++ G G+GKTT Q+L+
Sbjct: 1 STLLSGGPGTGKTTFALQFLYAG 23
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 30.8 bits (69), Expect = 5.8
Identities = 52/249 (20%), Positives = 94/249 (37%), Gaps = 17/249 (6%)
Query: 445 LTDSDDDRKER-DEFSERLKNRDKNNTK-----NVATSRQDKRAYEEAAKRLKLEAESRD 498
L + D D ++ D+ E L ++ + +++ K L+A +
Sbjct: 313 LANQDSDMRQHLDKLKEDLTRAEQEKAILQTEVDALRYELERKHNTLTKKTASLQAAQEE 372
Query: 499 KIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELA 558
K IE R EK E + +L + ++ +ERR +E K +L L
Sbjct: 373 KATYAGEIEDMRDRYEKT------ERKLRVLQKKIENLQETFRRKERRLKEEKERLRSLQ 426
Query: 559 KEHDKARELEKVQRYYMPKGTKGPEKYEEV---DEKEKVPHYEQRKWEDEQMSSAIFRVG 615
+ + LEK+++ K + E+ +E DE+ + +E K E E + + +
Sbjct: 427 TDTNTDTALEKLEKALAEK-ERIIERLKEQRDRDERYEQEEFETYKKEFEDLKEEVQNLQ 485
Query: 616 AKDREHQEQYELLLDNQIDFIQALSLPGNRDKGEEEMTEKQKKKLN-IEETKKSLPIYPF 674
K E + Q ELL + + + EK ++K +E+ K L P
Sbjct: 486 LKLSERELQLELLKEEVSKLASNQLKQRSDLERAHIELEKIREKHEKLEKELKRLRANPE 545
Query: 675 KQDLIDAVR 683
D AV
Sbjct: 546 SADRGSAVD 554
>gnl|CDD|130385 TIGR01318, gltD_gamma_fam, glutamate synthase small subunit family
protein, proteobacterial. This model represents one of
three built for the NADPH-dependent or NADH-dependent
glutamate synthase (EC 1.4.1.13 and 1.4.1.14,
respectively) small subunit and homologs. TIGR01317
describes the small subunit (or equivalent region from
longer forms) in eukaryotes, Gram-positive bacteria, and
some other lineages, both NADH and NADPH-dependent.
TIGR01316 describes a protein of similar length, from
Archaea and a number of bacterial lineages, that forms
glutamate synthase homotetramers without a large
subunit. This model describes both glutatate synthase
small subunit and closely related paralogs of unknown
function from a number of gamma and alpha subdivision
Proteobacteria, including E. coli.
Length = 467
Score = 30.5 bits (69), Expect = 5.9
Identities = 11/35 (31%), Positives = 16/35 (45%), Gaps = 2/35 (5%)
Query: 727 AAMSVSARVAEEMGV--KLGCEVGYAIRFEDCTSE 759
A +S + MG+ L CEVG I +D +
Sbjct: 192 AVLSRRREIFTAMGIEFHLNCEVGRDISLDDLLED 226
>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 557
Score = 30.6 bits (70), Expect = 6.5
Identities = 24/100 (24%), Positives = 37/100 (37%), Gaps = 32/100 (32%)
Query: 505 RIESR--------RKY-------LEKRKEDKVAELEADILDDEYLFSEQELTERERRDRE 549
+E R RKY LE + EL + + E S + L + + +
Sbjct: 301 EVEERLFALKSLARKYGVTIEDLLEYLDK-IKEELA-QLDNSE--ESLEALEKEVK---K 353
Query: 550 HKRKLLELAKE-----HDKARELEK-----VQRYYMPKGT 579
K +LLE A+ A+ELEK ++ M K
Sbjct: 354 LKAELLEAAEALSAIRKKAAKELEKEVTAELKALAMEKAR 393
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 29.7 bits (67), Expect = 6.5
Identities = 20/88 (22%), Positives = 35/88 (39%), Gaps = 14/88 (15%)
Query: 485 EAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERE 544
A KR KLE + R + R E+R+E K L+++ +E E E
Sbjct: 4 GAKKRAKLEEKQA-------RRQQREAEEEEREERKK-------LEEKREGERKEEEELE 49
Query: 545 RRDREHKRKLLELAKEHDKARELEKVQR 572
+ K + +E +E E+ ++
Sbjct: 50 EEREKKKEEEERKEREEQARKEQEEYEK 77
>gnl|CDD|222104 pfam13401, AAA_22, AAA domain.
Length = 124
Score = 29.2 bits (66), Expect = 6.5
Identities = 22/116 (18%), Positives = 40/116 (34%), Gaps = 31/116 (26%)
Query: 682 VRDHQVLIIEGETGSGKTTQIPQYLHEAGFTKDKMKIGCTQPRRVAAMSVSARV-AEEMG 740
R + ++ GE+GSGKTT + + + RRV + + +++
Sbjct: 1 RRGAGIGVLTGESGSGKTTLLRRLARQL------------PNRRVVYVEAPSLGTPKDLL 48
Query: 741 VKLGCEVGYAIRF---EDCTSERTIIKYMTDGTLHREFLSEPDLASYSVMIIDEAH 793
K+ +G + + I+ + R L IIDEA
Sbjct: 49 RKILRALGLPLSGGTTAELL--EAILDALKR--RGRPLL-----------IIDEAQ 89
>gnl|CDD|145707 pfam02697, DUF217, Uncharacterized ACR, COG1753. Structural
modelling suggests this domain may bind nucleic acids.
Length = 71
Score = 27.7 bits (62), Expect = 6.8
Identities = 14/61 (22%), Positives = 25/61 (40%), Gaps = 6/61 (9%)
Query: 482 AYEEAAKRLKLEAESRDKIIPKL-----RIESRRKYLEKRKEDKVAELEADILDDEYLFS 536
YE+ +LK +S ++I +L Y E++ ELE ++ + F
Sbjct: 10 VYEKL-LKLKRGGKSFSEVIRELIEKKGNRRKLMDYFGILSEEEADELEKELKEVRKKFR 68
Query: 537 E 537
E
Sbjct: 69 E 69
>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
structure-specific, divalent-metal-ion dependent, 5'
nuclease and homologs. Exonuclease-1 (EXO1) is involved
in multiple, eukaryotic DNA metabolic pathways,
including DNA replication processes (5' flap DNA
endonuclease activity and double stranded DNA
5'-exonuclease activity), DNA repair processes (DNA
mismatch repair (MMR) and post-replication repair
(PRR)), recombination, and telomere integrity. EXO1
functions in the MMS2 error-free branch of the PRR
pathway in the maintenance and repair of stalled
replication forks. Studies also suggest that EXO1 plays
both structural and catalytic roles during MMR-mediated
mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
family of structure-specific, 5' nucleases. These
nucleases contain a PIN (PilT N terminus) domain with a
helical arch/clamp region (I domain) of variable length
(approximately 43 residues in EXO1 PIN domains) and a
H3TH (helix-3-turn-helix) domain, an atypical
helix-hairpin-helix-2-like region. Both the H3TH domain
(not included here) and the helical arch/clamp region
are involved in DNA binding. Nucleases within this group
also have a carboxylate-rich active site that is
involved in binding essential divalent metal ion
cofactors (Mg2+/Mn2+). EXO1 nucleases also have
C-terminal Mlh1- and Msh2-binding domains which allow
interaction with MMR and PRR proteins, respectively.
Length = 210
Score = 29.8 bits (68), Expect = 7.1
Identities = 10/35 (28%), Positives = 17/35 (48%), Gaps = 2/35 (5%)
Query: 541 TERERRDREHKRKLLELAKEHDKARELEKVQRYYM 575
TE ERR+R + + LE A E + + + +
Sbjct: 88 TEEERRER--REENLEKALELLREGNRSEARECFQ 120
>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
production and conversion].
Length = 108
Score = 28.5 bits (64), Expect = 7.5
Identities = 19/66 (28%), Positives = 30/66 (45%), Gaps = 4/66 (6%)
Query: 484 EEAAKRLKLEAESRDKIIPKLRIESRRKYLEKRKEDKV----AELEADILDDEYLFSEQE 539
E + +K S D+ I + + E+ + E R+E + AE EA+ L E L +E
Sbjct: 6 SEVLREIKKAEISADEEIEEAKEEAEQIIKEAREEAREIIEEAEEEAEKLAQEILEEARE 65
Query: 540 LTERER 545
E E
Sbjct: 66 EAEEEA 71
>gnl|CDD|217188 pfam02689, Herpes_Helicase, Helicase. This family consists of
Helicases from the Herpes viruses. Helicases are
responsible for the unwinding of DNA and are essential
for replication and completion of the viral life cycle.
Length = 801
Score = 30.4 bits (69), Expect = 7.6
Identities = 19/47 (40%), Positives = 26/47 (55%), Gaps = 7/47 (14%)
Query: 687 VLIIEGETGSGKTTQIPQYLHEAGFTKDKMKIGCTQPRRVAAMSVSA 733
+I G G+GK+T I Q L+E D + G T RVAA ++SA
Sbjct: 61 AYLITGTAGAGKSTSI-QTLNE---NLDCLITGAT---RVAAQNLSA 100
>gnl|CDD|214640 smart00382, AAA, ATPases associated with a variety of cellular
activities. AAA - ATPases associated with a variety of
cellular activities. This profile/alignment only detects
a fraction of this vast family. The poorly conserved
N-terminal helix is missing from the alignment.
Length = 148
Score = 29.3 bits (65), Expect = 7.8
Identities = 9/16 (56%), Positives = 12/16 (75%)
Query: 685 HQVLIIEGETGSGKTT 700
+V++I G GSGKTT
Sbjct: 2 GEVILIVGPPGSGKTT 17
>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16). Autophagy is a
ubiquitous intracellular degradation system for
eukaryotic cells. During autophagy, cytoplasmic
components are enclosed in autophagosomes and delivered
to lysosomes/vacuoles. ATG16 (also known as Apg16) has
been shown to be bind to Apg5 and is required for the
function of the Apg12p-Apg5p conjugate in the yeast
autophagy pathway.
Length = 194
Score = 29.4 bits (66), Expect = 8.2
Identities = 31/148 (20%), Positives = 57/148 (38%), Gaps = 12/148 (8%)
Query: 455 RDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRKYLE 514
RDE RLK+RD+ Q ++ + L+ E+ K +S
Sbjct: 2 RDEILRRLKDRDRREKAFSELFEQYRQLADRTR---LLKVENDALQAEKYEQQSSHSSSP 58
Query: 515 KRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKAR--------E 566
A+ ++ + +ELTE ++ E ++LL L E ++ R
Sbjct: 59 SADGPGSDAAIAE-MEQKLAKLREELTELHKKRGELAQRLLLLNDELEQLRREIQQLEKT 117
Query: 567 LEKVQRYYMPKGTKGPEKYEEVDEKEKV 594
+ +++ T+ + EE+ EKEK
Sbjct: 118 IAELRSEITSLETEIRDLREELQEKEKD 145
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 29.9 bits (68), Expect = 8.5
Identities = 17/59 (28%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 514 EKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLE-LAKEHDK-ARELEKV 570
+R + + AE E ++L ++ EQ + +ER +EH ++L+E + E +K E E++
Sbjct: 208 AERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERM 266
>gnl|CDD|234015 TIGR02788, VirB11, P-type DNA transfer ATPase VirB11. The VirB11
protein is found in the vir locus of Agrobacterium Ti
plasmids where it is involved in the type IV secretion
system for DNA transfer. VirB11 is believed to be an
ATPase. VirB11 is a homolog of the P-like conjugation
system TrbB protein and the Flp pilus sytem protein
TadA.
Length = 308
Score = 30.0 bits (68), Expect = 8.6
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 674 FKQDLIDAVRDHQVLIIEGETGSGKTT 700
K+ L A+ + +II G TGSGKTT
Sbjct: 133 IKEFLRLAIASRKNIIISGGTGSGKTT 159
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 28.8 bits (65), Expect = 8.6
Identities = 21/73 (28%), Positives = 35/73 (47%), Gaps = 5/73 (6%)
Query: 480 KRAYEEAAKRLKLE-AESRDKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQ 538
K+A EEA ++ + E E R +I R E +++ ++ KE K A + +D+ +E
Sbjct: 36 KKAQEEAKEKEREERIEERKRI----REERKQELEKQLKERKEALKLLEEENDDEEDAET 91
Query: 539 ELTERERRDREHK 551
E TE D
Sbjct: 92 EDTEDVEDDEWEG 104
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 30.1 bits (68), Expect = 8.8
Identities = 30/175 (17%), Positives = 62/175 (35%), Gaps = 34/175 (19%)
Query: 498 DKIIPKLRIESRRKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLEL 557
D+ + +I+ LEK+ E ++ +LE + E EL E E+ E ++K
Sbjct: 272 DEKFERDKIKQLASELEKKLEKELKKLENKLEK-----QEDELEELEKAAEELRQK---- 322
Query: 558 AKEHDKARELEKVQRYY--MPKGTKGPEKYEEVDEKEKVPHYEQRKWEDEQMSSAIFRVG 615
E + + +G K + +E ++ K + + F+
Sbjct: 323 ---------GELLYANLQLIEEGLKSVRLADFYGNEEIKIELDKSK-TPSENAQRYFKKY 372
Query: 616 AKDREHQEQYELLLD---NQIDFIQALSLPGNRDKGEEEMTEKQKKKLNIEETKK 667
K + + + L I + ++ + EK + K IEE ++
Sbjct: 373 KKLKGAKVNLDRQLSELKEAIAYYESA----------KTALEKAEGKKAIEEIRE 417
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 29.8 bits (67), Expect = 9.7
Identities = 23/107 (21%), Positives = 43/107 (40%), Gaps = 5/107 (4%)
Query: 456 DEFSERLKNRDKNN----TKNVATSRQDKRAYEEAAKRLKLEAESRDKIIPKLRIESRRK 511
+ ++++ K N KN S +++ EEA + K E E R ++ K E +
Sbjct: 116 ENTKKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMN 175
Query: 512 YLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELA 558
K K+ + ELE L L ++ + + + K K +
Sbjct: 176 K-RKNKQALLDELETSTLPAAELIAQHKKNSVKLEMQVEKPKPEKPN 221
>gnl|CDD|219111 pfam06625, DUF1151, Protein of unknown function (DUF1151). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 122
Score = 28.6 bits (64), Expect = 9.8
Identities = 17/59 (28%), Positives = 30/59 (50%), Gaps = 5/59 (8%)
Query: 510 RKYLEKRKEDKVAELEADILDDEYLFSEQELTERERRDREHKRKLLELAKEHDKARELE 568
++ LEKRK D+V + + E +++ +E ER + +KL +L E K E +
Sbjct: 54 QRVLEKRKRDQVLKEQ-----KEEEEAKRLQSELERELMKRAQKLEQLELEKAKEDEEQ 107
>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family. This model
represents a subfamily of RNA splicing factors including
the Pad-1 protein (N. crassa), CAPER (M. musculus) and
CC1.3 (H.sapiens). These proteins are characterized by
an N-terminal arginine-rich, low complexity domain
followed by three (or in the case of 4 H. sapiens
paralogs, two) RNA recognition domains (rrm: pfam00706).
These splicing factors are closely related to the U2AF
splicing factor family (TIGR01642). A homologous gene
from Plasmodium falciparum was identified in the course
of the analysis of that genome at TIGR and was included
in the seed.
Length = 457
Score = 29.9 bits (67), Expect = 9.8
Identities = 23/98 (23%), Positives = 40/98 (40%), Gaps = 10/98 (10%)
Query: 453 KERDEFSERLKNRDKNNTKNVATSRQDKRAYEEAAKRLKLEAESRDKII-PKLRIESRRK 511
+ RD RL+N + + K SR+ R+ + + +R RD + R SR
Sbjct: 1 RYRDRERGRLRNDTRRSDKGRERSRRRSRSRDRSRRR-----RDRDYYRGRRGRSRSRSP 55
Query: 512 Y-LEKRKEDKVAELEADILDDEYLFSEQELTERERRDR 548
+ + D+ + +++ LTE ER DR
Sbjct: 56 NRYYRPRGDRSYRRDDRRSGRN---TKEPLTEAERDDR 90
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.134 0.394
Gapped
Lambda K H
0.267 0.0723 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 67,747,927
Number of extensions: 6840708
Number of successful extensions: 8057
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7757
Number of HSP's successfully gapped: 313
Length of query: 1320
Length of database: 10,937,602
Length adjustment: 108
Effective length of query: 1212
Effective length of database: 6,147,370
Effective search space: 7450612440
Effective search space used: 7450612440
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 65 (28.8 bits)