RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8497
(2355 letters)
>gnl|CDD|223587 COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA
replication, recombination, and repair / Transcription /
Translation, ribosomal structure and biogenesis].
Length = 513
Score = 311 bits (799), Expect = 5e-93
Identities = 142/507 (28%), Positives = 235/507 (46%), Gaps = 37/507 (7%)
Query: 28 IEKPIRQGLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLP 87
+ + Q L++ GF + +PIQ+AA P +L G D + Q+++GTGK+ F++ L K++K
Sbjct: 36 LSPELLQALKDLGFEEPTPIQLAAIPLILAGRDVLGQAQTGTGKTAAFLLPLLQKILKSV 95
Query: 88 -VQQPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGT----QVERPKRPVQIVV 142
+ +ILAPTRE+AVQI + +R +G ++ L V GG Q+E KR V IVV
Sbjct: 96 ERKYVSALILAPTRELAVQIAEELRKLGKNLGGLRVAVVYGGVSIRKQIEALKRGVDIVV 155
Query: 143 GSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSATY 202
+PGR+ +IK L++ V ++DEAD++++ GF++DI I LPP +Q L+ SAT
Sbjct: 156 ATPGRLLDLIKRGKLDLSGVETLVLDEADRMLDMGFIDDIEKILKALPPDRQTLLFSATM 215
Query: 203 SHDNLATLQKYMVDPLLIR--PEDATRPLLGVKQLVALIPECKNPSLRYVDEEKKLVQLL 260
D ++Y+ DP+ I E R L +KQ E ++ + + L++LL
Sbjct: 216 PDDIRELARRYLNDPVEIEVSVEKLERTLKKIKQFYL---EVESEEEK----LELLLKLL 268
Query: 261 SQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRILV 320
+ ++F + E + E+L F L G Q+ R +L+ FK G++R+LV
Sbjct: 269 KDEDEGRVIVFVRTKRLVEELAESLRKRGFKVAALHGDLPQEERDRALEKFKDGELRVLV 328
Query: 321 TTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVITIVSA-ESLVKFHS 379
TD+AARG+D +V VIN ++P D Y+HR+GR GR G +G+ I+ V+ E + K
Sbjct: 329 ATDVAARGLDIPDVSHVINYDLPLDPEDYVHRIGRTGRAGRKGVAISFVTEEEEVKKLKR 388
Query: 380 LMGEINLDHAFNVGLVPDNLTGDQINWTQRVQTLLAKPLDQAQEREDVEQTAEESSKVEN 439
+ ++ + +P D + EES +
Sbjct: 389 IE----------------KRLERKLPSAVLLPLD--EPEDAKLLKTTRPGLEEESDISDE 430
Query: 440 TKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESDSENDGKPDENGSQRGSKRSKEFKT 499
K L+ + + G +K G+ + P+ S
Sbjct: 431 IKKLKSSKKALLRGLGVRFTLSKLLANLGKEIPGAGDAVTIDPELERRSPNSAD----DI 486
Query: 500 QEKCVKGAGEDENGDRKASKDEQKIDG 526
+ + E K K+
Sbjct: 487 EYILKGLSYRAEERTAKNEAANIKLQS 513
>gnl|CDD|185609 PTZ00424, PTZ00424, helicase 45; Provisional.
Length = 401
Score = 235 bits (601), Expect = 7e-68
Identities = 136/384 (35%), Positives = 215/384 (55%), Gaps = 27/384 (7%)
Query: 9 ARTKDVLSNIDVL---FENDVLIEKPIRQGLQENGFVKESPIQMAAFPYVLNGYDAIVQS 65
A T + SN D + F+ L E +R G+ GF K S IQ +L+GYD I Q+
Sbjct: 14 ASTGTIESNYDEIVDSFDALKLNEDLLR-GIYSYGFEKPSAIQQRGIKPILDGYDTIGQA 72
Query: 66 KSGTGKSIVFVVAALNKLIKLPVQQPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYF 125
+SGTGK+ FV+AAL +LI + Q +ILAPTRE+A QI VV ++G ++K +
Sbjct: 73 QSGTGKTATFVIAAL-QLIDYDLNACQALILAPTRELAQQIQKVVLALGDYLK-VRCHAC 130
Query: 126 IGGTQV----ERPKRPVQIVVGSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVED 181
+GGT V + K V +VVG+PGR+ MI ++L +D ++LFI+DEAD++++ GF
Sbjct: 131 VGGTVVRDDINKLKAGVHMVVGTPGRVYDMIDKRHLRVDDLKLFILDEADEMLSRGFKGQ 190
Query: 182 ITWIYSQLPPMKQMLVVSATYSHDNLATLQKYMVDP--LLIRPEDATRPLLGVKQLVALI 239
I ++ +LPP Q+ + SAT ++ L K+M DP +L++ ++ T L G++Q +
Sbjct: 191 IYDVFKKLPPDVQVALFSATMPNEILELTTKFMRDPKRILVKKDELT--LEGIRQFYVAV 248
Query: 240 PECKNPSLRYVDEEKK---LVQLLSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLS 296
+ EE K L L Q +I+ N + + + + + + +F +
Sbjct: 249 EK----------EEWKFDTLCDLYETLTITQAIIYCNTRRKVDYLTKKMHERDFTVSCMH 298
Query: 297 GAQDQKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRA 356
G DQK R + F+ G R+L+TTDL ARGID V LVIN ++P Y+HR+GR+
Sbjct: 299 GDMDQKDRDLIMREFRSGSTRVLITTDLLARGIDVQQVSLVINYDLPASPENYIHRIGRS 358
Query: 357 GRYGTRGLVITIVSAESLVKFHSL 380
GR+G +G+ I V+ + + + +
Sbjct: 359 GRFGRKGVAINFVTPDDIEQLKEI 382
>gnl|CDD|238167 cd00268, DEADc, DEAD-box helicases. A diverse family of proteins
involved in ATP-dependent RNA unwinding, needed in a
variety of cellular processes including splicing,
ribosome biogenesis and RNA degradation. The name
derives from the sequence of the Walker B motif (motif
II). This domain contains the ATP- binding region.
Length = 203
Score = 205 bits (525), Expect = 3e-60
Identities = 75/205 (36%), Positives = 121/205 (59%), Gaps = 7/205 (3%)
Query: 22 FENDVLIEKPIRQGLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALN 81
FE ++ + + +G+ GF K +PIQ A P +L+G D I Q+++G+GK+ F++ L
Sbjct: 1 FE-ELGLSPELLRGIYALGFEKPTPIQARAIPPLLSGRDVIGQAQTGSGKTAAFLIPILE 59
Query: 82 KLIKLP-VQQPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGT----QVERPKR 136
KL P PQ +ILAPTRE+A+QI +V R +G H NL V GGT Q+ + KR
Sbjct: 60 KLDPSPKKDGPQALILAPTRELALQIAEVARKLGKH-TNLKVVVIYGGTSIDKQIRKLKR 118
Query: 137 PVQIVVGSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQML 196
IVV +PGR+ +++ L++ V+ ++DEAD++++ GF + I I LP +Q L
Sbjct: 119 GPHIVVATPGRLLDLLERGKLDLSKVKYLVLDEADRMLDMGFEDQIREILKLLPKDRQTL 178
Query: 197 VVSATYSHDNLATLQKYMVDPLLIR 221
+ SAT + +K++ +P+ I
Sbjct: 179 LFSATMPKEVRDLARKFLRNPVRIL 203
>gnl|CDD|236977 PRK11776, PRK11776, ATP-dependent RNA helicase DbpA; Provisional.
Length = 460
Score = 210 bits (537), Expect = 1e-58
Identities = 110/356 (30%), Positives = 183/356 (51%), Gaps = 40/356 (11%)
Query: 34 QGLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKL-IKLPVQQPQ 92
L E G+ + +PIQ + P +L G D I Q+K+G+GK+ F + L KL +K + Q
Sbjct: 17 ANLNELGYTEMTPIQAQSLPAILAGKDVIAQAKTGSGKTAAFGLGLLQKLDVKRF--RVQ 74
Query: 93 TIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGTQVERPKRP--------VQIVVGS 144
++L PTRE+A Q+ +R + + N+ V GG P P I+VG+
Sbjct: 75 ALVLCPTRELADQVAKEIRRLARFIPNIKVLTLCGGV----PMGPQIDSLEHGAHIIVGT 130
Query: 145 PGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSATYSH 204
PGRI ++ L++D++ ++DEAD++++ GF + I I Q P +Q L+ SATY
Sbjct: 131 PGRILDHLRKGTLDLDALNTLVLDEADRMLDMGFQDAIDAIIRQAPARRQTLLFSATYP- 189
Query: 205 DNLATL-QKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLR---Y-VDEEKK---L 256
+ +A + Q++ DP+ ++ E P++ Y V +++ L
Sbjct: 190 EGIAAISQRFQRDPVEVKVESTH----------------DLPAIEQRFYEVSPDERLPAL 233
Query: 257 VQLLSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKV 316
+LL CV+F N + + + + L F A L G +Q+ R L F
Sbjct: 234 QRLLLHHQPESCVVFCNTKKECQEVADALNAQGFSALALHGDLEQRDRDQVLVRFANRSC 293
Query: 317 RILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVITIVSAE 372
+LV TD+AARG+D ++ VIN E+ D ++HR+GR GR G++GL +++V+ E
Sbjct: 294 SVLVATDVAARGLDIKALEAVINYELARDPEVHVHRIGRTGRAGSKGLALSLVAPE 349
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 197 bits (501), Expect = 6e-54
Identities = 107/352 (30%), Positives = 189/352 (53%), Gaps = 22/352 (6%)
Query: 32 IRQGLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQ-- 89
+ + + GF +PIQ Y L G+DAI ++++GTGK+ F+++ +N+L++ P
Sbjct: 98 LMHAIHDLGFPYCTPIQAQVLGYTLAGHDAIGRAQTGTGKTAAFLISIINQLLQTPPPKE 157
Query: 90 ----QPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGTQVERPKRPVQ-----I 140
+P+ +I+APTRE+ VQI ++ + L+V F+GG ++ + ++ I
Sbjct: 158 RYMGEPRALIIAPTRELVVQIAKDAAALTKYT-GLNVMTFVGGMDFDKQLKQLEARFCDI 216
Query: 141 VVGSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPM--KQMLVV 198
+V +PGR+ + +++D V + ++DEAD++++ GF+ + I Q P +Q L+
Sbjct: 217 LVATPGRLLDFNQRGEVHLDMVEVMVLDEADRMLDMGFIPQVRQIIRQTPRKEERQTLLF 276
Query: 199 SATYSHDNLATLQKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLRYVDEEKKLVQ 258
SAT++ D + +++ DP ++ E V+Q V + D+ K L
Sbjct: 277 SATFTDDVMNLAKQWTTDPAIVEIEPENVASDTVEQHVYAVAGS--------DKYKLLYN 328
Query: 259 LLSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRI 318
L++Q P+ + ++F+N + I E L A LSG Q R+ +L+ F+ GK+R+
Sbjct: 329 LVTQNPWERVMVFANRKDEVRRIEERLVKDGINAAQLSGDVPQHKRIKTLEGFREGKIRV 388
Query: 319 LVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVITIVS 370
LV TD+A RGI + VIN +P D Y+HR+GR GR G G+ I+
Sbjct: 389 LVATDVAGRGIHIDGISHVINFTLPEDPDDYVHRIGRTGRAGASGVSISFAG 440
>gnl|CDD|235314 PRK04837, PRK04837, ATP-dependent RNA helicase RhlB; Provisional.
Length = 423
Score = 185 bits (471), Expect = 2e-50
Identities = 121/361 (33%), Positives = 196/361 (54%), Gaps = 33/361 (9%)
Query: 25 DVLIEKPIRQGLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLI 84
D + + + L++ GF +PIQ A P L G D Q+++GTGK++ F+ A + L+
Sbjct: 12 DFALHPQVVEALEKKGFHNCTPIQALALPLTLAGRDVAGQAQTGTGKTMAFLTATFHYLL 71
Query: 85 KLP------VQQPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGG----TQVERP 134
P V QP+ +I+APTRE+AVQI + L + GG Q++
Sbjct: 72 SHPAPEDRKVNQPRALIMAPTRELAVQIHADAEPLAQAT-GLKLGLAYGGDGYDKQLKVL 130
Query: 135 KRPVQIVVGSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQ 194
+ V I++G+ GR+ K ++N+ ++++ ++DEAD++ + GF++DI W++ ++PP Q
Sbjct: 131 ESGVDILIGTTGRLIDYAKQNHINLGAIQVVVLDEADRMFDLGFIKDIRWLFRRMPPANQ 190
Query: 195 ML--VVSATYSH--DNLATLQKYMVDP--LLIRPEDATRPLLGVKQLVALIPECKNPSLR 248
L + SAT S+ LA ++M +P + + PE T +K+ E PS
Sbjct: 191 RLNMLFSATLSYRVRELAF--EHMNNPEYVEVEPEQKTGHR--IKE------ELFYPS-- 238
Query: 249 YVDEEK-KLVQ-LLSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLA 306
+EEK +L+Q L+ + ++ +IF+N + R E I +LA L+G QK RL
Sbjct: 239 --NEEKMRLLQTLIEEEWPDRAIIFANTKHRCEEIWGHLAADGHRVGLLTGDVAQKKRLR 296
Query: 307 SLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVI 366
L+ F RG + ILV TD+AARG+ V V N ++P D Y+HR+GR GR G G I
Sbjct: 297 ILEEFTRGDLDILVATDVAARGLHIPAVTHVFNYDLPDDCEDYVHRIGRTGRAGASGHSI 356
Query: 367 T 367
+
Sbjct: 357 S 357
>gnl|CDD|236941 PRK11634, PRK11634, ATP-dependent RNA helicase DeaD; Provisional.
Length = 629
Score = 187 bits (476), Expect = 2e-49
Identities = 107/351 (30%), Positives = 192/351 (54%), Gaps = 17/351 (4%)
Query: 25 DVLIEKPIRQGLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLI 84
D+ ++ PI + L + G+ K SPIQ P++LNG D + +++G+GK+ F + L+ +
Sbjct: 10 DLGLKAPILEALNDLGYEKPSPIQAECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLHN-L 68
Query: 85 KLPVQQPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGG----TQVERPKRPVQI 140
++ PQ ++LAPTRE+AVQ+ + + H++ ++V GG Q+ ++ QI
Sbjct: 69 DPELKAPQILVLAPTRELAVQVAEAMTDFSKHMRGVNVVALYGGQRYDVQLRALRQGPQI 128
Query: 141 VVGSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSA 200
VVG+PGR+ +K L++ + ++DEAD+++ GF+ED+ I +Q+P Q + SA
Sbjct: 129 VVGTPGRLLDHLKRGTLDLSKLSGLVLDEADEMLRMGFIEDVETIMAQIPEGHQTALFSA 188
Query: 201 TYSHDNLATLQKYMVDPLLIRPEDA--TRPLLGVKQLVALIPECKNPSLRYVDEEKKLVQ 258
T +++M +P +R + + TRP + ++ + + + LV+
Sbjct: 189 TMPEAIRRITRRFMKEPQEVRIQSSVTTRPDISQSYW----------TVWGMRKNEALVR 238
Query: 259 LLSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRI 318
L F+ +IF + + E L + + L+G +Q R +L+ K G++ I
Sbjct: 239 FLEAEDFDAAIIFVRTKNATLEVAEALERNGYNSAALNGDMNQALREQTLERLKDGRLDI 298
Query: 319 LVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVITIV 369
L+ TD+AARG+D + LV+N +IP D+ +Y+HR+GR GR G G + V
Sbjct: 299 LIATDVAARGLDVERISLVVNYDIPMDSESYVHRIGRTGRAGRAGRALLFV 349
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 177 bits (450), Expect = 2e-47
Identities = 121/369 (32%), Positives = 180/369 (48%), Gaps = 44/369 (11%)
Query: 36 LQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQ---QPQ 92
LQ+ G+ + + IQ A P L+G D + + +GTGK+ F++ AL L+ P + P+
Sbjct: 16 LQDKGYTRPTAIQAEAIPPALDGRDVLGSAPTGTGKTAAFLLPALQHLLDFPRRKSGPPR 75
Query: 93 TIILAPTREIAVQITDVVRSVGAHVKNLHVDYFI--GG----TQVERPKRPVQIVVGSPG 146
+IL PTRE+A+Q+ D R + H H+D GG E IVV +PG
Sbjct: 76 ILILTPTRELAMQVADQARELAKH---THLDIATITGGVAYMNHAEVFSENQDIVVATPG 132
Query: 147 RIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSAT----- 201
R+ Q IK + + +V I+DEAD++++ GF +DI I ++ KQ L+ SAT
Sbjct: 133 RLLQYIKEENFDCRAVETLILDEADRMLDMGFAQDIETIAAETRWRKQTLLFSATLEGDA 192
Query: 202 ---YSHDNLATLQKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLRYVDEEKK--- 255
++ L DP+ + E + R + Q R D E K
Sbjct: 193 VQDFAERLLN-------DPVEVEAEPSRRERKKIHQWY----------YRADDLEHKTAL 235
Query: 256 LVQLLSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGK 315
L LL Q + ++F + R + L A YL G Q R ++ G+
Sbjct: 236 LCHLLKQPEVTRSIVFVRTRERVHELAGWLRKAGINCCYLEGEMVQAKRNEAIKRLTDGR 295
Query: 316 VRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVITIVSAESLV 375
V +LV TD+AARGID +V VIN ++P A TYLHR+GR GR G +G I++V A
Sbjct: 296 VNVLVATDVAARGIDIDDVSHVINFDMPRSADTYLHRIGRTGRAGRKGTAISLVEAHD-- 353
Query: 376 KFHSLMGEI 384
H L+G+I
Sbjct: 354 --HLLLGKI 360
>gnl|CDD|235307 PRK04537, PRK04537, ATP-dependent RNA helicase RhlB; Provisional.
Length = 572
Score = 176 bits (447), Expect = 5e-46
Identities = 115/348 (33%), Positives = 189/348 (54%), Gaps = 26/348 (7%)
Query: 35 GLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPV------ 88
GL+ GF + +PIQ P L G D Q+++GTGK++ F+VA +N+L+ P
Sbjct: 23 GLESAGFTRCTPIQALTLPVALPGGDVAGQAQTGTGKTLAFLVAVMNRLLSRPALADRKP 82
Query: 89 QQPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGT----QVERPKRPVQIVVGS 144
+ P+ +ILAPTRE+A+QI GA + L GG Q E ++ V +++ +
Sbjct: 83 EDPRALILAPTRELAIQIHKDAVKFGADL-GLRFALVYGGVDYDKQRELLQQGVDVIIAT 141
Query: 145 PGRIKQMIKL-KYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPP--MKQMLVVSAT 201
PGR+ +K K +++ + + ++DEAD++ + GF++DI ++ ++P +Q L+ SAT
Sbjct: 142 PGRLIDYVKQHKVVSLHACEICVLDEADRMFDLGFIKDIRFLLRRMPERGTRQTLLFSAT 201
Query: 202 YSHDNLATLQKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLRYVDEEKK--LVQL 259
SH L ++M +P + E T V+Q + P+ DEEK+ L+ L
Sbjct: 202 LSHRVLELAYEHMNEPEKLVVETETITAARVRQRIYF------PA----DEEKQTLLLGL 251
Query: 260 LSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRIL 319
LS++ + ++F N + E + L + LSG QK R + L+ F++G++ IL
Sbjct: 252 LSRSEGARTMVFVNTKAFVERVARTLERHGYRVGVLSGDVPQKKRESLLNRFQKGQLEIL 311
Query: 320 VTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVIT 367
V TD+AARG+ V V N ++P DA Y+HR+GR R G G I+
Sbjct: 312 VATDVAARGLHIDGVKYVYNYDLPFDAEDYVHRIGRTARLGEEGDAIS 359
>gnl|CDD|236722 PRK10590, PRK10590, ATP-dependent RNA helicase RhlE; Provisional.
Length = 456
Score = 169 bits (429), Expect = 1e-44
Identities = 112/365 (30%), Positives = 190/365 (52%), Gaps = 28/365 (7%)
Query: 32 IRQGLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQ-- 89
I + + E G+ + +PIQ A P VL G D + +++GTGK+ F + L LI
Sbjct: 12 ILRAVAEQGYREPTPIQQQAIPAVLEGRDLMASAQTGTGKTAGFTLPLLQHLITRQPHAK 71
Query: 90 --QP-QTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGT----QVERPKRPVQIVV 142
+P + +IL PTRE+A QI + VR ++ N+ GG Q+ + + V ++V
Sbjct: 72 GRRPVRALILTPTRELAAQIGENVRDYSKYL-NIRSLVVFGGVSINPQMMKLRGGVDVLV 130
Query: 143 GSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSATY 202
+PGR+ + + +D V + ++DEAD++++ GF+ DI + ++LP +Q L+ SAT+
Sbjct: 131 ATPGRLLDLEHQNAVKLDQVEILVLDEADRMLDMGFIHDIRRVLAKLPAKRQNLLFSATF 190
Query: 203 SHDNLATLQKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLRYVDEEKK---LVQL 259
S D A +K + +PL I V Q V +VD+++K L Q+
Sbjct: 191 SDDIKALAEKLLHNPLEIEVARRNTASEQVTQHV-----------HFVDKKRKRELLSQM 239
Query: 260 LSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRIL 319
+ + + Q ++F+ + + E L + + G + Q AR +L FK G +R+L
Sbjct: 240 IGKGNWQQVLVFTRTKHGANHLAEQLNKDGIRSAAIHGNKSQGARTRALADFKSGDIRVL 299
Query: 320 VTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVITIVSAESLVKFHS 379
V TD+AARG+D + V+N E+P+ Y+HR+GR GR G +++V V H
Sbjct: 300 VATDIAARGLDIEELPHVVNYELPNVPEDYVHRIGRTGRAAATGEALSLVC----VDEHK 355
Query: 380 LMGEI 384
L+ +I
Sbjct: 356 LLRDI 360
>gnl|CDD|215832 pfam00270, DEAD, DEAD/DEAH box helicase. Members of this family
include the DEAD and DEAH box helicases. Helicases are
involved in unwinding nucleic acids. The DEAD box
helicases are involved in various aspects of RNA
metabolism, including nuclear transcription, pre mRNA
splicing, ribosome biogenesis, nucleocytoplasmic
transport, translation, RNA decay and organellar gene
expression.
Length = 169
Score = 155 bits (393), Expect = 4e-43
Identities = 58/167 (34%), Positives = 101/167 (60%), Gaps = 8/167 (4%)
Query: 45 SPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQQPQTIILAPTREIAV 104
+PIQ A P +L+G D +VQ+ +G+GK++ F++ L L+ PQ ++LAPTRE+A
Sbjct: 1 TPIQAQAIPAILSGKDVLVQAPTGSGKTLAFLLPILQALLPKK-GGPQALVLAPTRELAE 59
Query: 105 QITDVVRSVGAHVKNLHVDYFIGGTQVER-----PKRPVQIVVGSPGRIKQMIKLKYLNM 159
QI + ++ + L V GGT ++ K I+VG+PGR+ +++ L +
Sbjct: 60 QIYEELKKLFKI-LGLRVALLTGGTSLKEQARKLKKGKADILVGTPGRLLDLLRRGKLKL 118
Query: 160 DS-VRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSATYSHD 205
++L ++DEA +L++ GF +D+ I S+LPP +Q+L++SAT +
Sbjct: 119 LKNLKLLVLDEAHRLLDMGFGDDLEEILSRLPPDRQILLLSATLPRN 165
>gnl|CDD|214692 smart00487, DEXDc, DEAD-like helicases superfamily.
Length = 201
Score = 150 bits (381), Expect = 5e-41
Identities = 57/205 (27%), Positives = 105/205 (51%), Gaps = 11/205 (5%)
Query: 36 LQENGFVKESPIQMAAFPYVLNGY-DAIVQSKSGTGKSIVFVVAALNKLIKLPVQQPQTI 94
+++ GF P Q A +L+G D I+ + +G+GK++ ++ AL L + + +
Sbjct: 1 IEKFGFEPLRPYQKEAIEALLSGLRDVILAAPTGSGKTLAALLPALEALKRGK--GGRVL 58
Query: 95 ILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGTQVERPKR-----PVQIVVGSPGRIK 149
+L PTRE+A Q + ++ +G L V GG R I+V +PGR+
Sbjct: 59 VLVPTRELAEQWAEELKKLGPS-LGLKVVGLYGGDSKREQLRKLESGKTDILVTTPGRLL 117
Query: 150 QMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSATYSHDNLAT 209
+++ L++ +V L I+DEA +L++ GF + + + LP Q+L++SAT +
Sbjct: 118 DLLENDKLSLSNVDLVILDEAHRLLDGGFGDQLEKLLKLLPKNVQLLLLSATPPEEIENL 177
Query: 210 LQKYMVDPLLIRPEDATRPLLGVKQ 234
L+ ++ DP+ I PL ++Q
Sbjct: 178 LELFLNDPVFIDVGF--TPLEPIEQ 200
>gnl|CDD|240273 PTZ00110, PTZ00110, helicase; Provisional.
Length = 545
Score = 144 bits (364), Expect = 2e-35
Identities = 112/366 (30%), Positives = 184/366 (50%), Gaps = 41/366 (11%)
Query: 36 LQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQQ----P 91
L+ GF + +PIQ+ +P L+G D I +++G+GK++ F++ A+ + P+ + P
Sbjct: 145 LKNAGFTEPTPIQVQGWPIALSGRDMIGIAETGSGKTLAFLLPAIVHINAQPLLRYGDGP 204
Query: 92 QTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGTQVERPKRP--------VQIVVG 143
++LAPTRE+A QI + GA K + GG PKR V+I++
Sbjct: 205 IVLVLAPTRELAEQIREQCNKFGASSK-IRNTVAYGGV----PKRGQIYALRRGVEILIA 259
Query: 144 SPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSATYS 203
PGR+ ++ N+ V ++DEAD++++ GF I I SQ+ P +Q L+ SAT+
Sbjct: 260 CPGRLIDFLESNVTNLRRVTYLVLDEADRMLDMGFEPQIRKIVSQIRPDRQTLMWSATWP 319
Query: 204 HD--NLA-TLQKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLRYVDEEK--KLVQ 258
+ +LA L K + + D T +KQ V ++ E EK KL
Sbjct: 320 KEVQSLARDLCKEEPVHVNVGSLDLT-ACHNIKQEVFVVEE----------HEKRGKLKM 368
Query: 259 LLSQ--TPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKV 316
LL + ++ +IF + + + + L + A + G + Q+ R L+ FK GK
Sbjct: 369 LLQRIMRDGDKILIFVETKKGADFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKTGKS 428
Query: 317 RILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVITIV------S 370
I++ TD+A+RG+D +V VIN + P+ Y+HR+GR GR G +G T +
Sbjct: 429 PIMIATDVASRGLDVKDVKYVINFDFPNQIEDYVHRIGRTGRAGAKGASYTFLTPDKYRL 488
Query: 371 AESLVK 376
A LVK
Sbjct: 489 ARDLVK 494
>gnl|CDD|215103 PLN00206, PLN00206, DEAD-box ATP-dependent RNA helicase;
Provisional.
Length = 518
Score = 139 bits (351), Expect = 4e-34
Identities = 104/363 (28%), Positives = 180/363 (49%), Gaps = 29/363 (7%)
Query: 34 QGLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQ---- 89
L+ G+ +PIQM A P L+G +V + +G+GK+ F+V +++ +
Sbjct: 134 LNLETAGYEFPTPIQMQAIPAALSGRSLLVSADTGSGKTASFLVPIISRCCTIRSGHPSE 193
Query: 90 --QPQTIILAPTREIAVQITDVVRSVGAHVKNL--HVDYFIGG----TQVERPKRPVQIV 141
P ++L PTRE+ VQ+ D + +G K L +GG Q+ R ++ V+++
Sbjct: 194 QRNPLAMVLTPTRELCVQVEDQAKVLG---KGLPFKTALVVGGDAMPQQLYRIQQGVELI 250
Query: 142 VGSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSAT 201
VG+PGR+ ++ + +D+V + ++DE D ++ GF + + I+ L Q+L+ SAT
Sbjct: 251 VGTPGRLIDLLSKHDIELDNVSVLVLDEVDCMLERGFRDQVMQIFQALS-QPQVLLFSAT 309
Query: 202 YSHDNLATLQKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLRYVDEEKKLVQLL- 260
S + D +LI + RP VKQL + E K +++KL +L
Sbjct: 310 VSPEVEKFASSLAKDIILISIGNPNRPNKAVKQLAIWV-ETKQ-------KKQKLFDILK 361
Query: 261 SQTPFNQ-CVIFSNYQLRPEVICENLANANFGAEYLS--GAQDQKARLASLDSFKRGKVR 317
S+ F V+F + +L +++ + G + LS G + K R + SF G+V
Sbjct: 362 SKQHFKPPAVVFVSSRLGADLLANAITVVT-GLKALSIHGEKSMKERREVMKSFLVGEVP 420
Query: 318 ILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVITIVSAESLVKF 377
++V T + RG+D V VI ++P+ Y+H++GRA R G +G I V+ E F
Sbjct: 421 VIVATGVLGRGVDLLRVRQVIIFDMPNTIKEYIHQIGRASRMGEKGTAIVFVNEEDRNLF 480
Query: 378 HSL 380
L
Sbjct: 481 PEL 483
>gnl|CDD|238005 cd00046, DEXDc, DEAD-like helicases superfamily. A diverse family
of proteins involved in ATP-dependent RNA or DNA
unwinding. This domain contains the ATP-binding region.
Length = 144
Score = 125 bits (315), Expect = 7e-33
Identities = 48/146 (32%), Positives = 82/146 (56%), Gaps = 8/146 (5%)
Query: 60 DAIVQSKSGTGKSIVFVVAALNKLIKLPVQQPQTIILAPTREIAVQITDVVRSVGAHVKN 119
D ++ + +G+GK++ ++ L L L + Q ++LAPTRE+A Q+ + ++ +
Sbjct: 2 DVLLAAPTGSGKTLAALLPILELLDSL--KGGQVLVLAPTRELANQVAERLKELFGEGIK 59
Query: 120 LHVDYFIGGT----QVERPKRPVQIVVGSPGRIKQMIKLKYLNMDSVRLFIMDEADKLIN 175
V Y IGGT Q + IVVG+PGR+ ++ L++ + L I+DEA +L+N
Sbjct: 60 --VGYLIGGTSIKQQEKLLSGKTDIVVGTPGRLLDELERLKLSLKKLDLLILDEAHRLLN 117
Query: 176 TGFVEDITWIYSQLPPMKQMLVVSAT 201
GF I +LP +Q+L++SAT
Sbjct: 118 QGFGLLGLKILLKLPKDRQVLLLSAT 143
>gnl|CDD|238034 cd00079, HELICc, Helicase superfamily c-terminal domain; associated
with DEXDc-, DEAD-, and DEAH-box proteins, yeast
initiation factor 4A, Ski2p, and Hepatitis C virus NS3
helicases; this domain is found in a wide variety of
helicases and helicase related proteins; may not be an
autonomously folding unit, but an integral part of the
helicase; 4 helicase superfamilies at present according
to the organization of their signature motifs; all
helicases share the ability to unwind nucleic acid
duplexes with a distinct directional polarity; they
utilize the free energy from nucleoside triphosphate
hydrolysis to fuel their translocation along DNA,
unwinding the duplex in the process.
Length = 131
Score = 113 bits (286), Expect = 4e-29
Identities = 43/126 (34%), Positives = 68/126 (53%), Gaps = 4/126 (3%)
Query: 247 LRYVDEEKK--LVQLLSQ--TPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQK 302
+ V++EK L++LL + + +IF + + + E L L G Q+
Sbjct: 6 VLPVEDEKLEALLELLKEHLKKGGKVLIFCPSKKMLDELAELLRKPGIKVAALHGDGSQE 65
Query: 303 ARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTR 362
R L F+ G++ +LV TD+ ARGID NV +VIN ++P ++YL R+GRAGR G +
Sbjct: 66 EREEVLKDFREGEIVVLVATDVIARGIDLPNVSVVINYDLPWSPSSYLQRIGRAGRAGQK 125
Query: 363 GLVITI 368
G I +
Sbjct: 126 GTAILL 131
Score = 30.3 bits (69), Expect = 4.8
Identities = 16/57 (28%), Positives = 33/57 (57%), Gaps = 5/57 (8%)
Query: 78 AALNKLIKLPVQQP-QTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGTQVER 133
AL +L+K +++ + +I P++++ ++ +++R G V LH D G+Q ER
Sbjct: 15 EALLELLKEHLKKGGKVLIFCPSKKMLDELAELLRKPGIKVAALHGD----GSQEER 67
>gnl|CDD|197757 smart00490, HELICc, helicase superfamily c-terminal domain.
Length = 82
Score = 93.4 bits (233), Expect = 1e-22
Identities = 35/82 (42%), Positives = 47/82 (57%)
Query: 279 EVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVI 338
E + E L L G Q+ R LD F GK+++LV TD+A RG+D VDLVI
Sbjct: 1 EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVATDVAERGLDLPGVDLVI 60
Query: 339 NLEIPHDAATYLHRMGRAGRYG 360
++P A+Y+ R+GRAGR G
Sbjct: 61 IYDLPWSPASYIQRIGRAGRAG 82
>gnl|CDD|201125 pfam00271, Helicase_C, Helicase conserved C-terminal domain. The
Prosite family is restricted to DEAD/H helicases,
whereas this domain family is found in a wide variety of
helicases and helicase related proteins. It may be that
this is not an autonomously folding unit, but an
integral part of the helicase.
Length = 78
Score = 90.3 bits (225), Expect = 1e-21
Identities = 33/76 (43%), Positives = 47/76 (61%)
Query: 285 LANANFGAEYLSGAQDQKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPH 344
L L G Q+ R L+ F+ GK ++LV TD+A RGID +V+LVIN ++P
Sbjct: 3 LRKPGIKVARLHGGLSQEEREEILEDFRNGKSKVLVATDVAGRGIDLPDVNLVINYDLPW 62
Query: 345 DAATYLHRMGRAGRYG 360
+ A+Y+ R+GRAGR G
Sbjct: 63 NPASYIQRIGRAGRAG 78
>gnl|CDD|224122 COG1201, Lhr, Lhr-like helicases [General function prediction
only].
Length = 814
Score = 60.4 bits (147), Expect = 1e-08
Identities = 76/375 (20%), Positives = 145/375 (38%), Gaps = 93/375 (24%)
Query: 46 PIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQQPQTIILA----PTRE 101
P Q A P + +G + ++ + +G+GK+ + +N+L+ L + + I A P +
Sbjct: 25 PPQRYAIPEIHSGENVLIIAPTGSGKTEAAFLPVINELLSLGKGKLEDGIYALYISPLKA 84
Query: 102 IAVQI----TDVVRSVGAHVKNLHVDYFIGGTQVERP---KRPVQIVVGSP--------- 145
+ I + +R +G V H D Q E+ K P I++ +P
Sbjct: 85 LNNDIRRRLEEPLRELGIEVAVRHGDT----PQSEKQKMLKNPPHILITTPESLAILLNS 140
Query: 146 GRIKQMIKLKYLNMDSVRLFIMDEADKLINT--GFVEDITWIYSQLPPMKQMLVVSATYS 203
+ +++++ VR I+DE L + G QL L
Sbjct: 141 PKFRELLR-------DVRYVIVDEIHALAESKRG---------VQL-----AL------- 172
Query: 204 HDNLATLQKYMVDPLLI-------RPEDATRPLLG--VKQLVALIPECKNPSLRYVDEE- 253
+L L++ D I PE+ + L+G + + K ++ +
Sbjct: 173 --SLERLRELAGDFQRIGLSATVGPPEEVAKFLVGFGDPCEIVDVSAAKKLEIKVISPVE 230
Query: 254 -------------KKLVQLLSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLS--GA 298
+++ +L+ + +IF+N + E + L G+
Sbjct: 231 DLIYDEELWAALYERIAELVKKH--RTTLIFTNTRSGAERLAFRL-KKLGPDIIEVHHGS 287
Query: 299 QDQKARLASLDSFKRGKVRILVTT---DLAARGIDAANVDLVINLEIPHDAATYLHRMGR 355
++ RL + K G+++ +V T +L GID ++DLVI L P +L R+GR
Sbjct: 288 LSRELRLEVEERLKEGELKAVVATSSLEL---GIDIGDIDLVIQLGSPKSVNRFLQRIGR 344
Query: 356 AGRYGT---RGLVIT 367
AG +G++I
Sbjct: 345 AGHRLGEVSKGIIIA 359
>gnl|CDD|182933 PRK11057, PRK11057, ATP-dependent DNA helicase RecQ; Provisional.
Length = 607
Score = 55.9 bits (135), Expect = 2e-07
Identities = 92/359 (25%), Positives = 150/359 (41%), Gaps = 64/359 (17%)
Query: 29 EKPIRQGLQEN-GFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLP 87
E +Q LQE G+ + P Q VL+G D +V +G GKS+ + + AL
Sbjct: 10 ESLAKQVLQETFGYQQFRPGQQEIIDAVLSGRDCLVVMPTGGGKSLCYQIPAL------- 62
Query: 88 VQQPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGTQVERPKRPVQIVVGSPGR 147
V T++++P + D + + G + TQ R Q+ V + R
Sbjct: 63 VLDGLTLVVSPLISLMKDQVDQLLANGVAAA------CLNSTQ----TREQQLEVMAGCR 112
Query: 148 IKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVS-----ATY 202
Q IKL L+I E +L+ F+E + P +L V + +
Sbjct: 113 TGQ-IKL---------LYIAPE--RLMMDNFLEHLA----HWNPA--LLAVDEAHCISQW 154
Query: 203 SHD------NLATLQKYMVD----PLLIRPEDATR----PLLGVKQLVALIPECKNPSLR 248
HD L L++ L +D TR LLG+ + I P++R
Sbjct: 155 GHDFRPEYAALGQLRQRFPTLPFMALTATADDTTRQDIVRLLGLNDPLIQISSFDRPNIR 214
Query: 249 Y--VDEEKKLVQLLS--QTPFNQC-VIFSNYQLRPEVICENLANANFG-AEYLSGAQ-DQ 301
Y V++ K L QL+ Q + +I+ N + + E L + A Y +G D
Sbjct: 215 YTLVEKFKPLDQLMRYVQEQRGKSGIIYCNSRAKVEDTAARLQSRGISAAAYHAGLDNDV 274
Query: 302 KARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYG 360
+A + ++F+R ++I+V T GI+ NV V++ +IP + +Y GRAGR G
Sbjct: 275 RADVQ--EAFQRDDLQIVVATVAFGMGINKPNVRFVVHFDIPRNIESYYQETGRAGRDG 331
>gnl|CDD|223989 COG1061, SSL2, DNA or RNA helicases of superfamily II
[Transcription / DNA replication, recombination, and
repair].
Length = 442
Score = 55.1 bits (133), Expect = 3e-07
Identities = 58/342 (16%), Positives = 117/342 (34%), Gaps = 54/342 (15%)
Query: 54 YVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQQPQTIILAPTREIAVQITDVVRSV 113
++ +G GK++V A+ +L + T++L PT+E+ Q + ++
Sbjct: 51 NRRTERRGVIVLPTGAGKTVVAA-EAIAELKR------STLVLVPTKELLDQWAEALK-- 101
Query: 114 GAHVKNLHVDYFIGGTQVERPKRPVQIVVGSPGRIKQMIKLKYLNMDSVRLFIMDEADKL 173
+ GG + E V + +Q++ N L I DE L
Sbjct: 102 -KFLLLNDEIGIYGGGEKELEPAKVTVATVQTLARRQLLDEFLGN--EFGLIIFDEVHHL 158
Query: 174 INTGFVEDITWIYSQLPPMKQMLVVSAT--YSHDNLATLQKYMVDPLL--IRPEDATRP- 228
+ I L L ++AT ++ P++ + ++
Sbjct: 159 PAPSYRR----ILELLSAAYPRLGLTATPEREDGGRIGDLFDLIGPIVYEVSLKELIDEG 214
Query: 229 -LLGVKQLVALIPECKNPSLRYVDEEKKLVQLLSQTPFN--------------------- 266
L K + + ++ Y E + +LL
Sbjct: 215 YLAPYKYVEIKVTLTEDEEREYAKESARFRELLRARGTLRAENEARRIAIASERKIAAVR 274
Query: 267 ----------QCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKV 316
+ +IF++ I + A E ++G ++ R A L+ F+ G +
Sbjct: 275 GLLLKHARGDKTLIFASDVEHAYEIAKLF-LAPGIVEAITGETPKEEREAILERFRTGGI 333
Query: 317 RILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGR 358
++LVT + G+D + D++I L ++ R+GR R
Sbjct: 334 KVLVTVKVLDEGVDIPDADVLIILRPTGSRRLFIQRLGRGLR 375
>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
Length = 773
Score = 52.6 bits (127), Expect = 2e-06
Identities = 40/154 (25%), Positives = 69/154 (44%), Gaps = 23/154 (14%)
Query: 232 VKQLVALIPECK--NPSLRYVDEEKKLVQLLSQT----PFNQCVIFSNYQLRPEVICENL 285
++ V E +P L KL +++ + P ++ ++F+ Y+ E I + L
Sbjct: 332 FRKAVRKAKELDIEHPKLE------KLREIVKEQLGKNPDSRIIVFTQYRDTAEKIVDLL 385
Query: 286 ANANFGAEYLSGAQD--------QKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLV 337
A G QK ++ LD F+ G+ +LV+T +A G+D +VDLV
Sbjct: 386 EKEGIKAVRFVGQASKDGDKGMSQKEQIEILDKFRAGEFNVLVSTSVAEEGLDIPSVDLV 445
Query: 338 INLE-IPHDAATYLHRMGRAGRYGTRGLVITIVS 370
I E +P + R GR GR G V+ +++
Sbjct: 446 IFYEPVP-SEIRSIQRKGRTGR-QEEGRVVVLIA 477
>gnl|CDD|224036 COG1111, MPH1, ERCC4-like helicases [DNA replication,
recombination, and repair].
Length = 542
Score = 50.4 bits (121), Expect = 1e-05
Identities = 24/70 (34%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 301 QKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYG 360
QK + +D F++G+ +LV T + G+D VDLVI E + R GR GR
Sbjct: 411 QKEQKEIIDQFRKGEYNVLVATSVGEEGLDIPEVDLVIFYEPVPSEIRSIQRKGRTGR-K 469
Query: 361 TRGLVITIVS 370
+G V+ +V+
Sbjct: 470 RKGRVVVLVT 479
Score = 33.1 bits (76), Expect = 2.0
Identities = 24/106 (22%), Positives = 47/106 (44%), Gaps = 7/106 (6%)
Query: 68 GTGKSIVFVVAALNKLIKLPVQQPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIG 127
G GK+ + + N+L + + LAPT+ + +Q + R V + + G
Sbjct: 39 GLGKTFIAAMVIANRLRWFG---GKVLFLAPTKPLVLQHAEFCRKVTG-IPEDEIAALTG 94
Query: 128 GTQVERPKR---PVQIVVGSPGRIKQMIKLKYLNMDSVRLFIMDEA 170
+ E + ++ V +P ++ +K +++D V L I DEA
Sbjct: 95 EVRPEEREELWAKKKVFVATPQVVENDLKAGRIDLDDVSLLIFDEA 140
>gnl|CDD|129701 TIGR00614, recQ_fam, ATP-dependent DNA helicase, RecQ family. All
proteins in this family for which functions are known
are 3'-5' DNA-DNA helicases. These proteins are used for
recombination, recombinational repair, and possibly
maintenance of chromosome stability. This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 470
Score = 50.2 bits (120), Expect = 1e-05
Identities = 73/343 (21%), Positives = 129/343 (37%), Gaps = 63/343 (18%)
Query: 46 PIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAAL--NKLIKLPVQQPQTIILAPTREIA 103
P+Q+ VL G D V +G GKS+ + + AL + + T++++P +
Sbjct: 14 PVQLEVINAVLLGRDCFVVMPTGGGKSLCYQLPALCSDGI---------TLVISPLISLM 64
Query: 104 VQITDVVRSVGAHVKNLHVDYFIGGTQVERPKRPVQIVVGSPGRIKQMIKLKYLN----M 159
+++ G F+ +Q + ++ V + IKL Y+
Sbjct: 65 EDQVLQLKASGIPAT------FLNSSQSKEQQKNVLTDLKDG-----KIKLLYVTPEKCS 113
Query: 160 DSVRLFIMDEADKLINTGFVEDITWI----------YSQLPPMKQML----VVSATYSHD 205
S RL E K I V++ I Y L +KQ +++ T +
Sbjct: 114 ASNRLLQTLEERKGITLIAVDEAHCISQWGHDFRPDYKALGSLKQKFPNVPIMALTATAS 173
Query: 206 NLATLQKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLRYVDEEKK------LVQL 259
+R ED R L +K P+L Y K L++
Sbjct: 174 P------------SVR-EDILR-QLNLKNPQIFCTSFDRPNLYYEVRRKTPKILEDLLRF 219
Query: 260 L-SQTPFNQCVIFSNYQLRPEVICENLANANFGA-EYLSGAQDQKARLASLDSFKRGKVR 317
+ + +I+ + + E + +L N A Y +G + AR F+R +++
Sbjct: 220 IRKEFKGKSGIIYCPSRKKSEQVTASLQNLGIAAGAYHAGLEI-SARDDVHHKFQRDEIQ 278
Query: 318 ILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYG 360
++V T GI+ +V VI+ +P +Y GRAGR G
Sbjct: 279 VVVATVAFGMGINKPDVRFVIHYSLPKSMESYYQESGRAGRDG 321
>gnl|CDD|130456 TIGR01389, recQ, ATP-dependent DNA helicase RecQ. The
ATP-dependent DNA helicase RecQ of E. coli is about 600
residues long. This model represents bacterial proteins
with a high degree of similarity in domain architecture
and in primary sequence to E. coli RecQ. The model
excludes eukaryotic and archaeal proteins with RecQ-like
regions, as well as more distantly related bacterial
helicases related to RecQ [DNA metabolism, DNA
replication, recombination, and repair].
Length = 591
Score = 47.4 bits (113), Expect = 8e-05
Identities = 77/352 (21%), Positives = 132/352 (37%), Gaps = 56/352 (15%)
Query: 32 IRQGLQEN-GFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAAL---------N 81
+Q L+ G+ P Q +VL+G D +V +G GKS+ + V AL +
Sbjct: 1 AQQVLKRTFGYDDFRPGQEEIISHVLDGRDVLVVMPTGGGKSLCYQVPALLLKGLTVVIS 60
Query: 82 KLIKLPVQQPQTIILAPTREIAV-------QITDVVRS-VGAHVKNLHVDYFIGGTQVER 133
LI L Q + A + + D+ ++ V +K L+V
Sbjct: 61 PLISLMKDQVDQLRAAGVAAAYLNSTLSAKEQQDIEKALVNGELKLLYV----------- 109
Query: 134 PKRPVQIVVGSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMK 193
+P R++Q L L + L +DEA + W + P +
Sbjct: 110 ----------APERLEQDYFLNMLQRIPIALVAVDEAHCVSQ--------WGHDFRPEYQ 151
Query: 194 QMLVVSATYSHDNLATLQKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLRY--VD 251
++ ++ + L R LL + I P+LR+ V
Sbjct: 152 RLGSLAERFPQVPRIALTATADAET----RQDIRELLRLADANEFITSFDRPNLRFSVVK 207
Query: 252 EEKKLVQLLS--QTPFNQC-VIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASL 308
+ K LL + Q +I+++ + + E + E L + A K R +
Sbjct: 208 KNNKQKFLLDYLKKHRGQSGIIYASSRKKVEELAERLESQGISALAYHAGLSNKVRAENQ 267
Query: 309 DSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYG 360
+ F V+++V T+ GID NV VI+ ++P + +Y GRAGR G
Sbjct: 268 EDFLYDDVKVMVATNAFGMGIDKPNVRFVIHYDMPGNLESYYQEAGRAGRDG 319
>gnl|CDD|235514 PRK05580, PRK05580, primosome assembly protein PriA; Validated.
Length = 679
Score = 47.1 bits (113), Expect = 1e-04
Identities = 28/82 (34%), Positives = 33/82 (40%), Gaps = 21/82 (25%)
Query: 302 KARLAS-LDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLH--------R 352
K L L F RG+ IL+ T + A+G D NV LV L DA L R
Sbjct: 466 KGALEQLLAQFARGEADILIGTQMLAKGHDFPNVTLVGVL----DADLGLFSPDFRASER 521
Query: 353 M--------GRAGRYGTRGLVI 366
GRAGR G V+
Sbjct: 522 TFQLLTQVAGRAGRAEKPGEVL 543
>gnl|CDD|233040 TIGR00595, priA, primosomal protein N'. All proteins in this
family for which functions are known are components of
the primosome which is involved in replication, repair,
and recombination.This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 505
Score = 46.2 bits (110), Expect = 2e-04
Identities = 25/78 (32%), Positives = 30/78 (38%), Gaps = 12/78 (15%)
Query: 301 QKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLV------INLEIP------HDAAT 348
+ A A L+ F GK IL+ T + A+G NV LV L P
Sbjct: 298 KGAHEALLNQFANGKADILIGTQMIAKGHHFPNVTLVGVLDADSGLHSPDFRAAERGFQL 357
Query: 349 YLHRMGRAGRYGTRGLVI 366
GRAGR G VI
Sbjct: 358 LTQVAGRAGRAEDPGQVI 375
>gnl|CDD|234478 TIGR04121, DEXH_lig_assoc, DEXH box helicase, DNA
ligase-associated. Members of this protein family are
DEAD/DEAH box helicases found associated with a
bacterial ATP-dependent DNA ligase, part of a four-gene
system that occurs in about 12 % of prokaryotic
reference genomes. The actual motif in this family is
DE[VILW]H.
Length = 803
Score = 46.4 bits (111), Expect = 2e-04
Identities = 85/371 (22%), Positives = 160/371 (43%), Gaps = 68/371 (18%)
Query: 46 PIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKL--PVQQPQTIILAPTREIA 103
P Q+ + L G ++ + +G+GK++ + +L L P + T+ + P R +A
Sbjct: 16 PFQLEMWAAALEGRSGLLIAPTGSGKTLAGFLPSLIDLAGPEKPKKGLHTLYITPLRALA 75
Query: 104 VQITDVVRSVGAHVKNLHVDYFIG---G--TQVERP---KRPVQIVVGSPGRIKQMIKLK 155
V D+ R++ A ++ L + + G + ER K+P I++ +P + ++
Sbjct: 76 V---DIARNLQAPIEELGLPIRVETRTGDTSSSERARQRKKPPDILLTTPESLALLLSYP 132
Query: 156 YLNMDSVRLF------IMDEADKLINT--G-FVE-DITWIYSQLPPMKQMLVVSATYSHD 205
D+ RLF ++DE +L + G +E + + +L P + +SAT
Sbjct: 133 ----DAARLFKDLRCVVVDEWHELAGSKRGDQLELALARL-RRLAPGLRRWGLSATIG-- 185
Query: 206 NLATLQKYMV-----DPLLIRPEDATRPLLGVKQLVALIPEC--KNP-----SLRYVDEE 253
NL ++ ++ +L+R + + ++L+PE + P LR + E
Sbjct: 186 NLEEARRVLLGVGGAPAVLVRGKLPKAIPV-----ISLLPESEERFPWAGHLGLRALPE- 239
Query: 254 KKLVQLLSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYL------SGAQDQKARLAS 307
+ + Q ++F+N + + E+ + L AN E+ G+ D++ R
Sbjct: 240 --VYAEIDQA--RTTLVFTNTRSQAELWFQALWEAN--PEFALPIALHHGSLDREQRRWV 293
Query: 308 LDSFKRGKVRILVTT---DLAARGIDAANVDLVINLEIPHDAATYLHRMGRAG-RYGTRG 363
+ G++R +V T DL G+D VDLVI + P A L R GR+ R G
Sbjct: 294 EAAMAAGRLRAVVCTSSLDL---GVDFGPVDLVIQIGSPKGVARLLQRAGRSNHRPG-EP 349
Query: 364 LVITIVSAESL 374
+V L
Sbjct: 350 SRALLVPTNRL 360
>gnl|CDD|223588 COG0514, RecQ, Superfamily II DNA helicase [DNA replication,
recombination, and repair].
Length = 590
Score = 45.0 bits (107), Expect = 5e-04
Identities = 24/92 (26%), Positives = 39/92 (42%)
Query: 279 EVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVI 338
E + E L A + R +F +++++V T+ GID +V VI
Sbjct: 244 EELAEWLRKNGISAGAYHAGLSNEERERVQQAFLNDEIKVMVATNAFGMGIDKPDVRFVI 303
Query: 339 NLEIPHDAATYLHRMGRAGRYGTRGLVITIVS 370
+ ++P +Y GRAGR G I + S
Sbjct: 304 HYDLPGSIESYYQETGRAGRDGLPAEAILLYS 335
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 44.4 bits (105), Expect = 8e-04
Identities = 45/252 (17%), Positives = 67/252 (26%), Gaps = 21/252 (8%)
Query: 1804 SPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVKSNESHFGNSRCQPKTAYQSS 1863
+ S D S S + S ++ S P A S
Sbjct: 101 AREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDL-SEMLRPVGSPGPPPAASPPAAGASP 159
Query: 1864 SAQSEDGTSYKSISWDAYSVIDEPESYDVSWRTCSSESAAREKNSGQRSRGSGDKAARKS 1923
+A + D S + + S +E S A S + S R S
Sbjct: 160 AAVASDAASSRQAA-LPLSSPEETA-------RAPSSPPAEPPPSTPPAAASPRPPRRSS 211
Query: 1924 -ASLPASQIALRDGK-GDFKSPRKASHSLKDGKRYGNAQSFEAARNSNPYEAPMPGA--- 1978
S AS A G+ + +S S ++ N P P P
Sbjct: 212 PISASASSPAPAPGRSAADDAGASSSDSSS----SESSGCGWGPENECPLPRPAPITLPT 267
Query: 1979 --WQPVLEGTSVFDDAEKSSKSGPKSSKAGPKSSKPGPKSSKPGPKSSKRYPSRSGSSGL 2036
W+ SS S P+ P S PG P + S S S
Sbjct: 268 RIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPG-SGPAPSSPRASSSSSSSRESSS 326
Query: 2037 NNRTEGNSVSHT 2048
++ + + S
Sbjct: 327 SSTSSSSESSRG 338
Score = 33.6 bits (77), Expect = 1.5
Identities = 40/239 (16%), Positives = 64/239 (26%), Gaps = 31/239 (12%)
Query: 1898 SSESAAREKNSGQRSRGSGDKAARKSASLPASQIALRDGKGDFKSPRKASHSLKDGKRYG 1957
SS S + G + + A + + + +G P +S S +R
Sbjct: 239 SSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSS--PRERSP 296
Query: 1958 NAQSFEAARNSNPYEAPMPGAWQPVLEGTSVFDDA-EKSSKSGPKSSKAGPKSSKPGPKS 2016
+ S+P P P + + +S + + +S S S A +S
Sbjct: 297 SPSP------SSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRS 350
Query: 2017 SKPGPKSSKRYPSRSGSSGLNNRTEGNSVSHTWENHFATGTDRRSHQYPAYGSMSRHDLY 2076
P PS +R + + T RR+ A + R
Sbjct: 351 PSPSRPPPPADPSSPRKRPRPSRAPSSPAASA-----GRPTRRRARAAVAGRARRRD--- 402
Query: 2077 FDESQLLGNEALGRGEIRGSGGGFASRVSSVGPANESGLVVAG--WNRCDREGHDPARG 2133
G R S A S A L +G W P G
Sbjct: 403 -------ATGRFPAGRPRPSPLD-AGAASGAFYARYPLLTPSGEPWPGSP----PPPPG 449
>gnl|CDD|224119 COG1198, PriA, Primosomal protein N' (replication factor Y) -
superfamily II helicase [DNA replication, recombination,
and repair].
Length = 730
Score = 43.5 bits (103), Expect = 0.001
Identities = 28/83 (33%), Positives = 34/83 (40%), Gaps = 21/83 (25%)
Query: 306 ASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHR------------- 352
LD F G+ IL+ T + A+G D NV LV L DA T L
Sbjct: 525 DLLDQFANGEADILIGTQMIAKGHDFPNVTLVGVL----DADTGLGSPDFRASERTFQLL 580
Query: 353 ---MGRAGRYGTRGLVITIVSAE 372
GRAGR G G V+ I +
Sbjct: 581 MQVAGRAGRAGKPGEVV-IQTYN 602
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 43.4 bits (102), Expect = 0.002
Identities = 29/132 (21%), Positives = 50/132 (37%), Gaps = 11/132 (8%)
Query: 1746 ESSHKRNRKANDFNDTCGAVANET---FDGEDSKGGSDD---ENNLIVTLSEAGDSEVDV 1799
+ K +D D A+E D E + D+ E N + D D+
Sbjct: 3996 DEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDL 4055
Query: 1800 EGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVKSNESHFGNSRCQPKTA 1859
++ +ED N Q + EST +SDE GE V +++ + + K+
Sbjct: 4056 AEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGE-----VPEDQAIDNHPKMDAKST 4110
Query: 1860 YQSSSAQSEDGT 1871
+ S+ A E+
Sbjct: 4111 FASAEADEENTD 4122
Score = 39.6 bits (92), Expect = 0.029
Identities = 58/329 (17%), Positives = 119/329 (36%), Gaps = 37/329 (11%)
Query: 422 QEREDVEQTAEESSKVENTKPLRG-APSKEVTKKGASKE--GTKWNLTDGQSEEES-DSE 477
E +D+E+ A E + A E G +K+ ++ +D + E + D E
Sbjct: 3832 NEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEE 3891
Query: 478 NDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKASKDEQKIDGANSAEKSLEDK 537
+ P++ + K E +E ++ E ++ ++ A+ +E + K+LEDK
Sbjct: 3892 VNDIPEDLSNSLNEKLWDE-PNEEDLLET--EQKSNEQSAANNESDLVSKEDDNKALEDK 3948
Query: 538 HGKKITNSFEAVGEIVQNDTETTKENNQTDEVPEENG--LVGNPNKAEVGDLNECESCTD 595
+ +++ E + + DE+ + P + E DL E
Sbjct: 3949 D-----------RQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPE------ 3991
Query: 596 SFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDP-QGNNANQREIR--- 651
++E + ++ ++ + ++K++ D ++ DP + NN +I+
Sbjct: 3992 DLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDD 4051
Query: 652 -------DNSATLDKTEKESVNNVESKDERPPSTANNPDGTEWEDIPSDTIPNDETSNDN 704
D D E+ N ES ++ S G ED D P + +
Sbjct: 4052 FSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTF 4111
Query: 705 KLTPVDFDYINDFVNQTPSETGDRPLVHG 733
D + + + E G+ V G
Sbjct: 4112 ASAEADEENTDKGIVGENEELGEEDGVRG 4140
>gnl|CDD|137505 PRK09751, PRK09751, putative ATP-dependent helicase Lhr;
Provisional.
Length = 1490
Score = 43.4 bits (102), Expect = 0.002
Identities = 24/62 (38%), Positives = 35/62 (56%), Gaps = 2/62 (3%)
Query: 296 SGAQDQKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGR 355
S +++Q+A + K G++R +V T GID VDLVI + P A+ L R+GR
Sbjct: 310 SVSKEQRAITEQ--ALKSGELRCVVATSSLELGIDMGAVDLVIQVATPLSVASGLQRIGR 367
Query: 356 AG 357
AG
Sbjct: 368 AG 369
>gnl|CDD|224126 COG1205, COG1205, Distinct helicase family with a unique C-terminal
domain including a metal-binding cysteine cluster
[General function prediction only].
Length = 851
Score = 42.4 bits (100), Expect = 0.003
Identities = 40/180 (22%), Positives = 65/180 (36%), Gaps = 17/180 (9%)
Query: 294 YLSGAQDQKARLASLDSFKRGKVRILVTTDLAAR-GIDAANVDLVINLEIP-HDAATYLH 351
Y +G ++ R + FK G++ ++ T A GID ++D VI P ++
Sbjct: 344 YRAGLHREERRRIEAE-FKEGELLGVIAT-NALELGIDIGSLDAVIAYGYPGVSVLSFRQ 401
Query: 352 RMGRAGRYGTRGLVITIVSAESLVKFHSLMGEINLDHAFNVGLVPDNLTGDQINWTQRVQ 411
R GRAGR G LV+ ++ ++ L ++ E G V D +
Sbjct: 402 RAGRAGRRGQESLVLVVLRSDPLDSYYLRHPE--ELLETGFGPVESVRVDDNNEYLLDGH 459
Query: 412 TLLA-----------KPLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEG 460
L A P D + E E E R + ++GAS+
Sbjct: 460 LLCADAELPLKDRELLPSDGSLEEEIARSLVGEGQLHHENDKYRPSSRGLCPREGASQVS 519
Score = 32.8 bits (75), Expect = 2.9
Identities = 15/56 (26%), Positives = 32/56 (57%), Gaps = 2/56 (3%)
Query: 48 QMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQQPQTIILAPTREIA 103
Q+ A + G + +V + +G+GK+ F++ L+ L++ P + ++L PT +A
Sbjct: 75 QVDALRLIREGRNVVVTTGTGSGKTESFLLPILDHLLRDP--SARALLLYPTNALA 128
>gnl|CDD|224125 COG1204, COG1204, Superfamily II helicase [General function
prediction only].
Length = 766
Score = 42.0 bits (99), Expect = 0.005
Identities = 25/134 (18%), Positives = 63/134 (47%), Gaps = 16/134 (11%)
Query: 46 PIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKL--PVQQPQTIILAPTR--- 100
P Q A +L+ + ++ + +G+GK+++ ++A L+ L++ V + + P +
Sbjct: 35 PQQEAVEKGLLSDENVLISAPTGSGKTLIALLAILSTLLEGGGKV-----VYIVPLKALA 89
Query: 101 -EIAVQITDVVRSVGAHVKNLHVDYFIGGTQVERPKRPVQIVVGSPGRIKQMIKLKYLNM 159
E + + + +G V DY + ++ R ++V +P ++ + + + +
Sbjct: 90 EEKYEEFSRLEE-LGIRVGISTGDYDLDDERLARY----DVIVTTPEKLDSLTRKRPSWI 144
Query: 160 DSVRLFIMDEADKL 173
+ V L ++DE L
Sbjct: 145 EEVDLVVIDEIHLL 158
Score = 32.7 bits (75), Expect = 3.1
Identities = 20/77 (25%), Positives = 29/77 (37%), Gaps = 11/77 (14%)
Query: 304 RLASLDSFKRGKVRILVTTDLAARG---------IDAANVDLVINLEIPHDAATYLHRMG 354
R D+F++GK+++LV+T A G I + L G
Sbjct: 329 RQLVEDAFRKGKIKVLVSTPTLAAGVNLPARTVIIKDTRRYDPKGGIVDIPVLDVLQMAG 388
Query: 355 RAGR--YGTRGLVITIV 369
RAGR Y G I +
Sbjct: 389 RAGRPGYDDYGEAIILA 405
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 40.9 bits (95), Expect = 0.011
Identities = 48/267 (17%), Positives = 97/267 (36%), Gaps = 20/267 (7%)
Query: 414 LAKPLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEE 473
K ++A++ ++ ++ AEE+ K + K K+ + + E K ++EE
Sbjct: 1462 AKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEA 1521
Query: 474 SDSENDGKPDENGSQRGSKRSKEFKTQEKC-----VKGAGEDENGDRKASKDEQKIDGAN 528
++ K +E +K+++E K ++ +K A E + + +E K
Sbjct: 1522 KKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALR 1581
Query: 529 SAE--KSLEDKHGKKITNSFEAV----GEIVQNDTETTKENNQTDEVPEENGLVGNPNKA 582
AE K E+ +++ +E E + E + + + EE V K
Sbjct: 1582 KAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKK 1641
Query: 583 EVGDLNECESCTDSFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDPQG 642
E + + E E + + +K + KKK + +K D++
Sbjct: 1642 EAEEKKKAE---------ELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAE 1692
Query: 643 NNANQREIRDNSATLDKTEKESVNNVE 669
+ E + L K E E E
Sbjct: 1693 ALKKEAEEAKKAEELKKKEAEEKKKAE 1719
Score = 37.0 bits (85), Expect = 0.15
Identities = 59/280 (21%), Positives = 117/280 (41%), Gaps = 28/280 (10%)
Query: 415 AKPLDQAQEREDVEQT-------------AEESSKVENTKPLRGAPSKEVTKKGASKEGT 461
A+ L +A+E++ E+ AEE+ K E + E KK ++E
Sbjct: 1554 AEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAK 1613
Query: 462 KWNLTDGQSEEESDSENDGKPDENGSQRGS---KRSKEFKTQEKCVKGAGEDENGDRKAS 518
K ++EE +E + K E ++ + K+++E K E+ K +E +KA
Sbjct: 1614 KAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEA--KKAE 1671
Query: 519 KDEQKIDGANSAEKSLEDKHGKKITNSFEA--VGEIVQNDTETTKENNQTDEVPEENGLV 576
+D++K + A AE+ + + EA E+ + + E K+ + + EEN +
Sbjct: 1672 EDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKI- 1730
Query: 577 GNPNKAEVGDLNECESCTDSFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGD 636
KAE E + + +E + K+ +++K + +K+K+ + D
Sbjct: 1731 ----KAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELD 1786
Query: 637 KRDPQGNNANQREIRD---NSATLDKTEKESVNNVESKDE 673
+ D + ++I+D N A + + KE + E
Sbjct: 1787 EEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKE 1826
Score = 35.9 bits (82), Expect = 0.40
Identities = 48/237 (20%), Positives = 88/237 (37%), Gaps = 5/237 (2%)
Query: 416 KPLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESD 475
K D+A++ E+ ++ E K E K A K K + K ++ E +
Sbjct: 1290 KKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAK 1349
Query: 476 SENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKASKDEQKIDGANSAEKSLE 535
+E + DE + + E K +E K + + K DE K +K+ E
Sbjct: 1350 AEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE 1409
Query: 536 DKHG---KKITNSFEAVGEIVQNDTETTKENNQTDEVPEENGLVGNPNKAE--VGDLNEC 590
K KK + + E + E K+ + + E KAE E
Sbjct: 1410 LKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEA 1469
Query: 591 ESCTDSFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDPQGNNANQ 647
+ ++ E + + D E K+K ++KK + K+K + K+ + A++
Sbjct: 1470 KKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE 1526
Score = 35.5 bits (81), Expect = 0.45
Identities = 59/256 (23%), Positives = 102/256 (39%), Gaps = 25/256 (9%)
Query: 420 QAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESDSEND 479
+A + + E+ AEE+ K E+ K E KK A + ++EEE ++E
Sbjct: 1203 EAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAK--------KAEEERNNEEI 1254
Query: 480 GKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKASKDEQKIDGANSAEKSLEDKHG 539
K +E ++R K +E + A E K +++++K D A AE+ +
Sbjct: 1255 RKFEEARMAHFARRQAAIKAEEA--RKADE-----LKKAEEKKKADEAKKAEEKKKADEA 1307
Query: 540 KKITNSFEAVGEIVQNDTETTKENNQTDEVPEENGLVGNPNKAEVGDLNECESCTDSFHM 599
KK + E + E K+ + + EE KA E E+ D
Sbjct: 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEA------KKAAEAAKAEAEAAADEAEA 1361
Query: 600 NEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDPQGNNANQREIRDNSATLDK 659
E E KE K+K +KKK + K+K + K+ + + E++ +A K
Sbjct: 1362 AEEKAEAA--EKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419
Query: 660 TE--KESVNNVESKDE 673
+ K+ + DE
Sbjct: 1420 ADEAKKKAEEKKKADE 1435
Score = 34.3 bits (78), Expect = 0.98
Identities = 53/269 (19%), Positives = 104/269 (38%), Gaps = 16/269 (5%)
Query: 408 QRVQTLLAKPLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTD 467
++ K + ++ +++++ A K + K K K ++E K +
Sbjct: 1391 KKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAK 1450
Query: 468 GQSEEESDSENDGKPDENG--SQRGSKRSKEFKTQEKCVKGAGEDENGD---RKASKDEQ 522
++EE +E K E + K+++E K ++ K A E + +KA++ ++
Sbjct: 1451 KKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKK 1510
Query: 523 KIDGANSAEKSLEDKHGKKITNSFEAVGEIVQNDTETTKENNQTDEV--PEENGLVGNPN 580
K D A AE++ + KK + +A + + E + +E+ EE
Sbjct: 1511 KADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAK 1570
Query: 581 KAEVGDLNECESCTDSFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDP 640
KAE D N + E + ++ +E +K K ++KK ++ K K K+
Sbjct: 1571 KAE-EDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKK-- 1627
Query: 641 QGNNANQREIRDNSATLDKTEKESVNNVE 669
E + L K E E E
Sbjct: 1628 ------AEEEKKKVEQLKKKEAEEKKKAE 1650
Score = 32.8 bits (74), Expect = 2.8
Identities = 49/251 (19%), Positives = 99/251 (39%), Gaps = 14/251 (5%)
Query: 414 LAKPLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEE 473
K ++ ++ ++ ++ AEE+ K + K K K ++E K + ++EE
Sbjct: 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEA 1482
Query: 474 SDSENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKASKDEQKIDGANSAEKS 533
++ K E K++ E K + K A E + + DE K A A+K+
Sbjct: 1483 KKADEAKKKAE----EAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAK--KAEEAKKA 1536
Query: 534 LEDKHGKKITNSFEAVGEIVQNDTETTKENNQTDEVPEENGLVGNPNKAEVGDLNECESC 593
E K ++ + E E K+ + + E+ + KAE + E
Sbjct: 1537 DEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMA--LRKAEE--AKKAEEA 1592
Query: 594 TDSFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDPQGNNANQREIRDN 653
M + +E + K + +K ++K K + +K +K+ + + E +
Sbjct: 1593 RIEEVMKLYEEEKKM----KAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKK 1648
Query: 654 SATLDKTEKES 664
+ L K E+E+
Sbjct: 1649 AEELKKAEEEN 1659
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of AF4
(Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain. Lilliputian
represents a novel pair-rule gene that acts in
cytoskeleton regulation, segmentation and morphogenesis
in Drosophila.
Length = 1154
Score = 40.3 bits (94), Expect = 0.017
Identities = 53/270 (19%), Positives = 94/270 (34%), Gaps = 29/270 (10%)
Query: 1782 ENNLIVTLSEAGDSEVDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHS 1841
E++L ++ SE D E E S ++ E A +S S +S ES S S
Sbjct: 357 EDDLKLSSSEDSDEEQATEKPPSRNTPPSAP---SSNPEPAASSSGSSSSSSGSESSSGS 413
Query: 1842 VKSNESHFGNSRCQPKTAYQSSSAQSEDGTSYK--------SISWDAYSVIDEPESYDVS 1893
+ES +S + +++S + E ++ K ++ S + S
Sbjct: 414 DSESESSSSDS--EENEPPRTASPEPEPPSTNKWQLDNWLNKVNPHKVSPAESVSSNPPI 471
Query: 1894 WRTCSSESAAREKNSGQRSRGSGDK---AARKSASLPASQIALRDGKGDFKSPRKASHSL 1950
+ E + S ++++ +Q G+G KSP S +
Sbjct: 472 KQPMEKEGKVKSSGSQYHPESKEPPPKSSSKEKRRPRTAQKGPESGRGKQKSP-AQSEAP 530
Query: 1951 KDGKRYGNAQSFEAARNSNPYEAPMPGAWQPVLEGTSVFDDAEKSSKSGPKSSKAG---- 2006
+ G Q + + S E GT + + ++ PK++ G
Sbjct: 531 PQRRTVGKKQPKKPEKASAGDERTGLRPESEP--GTLPYGSSVQTPPDRPKAATKGSRKP 588
Query: 2007 -----PKSSKPGPKSSKPGPKSSKRYPSRS 2031
PKSS P P + K KS + +S
Sbjct: 589 SPRKEPKSSVP-PAAEKRKYKSPSKIVPKS 617
>gnl|CDD|237497 PRK13767, PRK13767, ATP-dependent helicase; Provisional.
Length = 876
Score = 39.9 bits (94), Expect = 0.019
Identities = 20/54 (37%), Positives = 32/54 (59%)
Query: 304 RLASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAG 357
RL + KRG+++++V++ GID +DLV+ L P + L R+GRAG
Sbjct: 329 RLEVEEKLKRGELKVVVSSTSLELGIDIGYIDLVVLLGSPKSVSRLLQRIGRAG 382
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 39.0 bits (91), Expect = 0.029
Identities = 19/165 (11%), Positives = 58/165 (35%), Gaps = 4/165 (2%)
Query: 1729 SSSHSQSEFDPKRLHSKESSHKRNRKANDFNDTCGAVANETFDGEDSKGGSDDENNLIVT 1788
++S+ + +S N + + + +K DD +L
Sbjct: 67 NTSNQDNNDKKFSTIDSSTSDSNNIIDFIYK----NLPQTNINQLLTKNKYDDNYSLTTL 122
Query: 1789 LSEAGDSEVDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVKSNESH 1848
+ + D+ P + ++ N+S + +S+ +N ++S ++D+ S+ +
Sbjct: 123 IQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNT 182
Query: 1849 FGNSRCQPKTAYQSSSAQSEDGTSYKSISWDAYSVIDEPESYDVS 1893
++ + + + + + + + S + +S S
Sbjct: 183 KPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKDNQSMSDS 227
Score = 37.1 bits (86), Expect = 0.12
Identities = 36/233 (15%), Positives = 81/233 (34%), Gaps = 5/233 (2%)
Query: 1779 SDDENNLIVTLSEAGDSEVDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESD 1838
D S+ + + + +S + D NN+ N + + S D ++ D +
Sbjct: 32 KDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNI 91
Query: 1839 SHSVKSNESHFGNSRCQPKTAYQSSSAQSEDGTSYKSISWDAYSVIDEPESYDVSWRTCS 1898
+ N ++ K Y + + + + +++ D S ++P + + S +
Sbjct: 92 IDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDI-SDYEQPRNSEKSTNDSN 150
Query: 1899 SESAAREKNSGQRSRGSGDKAARKSASLPASQIALRDGKGDFKSPRKASHSLKDGKRYGN 1958
S + KN DKA + A + K + K + + + +
Sbjct: 151 KNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQP--NSPKPTQPNQSNSQPAS 208
Query: 1959 AQSFEAARNSNPYEAPMPGAWQPVLEGTSVFDDAEKSSKSGPKSSKAGPKSSK 2011
+ +S ++ A +L+ S +DA+K+ K SK +
Sbjct: 209 DDTANQKSSSKDNQSMSDSALDSILDQYS--EDAKKTQKDYASQSKKDKTETS 259
Score = 37.1 bits (86), Expect = 0.13
Identities = 38/209 (18%), Positives = 69/209 (33%), Gaps = 16/209 (7%)
Query: 516 KASKDEQKIDGANSAEKSLEDKHGKKITNSFEAVGEIVQNDTETTKENNQTDEV-----P 570
K S D++ +S K + +N + D+ T+ NN D +
Sbjct: 44 KKSNDDETSKDTSS--KDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIYKNLPQ 101
Query: 571 EENGLVGNPNKAEVGDLNECESCTDSFHMNEHSDENDLGNNTKESQKQKGHKSK---KKK 627
+ NK + + + F++N + + N+++S S K
Sbjct: 102 TNINQLLTKNKYDDN-YSLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKND 160
Query: 628 DRKRKGNGDKRDPQGNNANQREIRDNSATLDKTEKESVNNVESKDERPPSTANNPDGTE- 686
+ DK D Q ++ S + K + N + TAN ++
Sbjct: 161 TDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKD 220
Query: 687 ---WEDIPSDTIPNDETSNDNKLTPVDFD 712
D D+I D+ S D K T D+
Sbjct: 221 NQSMSDSALDSIL-DQYSEDAKKTQKDYA 248
>gnl|CDD|234365 TIGR03817, DECH_helic, helicase/secretion neighborhood putative
DEAH-box helicase. A conserved gene neighborhood widely
spread in the Actinobacteria contains this
uncharacterized DEAH-box family helicase encoded
convergently towards an operon of genes for protein
homologous to type II secretion and pilus formation
proteins. The context suggests that this helicase may
play a role in conjugal transfer of DNA.
Length = 742
Score = 38.9 bits (91), Expect = 0.032
Identities = 24/87 (27%), Positives = 42/87 (48%), Gaps = 5/87 (5%)
Query: 288 ANFGAEYLSGAQDQKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAA 347
A + A YL +D++ +L G++ + TT+ G+D + +D V+ P A
Sbjct: 307 AAYRAGYL--PEDRRELERALRD---GELLGVATTNALELGVDISGLDAVVIAGFPGTRA 361
Query: 348 TYLHRMGRAGRYGTRGLVITIVSAESL 374
+ + GRAGR G LV+ + + L
Sbjct: 362 SLWQQAGRAGRRGQGALVVLVARDDPL 388
>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
family. Members of this protein family occur in
Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
related Mycoplasmas in small paralogous families that
may also include truncated forms and/or pseudogenes.
Members are predicted lipoproteins with a conserved
signal peptidase II processing and lipid attachment
site. Note that the name for certain characterized
members, p72, reflects an anomalous apparent molecular
weight, given a theoretical MW of about 61 kDa.
Length = 541
Score = 38.7 bits (90), Expect = 0.038
Identities = 24/113 (21%), Positives = 46/113 (40%), Gaps = 6/113 (5%)
Query: 557 TETTKENNQTDEVPEENGLVGNPNKAEVGDLNECESCTDSFHMNEHSDENDLGNNTKESQ 616
+ T+ + Q ++ PE P + D E ++ NE+ DE + + E +
Sbjct: 25 STTSSNSKQPEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKK 84
Query: 617 KQKGH-KSKKKKDRKRKGNGDKR----DPQGNNANQREI-RDNSATLDKTEKE 663
K + +K + DK+ P +A+Q + + + LDK +KE
Sbjct: 85 PDPSKNKEEIEKPKDEPKKPDKKPQADQPNNVHADQPNNNKVDFSDLDKLKKE 137
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 38.5 bits (90), Expect = 0.046
Identities = 48/269 (17%), Positives = 94/269 (34%), Gaps = 35/269 (13%)
Query: 448 SKEVTKKGASKEGTKWNLTDGQSEEESDSENDGKPDENGSQRG-SKRSKEFKTQEKCVKG 506
+E+ +K K ++ + + EE D ++D D+ + K+ + K E +
Sbjct: 311 GEELRRKIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEEN 370
Query: 507 AG--------------EDENGD-----RKASKDEQKIDGANSAEKSLEDKHGKKITNSFE 547
+G ++EN R+ + E++ D E+ + G++
Sbjct: 371 SGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEE-ENEEPSKKNVGRRKFG--P 427
Query: 548 AVGEIVQNDTETTKENNQTDEVP----EENGLVGNPNKAEVGDLNECESCTDSFHMNEHS 603
GE + KEN + EE L N+ ++ E
Sbjct: 428 ENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEE 487
Query: 604 DENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDPQGNNANQREIRDNSATLDKTEKE 663
+E D N ++ G KS KK+D K+K + + A + + K +KE
Sbjct: 488 EELDEENPWLKTTSSVG-KSAKKQDSKKKSSSKL-----DKAANKISKAAVKVKKKKKKE 541
Query: 664 SVNNVESK--DERPPSTANNPDGTEWEDI 690
+++ DE + D + +D
Sbjct: 542 KSIDLDDDLIDEEDSIKLDVDDEEDEDDE 570
Score = 33.9 bits (78), Expect = 1.2
Identities = 18/121 (14%), Positives = 39/121 (32%), Gaps = 4/121 (3%)
Query: 417 PLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESDS 476
+ + + + ++ E + E K K E + + EEE D
Sbjct: 437 KKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKE----EEEEELDE 492
Query: 477 ENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKASKDEQKIDGANSAEKSLED 536
EN + + +K+ K + A + K ++K + + + L D
Sbjct: 493 ENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLID 552
Query: 537 K 537
+
Sbjct: 553 E 553
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 38.4 bits (89), Expect = 0.050
Identities = 46/269 (17%), Positives = 99/269 (36%), Gaps = 21/269 (7%)
Query: 423 EREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESDSENDGKP 482
+ + E T E + + + E + A +EG + +SE E +E G+
Sbjct: 634 DVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQ 693
Query: 483 DENGSQRGSKRSKEFKTQEKCVKGAGEDE--------------NGDRKASKDEQKIDGAN 528
+ G + + +T+ + V+ GE E G+ + E + +G +
Sbjct: 694 EGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKH 753
Query: 529 SAEKSLEDKHGKKITNSFEAVGEIVQNDTETTKENNQTDEVPEENGLVGNPNKAEVGDLN 588
E + K + + E E +++ E + + E+ + G G +
Sbjct: 754 EVETEGDRKETE---HEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETE-- 808
Query: 589 ECESCTDSFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDPQGNNANQR 648
E +D+ ++ + T E + ++ + K+D K G D G ++ +
Sbjct: 809 AGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSD--GGDSEEE 866
Query: 649 EIRDNSATLDKTEKESVNNVESKDERPPS 677
E + ++ E+E E ++E P S
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENEEPLS 895
Score = 35.7 bits (82), Expect = 0.40
Identities = 37/209 (17%), Positives = 74/209 (35%), Gaps = 8/209 (3%)
Query: 417 PLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTK-KGASKEGTKWNLTDGQSEEESD 475
P ++ E+E + + + + + T+ +G EG +G+ E E +
Sbjct: 686 PAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGE-EVEDE 744
Query: 476 SENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKASKD-EQKIDGANSAEKSL 534
E + + G ++ E + + + EDE G+ +A +D E K D +
Sbjct: 745 GEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDE-GEIQAGEDGEMKGDEGAEGKVEH 803
Query: 535 EDKHGKKITNSFEAVGEIVQNDTETTKENNQTDEVPEENGLVGNPNKA----EVGDLNEC 590
E + + E E +DTE E + + E G K D +
Sbjct: 804 EGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDS 863
Query: 591 ESCTDSFHMNEHSDENDLGNNTKESQKQK 619
E + E +E + +E + ++
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
>gnl|CDD|224123 COG1202, COG1202, Superfamily II helicase, archaea-specific
[General function prediction only].
Length = 830
Score = 38.6 bits (90), Expect = 0.054
Identities = 31/123 (25%), Positives = 53/123 (43%), Gaps = 8/123 (6%)
Query: 267 QCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRILVTTDLAA 326
Q ++F+ + R + + L A K R + +F ++ +VTT A
Sbjct: 442 QTIVFTYSRRRCHELADALTGKGLKAAPYHAGLPYKERKSVERAFAAQELAAVVTTAALA 501
Query: 327 RGID-AANVDLVINLEIPHD---AATYLHRMGRAGR--YGTRGLVITIVSAESLVKFHSL 380
G+D A+ + +L + + + +GRAGR Y RG V +V E K+H+
Sbjct: 502 AGVDFPASQVIFESLAMGIEWLSVREFQQMLGRAGRPDYHDRGKVYLLV--EPGKKYHAS 559
Query: 381 MGE 383
M E
Sbjct: 560 MEE 562
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 37.2 bits (86), Expect = 0.13
Identities = 37/222 (16%), Positives = 85/222 (38%), Gaps = 26/222 (11%)
Query: 409 RVQTLLAKPLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDG 468
R++ +++K + + E+ E+ + + + T PSK + G S
Sbjct: 383 RLEEVISKYAMKQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESP------SMAS 436
Query: 469 QSEEESDSENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKASKDEQKIDGAN 528
Q EE +S + + +E + + S+E + +++ E+E + E++++G++
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDE------EEEEEVEADNGSEEEMEGSS 490
Query: 529 SAEKSLEDKHGKKITNSFEAVGEIVQNDTETTKENNQTDEVPEENGLVGNPNKAEVGDLN 588
+ E+ + E G ++ + + ++ E P+E P + E
Sbjct: 491 EGDGDGEEPEEDAERRNSEMAGISRMSEGQQPRGSSVQPESPQE-----EPLQPES---- 541
Query: 589 ECESCTDSFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRK 630
D+ + E SDE L + S + + K
Sbjct: 542 -----MDAESVGEESDEELLAEESPLSSHTELEGVATPVETK 578
Score = 33.3 bits (76), Expect = 1.7
Identities = 21/136 (15%), Positives = 45/136 (33%), Gaps = 13/136 (9%)
Query: 1745 KESSHKRNRKANDFNDTCGAVANETFDGEDSKGGSDDENN--------LIVTLSEAGDSE 1796
+E +R ++ + + + + ++S+ E SE + E
Sbjct: 409 RERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGE 468
Query: 1797 VDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVKSNESHFGNSRCQP 1856
+ E E E E N S+ + E +++ E + +S + S +
Sbjct: 469 DEEEEE-----EVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQPR 523
Query: 1857 KTAYQSSSAQSEDGTS 1872
++ Q S Q E
Sbjct: 524 GSSVQPESPQEEPLQP 539
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
splicing factor. These splicing factors consist of an
N-terminal arginine-rich low complexity domain followed
by three tandem RNA recognition motifs (pfam00076). The
well-characterized members of this family are auxilliary
components of the U2 small nuclear ribonuclearprotein
splicing factor (U2AF). These proteins are closely
related to the CC1-like subfamily of splicing factors
(TIGR01622). Members of this subfamily are found in
plants, metazoa and fungi.
Length = 509
Score = 35.6 bits (82), Expect = 0.38
Identities = 22/94 (23%), Positives = 40/94 (42%), Gaps = 1/94 (1%)
Query: 893 DRTKRRSRPRKNKENTDILNTLGQDKTSAHGRTPSGGHRTPERRRSDGVRNDTRDERNGA 952
DR++ R R R+++E + ++ +D+ R+P + RR D R +R R+
Sbjct: 30 DRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRRRSRSVRSIE 89
Query: 953 YTGDKGGRNRGHSVNSRKDHSFAGETPRSKGGFD 986
+ R+R S RKD G++
Sbjct: 90 -QHRRRLRDRSPSNQWRKDDKKRSLWDIKPPGYE 122
Score = 32.2 bits (73), Expect = 3.6
Identities = 27/146 (18%), Positives = 46/146 (31%), Gaps = 28/146 (19%)
Query: 1043 EQPVDEKKSRRELSQKMDSDRYRPDSDGDPNRRPSSQESLHHNDSLDIKLRKKLKPGHRL 1102
E+P E++ R + S+R R R S+ H S + R+ +P R
Sbjct: 3 EEPDREREKSRGRDRDRSSERPR------RRSRDRSRFRDRHRRSRERSYREDSRPRDRR 56
Query: 1103 QRNSKKRHSKISTERSNSNSAPREKSSDAKQAANEEENITGGNDENKTPVASEVRNKNKK 1162
+ +S+ S + S PR +S + + +
Sbjct: 57 RYDSRSPRSLRYSSVRRSRDRPRRRSRSVRSIEQHRR-------------------RLRD 97
Query: 1163 VKGVKEKNPSIVGDEKRRYWWRSPPG 1188
+ D+KR W PPG
Sbjct: 98 RSPSNQWRKD---DKKRSLWDIKPPG 120
>gnl|CDD|188395 TIGR03865, PQQ_CXXCW, PQQ-dependent catabolism-associated CXXCW
motif protein. Members of this protein family have a
CXXXCW motif, consistent with a possible role in redox
cofactor binding. This protein family shows strong
relationships by phylogenetic profiling and conserved
gene neighborhoods with a transport system for alcohols
metabolized by PQQ-dependent enzymes.
Length = 162
Score = 33.9 bits (78), Expect = 0.40
Identities = 19/62 (30%), Positives = 25/62 (40%), Gaps = 5/62 (8%)
Query: 674 RPPSTANNPDGTEWEDIPSDTIPND----ETSNDNKLTPVDFDYINDFVNQTPSETGDRP 729
RPP N +GT W D P ++P T N L P Y+ + + RP
Sbjct: 60 RPPKPKNLLEGTVWRDEPHLSLPGSLWLPNTGYGN-LAPAWQAYLRRGLARATGGDKGRP 118
Query: 730 LV 731
LV
Sbjct: 119 LV 120
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 35.3 bits (81), Expect = 0.44
Identities = 45/268 (16%), Positives = 92/268 (34%), Gaps = 35/268 (13%)
Query: 453 KKGASKEGTKWNLTDGQSEEESDSENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDEN 512
+KG G + D + ++E D + K E+G + SK+ K+ + K K +D+
Sbjct: 203 EKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNK--KKLDDDKK 260
Query: 513 GDRKASKDEQKIDGAN--------------SAEKSLEDKHGKKITNSFEAVGEIVQNDTE 558
G R D + D + SA + ++ K++ A EI Q++
Sbjct: 261 GKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDS 320
Query: 559 TTKENNQTDEVPEENGLVGNPNKAEVGDL-----NECESCTDSFHMNEHSDENDLGNNTK 613
E +E EE G + K + + ++ + + +
Sbjct: 321 EESE----EEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGEDSVSLV 376
Query: 614 ESQKQKGHKSKKKKDR---KRKGNGDKRDP--QGNNANQREIRDNSAT-----LDKTEKE 663
++KQK K ++ D +G R + ++ + S + K + E
Sbjct: 377 TAKKQKEPKKEEPVDSNPSSPGNSGPARPSPESKDKGKRKAANEVSKSPASVPAKKLKTE 436
Query: 664 SVNNVESKDERPPSTANNPDGTEWEDIP 691
+ S P + + + + D
Sbjct: 437 NAPKSSSGKSTPQTFSGSKSSSNAADGG 464
Score = 32.2 bits (73), Expect = 4.3
Identities = 19/103 (18%), Positives = 36/103 (34%), Gaps = 1/103 (0%)
Query: 416 KPLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESD 475
K + + ++EQ + E G SK+ KK +G K L S+ D
Sbjct: 303 KLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKK-GKKLKKLKGKKNGLDKDDSDSGDD 361
Query: 476 SENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKAS 518
S++ E+ + + ++ +E+ V A
Sbjct: 362 SDDSDIDGEDSVSLVTAKKQKEPKKEEPVDSNPSSPGNSGPAR 404
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 35.2 bits (81), Expect = 0.48
Identities = 23/76 (30%), Positives = 33/76 (43%), Gaps = 3/76 (3%)
Query: 468 GQSEEESDSENDGKPDENGSQRGSKRSKEFKTQEK--CVKGAGEDENGDRKASKDEQKID 525
G SE+ S N+ P + S +GSK+ K T K + +DE K K QK
Sbjct: 398 GSSEKSLGS-NESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKG 456
Query: 526 GANSAEKSLEDKHGKK 541
S++ + K G K
Sbjct: 457 RDKSSKVPSDSKAGGK 472
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 35.3 bits (81), Expect = 0.50
Identities = 25/161 (15%), Positives = 49/161 (30%), Gaps = 2/161 (1%)
Query: 1094 KKLKPGHRLQRNSKKRHSKISTERSNSNSAPREKSSDAKQAANEEENITGGNDENKTPVA 1153
K+L P N S S S + + + I + ++
Sbjct: 228 KELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIGINNHHSKHADSKL 287
Query: 1154 SEVRNKNKKVKGVKEKNPSIVGDEKRRYWWRSPPGGGPNAARH-KVCDDTDFQISWNDPH 1212
+ + + S K S G ++ K ++ S + P
Sbjct: 288 QTI-EVIENHSNKSRPSSSSTNGSKETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPK 346
Query: 1213 PPKLTHGTGKSRSKSSHSKQSKRSAENPSCRRMRGQDAFAP 1253
+G+ S S S + + KR++++ S R +DA
Sbjct: 347 SHSSANGSVPSSSVSDNESKQKRASKSSSGARDSKKDASGM 387
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The function
is unclear.
Length = 151
Score = 33.4 bits (76), Expect = 0.56
Identities = 28/97 (28%), Positives = 37/97 (38%), Gaps = 9/97 (9%)
Query: 599 MNEHSDENDLGNNTKESQKQKGHKSKKKKDRK---RKGNGDKRD--PQGNNANQREIRDN 653
M E S E L KES K + KS KKK K +KG K++ P G ++
Sbjct: 1 MKETSYEEALAKQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEKPNGKIPEHEPNQEV 60
Query: 654 SATLDKTEKESVNNVESKDER----PPSTANNPDGTE 686
+ EKE V V P A P ++
Sbjct: 61 TEVEVIIEKEPVPAVAVAPVPVAVVAPVVAPKPKKSQ 97
>gnl|CDD|236779 PRK10864, PRK10864, putative methyltransferase; Provisional.
Length = 346
Score = 34.8 bits (80), Expect = 0.57
Identities = 24/78 (30%), Positives = 33/78 (42%), Gaps = 4/78 (5%)
Query: 915 GQDKTSAHGRTPSGGHRTPERRRSDGVRNDTRDERNGAYTGDKGGRNRGHSVNSRKDHSF 974
DK + + RT GG R + R+DG R RD+RN K + +V SR
Sbjct: 22 DSDKRTHNPRTGKGGGRPSGKSRADGGRRPARDDRNSQSRDRKWEDSPWRTV-SRAPGDE 80
Query: 975 AGETPRSKGGFD--SYLD 990
E GG S++D
Sbjct: 81 TPEKA-DHGGISGKSFID 97
>gnl|CDD|236641 PRK10019, PRK10019, nickel/cobalt efflux protein RcnA; Provisional.
Length = 279
Score = 34.0 bits (78), Expect = 0.88
Identities = 10/43 (23%), Positives = 16/43 (37%), Gaps = 6/43 (13%)
Query: 1537 QGWNEHSELQNGIDYGRLPIGHQHNHDHRHKHRFNHSSSGGNP 1579
+ W E+ + H H+H+H H H +H G
Sbjct: 114 RNWLENMHHHDH------DHDHDHDHEHHHDHGHHHHHEHGAT 150
>gnl|CDD|226583 COG4098, comFA, Superfamily II DNA/RNA helicase required for DNA
uptake (late competence protein) [DNA replication,
recombination, and repair].
Length = 441
Score = 34.0 bits (78), Expect = 0.92
Identities = 24/73 (32%), Positives = 41/73 (56%), Gaps = 5/73 (6%)
Query: 298 AQDQKARLASLDSFKRGKVRILVTTDLAARGIDAANVDL-VINLEIPHDAATYLHRM-GR 355
++DQ R +++F+ GK+ +L+TT + RG+ NVD+ V+ E + L ++ GR
Sbjct: 339 SEDQH-RKEKVEAFRDGKITLLITTTILERGVTFPNVDVFVLGAEHRVFTESALVQIAGR 397
Query: 356 AGRYGTR--GLVI 366
GR R G V+
Sbjct: 398 VGRSLERPTGDVL 410
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 34.3 bits (78), Expect = 0.98
Identities = 19/82 (23%), Positives = 30/82 (36%), Gaps = 3/82 (3%)
Query: 1951 KDGKRYGNAQSFEAARNSNPYEAPMPGAWQPVLEGTSVFDDAEKSSKSGPKSSKAG--PK 2008
K K+ + ++ ++ P E P P G ++ E K G P
Sbjct: 491 KSKKKLAPIEEEDSDKHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPG 550
Query: 2009 SSKPGPKSSKPGPKSSKRYPSR 2030
+K G KPGP + PS+
Sbjct: 551 ETKEGEVGKKPGPAKEHK-PSK 571
Score = 31.2 bits (70), Expect = 8.4
Identities = 22/105 (20%), Positives = 35/105 (33%), Gaps = 3/105 (2%)
Query: 1923 SASLPASQIALRDGKGDFKSPRKASHSLKDGKRYGNAQSFEAARNSNPYEAPMPGAWQPV 1982
++ LP ++G+ K S K+G + G + E + P + P P
Sbjct: 516 ASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKI-PT 574
Query: 1983 LEGTSVFDDAEKSSK--SGPKSSKAGPKSSKPGPKSSKPGPKSSK 2025
L F K K PK K + +P S P+
Sbjct: 575 LSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLD 619
>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2. A Saccharomyces
cerevisiae member of this family (PGA2) is an ER protein
which has been implicated in protein trafficking.
Length = 139
Score = 32.1 bits (73), Expect = 1.3
Identities = 21/103 (20%), Positives = 39/103 (37%), Gaps = 12/103 (11%)
Query: 423 EREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESDSENDGKP 482
+ ++ +E ++ E + + S + GA+ G EE++D E D +
Sbjct: 39 AKAQEKEHEKERAEREEAREKKAKISPNALRGGATA---------GHGEEDTDDEEDEED 89
Query: 483 DENGS---QRGSKRSKEFKTQEKCVKGAGEDENGDRKASKDEQ 522
S Q G K K + + + A E D+ +DE
Sbjct: 90 FATPSAVPQWGKKARKRQRKVIRKLLEAEEQLREDQYDDEDED 132
>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN). This family
consists of several eukaryotic survival motor neuron
(SMN) proteins. The Survival of Motor Neurons (SMN)
protein, the product of the spinal muscular
atrophy-determining gene, is part of a large
macromolecular complex (SMN complex) that functions in
the assembly of spliceosomal small nuclear
ribonucleoproteins (snRNPs). The SMN complex functions as
a specificity factor essential for the efficient assembly
of Sm proteins on U snRNAs and likely protects cells from
illicit, and potentially deleterious, non-specific
binding of Sm proteins to RNAs.
Length = 264
Score = 33.4 bits (76), Expect = 1.3
Identities = 35/128 (27%), Positives = 50/128 (39%), Gaps = 28/128 (21%)
Query: 1735 SEFDPKRLHSKESSHKRNRKANDFN----------DTCGAVANE-------TFDGEDSKG 1777
E D + K ++K+NR N D+C AV +E T D K
Sbjct: 38 QENDEQNPGKKRKNNKKNRSRKKCNAAPLKEWQVGDSCNAVWSEDGNLYTATITSIDQKR 97
Query: 1778 GSDDENNLIVTLSEAGDSE----VDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSH 1833
G+ +V + G+ E D+ P D DEDA N E+ + +SD +SH
Sbjct: 98 GT-----CVVFYTGYGNEEEQNLADLLTP--PPDVDEDALKEANVNETESSTDESDRSSH 150
Query: 1834 DGESDSHS 1841
E S S
Sbjct: 151 SHEVRSKS 158
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 33.7 bits (77), Expect = 1.4
Identities = 22/149 (14%), Positives = 52/149 (34%), Gaps = 6/149 (4%)
Query: 430 TAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESDSENDGKPDENGSQR 489
TKP + ++ ++ KE K + + + + + D KP E ++
Sbjct: 82 KGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPK-DRKPKEEAKEK 140
Query: 490 GSKRSKEFKTQEKCVKGAGEDENGDR-----KASKDEQKIDGANSAEKSLEDKHGKKITN 544
+ KE + ++K + +E R K+ + + +K ++ ++
Sbjct: 141 RPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAA 200
Query: 545 SFEAVGEIVQNDTETTKENNQTDEVPEEN 573
G+ + D +E + D E
Sbjct: 201 REAVKGKPEEPDVNEEREKEEDDGKDRET 229
>gnl|CDD|237253 PRK12898, secA, preprotein translocase subunit SecA; Reviewed.
Length = 656
Score = 33.4 bits (77), Expect = 1.6
Identities = 37/113 (32%), Positives = 48/113 (42%), Gaps = 22/113 (19%)
Query: 285 LANANFGAEYLSGAQDQKARLASLDSFKRGKVRILVTTDLAARGID-------AANVDL- 336
L A + L+ QD A A++ + + RI V T++A RG D AA L
Sbjct: 493 LREAGLPHQVLNAKQD--AEEAAIVARAGQRGRITVATNMAGRGTDIKLEPGVAARGGLH 550
Query: 337 VINLEIPHDAATYLHRM-----GRAGRYGTRGLVITIVSAES--LVKFHSLMG 382
VI E HD+A R+ GR GR G G I+S E L F G
Sbjct: 551 VILTE-RHDSA----RIDRQLAGRCGRQGDPGSYEAILSLEDDLLQSFLGSRG 598
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
Provisional.
Length = 672
Score = 33.3 bits (77), Expect = 1.7
Identities = 29/235 (12%), Positives = 66/235 (28%), Gaps = 23/235 (9%)
Query: 415 AKPLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEES 474
+ A EQ A E++ + + + AS + E +
Sbjct: 78 RRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAA 137
Query: 475 DSENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKASKDEQKIDGANSAEKSL 534
K E G Q ++ + + + + GDR+ + E + E+
Sbjct: 138 RRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERG 197
Query: 535 EDKHGKKITNSFEAVGEIVQNDTETTKENNQTDEVPEENGLVGNPNKAEVGDLNECESCT 594
D + + D + + + G + + D
Sbjct: 198 RDG------------DDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDAR------ 239
Query: 595 DSFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDPQGNNANQRE 649
+ + E+ + + + + G + ++ +DR R+G RE
Sbjct: 240 -----GDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGGNEREPELRE 289
Score = 32.2 bits (74), Expect = 3.9
Identities = 16/107 (14%), Positives = 29/107 (27%), Gaps = 5/107 (4%)
Query: 880 NSSDYQSDSKPYGDRTKRRSRPRKNKENTDILNTLGQDKTSAHGRTPSGGHRTPERRRSD 939
+ + + D +RR + + ++ R G R RR
Sbjct: 157 RADAAERTEEEERDERRRRGDREDRQAEAE-----RGERGRREERGRDGDDRDRRDRREQ 211
Query: 940 GVRNDTRDERNGAYTGDKGGRNRGHSVNSRKDHSFAGETPRSKGGFD 986
G R + R R+G + R + G+ G
Sbjct: 212 GDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGR 258
>gnl|CDD|233032 TIGR00580, mfd, transcription-repair coupling factor (mfd). All
proteins in this family for which functions are known
are DNA-dependent ATPases that function in the process
of transcription-coupled DNA repair in which the repair
of the transcribed strand of actively transcribed genes
is repaired at a higher rate than the repair of
non-transcribed regions of the genome and than the
non-transcribed strand of the same gene. This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University). This family is
closely related to the RecG and UvrB families [DNA
metabolism, DNA replication, recombination, and repair].
Length = 926
Score = 33.5 bits (77), Expect = 1.7
Identities = 17/59 (28%), Positives = 27/59 (45%), Gaps = 9/59 (15%)
Query: 310 SFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATY----LHRM-GRAGRYGTRG 363
F +G+ ++LV T + GID N N I A + L+++ GR GR +
Sbjct: 707 EFYKGEFQVLVCTTIIETGIDIPNA----NTIIIERADKFGLAQLYQLRGRVGRSKKKA 761
>gnl|CDD|234506 TIGR04221, SecA2_Mycobac, accessory Sec system translocase SecA2,
Actinobacterial type. Members of this family are the
SecA2 subunit of the Mycobacterial type of accessory
secretory system. This family is quite different SecA2
of the Staph/Strep type (TIGR03714).
Length = 762
Score = 33.6 bits (77), Expect = 1.7
Identities = 31/110 (28%), Positives = 42/110 (38%), Gaps = 17/110 (15%)
Query: 317 RILVTTDLAARGID-------AANVDLVINLEIPHDAATYLHRM--------GRAGRYGT 361
+ V+T +A RG D A+ D V L H T HR GRAGR G
Sbjct: 475 AVTVSTQMAGRGTDIRLGGSDEADHDRVAELGGLHVIGTGRHRTARLDNQLRGRAGRQGD 534
Query: 362 RGLVITIVSAE-SLVKFHSLMGEINLDHAFNVGLVPDNLTGDQINWTQRV 410
G + VS E +V + A + G + D ++ QRV
Sbjct: 535 PGSSVFFVSLEDDVVAVGGAGETVPAQPAED-GRIESPRVQDFVDHAQRV 583
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 33.4 bits (76), Expect = 1.7
Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 23/164 (14%)
Query: 414 LAKPLDQAQ-----EREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDG 468
+ D AQ +R++V Q +E+ + + + +E K +
Sbjct: 232 AQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIK 291
Query: 469 QSEEESDSENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRK-ASKDEQKIDGA 527
+++EE+ D K + +E K EK + + R+ ++D QK
Sbjct: 292 KNDEEALKAKDHKAFD--------LKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQ 343
Query: 528 NSAEKSLEDK---------HGKKITNSFEAVGEIVQNDTETTKE 562
A+ + ++ +G K+ + + E+V D +T
Sbjct: 344 VEAQPTSLNEDAIDSSNPVYGLKVVDPITNLSELVLIDLKTEVR 387
>gnl|CDD|239431 cd03315, MLE_like, Muconate lactonizing enzyme (MLE) like subgroup of
the enolase superfamily. Enzymes of this subgroup share
three conserved carboxylate ligands for the essential
divalent metal ion (usually Mg2+), two aspartates and a
glutamate, and residues that can function as general
acid/base catalysts, a Lys-X-Lys motif and another
conserved lysine. Despite these conserved residues, the
members of the MLE subgroup, like muconate lactonizing
enzyme, o-succinylbenzoate synthase (OSBS) and
N-acylamino acid racemase (NAAAR), catalyze different
reactions.
Length = 265
Score = 32.7 bits (75), Expect = 2.1
Identities = 16/52 (30%), Positives = 27/52 (51%), Gaps = 7/52 (13%)
Query: 1016 DLVVYDDAFDMFSP-----AANNLDEYEIDYVEQPV--DEKKSRRELSQKMD 1060
D + DA ++P A L++ +DYVEQP+ D+ + R L++ D
Sbjct: 129 DAELRVDANRGWTPKQAIRALRALEDLGLDYVEQPLPADDLEGRAALARATD 180
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 32.9 bits (75), Expect = 2.2
Identities = 36/184 (19%), Positives = 65/184 (35%), Gaps = 26/184 (14%)
Query: 1740 KRLHSKESSHKRNRKANDFNDTCGAVANETFDGEDSKGGSDDENNLIVTLSEAGDSEVDV 1799
K K++ + ++A + +D E + ED + + + E D
Sbjct: 148 KGKAKKKTKKSKKKEAKESSDK---DDEEESESEDESKSEESAED-----DSDDEEEEDS 199
Query: 1800 EGE----------NSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVKSNESHF 1849
+ E +S +E+ + S N E T+++ + +S ES S S S
Sbjct: 200 DSEDYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSEESS-- 257
Query: 1850 GNSRCQPKTAYQSSSAQSEDGTSYKSISWDAYSVIDEPESYDVSWRTCSSESAAREKNSG 1909
+PK SS+ Y S S D ++ + V + R+ G
Sbjct: 258 -PPSKKPKEKKTSSTFLPSLMGGYFSGSEDEDDDDEDIDPDQV-----VKKPVKRKNRRG 311
Query: 1910 QRSR 1913
QR+R
Sbjct: 312 QRAR 315
>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
Length = 534
Score = 33.1 bits (75), Expect = 2.3
Identities = 17/92 (18%), Positives = 33/92 (35%), Gaps = 2/92 (2%)
Query: 1791 EAGDSEVDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVKSNESHFG 1850
E G + N+P D+D +A + + +D D + +E G
Sbjct: 410 EEGRAAAAAAAANAPADQDVEAEAHDEFDQDPGAPAHADRADSDEDDMDEQESGDERADG 469
Query: 1851 NSRCQPKTAYQSSSAQSEDGTSYKSISWDAYS 1882
+Y ++S++ E ++ SW S
Sbjct: 470 EDDSDSSYSYSTTSSEDESDSADD--SWGDES 499
>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
Provisional.
Length = 337
Score = 32.8 bits (75), Expect = 2.3
Identities = 14/64 (21%), Positives = 28/64 (43%), Gaps = 5/64 (7%)
Query: 1789 LSEAGDSEVDVEGE--NSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVKSNE 1846
L EA D + D + + D + ++++ +D S ++ D+ + + KSNE
Sbjct: 276 LGEAIDEDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTNEEDDRGEKESNGA---KSNE 332
Query: 1847 SHFG 1850
H
Sbjct: 333 LHLT 336
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 31.7 bits (72), Expect = 2.9
Identities = 20/64 (31%), Positives = 32/64 (50%)
Query: 425 EDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESDSENDGKPDE 484
EDVE ++++SK + T P + +K+ KK KE + S+ E + END K D
Sbjct: 41 EDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDY 100
Query: 485 NGSQ 488
+
Sbjct: 101 ELPK 104
>gnl|CDD|233483 TIGR01587, cas3_core, CRISPR-associated helicase Cas3. This model
represents the highly conserved core region of an
alignment of Cas3, a protein found in association with
CRISPR repeat elements in a broad range of bacteria and
archaea. Cas3 appears to be a helicase, with regions
found by pfam00270 (DEAD/DEAH box helicase) and
pfam00271 (Helicase conserved C-terminal domain). Some
but not all members have an N-terminal HD domain region
(pfam01966) that is not included within this model
[Mobile and extrachromosomal element functions, Other].
Length = 358
Score = 32.4 bits (74), Expect = 2.9
Identities = 15/55 (27%), Positives = 28/55 (50%), Gaps = 3/55 (5%)
Query: 308 LDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTR 362
L+ K+ + ++V T + +D + D++I P D+ R+GR RYG +
Sbjct: 271 LEEMKKNEKFVIVATQVIEASLD-ISADVMITELAPIDSLI--QRLGRLHRYGRK 322
>gnl|CDD|215597 PLN03137, PLN03137, ATP-dependent DNA helicase; Q4-like;
Provisional.
Length = 1195
Score = 32.6 bits (74), Expect = 3.2
Identities = 26/99 (26%), Positives = 44/99 (44%), Gaps = 1/99 (1%)
Query: 265 FNQC-VIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRILVTTD 323
F++C +I+ ++ E + E L A + G+ D R + + ++ I+ T
Sbjct: 679 FDECGIIYCLSRMDCEKVAERLQEFGHKAAFYHGSMDPAQRAFVQKQWSKDEINIICATV 738
Query: 324 LAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTR 362
GI+ +V VI+ +P Y GRAGR G R
Sbjct: 739 AFGMGINKPDVRFVIHHSLPKSIEGYHQECGRAGRDGQR 777
>gnl|CDD|215814 pfam00242, DNA_pol_viral_N, DNA polymerase (viral) N-terminal domain.
Length = 379
Score = 32.1 bits (73), Expect = 3.7
Identities = 15/82 (18%), Positives = 25/82 (30%), Gaps = 3/82 (3%)
Query: 1995 SSKSGPKSSKAGPKSSKPGPKSSKPGPKSSKRYPSRSGSSGLNN---RTEGNSVSHTWEN 2051
+ S + S+ P +P + S SS + R S T E
Sbjct: 262 GQQGRSGSIRGRKHSTTRRPFGVEPSSSGVTTNRASSSSSCFHQSAVRETAYSSLSTSER 321
Query: 2052 HFATGTDRRSHQYPAYGSMSRH 2073
H ++G P S++
Sbjct: 322 HSSSGHAVELRSIPGGSVSSQN 343
>gnl|CDD|114045 pfam05297, Herpes_LMP1, Herpesvirus latent membrane protein 1 (LMP1).
This family consists of several latent membrane protein
1 or LMP1s mostly from Epstein-Barr virus. LMP1 of EBV is
a 62-65 kDa plasma membrane protein possessing six
membrane spanning regions, a short cytoplasmic N-terminus
and a long cytoplasmic carboxy tail of 200 amino acids.
EBV latent membrane protein 1 (LMP1) is essential for
EBV-mediated transformation and has been associated with
several cases of malignancies. EBV-like viruses in
Cynomolgus monkeys (Macaca fascicularis) have been
associated with high lymphoma rates in immunosuppressed
monkeys.
Length = 382
Score = 31.9 bits (72), Expect = 4.2
Identities = 37/142 (26%), Positives = 51/142 (35%), Gaps = 16/142 (11%)
Query: 1725 KHHHSSS--HSQSEFDPKRLHSKESSHKRNRK------ANDFNDTCGAVANETFDGEDSK 1776
+HHH S H Q D H +S+ R A D C G D+
Sbjct: 194 EHHHDDSLPHPQQATDDSG-HESDSNSNEGRHHLLVSGAGDGPPLCSQNLGAPGGGPDN- 251
Query: 1777 GGSDDENNLIVTLSEAGDSEVDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGE 1836
G D +N + G + D +N P D D +N D ++TA N D ++ D
Sbjct: 252 -GPQDPDNT----DDNGPQDPDNTDDNGPQDPDNTDDNGPQDPDNTADNGPQDPDNTDDN 306
Query: 1837 SDSHSVKSNES-HFGNSRCQPK 1857
+ N S GN PK
Sbjct: 307 GPHDPLPHNPSDSAGNDGGPPK 328
>gnl|CDD|114337 pfam05609, LAP1C, Lamina-associated polypeptide 1C (LAP1C). This
family contains rat LAP1C proteins and several
uncharacterized highly related sequences from both mice
and humans. LAP1s (lamina-associated polypeptide 1s) are
type 2 integral membrane proteins with a single
membrane-spanning region of the inner nuclear membrane.
LAP1s bind to both A- and B-type lamins and have a
putative role in the membrane attachment and assembly of
the nuclear lamina.
Length = 465
Score = 31.9 bits (72), Expect = 4.3
Identities = 25/105 (23%), Positives = 37/105 (35%), Gaps = 16/105 (15%)
Query: 1898 SSESAAREKNSGQRSRGSGDKAARKSASLPASQIALRDGKGDFKSPRKASHSLKDGKRYG 1957
SS + +E + S S DK R A L +S AL DF + + L G +
Sbjct: 109 SSSNVTKEPLRSRDSHESSDKVGRADAHLGSSSWALPKSASDFTAHSQQPSVLTTGSQ-- 166
Query: 1958 NAQSFEAARNSNPYEAPMPGAWQPVLEGTSVFDDAEKSSKSGPKS 2002
+AP Q E T + S++G +S
Sbjct: 167 --------------KAPQEWHSQAGRERTRMLRRRILKSEAGNQS 197
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1 protein
homologues. SDA1 is a Saccharomyces cerevisiae protein
which is involved in the control of the actin
cytoskeleton. The protein is essential for cell viability
and is localised in the nucleus.
Length = 317
Score = 31.5 bits (72), Expect = 4.7
Identities = 14/78 (17%), Positives = 31/78 (39%), Gaps = 1/78 (1%)
Query: 1771 DGEDSKGGSDDENNLIVTLSEAGDSEVD-VEGENSPTDEDEDANNSQNDGESTAQNSQSD 1829
G +S D+E V E D E + ++ E+ E D+ + + E+ + +
Sbjct: 90 QGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDS 149
Query: 1830 ENSHDGESDSHSVKSNES 1847
+ E + + + E+
Sbjct: 150 DEELSEEDEEEAAEEEEA 167
>gnl|CDD|200575 cd10951, CE4_ClCDA_like, Catalytic NodB homology domain of
Colletotrichum lindemuthianum chitin deacetylase and
similar proteins. This family is represented by the
chitin deacetylase (endo-chitin de-N-acetylase, ClCDA,
EC 3.5.1.41) from Colletotrichum lindemuthianum (also
known as Glomerella lindemuthiana), which is a member of
the carbohydrate esterase 4 (CE4) superfamily. ClCDA
catalyzes the hydrolysis of N-acetamido groups of
N-acetyl-D-glucosamine residues in chitin, converting it
to chitosan in fungal cell walls. It consists of a
single catalytic domain similar to the deformed
(alpha/beta)8 barrel fold adopted by other CE4
esterases, which encompasses a mononuclear metalloenzyme
employing a conserved His-His-Asp zinc-binding triad
closely associated with the conserved catalytic base
(aspartic acid) and acid (histidine), to carry out
acid/base catalysis. It possesses a highly conserved
substrate-binding groove, with subtle alterations that
influence substrate specificity and subsite affinity.
Unlike its bacterial homologs, ClCDA contains two
intramolecular disulfide bonds that may add stability to
this secreted protein. The family also includes many
uncharacterized deacetylases and hypothetical proteins
mainly from eukaryotes, which show high sequence
similarity to ClCDA.
Length = 197
Score = 31.1 bits (71), Expect = 4.9
Identities = 14/38 (36%), Positives = 21/38 (55%)
Query: 196 LVVSATYSHDNLATLQKYMVDPLLIRPEDATRPLLGVK 233
+ S T+SH +L L + + + EDA R +LGVK
Sbjct: 65 QIASHTWSHPDLTKLSAAQIRDEMTKLEDALRKILGVK 102
>gnl|CDD|130302 TIGR01235, pyruv_carbox, pyruvate carboxylase. This enzyme plays a
role in gluconeogensis but not glycolysis [Energy
metabolism, Glycolysis/gluconeogenesis].
Length = 1143
Score = 32.1 bits (73), Expect = 5.3
Identities = 17/41 (41%), Positives = 23/41 (56%), Gaps = 4/41 (9%)
Query: 753 KACLTNDSVRVQVLMEKLDHVDASVLADWILNKHNNVAALF 793
KA ND V V+ L+E+ H++ +L D KH NV LF
Sbjct: 189 KAAFGNDEVYVEKLIERPRHIEVQLLGD----KHGNVVHLF 225
>gnl|CDD|222258 pfam13604, AAA_30, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. There is a Walker A and
Walker B.
Length = 195
Score = 31.0 bits (71), Expect = 5.5
Identities = 22/80 (27%), Positives = 35/80 (43%), Gaps = 11/80 (13%)
Query: 45 SPIQMAAFPYVLNGYD--AIVQSKSGTGKSIVFVVAALNKLIKLPVQQPQTII-LAPTRE 101
S Q A +VL D A+VQ +GTGK+ + A + + +I LAPT +
Sbjct: 3 SAEQREAVRHVLTSGDRVAVVQGPAGTGKT--TSLKAAREAWE---AAGYRVIGLAPTGK 57
Query: 102 IAVQITDVVRSVGAHVKNLH 121
A + + +G + L
Sbjct: 58 AAKVLGE---ELGIEARTLA 74
>gnl|CDD|114448 pfam05722, Ustilago_mating, Ustilago B locus mating-type protein.
This family consists of several Ustilago mating-type
proteins. The b locus of the phytopathogenic fungus
Ustilago maydis encodes a multiallelic recognition
function that controls the ability of the fungus to form
a dikaryon and complete the sexual stage of the life
cycle. The b locus has at least 25 alleles and any
combination of two different alleles, brought together by
mating between haploid cells, allows the fungus to cause
disease and undergo sexual development within the plant.
Length = 239
Score = 31.3 bits (70), Expect = 5.7
Identities = 25/109 (22%), Positives = 48/109 (44%), Gaps = 5/109 (4%)
Query: 1985 GTSVFDDAEKSSKSGPKSSKAGPKSS----KPGPKSS-KPGPKSSKRYPSRSGSSGLNNR 2039
G V+D + KS PK A ++ P K++ PK K P S + +++
Sbjct: 81 GDWVYDLCAANKKSPPKPGMARAVTTAAKRHPARKTAPAAKPKPKKAKPRASKTPSIDST 140
Query: 2040 TEGNSVSHTWENHFATGTDRRSHQYPAYGSMSRHDLYFDESQLLGNEAL 2088
+ +++ T E + D + + GS+ HD + D + +L + +L
Sbjct: 141 LDSSALESTPELSACSTADNSFSSFCSNGSLGHHDPFQDGNDILQSPSL 189
>gnl|CDD|187770 cd09639, Cas3_I, CRISPR/Cas system-associated protein Cas3. CRISPR
(Clustered Regularly Interspaced Short Palindromic
Repeats) and associated Cas proteins comprise a system
for heritable host defense by prokaryotic cells against
phage and other foreign DNA; DEAD/DEAH box helicase DNA
helicase cas3'; Often but not always is fused to HD
nuclease domain; signature gene for Type I.
Length = 353
Score = 31.6 bits (72), Expect = 5.8
Identities = 17/55 (30%), Positives = 29/55 (52%), Gaps = 3/55 (5%)
Query: 308 LDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTR 362
L FK+ + ++V T + +D +VD++I P D+ R+GR RYG +
Sbjct: 267 LLEFKKSEKFVIVATQVIEASLD-ISVDVMITELAPIDSLI--QRLGRLHRYGEK 318
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 31.8 bits (72), Expect = 6.1
Identities = 16/84 (19%), Positives = 32/84 (38%), Gaps = 2/84 (2%)
Query: 1793 GDSEVDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVK--SNESHFG 1850
G+S D + D DEDA+ N+ ++ + NS + SV S E
Sbjct: 111 GESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIE 170
Query: 1851 NSRCQPKTAYQSSSAQSEDGTSYK 1874
+ Q + ++ + ++ +
Sbjct: 171 EKKGQTENTFKFYNFGNDGEEAAA 194
>gnl|CDD|218601 pfam05477, SURF2, Surfeit locus protein 2 (SURF2). Surfeit locus
protein 2 is part of a group of at least six sequence
unrelated genes (Surf-1 to Surf-6). The six Surfeit
genes have been classified as housekeeping genes, being
expressed in all tissue types tested and not containing
a TATA box in their promoter region. The exact function
of SURF2 is unknown.
Length = 244
Score = 31.0 bits (70), Expect = 6.5
Identities = 25/74 (33%), Positives = 36/74 (48%), Gaps = 5/74 (6%)
Query: 562 ENNQTDEVPEENGLVGNPNKAEVGDLNECESCTDSFHMNEHSDENDLGNNTKESQKQKGH 621
E++ +D P E + NP K + GD ++ D M S E K S+KQ G
Sbjct: 168 EDSMSDLYPPELFTLKNPGKEQNGDEDDDFETDDEDEMEVESPELQ----QKRSKKQSGS 223
Query: 622 KSKK-KKDRKRKGN 634
+KK KK+ K+KG
Sbjct: 224 LTKKFKKNHKKKGP 237
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 31.6 bits (71), Expect = 6.8
Identities = 39/168 (23%), Positives = 69/168 (41%), Gaps = 15/168 (8%)
Query: 1728 HSSSHSQSEFDPKRLHSKESSHKRNRKANDFNDTCGAVANETFDGEDSKGGSDDENNLIV 1787
HS + FD + L + N +D ++E FD ED K +D N
Sbjct: 22 HSYEEESAGFDSEELEDNDEQGYSF-GVNSEDDEEID-SDEAFDEEDEKRFADWSFN--- 76
Query: 1788 TLSEAGDSEVDVEGENSP-----TDEDEDANNSQNDGE-STAQNSQSDENSHDGESDSHS 1841
S++G S D + N+ D D+ N+ + + E S + +++ D D+
Sbjct: 77 -ASKSGKSNKDHKNLNNTKEISLNDSDDSVNSDKLENEGSVSSIDENELVDLDTLLDNDQ 135
Query: 1842 VKSNESHFGNSRCQPKTAYQSSSAQSEDGTSYKSISWDAYSVIDEPES 1889
+ NES + + +S ++ S D S +S D+ S I+ +S
Sbjct: 136 PEKNESGNNDHATDKENLLESDASSSNDSESEES---DSESEIESSDS 180
Score = 31.2 bits (70), Expect = 8.7
Identities = 27/124 (21%), Positives = 51/124 (41%), Gaps = 18/124 (14%)
Query: 1728 HSSSHSQSEFDPKRLHSKESSHKRNRKANDFNDTCGAVANETFDGEDSKGG--SDDENNL 1785
++S +S D K L++ + + + D +++G S DEN L
Sbjct: 76 NASKSGKSNKDHKNLNNTKEISLNDSDDS-----------VNSDKLENEGSVSSIDENEL 124
Query: 1786 IVTLSEAGDSEVDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVKSN 1845
V L D++ + E+ D D N S++ +S+S+E+ + E +S S+
Sbjct: 125 -VDLDTLLDNDQPEKNESGNNDHATDKENLLESDASSSNDSESEESDSESEIES----SD 179
Query: 1846 ESHF 1849
H
Sbjct: 180 SDHD 183
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the maturation
of the 18S rRNA and for 40S ribosome production.
Length = 809
Score = 31.5 bits (72), Expect = 7.0
Identities = 13/62 (20%), Positives = 29/62 (46%)
Query: 1771 DGEDSKGGSDDENNLIVTLSEAGDSEVDVEGENSPTDEDEDANNSQNDGESTAQNSQSDE 1830
DG D + DD+++L + S+ + + E+ +D+++D + + E + S
Sbjct: 328 DGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAEST 387
Query: 1831 NS 1832
S
Sbjct: 388 RS 389
>gnl|CDD|147601 pfam05505, Ebola_NP, Ebola nucleoprotein. This family consists of
Ebola and Marburg virus nucleoproteins. These proteins
are responsible for encapsidation of genomic RNA. It has
been found that nucleoprotein DNA vaccines can offer
protection from the virus.
Length = 717
Score = 31.1 bits (70), Expect = 8.1
Identities = 14/78 (17%), Positives = 31/78 (39%), Gaps = 6/78 (7%)
Query: 615 SQKQKGHKSKKKKDRKRKGNGDKRDPQGNNANQRE----IRDNSATLDKT--EKESVNNV 668
S + S+ + +K + +P G + + + +++ D+T + S+ +
Sbjct: 505 SHQPGNRASQDLNNNNQKQEDESTNPIGKTSLRYQELTPVQEEDEPEDQTDDDDSSLPPL 564
Query: 669 ESKDERPPSTANNPDGTE 686
ES D+ D TE
Sbjct: 565 ESDDDPGSDNEQGVDLTE 582
>gnl|CDD|173959 cd08549, G1PDH_related, Glycerol-1-phosphate_dehydrogenase and
related proteins. Bacterial and archeal
glycerol-1-phosphate dehydrogenase-like oxidoreductases.
The proteins have similarity with glycerol-1-phosphate
dehydrogenase (G1PDH). G1PDH plays a role in the
synthesis of phosphoglycerolipids in gram-positive
bacterial species. It catalyzes the reversibly reduction
of dihydroxyacetone phosphate (DHAP) to
glycerol-1-phosphate (G1P) in a NADH-dependent manner.
Its activity requires Ni++ ion. It also contains
archaeal Sn-glycerol-1-phosphate dehydrogenase (Gro1PDH)
that plays an important role in the formation of the
enantiomeric configuration of the glycerophosphate
backbone (sn-glycerol-1-phosphate) of archaeal ether
lipids.
Length = 332
Score = 30.6 bits (69), Expect = 9.7
Identities = 23/89 (25%), Positives = 38/89 (42%), Gaps = 3/89 (3%)
Query: 254 KKLVQLLSQTPFNQCVIFSNYQLRP--EVICENLANANFGAEYLSGAQDQKA-RLASLDS 310
K++++ L F + V+ + L P + E L + E+L G L S
Sbjct: 41 KEIIERLESNNFTKEVLERDSLLIPDEYELGEVLIKLDKDTEFLLGIGSGTIIDLVKFVS 100
Query: 311 FKRGKVRILVTTDLAARGIDAANVDLVIN 339
FK GK I V T + G ++ L++N
Sbjct: 101 FKVGKPFISVPTAPSMDGYASSVASLIVN 129
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.128 0.373
Gapped
Lambda K H
0.267 0.0781 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 115,132,887
Number of extensions: 11144633
Number of successful extensions: 8087
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7802
Number of HSP's successfully gapped: 193
Length of query: 2355
Length of database: 10,937,602
Length adjustment: 113
Effective length of query: 2242
Effective length of database: 5,925,600
Effective search space: 13285195200
Effective search space used: 13285195200
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 67 (29.5 bits)