RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17912
(779 letters)
>gnl|CDD|240273 PTZ00110, PTZ00110, helicase; Provisional.
Length = 545
Score = 466 bits (1202), Expect = e-157
Identities = 204/428 (47%), Positives = 264/428 (61%), Gaps = 20/428 (4%)
Query: 12 SRVYGRPSYSGGAPRGGAPRGGGGRGGYGAPR-GYSRGGRGGGGSNYGSN------SNLR 64
S Y + + YG R GY G GG S L+
Sbjct: 18 SNNYNSYGPYPDSSNPYGNYQANHQDNYGGFRPGYGNYSGGYGGFGMNSYGSSTLGKRLQ 77
Query: 65 TPNWNSMSMEQINKNLYTPNHLTVSRSQGEVESYLNHHDVTVKGRYYLLRLESRPGKCVP 124
+W S+++ KN Y + + S EV+ ++T+ G+ VP
Sbjct: 78 PIDWKSINLVPFEKNFYKEHPEVSALSSKEVDEIRKEKEITIIA-----------GENVP 126
Query: 125 RPIQHFEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIA 184
+P+ FE +FP YI+K + GF PT IQ QGWPIALSG D++ IA+TGSGKTL ++
Sbjct: 127 KPVVSFEYTSFPDYILKSLKNAGFTEPTPIQVQGWPIALSGRDMIGIAETGSGKTLAFLL 186
Query: 185 PAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQ 244
PAIVH+N+Q LR G+GPIVLVLAPTRELA+QI N FG+++ R +GG PK Q
Sbjct: 187 PAIVHINAQPLLRYGDGPIVLVLAPTRELAEQIREQCNKFGASSKIRNTVAYGGVPKRGQ 246
Query: 245 VKALQTGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIR 304
+ AL+ G EI+IA PGRLID+LE NL R +YLVLDEADRMLDMGFEPQIRKI+ QIR
Sbjct: 247 IYALRRGVEILIACPGRLIDFLESNVTNLRRVTYLVLDEADRMLDMGFEPQIRKIVSQIR 306
Query: 305 PDRQVLMWSATWPKEVQKLAEDFLVDY-VQLNIGSLNPTANHNIVQIVDVCQEHEKDYKL 363
PDRQ LMWSATWPKEVQ LA D + V +N+GSL+ TA HNI Q V V +EHEK KL
Sbjct: 307 PDRQTLMWSATWPKEVQSLARDLCKEEPVHVNVGSLDLTACHNIKQEVFVVEEHEKRGKL 366
Query: 364 QGLLSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRI 423
+ LL +I + K +IFVETK+ AD +T+ +R GW A+ IHG+K Q+ER VLNEF+
Sbjct: 367 KMLLQRIMRD-GDKILIFVETKKGADFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKT 425
Query: 424 GRASILVS 431
G++ I+++
Sbjct: 426 GKSPIMIA 433
Score = 36.7 bits (85), Expect = 0.056
Identities = 19/56 (33%), Positives = 34/56 (60%), Gaps = 2/56 (3%)
Query: 454 ILVSQYKESQQKRDRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 509
IL+ + E+++ D + E R+ L H +K Q+ER VLNEF+ G++ I+++
Sbjct: 380 ILI--FVETKKGADFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKTGKSPIMIA 433
Score = 35.1 bits (81), Expect = 0.15
Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 3/77 (3%)
Query: 461 ESQQKRDRVLNEF--RIGRASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQQER 518
E +KR + L RI R + + ++++ D + E R+ L H +K Q+ER
Sbjct: 358 EEHEKRGK-LKMLLQRIMRDGDKILIFVETKKGADFLTKELRLDGWPALCIHGDKKQEER 416
Query: 519 DRVLNEFRIGRASILVS 535
VLNEF+ G++ I+++
Sbjct: 417 TWVLNEFKTGKSPIMIA 433
Score = 35.1 bits (81), Expect = 0.16
Identities = 16/43 (37%), Positives = 26/43 (60%)
Query: 519 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 561
D + E R+ L H +K Q+ER VLNEF+ G++ I+++
Sbjct: 391 DFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKTGKSPIMIA 433
Score = 35.1 bits (81), Expect = 0.16
Identities = 16/43 (37%), Positives = 26/43 (60%)
Query: 545 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 587
D + E R+ L H +K Q+ER VLNEF+ G++ I+++
Sbjct: 391 DFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKTGKSPIMIA 433
Score = 35.1 bits (81), Expect = 0.16
Identities = 16/43 (37%), Positives = 26/43 (60%)
Query: 571 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 613
D + E R+ L H +K Q+ER VLNEF+ G++ I+++
Sbjct: 391 DFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKTGKSPIMIA 433
Score = 35.1 bits (81), Expect = 0.16
Identities = 16/43 (37%), Positives = 26/43 (60%)
Query: 597 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 639
D + E R+ L H +K Q+ER VLNEF+ G++ I+++
Sbjct: 391 DFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKTGKSPIMIA 433
Score = 35.1 bits (81), Expect = 0.16
Identities = 16/43 (37%), Positives = 26/43 (60%)
Query: 623 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 665
D + E R+ L H +K Q+ER VLNEF+ G++ I+++
Sbjct: 391 DFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKTGKSPIMIA 433
>gnl|CDD|223587 COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA
replication, recombination, and repair / Transcription /
Translation, ribosomal structure and biogenesis].
Length = 513
Score = 334 bits (858), Expect = e-106
Identities = 112/312 (35%), Positives = 175/312 (56%), Gaps = 7/312 (2%)
Query: 123 VPRPIQHFEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGY 182
+ F P +++ + ++GF+ PT IQ P+ L+G D++ A+TG+GKT +
Sbjct: 24 EEKTPPEFASLGLSPELLQALKDLGFEEPTPIQLAAIPLILAGRDVLGQAQTGTGKTAAF 83
Query: 183 IAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATA-TRVACVFGGAPK 241
+ P + + + L+LAPTRELA QI G RVA V+GG
Sbjct: 84 LLPLLQKILKSVERK---YVSALILAPTRELAVQIAEELRKLGKNLGGLRVAVVYGGVSI 140
Query: 242 GPQVKALQTGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIG 301
Q++AL+ G +IV+ATPGRL+D +++G ++L LVLDEADRMLDMGF I KI+
Sbjct: 141 RKQIEALKRGVDIVVATPGRLLDLIKRGKLDLSGVETLVLDEADRMLDMGFIDDIEKILK 200
Query: 302 QIRPDRQVLMWSATWPKEVQKLAEDFLVDYVQLNIGS-LNPTANHNIVQIVDVCQEHEKD 360
+ PDRQ L++SAT P ++++LA +L D V++ + I Q ++
Sbjct: 201 ALPPDRQTLLFSATMPDDIRELARRYLNDPVEIEVSVEKLERTLKKIKQFYLEV--ESEE 258
Query: 361 YKLQGLLSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNE 420
KL+ LL + E + I+FV TKR +++ S+R +G+ A+HG+ Q+ERDR L +
Sbjct: 259 EKLELLLKLLKDEDEGRVIVFVRTKRLVEELAESLRKRGFKVAALHGDLPQEERDRALEK 318
Query: 421 FRIGRASILVSQ 432
F+ G +LV+
Sbjct: 319 FKDGELRVLVAT 330
Score = 33.6 bits (77), Expect = 0.38
Identities = 11/25 (44%), Positives = 16/25 (64%)
Query: 562 HYNKSQQERDRVLNEFRIGRASILV 586
H + Q+ERDR L +F+ G +LV
Sbjct: 304 HGDLPQEERDRALEKFKDGELRVLV 328
Score = 33.6 bits (77), Expect = 0.38
Identities = 11/25 (44%), Positives = 16/25 (64%)
Query: 588 HYNKSQQERDRVLNEFRIGRASILV 612
H + Q+ERDR L +F+ G +LV
Sbjct: 304 HGDLPQEERDRALEKFKDGELRVLV 328
Score = 33.6 bits (77), Expect = 0.38
Identities = 11/25 (44%), Positives = 16/25 (64%)
Query: 614 HYNKSQQERDRVLNEFRIGRASILV 638
H + Q+ERDR L +F+ G +LV
Sbjct: 304 HGDLPQEERDRALEKFKDGELRVLV 328
Score = 33.6 bits (77), Expect = 0.38
Identities = 11/25 (44%), Positives = 16/25 (64%)
Query: 640 HYNKSQQERDRVLNEFRIGRASILV 664
H + Q+ERDR L +F+ G +LV
Sbjct: 304 HGDLPQEERDRALEKFKDGELRVLV 328
>gnl|CDD|238167 cd00268, DEADc, DEAD-box helicases. A diverse family of proteins
involved in ATP-dependent RNA unwinding, needed in a
variety of cellular processes including splicing,
ribosome biogenesis and RNA degradation. The name
derives from the sequence of the Walker B motif (motif
II). This domain contains the ATP- binding region.
Length = 203
Score = 307 bits (788), Expect = e-100
Identities = 98/206 (47%), Positives = 134/206 (65%), Gaps = 3/206 (1%)
Query: 130 FEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVH 189
FEE P +++ IY +GF+ PT IQA+ P LSG D++ A+TGSGKT ++ P +
Sbjct: 1 FEELGLSPELLRGIYALGFEKPTPIQARAIPPLLSGRDVIGQAQTGSGKTAAFLIPILEK 60
Query: 190 VNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQVKALQ 249
++ GP L+LAPTRELA QI VA G T +V ++GG Q++ L+
Sbjct: 61 LDPSPKKD---GPQALILAPTRELALQIAEVARKLGKHTNLKVVVIYGGTSIDKQIRKLK 117
Query: 250 TGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRPDRQV 309
G IV+ATPGRL+D LE+G ++L + YLVLDEADRMLDMGFE QIR+I+ + DRQ
Sbjct: 118 RGPHIVVATPGRLLDLLERGKLDLSKVKYLVLDEADRMLDMGFEDQIREILKLLPKDRQT 177
Query: 310 LMWSATWPKEVQKLAEDFLVDYVQLN 335
L++SAT PKEV+ LA FL + V++
Sbjct: 178 LLFSATMPKEVRDLARKFLRNPVRIL 203
>gnl|CDD|236977 PRK11776, PRK11776, ATP-dependent RNA helicase DbpA; Provisional.
Length = 460
Score = 235 bits (603), Expect = 1e-69
Identities = 112/309 (36%), Positives = 171/309 (55%), Gaps = 25/309 (8%)
Query: 130 FEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKT----LGYIAP 185
F PP ++ + E+G+ T IQAQ P L+G D++A AKTGSGKT LG +
Sbjct: 6 FSTLPLPPALLANLNELGYTEMTPIQAQSLPAILAGKDVIAQAKTGSGKTAAFGLGLLQ- 64
Query: 186 AIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGS-ATAT---RVACVFGGAPK 241
++ +R LVL PTRELA Q VA + A +V + GG P
Sbjct: 65 ---KLDVKRF-----RVQALVLCPTRELADQ---VAKEIRRLARFIPNIKVLTLCGGVPM 113
Query: 242 GPQVKALQTGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIG 301
GPQ+ +L+ GA I++ TPGR++D+L +GT++L + LVLDEADRMLDMGF+ I II
Sbjct: 114 GPQIDSLEHGAHIIVGTPGRILDHLRKGTLDLDALNTLVLDEADRMLDMGFQDAIDAIIR 173
Query: 302 QIRPDRQVLMWSATWPKEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDY 361
Q RQ L++SAT+P+ + +++ F D V++ + S I Q E+
Sbjct: 174 QAPARRQTLLFSATYPEGIAAISQRFQRDPVEVKVES--THDLPAIEQRFYEVSPDERLP 231
Query: 362 KLQGLLSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEF 421
LQ LL + ++F TK++ ++ ++ +G++A+A+HG+ Q++RD+VL F
Sbjct: 232 ALQRLLLH---HQPESCVVFCNTKKECQEVADALNAQGFSALALHGDLEQRDRDQVLVRF 288
Query: 422 RIGRASILV 430
S+LV
Sbjct: 289 ANRSCSVLV 297
>gnl|CDD|236722 PRK10590, PRK10590, ATP-dependent RNA helicase RhlE; Provisional.
Length = 456
Score = 212 bits (541), Expect = 6e-61
Identities = 117/305 (38%), Positives = 174/305 (57%), Gaps = 9/305 (2%)
Query: 130 FEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVH 189
F+ P I++ + E G++ PT IQ Q P L G DL+A A+TG+GKT G+ P + H
Sbjct: 3 FDSLGLSPDILRAVAEQGYREPTPIQQQAIPAVLEGRDLMASAQTGTGKTAGFTLPLLQH 62
Query: 190 VNSQRPLRSGEGPI-VLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQVKAL 248
+ +++P G P+ L+L PTRELA QI D+ R VFGG PQ+ L
Sbjct: 63 LITRQPHAKGRRPVRALILTPTRELAAQIGENVRDYSKYLNIRSLVVFGGVSINPQMMKL 122
Query: 249 QTGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRPDRQ 308
+ G ++++ATPGRL+D Q + L + LVLDEADRMLDMGF IR+++ ++ RQ
Sbjct: 123 RGGVDVLVATPGRLLDLEHQNAVKLDQVEILVLDEADRMLDMGFIHDIRRVLAKLPAKRQ 182
Query: 309 VLMWSATWPKEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDYKLQG-LL 367
L++SAT+ +++ LAE L + +++ + N TA+ + Q V H D K + LL
Sbjct: 183 NLLFSATFSDDIKALAEKLLHNPLEIEVARRN-TASEQVTQHV-----HFVDKKRKRELL 236
Query: 368 SQ-IGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGRA 426
SQ IG + ++F TK A+ + + G + AIHGNKSQ R R L +F+ G
Sbjct: 237 SQMIGKGNWQQVLVFTRTKHGANHLAEQLNKDGIRSAAIHGNKSQGARTRALADFKSGDI 296
Query: 427 SILVS 431
+LV+
Sbjct: 297 RVLVA 301
Score = 32.5 bits (74), Expect = 0.88
Identities = 18/53 (33%), Positives = 22/53 (41%), Gaps = 4/53 (7%)
Query: 18 PSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNLRTPNWNS 70
P +G RGG G G GG G +G R RG GG+ S P+
Sbjct: 386 PIQNGRQQRGG--GGRGQGGGRGQQQGQPR--RGEGGAKSASAKPAEKPSRRL 434
Score = 31.7 bits (72), Expect = 1.8
Identities = 30/95 (31%), Positives = 48/95 (50%), Gaps = 12/95 (12%)
Query: 448 RIGRASILVSQYKE-SQQKRDRVLNEFRIGRAS---ILVSHYNKSQQERDRV---LNEFR 500
R AS V+Q+ +KR R L IG+ + +LV + +++ + + LN+
Sbjct: 212 RRNTASEQVTQHVHFVDKKRKRELLSQMIGKGNWQQVLV--FTRTKHGANHLAEQLNKDG 269
Query: 501 IGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 535
I A+I H NKSQ R R L +F+ G +LV+
Sbjct: 270 IRSAAI---HGNKSQGARTRALADFKSGDIRVLVA 301
Score = 30.2 bits (68), Expect = 5.0
Identities = 30/93 (32%), Positives = 46/93 (49%), Gaps = 8/93 (8%)
Query: 422 RIGRASILVSQY-NKSQQERDRVLNEFRIGRAS---ILV-SQYKESQQKRDRVLNEFRIG 476
R AS V+Q+ + ++R R L IG+ + +LV ++ K LN+ I
Sbjct: 212 RRNTASEQVTQHVHFVDKKRKRELLSQMIGKGNWQQVLVFTRTKHGANHLAEQLNKDGIR 271
Query: 477 RASILVSHYNKSQQERDRVLNEFRIGRASILVS 509
A+I H NKSQ R R L +F+ G +LV+
Sbjct: 272 SAAI---HGNKSQGARTRALADFKSGDIRVLVA 301
Score = 29.4 bits (66), Expect = 7.7
Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 3/40 (7%)
Query: 522 LNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 561
LN+ I A+I H NKSQ R R L +F+ G +LV+
Sbjct: 265 LNKDGIRSAAI---HGNKSQGARTRALADFKSGDIRVLVA 301
Score = 29.4 bits (66), Expect = 7.7
Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 3/40 (7%)
Query: 548 LNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 587
LN+ I A+I H NKSQ R R L +F+ G +LV+
Sbjct: 265 LNKDGIRSAAI---HGNKSQGARTRALADFKSGDIRVLVA 301
Score = 29.4 bits (66), Expect = 7.7
Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 3/40 (7%)
Query: 574 LNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 613
LN+ I A+I H NKSQ R R L +F+ G +LV+
Sbjct: 265 LNKDGIRSAAI---HGNKSQGARTRALADFKSGDIRVLVA 301
Score = 29.4 bits (66), Expect = 7.7
Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 3/40 (7%)
Query: 600 LNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 639
LN+ I A+I H NKSQ R R L +F+ G +LV+
Sbjct: 265 LNKDGIRSAAI---HGNKSQGARTRALADFKSGDIRVLVA 301
Score = 29.4 bits (66), Expect = 7.7
Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 3/40 (7%)
Query: 626 LNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 665
LN+ I A+I H NKSQ R R L +F+ G +LV+
Sbjct: 265 LNKDGIRSAAI---HGNKSQGARTRALADFKSGDIRVLVA 301
>gnl|CDD|215103 PLN00206, PLN00206, DEAD-box ATP-dependent RNA helicase;
Provisional.
Length = 518
Score = 213 bits (545), Expect = 7e-61
Identities = 113/317 (35%), Positives = 165/317 (52%), Gaps = 10/317 (3%)
Query: 120 GKCVPRPIQHFEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKT 179
G+ VP PI F C PP ++ + G++ PT IQ Q P ALSG L+ A TGSGKT
Sbjct: 113 GEAVPPPILSFSSCGLPPKLLLNLETAGYEFPTPIQMQAIPAALSGRSLLVSADTGSGKT 172
Query: 180 LGYIAPAIVHVNSQRP--LRSGEGPIVLVLAPTRELAQQIETVANDFGSATATRVACVFG 237
++ P I + R P+ +VL PTREL Q+E A G + A V G
Sbjct: 173 ASFLVPIISRCCTIRSGHPSEQRNPLAMVLTPTRELCVQVEDQAKVLGKGLPFKTALVVG 232
Query: 238 GAPKGPQVKALQTGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIR 297
G Q+ +Q G E+++ TPGRLID L + I L S LVLDE D ML+ GF Q+
Sbjct: 233 GDAMPQQLYRIQQGVELIVGTPGRLIDLLSKHDIELDNVSVLVLDEVDCMLERGFRDQVM 292
Query: 298 KIIGQIRPDRQVLMWSATWPKEVQKLAEDFLVDYVQLNIGSLN-PT-ANHNIVQIVDVCQ 355
+I Q QVL++SAT EV+K A D + ++IG+ N P A + V+ Q
Sbjct: 293 QIF-QALSQPQVLLFSATVSPEVEKFASSLAKDIILISIGNPNRPNKAVKQLAIWVETKQ 351
Query: 356 EHEKDYKLQGLLSQIGSERTSKTIIFVETKRKADDITRSVR-NKGWAAVAIHGNKSQQER 414
+ +K + + L S+ ++FV ++ AD + ++ G A++IHG KS +ER
Sbjct: 352 KKQKLFDI--LKSK--QHFKPPAVVFVSSRLGADLLANAITVVTGLKALSIHGEKSMKER 407
Query: 415 DRVLNEFRIGRASILVS 431
V+ F +G ++V+
Sbjct: 408 REVMKSFLVGEVPVIVA 424
>gnl|CDD|215832 pfam00270, DEAD, DEAD/DEAH box helicase. Members of this family
include the DEAD and DEAH box helicases. Helicases are
involved in unwinding nucleic acids. The DEAD box
helicases are involved in various aspects of RNA
metabolism, including nuclear transcription, pre mRNA
splicing, ribosome biogenesis, nucleocytoplasmic
transport, translation, RNA decay and organellar gene
expression.
Length = 169
Score = 192 bits (489), Expect = 4e-57
Identities = 75/174 (43%), Positives = 105/174 (60%), Gaps = 7/174 (4%)
Query: 152 TAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTR 211
T IQAQ P LSG D++ A TGSGKTL ++ P + + ++ GP LVLAPTR
Sbjct: 1 TPIQAQAIPAILSGKDVLVQAPTGSGKTLAFLLPILQALLPKKG-----GPQALVLAPTR 55
Query: 212 ELAQQIETVANDFGSATATRVACVFGGAPKGPQVKALQTG-AEIVIATPGRLIDYLEQGT 270
ELA+QI RVA + GG Q + L+ G A+I++ TPGRL+D L +G
Sbjct: 56 ELAEQIYEELKKLFKILGLRVALLTGGTSLKEQARKLKKGKADILVGTPGRLLDLLRRGK 115
Query: 271 INLHRT-SYLVLDEADRMLDMGFEPQIRKIIGQIRPDRQVLMWSATWPKEVQKL 323
+ L + LVLDEA R+LDMGF + +I+ ++ PDRQ+L+ SAT P+ ++ L
Sbjct: 116 LKLLKNLKLLVLDEAHRLLDMGFGDDLEEILSRLPPDRQILLLSATLPRNLEDL 169
>gnl|CDD|214692 smart00487, DEXDc, DEAD-like helicases superfamily.
Length = 201
Score = 180 bits (459), Expect = 2e-52
Identities = 78/210 (37%), Positives = 118/210 (56%), Gaps = 11/210 (5%)
Query: 143 IYEMGFQAPTAIQAQGWPIALSG-CDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEG 201
I + GF+ Q + LSG D++ A TGSGKTL + PA+ L+ G+G
Sbjct: 1 IEKFGFEPLRPYQKEAIEALLSGLRDVILAAPTGSGKTLAALLPALEA------LKRGKG 54
Query: 202 PIVLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQVKALQTG-AEIVIATPG 260
VLVL PTRELA+Q G + +V ++GG K Q++ L++G +I++ TPG
Sbjct: 55 GRVLVLVPTRELAEQWAEELKKLGPSLGLKVVGLYGGDSKREQLRKLESGKTDILVTTPG 114
Query: 261 RLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRPDRQVLMWSATWPKEV 320
RL+D LE ++L ++LDEA R+LD GF Q+ K++ + + Q+L+ SAT P+E+
Sbjct: 115 RLLDLLENDKLSLSNVDLVILDEAHRLLDGGFGDQLEKLLKLLPKNVQLLLLSATPPEEI 174
Query: 321 QKLAEDFLVDYVQLNIGSLNPTANHNIVQI 350
+ L E FL D V +++G T I Q
Sbjct: 175 ENLLELFLNDPVFIDVG---FTPLEPIEQF 201
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 185 bits (472), Expect = 2e-51
Identities = 96/305 (31%), Positives = 151/305 (49%), Gaps = 9/305 (2%)
Query: 130 FEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVH 189
F E +++ + + G+ PTAIQA+ P AL G D++ A TG+GKT ++ PA+ H
Sbjct: 3 FSELELDESLLEALQDKGYTRPTAIQAEAIPPALDGRDVLGSAPTGTGKTAAFLLPALQH 62
Query: 190 VNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQVKALQ 249
+ P R P +L+L PTRELA Q+ A + T +A + GG +
Sbjct: 63 L-LDFPRRKSGPPRILILTPTRELAMQVADQARELAKHTHLDIATITGGVAYMNHAEVFS 121
Query: 250 TGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRPDRQV 309
+IV+ATPGRL+ Y+++ + L+LDEADRMLDMGF I I + R +Q
Sbjct: 122 ENQDIVVATPGRLLQYIKEENFDCRAVETLILDEADRMLDMGFAQDIETIAAETRWRKQT 181
Query: 310 LMWSATWPKE-VQKLAEDFLVDYVQLNIGSLNPTANH--NIVQIVDVCQEHEKDYKLQGL 366
L++SAT + VQ AE L D V++ P+ I Q + E L L
Sbjct: 182 LLFSATLEGDAVQDFAERLLNDPVEVEA---EPSRRERKKIHQWYYRADDLEHKTAL--L 236
Query: 367 LSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGRA 426
+ +++I+FV T+ + ++ +R G + G Q +R+ + GR
Sbjct: 237 CHLLKQPEVTRSIVFVRTRERVHELAGWLRKAGINCCYLEGEMVQAKRNEAIKRLTDGRV 296
Query: 427 SILVS 431
++LV+
Sbjct: 297 NVLVA 301
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 180 bits (458), Expect = 2e-49
Identities = 107/308 (34%), Positives = 160/308 (51%), Gaps = 11/308 (3%)
Query: 130 FEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVH 189
F + N P +M I+++GF T IQAQ L+G D + A+TG+GKT ++ I
Sbjct: 89 FHDFNLAPELMHAIHDLGFPYCTPIQAQVLGYTLAGHDAIGRAQTGTGKTAAFLISIINQ 148
Query: 190 VNSQRPLRS---GEGPIVLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQVK 246
+ P + GE P L++APTREL QI A T V GG Q+K
Sbjct: 149 LLQTPPPKERYMGE-PRALIIAPTRELVVQIAKDAAALTKYTGLNVMTFVGGMDFDKQLK 207
Query: 247 ALQTG-AEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRP 305
L+ +I++ATPGRL+D+ ++G ++L +VLDEADRMLDMGF PQ+R+II Q
Sbjct: 208 QLEARFCDILVATPGRLLDFNQRGEVHLDMVEVMVLDEADRMLDMGFIPQVRQIIRQTPR 267
Query: 306 --DRQVLMWSATWPKEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDYKL 363
+RQ L++SAT+ +V LA+ + D + I N A+ + Q V +K L
Sbjct: 268 KEERQTLLFSATFTDDVMNLAKQWTTDPAIVEIEPEN-VASDTVEQHVYAVAGSDKYKLL 326
Query: 364 QGLLSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRI 423
L++Q ER ++F K + I + G A + G+ Q +R + L FR
Sbjct: 327 YNLVTQNPWER---VMVFANRKDEVRRIEERLVKDGINAAQLSGDVPQHKRIKTLEGFRE 383
Query: 424 GRASILVS 431
G+ +LV+
Sbjct: 384 GKIRVLVA 391
>gnl|CDD|235307 PRK04537, PRK04537, ATP-dependent RNA helicase RhlB; Provisional.
Length = 572
Score = 173 bits (441), Expect = 3e-46
Identities = 115/308 (37%), Positives = 175/308 (56%), Gaps = 11/308 (3%)
Query: 130 FEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVH 189
F + P ++ + GF T IQA P+AL G D+ A+TG+GKTL ++ A+++
Sbjct: 11 FSSFDLHPALLAGLESAGFTRCTPIQALTLPVALPGGDVAGQAQTGTGKTLAFLV-AVMN 69
Query: 190 VNSQRPL---RSGEGPIVLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQVK 246
RP R E P L+LAPTRELA QI A FG+ R A V+GG Q +
Sbjct: 70 RLLSRPALADRKPEDPRALILAPTRELAIQIHKDAVKFGADLGLRFALVYGGVDYDKQRE 129
Query: 247 ALQTGAEIVIATPGRLIDYLEQ-GTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQI-- 303
LQ G +++IATPGRLIDY++Q ++LH VLDEADRM D+GF IR ++ ++
Sbjct: 130 LLQQGVDVIIATPGRLIDYVKQHKVVSLHACEICVLDEADRMFDLGFIKDIRFLLRRMPE 189
Query: 304 RPDRQVLMWSATWPKEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDYKL 363
R RQ L++SAT V +LA + + + +L + + TA + Q + + EK L
Sbjct: 190 RGTRQTLLFSATLSHRVLELAYEHMNEPEKLVVETETITAAR-VRQRIYFPADEEKQTLL 248
Query: 364 QGLLSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRI 423
GLLS+ SE ++T++FV TK + + R++ G+ + G+ Q++R+ +LN F+
Sbjct: 249 LGLLSR--SE-GARTMVFVNTKAFVERVARTLERHGYRVGVLSGDVPQKKRESLLNRFQK 305
Query: 424 GRASILVS 431
G+ ILV+
Sbjct: 306 GQLEILVA 313
Score = 33.0 bits (75), Expect = 0.63
Identities = 13/41 (31%), Positives = 14/41 (34%), Gaps = 2/41 (4%)
Query: 26 RGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNLRTP 66
R G R G G G + GGR G G R P
Sbjct: 431 RRGGGRSGPGGGS-RSGSV-GGGGRRDGAGADGKPRPRRKP 469
Score = 29.9 bits (67), Expect = 6.6
Identities = 8/31 (25%), Positives = 8/31 (25%)
Query: 23 GAPRGGAPRGGGGRGGYGAPRGYSRGGRGGG 53
G R G G G R G G
Sbjct: 433 GGGRSGPGGGSRSGSVGGGGRRDGAGADGKP 463
>gnl|CDD|185609 PTZ00424, PTZ00424, helicase 45; Provisional.
Length = 401
Score = 166 bits (423), Expect = 4e-45
Identities = 96/306 (31%), Positives = 161/306 (52%), Gaps = 10/306 (3%)
Query: 127 IQHFEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPA 186
+ F+ +++ IY GF+ P+AIQ +G L G D + A++G+GKT ++ A
Sbjct: 27 VDSFDALKLNEDLLRGIYSYGFEKPSAIQQRGIKPILDGYDTIGQAQSGTGKTATFVIAA 86
Query: 187 IVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATATRV-ACVFGGAPKGPQV 245
+ ++ L + + L+LAPTRELAQQI+ V G R ACV GG +
Sbjct: 87 LQLIDYD--LNACQ---ALILAPTRELAQQIQKVVLALGDYLKVRCHACV-GGTVVRDDI 140
Query: 246 KALQTGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRP 305
L+ G +V+ TPGR+ D +++ + + +LDEAD ML GF+ QI + ++ P
Sbjct: 141 NKLKAGVHMVVGTPGRVYDMIDKRHLRVDDLKLFILDEADEMLSRGFKGQIYDVFKKLPP 200
Query: 306 DRQVLMWSATWPKEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDYKLQG 365
D QV ++SAT P E+ +L F+ D ++ + T I Q V E E ++K
Sbjct: 201 DVQVALFSATMPNEILELTTKFMRDPKRILVKKDELTL-EGIRQFY-VAVEKE-EWKFDT 257
Query: 366 LLSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGR 425
L + ++ II+ T+RK D +T+ + + + +HG+ Q++RD ++ EFR G
Sbjct: 258 LCDLYETLTITQAIIYCNTRRKVDYLTKKMHERDFTVSCMHGDMDQKDRDLIMREFRSGS 317
Query: 426 ASILVS 431
+L++
Sbjct: 318 TRVLIT 323
>gnl|CDD|235314 PRK04837, PRK04837, ATP-dependent RNA helicase RhlB; Provisional.
Length = 423
Score = 166 bits (423), Expect = 6e-45
Identities = 102/317 (32%), Positives = 159/317 (50%), Gaps = 28/317 (8%)
Query: 128 QHFEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAI 187
Q F + P +++ + + GF T IQA P+ L+G D+ A+TG+GKT+ ++
Sbjct: 8 QKFSDFALHPQVVEALEKKGFHNCTPIQALALPLTLAGRDVAGQAQTGTGKTMAFLTATF 67
Query: 188 VHV--NSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQV 245
++ + R P L++APTRELA QI A AT ++ +GG Q+
Sbjct: 68 HYLLSHPAPEDRKVNQPRALIMAPTRELAVQIHADAEPLAQATGLKLGLAYGGDGYDKQL 127
Query: 246 KALQTGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRP 305
K L++G +I+I T GRLIDY +Q INL +VLDEADRM D+GF IR + ++ P
Sbjct: 128 KVLESGVDILIGTTGRLIDYAKQNHINLGAIQVVVLDEADRMFDLGFIKDIRWLFRRMPP 187
Query: 306 --DRQVLMWSATWPKEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEH----EK 359
R +++SAT V++LA + + + + + T H I +E
Sbjct: 188 ANQRLNMLFSATLSYRVRELAFEHMNNPEYVEVEPEQKTG-HRI-------KEELFYPSN 239
Query: 360 DYKLQGLLSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIH------GNKSQQE 413
+ K++ L + I E + IIF TK + ++I G A H G+ +Q++
Sbjct: 240 EEKMRLLQTLIEEEWPDRAIIFANTKHRCEEIW------GHLAADGHRVGLLTGDVAQKK 293
Query: 414 RDRVLNEFRIGRASILV 430
R R+L EF G ILV
Sbjct: 294 RLRILEEFTRGDLDILV 310
>gnl|CDD|236941 PRK11634, PRK11634, ATP-dependent RNA helicase DeaD; Provisional.
Length = 629
Score = 164 bits (417), Expect = 1e-42
Identities = 97/297 (32%), Positives = 172/297 (57%), Gaps = 16/297 (5%)
Query: 139 IMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRS 198
I++ + ++G++ P+ IQA+ P L+G D++ +A+TGSGKT + P + +++ + L++
Sbjct: 17 ILEALNDLGYEKPSPIQAECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLHNLDPE--LKA 74
Query: 199 GEGPIVLVLAPTRELAQQIETVANDFGS-ATATRVACVFGGAPKGPQVKALQTGAEIVIA 257
P +LVLAPTRELA Q+ DF V ++GG Q++AL+ G +IV+
Sbjct: 75 ---PQILVLAPTRELAVQVAEAMTDFSKHMRGVNVVALYGGQRYDVQLRALRQGPQIVVG 131
Query: 258 TPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRPDRQVLMWSATWP 317
TPGRL+D+L++GT++L + S LVLDEAD ML MGF + I+ QI Q ++SAT P
Sbjct: 132 TPGRLLDHLKRGTLDLSKLSGLVLDEADEMLRMGFIEDVETIMAQIPEGHQTALFSATMP 191
Query: 318 KEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDY---KLQGLLSQIGSER 374
+ ++++ F+ + ++ I S ++ D+ Q + + K + L+ + +E
Sbjct: 192 EAIRRITRRFMKEPQEVRIQS-------SVTTRPDISQSYWTVWGMRKNEALVRFLEAED 244
Query: 375 TSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGRASILVS 431
IIFV TK ++ ++ G+ + A++G+ +Q R++ L + GR IL++
Sbjct: 245 FDAAIIFVRTKNATLEVAEALERNGYNSAALNGDMNQALREQTLERLKDGRLDILIA 301
>gnl|CDD|238005 cd00046, DEXDc, DEAD-like helicases superfamily. A diverse family
of proteins involved in ATP-dependent RNA or DNA
unwinding. This domain contains the ATP-binding region.
Length = 144
Score = 144 bits (364), Expect = 4e-40
Identities = 61/150 (40%), Positives = 84/150 (56%), Gaps = 7/150 (4%)
Query: 167 DLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGS 226
D++ A TGSGKTL + P + L S +G VLVLAPTRELA Q+ +
Sbjct: 2 DVLLAAPTGSGKTLAALLPILEL------LDSLKGGQVLVLAPTRELANQVAERLKELFG 55
Query: 227 ATATRVACVFGGAPKGPQVKALQTGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADR 286
+V + GG Q K L +IV+ TPGRL+D LE+ ++L + L+LDEA R
Sbjct: 56 EGI-KVGYLIGGTSIKQQEKLLSGKTDIVVGTPGRLLDELERLKLSLKKLDLLILDEAHR 114
Query: 287 MLDMGFEPQIRKIIGQIRPDRQVLMWSATW 316
+L+ GF KI+ ++ DRQVL+ SAT
Sbjct: 115 LLNQGFGLLGLKILLKLPKDRQVLLLSATP 144
>gnl|CDD|238034 cd00079, HELICc, Helicase superfamily c-terminal domain; associated
with DEXDc-, DEAD-, and DEAH-box proteins, yeast
initiation factor 4A, Ski2p, and Hepatitis C virus NS3
helicases; this domain is found in a wide variety of
helicases and helicase related proteins; may not be an
autonomously folding unit, but an integral part of the
helicase; 4 helicase superfamilies at present according
to the organization of their signature motifs; all
helicases share the ability to unwind nucleic acid
duplexes with a distinct directional polarity; they
utilize the free energy from nucleoside triphosphate
hydrolysis to fuel their translocation along DNA,
unwinding the duplex in the process.
Length = 131
Score = 69.6 bits (171), Expect = 4e-14
Identities = 29/88 (32%), Positives = 45/88 (51%), Gaps = 6/88 (6%)
Query: 346 NIVQIVDVCQEHEKDYKLQGLLSQIGSERTS--KTIIFVETKRKADDITRSVRNKGWAAV 403
I Q V +D KL+ LL + K +IF +K+ D++ +R G
Sbjct: 1 PIKQYVLPV----EDEKLEALLELLKEHLKKGGKVLIFCPSKKMLDELAELLRKPGIKVA 56
Query: 404 AIHGNKSQQERDRVLNEFRIGRASILVS 431
A+HG+ SQ+ER+ VL +FR G +LV+
Sbjct: 57 ALHGDGSQEEREEVLKDFREGEIVVLVA 84
Score = 38.8 bits (91), Expect = 0.002
Identities = 19/82 (23%), Positives = 33/82 (40%), Gaps = 8/82 (9%)
Query: 433 YNKSQQERDRVLNEF-----RIGRASILVSQYKESQQKRDRVLNEFRIGRASILVSHYNK 487
+ E+ L E + G ++ K+ D + R + H +
Sbjct: 6 VLPVEDEKLEALLELLKEHLKKGGKVLIFCPSKK---MLDELAELLRKPGIKVAALHGDG 62
Query: 488 SQQERDRVLNEFRIGRASILVS 509
SQ+ER+ VL +FR G +LV+
Sbjct: 63 SQEEREEVLKDFREGEIVVLVA 84
Score = 38.0 bits (89), Expect = 0.004
Identities = 18/80 (22%), Positives = 35/80 (43%), Gaps = 5/80 (6%)
Query: 459 YKESQQKRDRVLNEFRIGRAS---ILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQ 515
+K + +L + +L+ + S++ D + R + H + SQ
Sbjct: 7 LPVEDEKLEALLELLKEHLKKGGKVLI--FCPSKKMLDELAELLRKPGIKVAALHGDGSQ 64
Query: 516 QERDRVLNEFRIGRASILVS 535
+ER+ VL +FR G +LV+
Sbjct: 65 EEREEVLKDFREGEIVVLVA 84
Score = 37.2 bits (87), Expect = 0.006
Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 2/69 (2%)
Query: 493 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQQERDRVLNEFR 552
+L E +L+ + S++ D + R + H + SQ+ER+ VL +FR
Sbjct: 18 LELLKEHLKKGGKVLI--FCPSKKMLDELAELLRKPGIKVAALHGDGSQEEREEVLKDFR 75
Query: 553 IGRASILVS 561
G +LV+
Sbjct: 76 EGEIVVLVA 84
Score = 37.2 bits (87), Expect = 0.006
Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 2/69 (2%)
Query: 519 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQQERDRVLNEFR 578
+L E +L+ + S++ D + R + H + SQ+ER+ VL +FR
Sbjct: 18 LELLKEHLKKGGKVLI--FCPSKKMLDELAELLRKPGIKVAALHGDGSQEEREEVLKDFR 75
Query: 579 IGRASILVS 587
G +LV+
Sbjct: 76 EGEIVVLVA 84
Score = 37.2 bits (87), Expect = 0.006
Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 2/69 (2%)
Query: 545 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQQERDRVLNEFR 604
+L E +L+ + S++ D + R + H + SQ+ER+ VL +FR
Sbjct: 18 LELLKEHLKKGGKVLI--FCPSKKMLDELAELLRKPGIKVAALHGDGSQEEREEVLKDFR 75
Query: 605 IGRASILVS 613
G +LV+
Sbjct: 76 EGEIVVLVA 84
Score = 37.2 bits (87), Expect = 0.006
Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 2/69 (2%)
Query: 571 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQQERDRVLNEFR 630
+L E +L+ + S++ D + R + H + SQ+ER+ VL +FR
Sbjct: 18 LELLKEHLKKGGKVLI--FCPSKKMLDELAELLRKPGIKVAALHGDGSQEEREEVLKDFR 75
Query: 631 IGRASILVS 639
G +LV+
Sbjct: 76 EGEIVVLVA 84
Score = 37.2 bits (87), Expect = 0.006
Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 2/69 (2%)
Query: 597 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQQERDRVLNEFR 656
+L E +L+ + S++ D + R + H + SQ+ER+ VL +FR
Sbjct: 18 LELLKEHLKKGGKVLI--FCPSKKMLDELAELLRKPGIKVAALHGDGSQEEREEVLKDFR 75
Query: 657 IGRASILVS 665
G +LV+
Sbjct: 76 EGEIVVLVA 84
>gnl|CDD|223989 COG1061, SSL2, DNA or RNA helicases of superfamily II
[Transcription / DNA replication, recombination, and
repair].
Length = 442
Score = 60.5 bits (147), Expect = 2e-09
Identities = 64/336 (19%), Positives = 104/336 (30%), Gaps = 87/336 (25%)
Query: 169 VAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSAT 228
V + TG+GKT +A + + LVL PT+EL Q F
Sbjct: 59 VIVLPTGAGKT--VVAAEAIAELKRS---------TLVLVPTKELLDQWAEALKKFLLLN 107
Query: 229 ATRVACVFGGAPKGPQVKALQTGAEIVIATPGRLID--YLEQGTINLH--RTSYLVLDEA 284
++GG K A++ +AT + Q ++ DE
Sbjct: 108 DEIG--IYGGGEKEL------EPAKVTVAT----VQTLARRQLLDEFLGNEFGLIIFDEV 155
Query: 285 DRMLDMGFEPQIRKIIGQIRPDRQVLMWSATWPKEVQK---------------------L 323
+ + R+I+ + L +AT +E +
Sbjct: 156 HHLPAPSY----RRILELLSAAYPRLGLTATPEREDGGRIGDLFDLIGPIVYEVSLKELI 211
Query: 324 AEDFLVDYV----------------------QLNIGSLNPTANHNIVQIVDVCQEHEKDY 361
E +L Y + T K
Sbjct: 212 DEGYLAPYKYVEIKVTLTEDEEREYAKESARFRELLRARGTLRAENEARRIAIASERKIA 271
Query: 362 KLQGLLSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEF 421
++GLL + R KT+IF A +I + G AI G ++ER+ +L F
Sbjct: 272 AVRGLLLKH--ARGDKTLIFASDVEHAYEIAKLFLAPGIVE-AITGETPKEEREAILERF 328
Query: 422 RIGRASILVSQYNKSQQERDRVLNE-FRIGRASILV 456
R G +LV+ +VL+E I A +L+
Sbjct: 329 RTGGIKVLVTV---------KVLDEGVDIPDADVLI 355
Score = 29.3 bits (66), Expect = 8.3
Identities = 24/136 (17%), Positives = 47/136 (34%), Gaps = 19/136 (13%)
Query: 543 ERDRVLNEFRIGRASILVSHYNKSQQE--RDRVLNEFRIGRASILVSHYNKSQQERDRVL 600
E R R + ++ ++ + R +L R + I S + +
Sbjct: 245 ELLRARGTLRAENEARRIAIASERKIAAVRGLLLKHARGDKTLIFAS--DVEHA--YEIA 300
Query: 601 NEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQQERDRVLNE-FRIGR 659
F ++ ++ER+ +L FR G +LV+ +VL+E I
Sbjct: 301 KLFLAPGIVEAITG-ETPKEEREAILERFRTGGIKVLVT---------VKVLDEGVDIPD 350
Query: 660 AS--ILVSDFDILRQW 673
A I++ R +
Sbjct: 351 ADVLIILRPTGSRRLF 366
>gnl|CDD|224122 COG1201, Lhr, Lhr-like helicases [General function prediction
only].
Length = 814
Score = 60.7 bits (148), Expect = 2e-09
Identities = 74/335 (22%), Positives = 125/335 (37%), Gaps = 70/335 (20%)
Query: 135 FPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQR 194
P + + F + T Q P SG +++ IA TGSGKT P I + S
Sbjct: 8 LDPRVREWFKR-KFTSLTPPQRYAIPEIHSGENVLIIAPTGSGKTEAAFLPVINELLSLG 66
Query: 195 PLRSGEGPIVLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQVKALQTGAEI 254
+ +G L ++P + L I + VA G P+ + K L+ I
Sbjct: 67 KGKLEDGIYALYISPLKALNNDIRRRLEEPLRELGIEVAVRHGDTPQSEKQKMLKNPPHI 126
Query: 255 VIATP---------GRLIDYLEQGTINLHRTSYLVLDE------ADR--MLDMGFEPQIR 297
+I TP + + L ++ Y+++DE + R L + E ++R
Sbjct: 127 LITTPESLAILLNSPKFRELLR----DVR---YVIVDEIHALAESKRGVQLALSLE-RLR 178
Query: 298 KIIGQIRPDRQVLMWSATW--PKEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDV-- 353
++ G + R L SAT P+EV K FLV + +IVDV
Sbjct: 179 ELAGDFQ--RIGL--SATVGPPEEVAK----FLV-------------GFGDPCEIVDVSA 217
Query: 354 ----------------CQEHEKDYKLQGLLSQIGSERTSKTIIFVETKRKADDITRSVRN 397
E + + + RT T+IF T+ A+ + ++
Sbjct: 218 AKKLEIKVISPVEDLIYDEELWAALYERIAELVKKHRT--TLIFTNTRSGAERLAFRLKK 275
Query: 398 KGWAAVAIH-GNKSQQERDRVLNEFRIGRASILVS 431
G + +H G+ S++ R V + G +V+
Sbjct: 276 LGPDIIEVHHGSLSRELRLEVEERLKEGELKAVVA 310
>gnl|CDD|224126 COG1205, COG1205, Distinct helicase family with a unique C-terminal
domain including a metal-binding cysteine cluster
[General function prediction only].
Length = 851
Score = 53.2 bits (128), Expect = 4e-07
Identities = 60/320 (18%), Positives = 115/320 (35%), Gaps = 67/320 (20%)
Query: 155 QAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELA 214
Q + G ++V TGSGKT ++ P + H+ LR L+L PT LA
Sbjct: 75 QVDALRLIREGRNVVVTTGTGSGKTESFLLPILDHL-----LRDPSAR-ALLLYPTNALA 128
Query: 215 QQIETVANDFGSATATRVAC-VFGGAPKGPQVKALQTG-AEIVIATPGRLIDYLEQGTIN 272
+ S +V + G + +A+ +I++ P D L +
Sbjct: 129 NDQAERLRELISDLPGKVTFGRYTGDTPPEERRAIIRNPPDILLTNP----DMLHYLLLR 184
Query: 273 LH--------RTSYLVLDEA---------------DRMLDMGFEPQIRKIIGQIRPDRQV 309
H YLV+DE R+L + + Q+
Sbjct: 185 NHDAWLWLLRNLKYLVVDELHTYRGVQGSEVALLLRRLLRR---------LRRYGSPLQI 235
Query: 310 LMWSATW--PKEVQKLAEDFLVDYVQLNI-GSLNPTANHNIV----QIVDVCQEHEKDYK 362
+ SAT P E AE+ ++ + +P V I ++ + +
Sbjct: 236 ICTSATLANPGE---FAEELFGRDFEVPVDEDGSPRGLRYFVRREPPIRELAESIRRSAL 292
Query: 363 LQGLLSQIGS---ERTSKTIIFVETKRKADDITRSVRN-------KGWAAVAIH-GNKSQ 411
+ L+ + + +T++F ++++ + + S R K AV+ + +
Sbjct: 293 AE--LATLAALLVRNGIQTLVFFRSRKQVELLYLSPRRRLVREGGKLLDAVSTYRAGLHR 350
Query: 412 QERDRVLNEFRIGRASILVS 431
+ER R+ EF+ G +++
Sbjct: 351 EERRRIEAEFKEGELLGVIA 370
>gnl|CDD|223588 COG0514, RecQ, Superfamily II DNA helicase [DNA replication,
recombination, and repair].
Length = 590
Score = 52.7 bits (127), Expect = 5e-07
Identities = 64/287 (22%), Positives = 121/287 (42%), Gaps = 50/287 (17%)
Query: 163 LSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQ-QIET-- 219
LSG D + + TG GK+L Y PA++ EG LV++P L + Q++
Sbjct: 30 LSGKDTLVVMPTGGGKSLCYQIPALL----------LEGL-TLVVSPLISLMKDQVDQLE 78
Query: 220 ---VANDFGSATATRVACVFGGAPKGPQVKALQTGA-EIVIATPGRL-----IDYLEQGT 270
+ + ++T +R + + L++G +++ +P RL ++ L++
Sbjct: 79 AAGIRAAYLNSTLSREE-------RQQVLNQLKSGQLKLLYISPERLMSPRFLELLKRLP 131
Query: 271 INLHRTSYLVLDEADRMLDMG--FEPQIRKIIGQIRPD--RQVLMW-SATWPKEVQK--L 323
I+L + +DEA + G F P R++ G++R ++ +AT V+
Sbjct: 132 ISL-----VAIDEAHCISQWGHDFRPDYRRL-GRLRAGLPNPPVLALTATATPRVRDDIR 185
Query: 324 AEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDYKLQGLLSQIGSERTSKTIIFVE 383
+ L D GS + V E L+ + + + II+
Sbjct: 186 EQLGLQDANIF-RGSFDRPNLALKVVEKG---EPSDQL---AFLATVLPQLSKSGIIYCL 238
Query: 384 TKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGRASILV 430
T++K +++ +R G +A A H S +ER+RV F ++V
Sbjct: 239 TRKKVEELAEWLRKNGISAGAYHAGLSNEERERVQQAFLNDEIKVMV 285
>gnl|CDD|224123 COG1202, COG1202, Superfamily II helicase, archaea-specific
[General function prediction only].
Length = 830
Score = 52.5 bits (126), Expect = 7e-07
Identities = 72/292 (24%), Positives = 116/292 (39%), Gaps = 48/292 (16%)
Query: 163 LSGCDLVAIAKTGSGKTL-GYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVA 221
L G +L+ ++ T SGKTL G +A R L G+ + LV P LA Q
Sbjct: 230 LEGENLLVVSATASGKTLIGELA------GIPRLLSGGKKMLFLV--PLVALANQKY--- 278
Query: 222 NDF-------GSATATRVACVFGGAPKGPQVKALQTGAEIVIAT-PGRLIDYLEQGTINL 273
DF G A RV + P V A+I++ T G IDYL + +L
Sbjct: 279 EDFKERYSKLGLKVAIRVGMSRIKTREEPVVVDTSPDADIIVGTYEG--IDYLLRTGKDL 336
Query: 274 HRTSYLVLDEADRMLDMGFEPQIRKIIGQIR---PDRQVLMWSATW--PKEVQKLAEDFL 328
+V+DE + D P++ +IG++R P Q + SAT P+E+ K L
Sbjct: 337 GDIGTVVIDEIHTLEDEERGPRLDGLIGRLRYLFPGAQFIYLSATVGNPEELAKKLGAKL 396
Query: 329 VDYVQLNIGSLNPTANHNIVQI---VDVCQEHEKDYKLQGLLSQIGSERTSK------TI 379
V Y + V + + + + + + L + S TI
Sbjct: 397 VLY------------DERPVPLERHLVFARNESEKWDIIARLVKREFSTESSKGYRGQTI 444
Query: 380 IFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGRASILVS 431
+F ++R+ ++ ++ KG A H +ER V F + +V+
Sbjct: 445 VFTYSRRRCHELADALTGKGLKAAPYHAGLPYKERKSVERAFAAQELAAVVT 496
>gnl|CDD|129701 TIGR00614, recQ_fam, ATP-dependent DNA helicase, RecQ family. All
proteins in this family for which functions are known
are 3'-5' DNA-DNA helicases. These proteins are used for
recombination, recombinational repair, and possibly
maintenance of chromosome stability. This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 470
Score = 49.8 bits (119), Expect = 4e-06
Identities = 67/276 (24%), Positives = 112/276 (40%), Gaps = 42/276 (15%)
Query: 162 ALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTREL----AQQI 217
L G D + TG GK+L Y PA+ I LV++P L Q+
Sbjct: 23 VLLGRDCFVVMPTGGGKSLCYQLPALC-----------SDGITLVISPLISLMEDQVLQL 71
Query: 218 ET--VANDF--GSATATRVACVFGGAPKGPQVKALQTGAEIVIATPGRLIDYLEQGTINL 273
+ + F S + + V K ++K L E + RL+ LE
Sbjct: 72 KASGIPATFLNSSQSKEQQKNVLTDL-KDGKIKLLYVTPEKC-SASNRLLQTLE----ER 125
Query: 274 HRTSYLVLDEADRMLDMG--FEPQIRKIIGQIR---PDRQVLMWSATWPKEVQKLAEDFL 328
+ + +DEA + G F P + + G ++ P+ ++ +AT V++ L
Sbjct: 126 KGITLIAVDEAHCISQWGHDFRPDYKAL-GSLKQKFPNVPIMALTATASPSVREDILRQL 184
Query: 329 -VDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDYK-LQGLLSQIGSERTSKT-IIFVETK 385
+ Q+ S + N+ V K K L+ LL I E K+ II+ ++
Sbjct: 185 NLKNPQIFCTSFD---RPNLYYEV-----RRKTPKILEDLLRFIRKEFKGKSGIIYCPSR 236
Query: 386 RKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEF 421
+K++ +T S++N G AA A H RD V ++F
Sbjct: 237 KKSEQVTASLQNLGIAAGAYHAGLEISARDDVHHKF 272
>gnl|CDD|234365 TIGR03817, DECH_helic, helicase/secretion neighborhood putative
DEAH-box helicase. A conserved gene neighborhood widely
spread in the Actinobacteria contains this
uncharacterized DEAH-box family helicase encoded
convergently towards an operon of genes for protein
homologous to type II secretion and pilus formation
proteins. The context suggests that this helicase may
play a role in conjugal transfer of DNA.
Length = 742
Score = 49.7 bits (119), Expect = 5e-06
Identities = 45/182 (24%), Positives = 67/182 (36%), Gaps = 26/182 (14%)
Query: 111 YLLRLESRPGKCVPRPIQHFEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVA 170
++ L +R G+ P P P ++ + G P QA+ +A +G +V
Sbjct: 3 HVEHLPARAGRTAPWPAW------AHPDVVAALEAAGIHRPWQHQARAAELAHAGRHVVV 56
Query: 171 IAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATAT 230
T SGK+L Y P + L L LAPT+ LA +
Sbjct: 57 ATGTASGKSLAYQLPVL------SALADDPRATALYLAPTKALAADQLRAVRELT-LRGV 109
Query: 231 RVACVFGGAPKGPQVKALQTGAEIVIATPGRLIDYLEQGTINLH--------RTSYLVLD 282
R A G P + A + A V+ P D L +G + H R Y+V+D
Sbjct: 110 RPATYDGDTPTEERRWAREH-ARYVLTNP----DMLHRGILPSHARWARFLRRLRYVVID 164
Query: 283 EA 284
E
Sbjct: 165 EC 166
>gnl|CDD|130456 TIGR01389, recQ, ATP-dependent DNA helicase RecQ. The
ATP-dependent DNA helicase RecQ of E. coli is about 600
residues long. This model represents bacterial proteins
with a high degree of similarity in domain architecture
and in primary sequence to E. coli RecQ. The model
excludes eukaryotic and archaeal proteins with RecQ-like
regions, as well as more distantly related bacterial
helicases related to RecQ [DNA metabolism, DNA
replication, recombination, and repair].
Length = 591
Score = 47.4 bits (113), Expect = 2e-05
Identities = 61/289 (21%), Positives = 119/289 (41%), Gaps = 54/289 (18%)
Query: 163 LSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQ-QIETVA 221
L G D++ + TG GK+L Y PA+ L G + +V++P L + Q++ +
Sbjct: 26 LDGRDVLVVMPTGGGKSLCYQVPAL--------LLKG---LTVVISPLISLMKDQVDQL- 73
Query: 222 NDFGSATATRVACVFGGAPKGPQ---VKALQTG-AEIVIATPGRL-----IDYLEQGTIN 272
A A + Q KAL G +++ P RL ++ L++ I
Sbjct: 74 ----RAAGVAAAYLNSTLSAKEQQDIEKALVNGELKLLYVAPERLEQDYFLNMLQRIPIA 129
Query: 273 LHRTSYLVLDEADRMLDMG--FEPQIRKI--IGQIRPDRQVLMWSATWPKEVQKLAEDFL 328
L + +DEA + G F P+ +++ + + P + +AT E ++
Sbjct: 130 L-----VAVDEAHCVSQWGHDFRPEYQRLGSLAERFPQVPRIALTATADAETRQ------ 178
Query: 329 VDYV-QLNIGSLN----PTANHNIV-QIVDVCQEHEKDYKLQGLLSQIGSERTSKTIIFV 382
D L + N N+ +V +K+ K + LL + R II+
Sbjct: 179 -DIRELLRLADANEFITSFDRPNLRFSVV------KKNNKQKFLLDYLKKHRGQSGIIYA 231
Query: 383 ETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGRASILVS 431
+++K +++ + ++G +A+A H S + R +F ++V+
Sbjct: 232 SSRKKVEELAERLESQGISALAYHAGLSNKVRAENQEDFLYDDVKVMVA 280
>gnl|CDD|201125 pfam00271, Helicase_C, Helicase conserved C-terminal domain. The
Prosite family is restricted to DEAD/H helicases,
whereas this domain family is found in a wide variety of
helicases and helicase related proteins. It may be that
this is not an autonomously folding unit, but an
integral part of the helicase.
Length = 78
Score = 40.2 bits (95), Expect = 2e-04
Identities = 14/36 (38%), Positives = 23/36 (63%)
Query: 396 RNKGWAAVAIHGNKSQQERDRVLNEFRIGRASILVS 431
R G +HG SQ+ER+ +L +FR G++ +LV+
Sbjct: 4 RKPGIKVARLHGGLSQEEREEILEDFRNGKSKVLVA 39
Score = 29.4 bits (67), Expect = 1.5
Identities = 11/31 (35%), Positives = 20/31 (64%)
Query: 479 SILVSHYNKSQQERDRVLNEFRIGRASILVS 509
+ H SQ+ER+ +L +FR G++ +LV+
Sbjct: 9 KVARLHGGLSQEEREEILEDFRNGKSKVLVA 39
Score = 29.4 bits (67), Expect = 1.5
Identities = 11/31 (35%), Positives = 20/31 (64%)
Query: 505 SILVSHYNKSQQERDRVLNEFRIGRASILVS 535
+ H SQ+ER+ +L +FR G++ +LV+
Sbjct: 9 KVARLHGGLSQEEREEILEDFRNGKSKVLVA 39
Score = 29.4 bits (67), Expect = 1.5
Identities = 11/31 (35%), Positives = 20/31 (64%)
Query: 531 SILVSHYNKSQQERDRVLNEFRIGRASILVS 561
+ H SQ+ER+ +L +FR G++ +LV+
Sbjct: 9 KVARLHGGLSQEEREEILEDFRNGKSKVLVA 39
Score = 29.4 bits (67), Expect = 1.5
Identities = 11/31 (35%), Positives = 20/31 (64%)
Query: 557 SILVSHYNKSQQERDRVLNEFRIGRASILVS 587
+ H SQ+ER+ +L +FR G++ +LV+
Sbjct: 9 KVARLHGGLSQEEREEILEDFRNGKSKVLVA 39
Score = 29.4 bits (67), Expect = 1.5
Identities = 11/31 (35%), Positives = 20/31 (64%)
Query: 583 SILVSHYNKSQQERDRVLNEFRIGRASILVS 613
+ H SQ+ER+ +L +FR G++ +LV+
Sbjct: 9 KVARLHGGLSQEEREEILEDFRNGKSKVLVA 39
Score = 29.4 bits (67), Expect = 1.5
Identities = 11/31 (35%), Positives = 20/31 (64%)
Query: 609 SILVSHYNKSQQERDRVLNEFRIGRASILVS 639
+ H SQ+ER+ +L +FR G++ +LV+
Sbjct: 9 KVARLHGGLSQEEREEILEDFRNGKSKVLVA 39
Score = 29.4 bits (67), Expect = 1.5
Identities = 11/31 (35%), Positives = 20/31 (64%)
Query: 635 SILVSHYNKSQQERDRVLNEFRIGRASILVS 665
+ H SQ+ER+ +L +FR G++ +LV+
Sbjct: 9 KVARLHGGLSQEEREEILEDFRNGKSKVLVA 39
Score = 27.9 bits (63), Expect = 4.3
Identities = 10/24 (41%), Positives = 18/24 (75%)
Query: 434 NKSQQERDRVLNEFRIGRASILVS 457
SQ+ER+ +L +FR G++ +LV+
Sbjct: 16 GLSQEEREEILEDFRNGKSKVLVA 39
>gnl|CDD|240291 PTZ00146, PTZ00146, fibrillarin; Provisional.
Length = 293
Score = 43.6 bits (103), Expect = 2e-04
Identities = 20/41 (48%), Positives = 20/41 (48%), Gaps = 1/41 (2%)
Query: 16 GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSN 56
GR GG GG GG G G G RG GGRGGGG
Sbjct: 13 GRGGGGGGGRGGGGRGGGRGGGR-GRGRGGGGGGRGGGGGG 52
Score = 41.6 bits (98), Expect = 0.001
Identities = 17/41 (41%), Positives = 17/41 (41%), Gaps = 1/41 (2%)
Query: 16 GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSN 56
G G G RGGG G G G RGG GGGG
Sbjct: 16 GGGGGGRGGGGRGGGRGGGRGRGRGGGGG-GRGGGGGGGPG 55
Score = 40.5 bits (95), Expect = 0.002
Identities = 19/40 (47%), Positives = 19/40 (47%), Gaps = 1/40 (2%)
Query: 15 YGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGG 54
G GG RGG GG G GG G RG RG GGG
Sbjct: 4 GGFGGGRGGG-RGGGGGGGRGGGGRGGGRGGGRGRGRGGG 42
Score = 37.8 bits (88), Expect = 0.018
Identities = 17/36 (47%), Positives = 17/36 (47%), Gaps = 3/36 (8%)
Query: 20 YSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGS 55
G RGG G GG GG G G GGRGGG
Sbjct: 4 GGFGGGRGG---GRGGGGGGGRGGGGRGGGRGGGRG 36
Score = 37.4 bits (87), Expect = 0.024
Identities = 17/38 (44%), Positives = 17/38 (44%), Gaps = 2/38 (5%)
Query: 16 GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGG 53
GR G RGG G GRGG G RG GG G
Sbjct: 21 GRGGGGRGGGRGG--GRGRGRGGGGGGRGGGGGGGPGK 56
Score = 37.0 bits (86), Expect = 0.024
Identities = 17/37 (45%), Positives = 17/37 (45%)
Query: 22 GGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYG 58
G GG GG G GG G G RGG GGG G
Sbjct: 2 MGGGFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRG 38
Score = 32.0 bits (73), Expect = 1.0
Identities = 14/26 (53%), Positives = 15/26 (57%)
Query: 34 GGRGGYGAPRGYSRGGRGGGGSNYGS 59
G GG+G RG RGG GGGG G
Sbjct: 1 GMGGGFGGGRGGGRGGGGGGGRGGGG 26
Score = 32.0 bits (73), Expect = 1.1
Identities = 15/33 (45%), Positives = 15/33 (45%)
Query: 26 RGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYG 58
G GGG GG G G RGG G GG G
Sbjct: 1 GMGGGFGGGRGGGRGGGGGGGRGGGGRGGGRGG 33
>gnl|CDD|197757 smart00490, HELICc, helicase superfamily c-terminal domain.
Length = 82
Score = 39.5 bits (93), Expect = 4e-04
Identities = 12/43 (27%), Positives = 26/43 (60%)
Query: 389 DDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGRASILVS 431
+++ ++ G +HG SQ+ER+ +L++F G+ +LV+
Sbjct: 1 EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVA 43
Score = 29.5 bits (67), Expect = 1.4
Identities = 10/43 (23%), Positives = 22/43 (51%)
Query: 467 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 509
+ + + + H SQ+ER+ +L++F G+ +LV+
Sbjct: 1 EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVA 43
Score = 29.5 bits (67), Expect = 1.4
Identities = 10/43 (23%), Positives = 22/43 (51%)
Query: 493 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 535
+ + + + H SQ+ER+ +L++F G+ +LV+
Sbjct: 1 EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVA 43
Score = 29.5 bits (67), Expect = 1.4
Identities = 10/43 (23%), Positives = 22/43 (51%)
Query: 519 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 561
+ + + + H SQ+ER+ +L++F G+ +LV+
Sbjct: 1 EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVA 43
Score = 29.5 bits (67), Expect = 1.4
Identities = 10/43 (23%), Positives = 22/43 (51%)
Query: 545 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 587
+ + + + H SQ+ER+ +L++F G+ +LV+
Sbjct: 1 EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVA 43
Score = 29.5 bits (67), Expect = 1.4
Identities = 10/43 (23%), Positives = 22/43 (51%)
Query: 571 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 613
+ + + + H SQ+ER+ +L++F G+ +LV+
Sbjct: 1 EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVA 43
Score = 29.5 bits (67), Expect = 1.4
Identities = 10/43 (23%), Positives = 22/43 (51%)
Query: 597 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 639
+ + + + H SQ+ER+ +L++F G+ +LV+
Sbjct: 1 EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVA 43
Score = 29.5 bits (67), Expect = 1.4
Identities = 10/43 (23%), Positives = 22/43 (51%)
Query: 623 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 665
+ + + + H SQ+ER+ +L++F G+ +LV+
Sbjct: 1 EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVA 43
>gnl|CDD|182933 PRK11057, PRK11057, ATP-dependent DNA helicase RecQ; Provisional.
Length = 607
Score = 42.0 bits (99), Expect = 0.001
Identities = 71/286 (24%), Positives = 115/286 (40%), Gaps = 66/286 (23%)
Query: 162 ALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQ-QIET- 219
LSG D + + TG GK+L Y PA+V G + LV++P L + Q++
Sbjct: 37 VLSGRDCLVVMPTGGGKSLCYQIPALV--------LDG---LTLVVSPLISLMKDQVDQL 85
Query: 220 ----VANDFGSATATR--VACVFGGAPKGPQVKALQTGAEIVIATPGRLI--DYLEQGTI 271
VA ++T TR V G G Q+K ++ P RL+ ++LE
Sbjct: 86 LANGVAAACLNSTQTREQQLEVMAGCRTG-QIK-------LLYIAPERLMMDNFLEH--- 134
Query: 272 NLH--RTSYLVLDEADRMLDMG--FEPQIRKIIGQIR---PDRQVLMWSATWPKEVQKLA 324
L + L +DEA + G F P+ +GQ+R P + +AT ++
Sbjct: 135 -LAHWNPALLAVDEAHCISQWGHDFRPEYAA-LGQLRQRFPTLPFMALTATADDTTRQ-- 190
Query: 325 EDFLVDYVQL--------NIGSLN-PTANHNIVQIVDVCQEHEKDYKLQGLLSQIGSERT 375
D V+L I S + P + +V EK L L+ + +R
Sbjct: 191 -----DIVRLLGLNDPLIQISSFDRPNIRYTLV---------EKFKPLDQLMRYVQEQRG 236
Query: 376 SKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEF 421
II+ ++ K +D ++++G +A A H R V F
Sbjct: 237 KSGIIYCNSRAKVEDTAARLQSRGISAAAYHAGLDNDVRADVQEAF 282
>gnl|CDD|224036 COG1111, MPH1, ERCC4-like helicases [DNA replication,
recombination, and repair].
Length = 542
Score = 41.2 bits (97), Expect = 0.002
Identities = 41/126 (32%), Positives = 56/126 (44%), Gaps = 11/126 (8%)
Query: 162 ALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTREL-AQQIETV 220
AL LV + TG GKT IA A+V N R G VL LAPT+ L Q E
Sbjct: 27 ALFKNTLVVLP-TGLGKTF--IA-AMVIANRLR----WFGGKVLFLAPTKPLVLQHAEFC 78
Query: 221 ANDFGSATATRVACVFGGAPKGPQVKALQTGAEIVIATPGRLIDYLEQGTINLHRTSYLV 280
G +A + G + + ++ +ATP + + L+ G I+L S L+
Sbjct: 79 RKVTG-IPEDEIAALTGEVRPEEREELWAKK-KVFVATPQVVENDLKAGRIDLDDVSLLI 136
Query: 281 LDEADR 286
DEA R
Sbjct: 137 FDEAHR 142
Score = 30.0 bits (68), Expect = 6.4
Identities = 23/86 (26%), Positives = 44/86 (51%), Gaps = 10/86 (11%)
Query: 356 EHEKDYKLQGLL-SQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAI---HGNK-- 409
EH K KL+ +L Q+ S+ I+F E + A++I ++ G A ++
Sbjct: 346 EHPKLEKLREILKEQLEKNGDSRVIVFTEYRDTAEEIVNFLKKIGIKARVRFIGQASREG 405
Query: 410 ----SQQERDRVLNEFRIGRASILVS 431
SQ+E+ ++++FR G ++LV+
Sbjct: 406 DKGMSQKEQKEIIDQFRKGEYNVLVA 431
>gnl|CDD|219320 pfam07172, GRP, Glycine rich protein family. This family of
proteins includes several glycine rich proteins as well
as two nodulins 16 and 24. The family also contains
proteins that are induced in response to various
stresses.
Length = 91
Score = 37.4 bits (87), Expect = 0.003
Identities = 20/45 (44%), Positives = 20/45 (44%), Gaps = 2/45 (4%)
Query: 16 GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSN 60
G Y GG GG GGG GG G G G GGGG YG
Sbjct: 48 GGGGYGGGG--GGGYGGGGYYGGGGGYGGGGGGYPGGGGGGYGGG 90
Score = 36.3 bits (84), Expect = 0.008
Identities = 16/41 (39%), Positives = 16/41 (39%), Gaps = 3/41 (7%)
Query: 22 GGAPRGGAPRGGGGR---GGYGAPRGYSRGGRGGGGSNYGS 59
G GG GGGG GGY G GG GG G
Sbjct: 45 YGGGGGGYGGGGGGGYGGGGYYGGGGGYGGGGGGYPGGGGG 85
Score = 33.2 bits (76), Expect = 0.069
Identities = 19/39 (48%), Positives = 19/39 (48%), Gaps = 3/39 (7%)
Query: 16 GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGG 54
G Y GG GG GGG GGY G GG GGGG
Sbjct: 56 GGGGYGGGGYYGGGGGYGGGGGGYPGGGG---GGYGGGG 91
Score = 32.4 bits (74), Expect = 0.16
Identities = 16/31 (51%), Positives = 17/31 (54%), Gaps = 2/31 (6%)
Query: 30 PRGGGGRGGY--GAPRGYSRGGRGGGGSNYG 58
+ GGG GGY G GY GG GGG YG
Sbjct: 43 DKYGGGGGGYGGGGGGGYGGGGYYGGGGGYG 73
>gnl|CDD|224125 COG1204, COG1204, Superfamily II helicase [General function
prediction only].
Length = 766
Score = 40.1 bits (94), Expect = 0.005
Identities = 30/126 (23%), Positives = 50/126 (39%), Gaps = 13/126 (10%)
Query: 160 PIALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIET 219
LS +++ A TGSGKTL I + L G G +V + P + LA++
Sbjct: 42 KGLLSDENVLISAPTGSGKTL------IALLAILSTLLEGGGKVVYI-VPLKALAEEKYE 94
Query: 220 VANDFGSATATRVACVFGGAPKGPQVKALQTGAEIVIATPGRLIDYLEQGTINLHR-TSY 278
+ RV G + A ++++ TP +L D L + +
Sbjct: 95 EFSRL-EELGIRVGISTGDYDLDDERLA---RYDVIVTTPEKL-DSLTRKRPSWIEEVDL 149
Query: 279 LVLDEA 284
+V+DE
Sbjct: 150 VVIDEI 155
>gnl|CDD|226808 COG4371, COG4371, Predicted membrane protein [Function unknown].
Length = 334
Score = 38.8 bits (90), Expect = 0.008
Identities = 19/37 (51%), Positives = 20/37 (54%), Gaps = 1/37 (2%)
Query: 19 SYSGGAPRGGAPRGGG-GRGGYGAPRGYSRGGRGGGG 54
YSGG P GG GGG GG+G P GG GGG
Sbjct: 67 GYSGGGPSGGGYSGGGYSGGGFGFPFIIPGGGGGGGF 103
Score = 35.7 bits (82), Expect = 0.085
Identities = 14/34 (41%), Positives = 16/34 (47%)
Query: 21 SGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGG 54
SGG GG+ R G + G S GG GGG
Sbjct: 49 SGGRIGGGSFRAPSGYSRGYSGGGPSGGGYSGGG 82
Score = 34.1 bits (78), Expect = 0.28
Identities = 17/42 (40%), Positives = 18/42 (42%), Gaps = 1/42 (2%)
Query: 18 PSYSGGAPRGGAPRGGGGRGGYGAP-RGYSRGGRGGGGSNYG 58
P A R G GGG RGYS GG GGG + G
Sbjct: 40 PLSVAAAARSGGRIGGGSFRAPSGYSRGYSGGGPSGGGYSGG 81
>gnl|CDD|221784 pfam12810, Gly_rich, Glycine rich protein. This family of proteins
is greatly expanded in Trichomonas vaginalis. The
proteins are composed of several glycine rich motifs
interspersed through the sequence. Although many
proteins have been annotated by similarity in the family
these annotations given the biased composition of the
sequences these are unlikely to be functionally
relevant.
Length = 248
Score = 38.3 bits (90), Expect = 0.010
Identities = 22/53 (41%), Positives = 26/53 (49%)
Query: 15 YGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNLRTPN 67
+GG GG+ GGGG GGY G G GGGGS+Y S + PN
Sbjct: 154 GASSGGNGGFGYGGSGNGGGGGGGYFGGGGGHYAGGGGGGSSYISGKDGCIPN 206
Score = 31.0 bits (71), Expect = 1.8
Identities = 10/29 (34%), Positives = 14/29 (48%)
Query: 34 GGRGGYGAPRGYSRGGRGGGGSNYGSNSN 62
GG+G + +GG GGG NS+
Sbjct: 56 GGKGEDNSSNNMVKGGYNGGGDGGNDNSS 84
>gnl|CDD|222478 pfam13966, zf-RVT, zinc-binding in reverse transcriptase. This
domain would appear to be a zinc-binding region of a
putative reverse transcriptase.
Length = 86
Score = 35.3 bits (82), Expect = 0.014
Identities = 15/49 (30%), Positives = 20/49 (40%), Gaps = 7/49 (14%)
Query: 704 LW-VTLNRIRTKFGKCAANLHKWGLKSSAACD-CGAEKQTILHIIQDCP 750
W NR+ TK L K G + C CG E++T H+ C
Sbjct: 37 AWLAAWNRLLTK-----DRLQKRGWQLPNRCVLCGQEEETRDHLFFHCS 80
>gnl|CDD|220915 pfam10961, DUF2763, Protein of unknown function (DUF2763). This
eukaryotic family of proteins has no known function.
Length = 91
Score = 34.7 bits (80), Expect = 0.020
Identities = 15/41 (36%), Positives = 16/41 (39%), Gaps = 3/41 (7%)
Query: 16 GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSN 56
R + S G G GG G G R R G GGG S
Sbjct: 48 YRKNSSSGKRW---DGGRGGPGPPGGGRRMGRIGGGGGPSR 85
Score = 27.4 bits (61), Expect = 8.3
Identities = 12/28 (42%), Positives = 12/28 (42%), Gaps = 1/28 (3%)
Query: 22 GGAPRGGAPRGGGGRGGYGAPRGYSRGG 49
G P GG G G GG G R GG
Sbjct: 65 PGPPGGGRRMGRIGGGG-GPSRPPMAGG 91
>gnl|CDD|224120 COG1199, DinG, Rad3-related DNA helicases [Transcription / DNA
replication, recombination, and repair].
Length = 654
Score = 37.5 bits (87), Expect = 0.028
Identities = 27/109 (24%), Positives = 42/109 (38%), Gaps = 33/109 (30%)
Query: 110 YYLLRLESRPGKCVPRPIQHFEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLV 169
+ PG PRP Q + + + E AL G + +
Sbjct: 3 DDGYLAVAFPGFE-PRPEQRE--------MAEAVAE----------------ALKGGEGL 37
Query: 170 AI-AKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQI 217
I A TG+GKTL Y+ PA+ + EG V++ T+ L +Q+
Sbjct: 38 LIEAPTGTGKTLAYLLPALAY-------AREEGKKVIISTRTKALQEQL 79
>gnl|CDD|234478 TIGR04121, DEXH_lig_assoc, DEXH box helicase, DNA
ligase-associated. Members of this protein family are
DEAD/DEAH box helicases found associated with a
bacterial ATP-dependent DNA ligase, part of a four-gene
system that occurs in about 12 % of prokaryotic
reference genomes. The actual motif in this family is
DE[VILW]H.
Length = 803
Score = 37.1 bits (87), Expect = 0.042
Identities = 25/70 (35%), Positives = 33/70 (47%), Gaps = 4/70 (5%)
Query: 150 APTAIQAQGWPIALSGCDLVAIAKTGSGKTL-GYIAPAIVHVNSQRPLRSGEGPIVLVLA 208
P Q + W AL G + IA TGSGKTL G++ I ++P +G L +
Sbjct: 13 TPRPFQLEMWAAALEGRSGLLIAPTGSGKTLAGFLPSLIDLAGPEKP---KKGLHTLYIT 69
Query: 209 PTRELAQQIE 218
P R LA I
Sbjct: 70 PLRALAVDIA 79
>gnl|CDD|233251 TIGR01054, rgy, reverse gyrase. This model describes reverse
gyrase, found in both archaeal and bacterial
thermophiles. This enzyme, a fusion of a type I
topoisomerase domain and a helicase domain, introduces
positive supercoiling to increase the melting
temperature of DNA double strands. Generally, these
gyrases are encoded as a single polypeptide. An
exception was found in Methanopyrus kandleri, where
enzyme is split within the topoisomerase domain,
yielding a heterodimer of gene products designated RgyB
and RgyA [DNA metabolism, DNA replication,
recombination, and repair].
Length = 1171
Score = 36.3 bits (84), Expect = 0.070
Identities = 58/302 (19%), Positives = 103/302 (34%), Gaps = 60/302 (19%)
Query: 170 AIAKTGSGKT-LGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQ----IETVANDF 224
IA TG GKT G + +R ++ PT L Q I ++A
Sbjct: 98 IIAPTGVGKTTFGLAMSLFLAKKGKR---------CYIILPTTLLVIQVAEKISSLAEKA 148
Query: 225 GSATATRVACVFGGAPKGPQVKALQ----TGAEIVIATPGRLIDYLEQGTINLH-RTSYL 279
G T + P + + ++ +I+I T L + L + ++
Sbjct: 149 GVGTV-NIGAYHSRLPTKEKKEFMERIENGDFDILITTTMFLSKNYD----ELGPKFDFI 203
Query: 280 VLDEADRMLD-----------MGF-EPQIRKIIGQIRPDRQVLMWSATWPKEVQKLAEDF 327
+D+ D +L +GF E I K IR ++ + A
Sbjct: 204 FVDDVDALLKASKNVDKLLKLLGFSEELIEKAWKLIRLRLKLYRALHAKKRLELLEAIPG 263
Query: 328 LVDYVQLNIGS----------------LNPTANHNIVQIVDVCQEHEKDYKLQGLLSQIG 371
L + S L + +V + +D L+ L +I
Sbjct: 264 KKRGC-LIVSSATGRPRGKRAKLFRELLGFEVGGGSDTLRNVVDVYVEDEDLKETLLEIV 322
Query: 372 SERTSKTIIFVET---KRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGRASI 428
+ + I++V K KA++I + N G AVA H K +++ ++ F G +
Sbjct: 323 KKLGTGGIVYVSIDYGKEKAEEIAEFLENHGVKAVAYHATKPKEDYEK----FAEGEIDV 378
Query: 429 LV 430
L+
Sbjct: 379 LI 380
>gnl|CDD|219420 pfam07466, DUF1517, Protein of unknown function (DUF1517). This
family consists of several hypothetical glycine rich
plant and bacterial proteins of around 300 residues in
length. The function of this family is unknown.
Length = 280
Score = 35.4 bits (82), Expect = 0.097
Identities = 18/48 (37%), Positives = 23/48 (47%), Gaps = 4/48 (8%)
Query: 15 YGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRG----GRGGGGSNYG 58
+ PS S +PR +P GGG G G G+ G GGGG +G
Sbjct: 13 FRAPSRSSSSPRSSSPGGGGYYGSPGGGFGFPFLIPFFGFGGGGGLFG 60
Score = 32.3 bits (74), Expect = 0.75
Identities = 15/49 (30%), Positives = 17/49 (34%), Gaps = 5/49 (10%)
Query: 4 RSPDMFELSRVYGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGG 52
R+P SR P S G GGG G + GG GG
Sbjct: 14 RAP-----SRSSSSPRSSSPGGGGYYGSPGGGFGFPFLIPFFGFGGGGG 57
>gnl|CDD|233069 TIGR00643, recG, ATP-dependent DNA helicase RecG. [DNA metabolism,
DNA replication, recombination, and repair].
Length = 630
Score = 35.4 bits (82), Expect = 0.13
Identities = 37/150 (24%), Positives = 61/150 (40%), Gaps = 29/150 (19%)
Query: 175 GSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATATRVAC 234
GSGKTL + + + G V ++APT LA+Q + + VA
Sbjct: 266 GSGKTLVAALAMLAAIEA--------GYQVALMAPTEILAEQHYNSLRNLLAPLGIEVAL 317
Query: 235 VFG---GAPKGPQVKALQTGA-EIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDM 290
+ G G + ++ + +G +V+ T L Q + R + +++DE R
Sbjct: 318 LTGSLKGKRRKELLETIASGQIHLVVGTHA-----LIQEKVEFKRLALVIIDEQHR---- 368
Query: 291 GFEPQIRKII-----GQIRPDRQVLMWSAT 315
F + RK + G P VL+ SAT
Sbjct: 369 -FGVEQRKKLREKGQGGFTPH--VLVMSAT 395
Score = 31.5 bits (72), Expect = 1.8
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 405 IHGNKSQQERDRVLNEFRIGRASILVS 431
+HG E++ V+ EFR G ILV+
Sbjct: 488 LHGRMKSDEKEAVMEEFREGEVDILVA 514
>gnl|CDD|180777 PRK06958, PRK06958, single-stranded DNA-binding protein;
Provisional.
Length = 182
Score = 34.0 bits (78), Expect = 0.13
Identities = 17/44 (38%), Positives = 17/44 (38%)
Query: 16 GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS 59
GR GG G GGG GG G G GGGG G
Sbjct: 110 GRGGSGGGGGGGDEGGYGGGGGGGGGGYGGESRSGGGGGRASGG 153
Score = 34.0 bits (78), Expect = 0.17
Identities = 17/41 (41%), Positives = 19/41 (46%), Gaps = 3/41 (7%)
Query: 19 SYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS 59
Y GG GG GG R G G G G GGGG+ G+
Sbjct: 125 GYGGGGGGGGGGYGGESRSGGG---GGRASGGGGGGAGGGA 162
Score = 32.5 bits (74), Expect = 0.52
Identities = 18/36 (50%), Positives = 20/36 (55%), Gaps = 4/36 (11%)
Query: 26 RGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNS 61
RGG+ GGGG GG GY GG GGGG G +
Sbjct: 111 RGGS--GGGGGGGDEG--GYGGGGGGGGGGYGGESR 142
Score = 31.7 bits (72), Expect = 0.93
Identities = 18/54 (33%), Positives = 19/54 (35%)
Query: 16 GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNLRTPNWN 69
G GG GG GGG G + G R GGGG G S P
Sbjct: 118 GGGGDEGGYGGGGGGGGGGYGGESRSGGGGGRASGGGGGGAGGGASRPSAPAGG 171
Score = 31.3 bits (71), Expect = 1.2
Identities = 16/38 (42%), Positives = 17/38 (44%)
Query: 22 GGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS 59
GG G GGG GGYG G GG GG + G
Sbjct: 109 GGRGGSGGGGGGGDEGGYGGGGGGGGGGYGGESRSGGG 146
Score = 30.9 bits (70), Expect = 1.7
Identities = 18/41 (43%), Positives = 18/41 (43%), Gaps = 3/41 (7%)
Query: 15 YGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGS 55
YG GG GG R GGG G G GG GGG S
Sbjct: 126 YGGGGGGGGGGYGGESRSGGGGGRAS---GGGGGGAGGGAS 163
>gnl|CDD|236092 PRK07772, PRK07772, single-stranded DNA-binding protein;
Provisional.
Length = 186
Score = 33.5 bits (77), Expect = 0.21
Identities = 18/36 (50%), Positives = 19/36 (52%), Gaps = 2/36 (5%)
Query: 24 APRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS 59
A RGG GGG GG+G G S GG GGGG
Sbjct: 120 ASRGGGGGGGG--GGFGGGGGGSGGGGGGGGGGGAP 153
Score = 33.5 bits (77), Expect = 0.25
Identities = 20/48 (41%), Positives = 23/48 (47%), Gaps = 3/48 (6%)
Query: 26 RGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNLRTPN---WNS 70
GG GGGG GG G G GG GGGG+ G + P W+S
Sbjct: 123 GGGGGGGGGGFGGGGGGSGGGGGGGGGGGAPGGGGAQASAPADDPWSS 170
Score = 31.9 bits (73), Expect = 0.70
Identities = 19/43 (44%), Positives = 19/43 (44%)
Query: 17 RPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS 59
R S GG GG GGGG G G G GG GGG S
Sbjct: 119 RASRGGGGGGGGGGFGGGGGGSGGGGGGGGGGGAPGGGGAQAS 161
Score = 28.8 bits (65), Expect = 7.1
Identities = 16/35 (45%), Positives = 17/35 (48%), Gaps = 3/35 (8%)
Query: 29 APRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNL 63
A RGGGG GG G G+ GG G GG G
Sbjct: 120 ASRGGGGGGGGG---GFGGGGGGSGGGGGGGGGGG 151
>gnl|CDD|203709 pfam07652, Flavi_DEAD, Flavivirus DEAD domain.
Length = 146
Score = 33.1 bits (76), Expect = 0.25
Identities = 17/44 (38%), Positives = 24/44 (54%), Gaps = 7/44 (15%)
Query: 175 GSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIE 218
G+GKT + P +V R LR+ LVLAPTR + ++E
Sbjct: 12 GAGKTRKVL-PELVRECIDRRLRT------LVLAPTRVVLAEME 48
>gnl|CDD|221759 pfam12764, Gly-rich_Ago1, Glycine-rich region of argonaut. This
domain is often found at the very N-terminal of
argonaut-like proteins.
Length = 102
Score = 31.9 bits (72), Expect = 0.31
Identities = 17/36 (47%), Positives = 18/36 (50%), Gaps = 2/36 (5%)
Query: 16 GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRG 51
G P Y G RGG P+ GGRGG G RG G
Sbjct: 2 GPPEY-QGRGRGGPPQ-QGGRGGGGGGRGGGSTGGP 35
>gnl|CDD|164795 PHA00370, III, attachment protein.
Length = 297
Score = 33.7 bits (77), Expect = 0.32
Identities = 24/72 (33%), Positives = 29/72 (40%), Gaps = 9/72 (12%)
Query: 19 SYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNLRTPN------WNSMS 72
S +GG GG GG G G G G GG GGGS GS T ++
Sbjct: 94 SDTGGDTGGGNTGGGSGGGDTGGSGG---GGSDGGGSEGGSTGKSLTKEGVGAGDFDYPK 150
Query: 73 MEQINKNLYTPN 84
M NK+ T +
Sbjct: 151 MANANKDALTED 162
Score = 29.5 bits (66), Expect = 7.6
Identities = 16/42 (38%), Positives = 16/42 (38%), Gaps = 1/42 (2%)
Query: 18 PSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS 59
GG G GG GG G G S GG GG GS
Sbjct: 82 KDGDGGGTGEGGSDTGGDTGG-GNTGGGSGGGDTGGSGGGGS 122
>gnl|CDD|218292 pfam04851, ResIII, Type III restriction enzyme, res subunit.
Length = 100
Score = 31.8 bits (73), Expect = 0.33
Identities = 14/50 (28%), Positives = 24/50 (48%), Gaps = 8/50 (16%)
Query: 174 TGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVAND 223
TGSGKT A A++ ++ + VL + P ++L +Q + D
Sbjct: 27 TGSGKT--LTAAALIARLAKGKKK------VLFVVPRKDLLEQALVIIID 68
>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
Length = 773
Score = 34.1 bits (79), Expect = 0.36
Identities = 43/127 (33%), Positives = 63/127 (49%), Gaps = 13/127 (10%)
Query: 162 ALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVA 221
AL LV + TG GKT IA + V ++R L G VL+LAPT+ L +Q
Sbjct: 27 ALKKNTLV-VLPTGLGKTA--IA---LLVIAER-LHKKGGK-VLILAPTKPLVEQHAEFF 78
Query: 222 NDFGSATATRVACVFGGAPKGPQVKALQTGAEIVIATPGRLI--DYLEQGTINLHRTSYL 279
F + ++ VF G + L A++++ATP ++I D + G I+L S L
Sbjct: 79 RKFLNIPEEKIV-VFTGEVSPEKRAELWEKAKVIVATP-QVIENDLIA-GRISLEDVSLL 135
Query: 280 VLDEADR 286
+ DEA R
Sbjct: 136 IFDEAHR 142
Score = 32.9 bits (76), Expect = 0.86
Identities = 27/85 (31%), Positives = 46/85 (54%), Gaps = 9/85 (10%)
Query: 356 EHEKDYKLQGLLS-QIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHG--NK--- 409
EH K KL+ ++ Q+G S+ I+F + + A+ I + +G AV G +K
Sbjct: 345 EHPKLEKLREIVKEQLGKNPDSRIIVFTQYRDTAEKIVDLLEKEGIKAVRFVGQASKDGD 404
Query: 410 ---SQQERDRVLNEFRIGRASILVS 431
SQ+E+ +L++FR G ++LVS
Sbjct: 405 KGMSQKEQIEILDKFRAGEFNVLVS 429
>gnl|CDD|184200 PRK13640, cbiO, cobalt transporter ATP-binding subunit;
Provisional.
Length = 282
Score = 33.2 bits (76), Expect = 0.38
Identities = 18/61 (29%), Positives = 32/61 (52%), Gaps = 4/61 (6%)
Query: 279 LVLDEADRMLDMGFEPQIRKIIGQIRPDRQVLMWSATWPKEVQKLAEDFLVDYVQLNIGS 338
++LDE+ MLD + QI K+I +++ + + S T + +A+ LV L+ G
Sbjct: 165 IILDESTSMLDPAGKEQILKLIRKLKKKNNLTVISITHDIDEANMADQVLV----LDDGK 220
Query: 339 L 339
L
Sbjct: 221 L 221
>gnl|CDD|235782 PRK06341, PRK06341, single-stranded DNA-binding protein;
Provisional.
Length = 166
Score = 32.5 bits (74), Expect = 0.40
Identities = 19/50 (38%), Positives = 23/50 (46%), Gaps = 1/50 (2%)
Query: 11 LSRVYGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSN 60
L+ + GR GG GG GGG G G RG R GGG N+ +
Sbjct: 111 LTMLDGRGE-GGGGGGGGDDGGGGDFGSSGPSRGGPRPASSGGGGNFSRD 159
>gnl|CDD|218350 pfam04959, ARS2, Arsenite-resistance protein 2. Arsenite is a
carcinogenic compound which can act as a co-mutagen by
inhibiting DNA repair. Arsenite-resistance protein 2 is
thought to play a role in arsenite resistance.
Length = 211
Score = 32.9 bits (75), Expect = 0.43
Identities = 20/63 (31%), Positives = 22/63 (34%), Gaps = 12/63 (19%)
Query: 15 YGRPSYSGGAPRGGAPR---GGGGRGGYGAPRGYSRGGRG--------GGGSNYGSNSNL 63
Y + P G PR G GRGG P GGRG GG N +L
Sbjct: 150 YPPQTPQALMP-YGQPRPPMMGYGRGGPPFPPNQYGGGRGNYDEFRGQGGYYGKPRNRDL 208
Query: 64 RTP 66
P
Sbjct: 209 DGP 211
Score = 30.9 bits (70), Expect = 2.0
Identities = 19/56 (33%), Positives = 19/56 (33%), Gaps = 18/56 (32%)
Query: 22 GGAPRGGAPRGGG-------GRGGYGAPR----GYSRGGR-------GGGGSNYGS 59
P G AP G YG PR GY RGG GGG NY
Sbjct: 137 KPDPGGLAPGLPGYPPQTPQALMPYGQPRPPMMGYGRGGPPFPPNQYGGGRGNYDE 192
>gnl|CDD|221690 pfam12650, DUF3784, Domain of unknown function (DUF3784). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria and archaea.
Proteins in this family are typically between 96 and 110
amino acids in length.
Length = 96
Score = 31.0 bits (71), Expect = 0.47
Identities = 9/33 (27%), Positives = 20/33 (60%), Gaps = 2/33 (6%)
Query: 421 FRIGRASILVSQYN-KSQQERDRVLNEFRIGRA 452
G+ S L+S YN S++E+++ +E ++ +
Sbjct: 14 LLKGKGSFLISGYNTMSKEEKEKY-DEKKLCKF 45
Score = 31.0 bits (71), Expect = 0.52
Identities = 9/33 (27%), Positives = 20/33 (60%), Gaps = 2/33 (6%)
Query: 473 FRIGRASILVSHYN-KSQQERDRVLNEFRIGRA 504
G+ S L+S YN S++E+++ +E ++ +
Sbjct: 14 LLKGKGSFLISGYNTMSKEEKEKY-DEKKLCKF 45
Score = 31.0 bits (71), Expect = 0.52
Identities = 9/33 (27%), Positives = 20/33 (60%), Gaps = 2/33 (6%)
Query: 499 FRIGRASILVSHYN-KSQQERDRVLNEFRIGRA 530
G+ S L+S YN S++E+++ +E ++ +
Sbjct: 14 LLKGKGSFLISGYNTMSKEEKEKY-DEKKLCKF 45
Score = 31.0 bits (71), Expect = 0.52
Identities = 9/33 (27%), Positives = 20/33 (60%), Gaps = 2/33 (6%)
Query: 525 FRIGRASILVSHYN-KSQQERDRVLNEFRIGRA 556
G+ S L+S YN S++E+++ +E ++ +
Sbjct: 14 LLKGKGSFLISGYNTMSKEEKEKY-DEKKLCKF 45
Score = 31.0 bits (71), Expect = 0.52
Identities = 9/33 (27%), Positives = 20/33 (60%), Gaps = 2/33 (6%)
Query: 551 FRIGRASILVSHYN-KSQQERDRVLNEFRIGRA 582
G+ S L+S YN S++E+++ +E ++ +
Sbjct: 14 LLKGKGSFLISGYNTMSKEEKEKY-DEKKLCKF 45
Score = 31.0 bits (71), Expect = 0.52
Identities = 9/33 (27%), Positives = 20/33 (60%), Gaps = 2/33 (6%)
Query: 577 FRIGRASILVSHYN-KSQQERDRVLNEFRIGRA 608
G+ S L+S YN S++E+++ +E ++ +
Sbjct: 14 LLKGKGSFLISGYNTMSKEEKEKY-DEKKLCKF 45
Score = 31.0 bits (71), Expect = 0.52
Identities = 9/33 (27%), Positives = 20/33 (60%), Gaps = 2/33 (6%)
Query: 603 FRIGRASILVSHYN-KSQQERDRVLNEFRIGRA 634
G+ S L+S YN S++E+++ +E ++ +
Sbjct: 14 LLKGKGSFLISGYNTMSKEEKEKY-DEKKLCKF 45
Score = 31.0 bits (71), Expect = 0.52
Identities = 9/33 (27%), Positives = 20/33 (60%), Gaps = 2/33 (6%)
Query: 629 FRIGRASILVSHYN-KSQQERDRVLNEFRIGRA 660
G+ S L+S YN S++E+++ +E ++ +
Sbjct: 14 LLKGKGSFLISGYNTMSKEEKEKY-DEKKLCKF 45
>gnl|CDD|224429 COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase
type [General function prediction only].
Length = 271
Score = 32.7 bits (75), Expect = 0.56
Identities = 17/57 (29%), Positives = 23/57 (40%), Gaps = 3/57 (5%)
Query: 1 MVMRSPDMFELSRVYGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGR--GGGGS 55
M RS + + G+ G GG GG + G+S GG GGGG+
Sbjct: 212 MFGRSLYLGRQPDRWLNGVLGRRRRSSGS-GGSGGSGGGSSGGGFSGGGGSSGGGGA 267
Score = 31.6 bits (72), Expect = 1.3
Identities = 17/48 (35%), Positives = 18/48 (37%), Gaps = 5/48 (10%)
Query: 15 YGRPSYSGGA---PRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS 59
GR R G GG GG G G S GG GGG + G
Sbjct: 219 LGRQPDRWLNGVLGRRRRSSGSGGSGGSG--GGSSGGGFSGGGGSSGG 264
Score = 30.8 bits (70), Expect = 2.8
Identities = 17/45 (37%), Positives = 17/45 (37%), Gaps = 5/45 (11%)
Query: 11 LSRVYGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGS 55
L R G GG GGG GG G S GG G GS
Sbjct: 231 LGRRRRSSGSGGSGGSGGGSSGGGFSGG-----GGSSGGGGASGS 270
>gnl|CDD|221518 pfam12301, CD99L2, CD99 antigen like protein 2. This family of
proteins is found in eukaryotes. Proteins in this family
are typically between 165 and 237 amino acids in length.
CD99L2 and CD99 are involved in trans-endothelial
migration of neutrophils in vitro and in the recruitment
of neutrophils into inflamed peritoneum.
Length = 154
Score = 31.7 bits (72), Expect = 0.70
Identities = 15/59 (25%), Positives = 17/59 (28%), Gaps = 11/59 (18%)
Query: 14 VYGRPSYSGGAPRGGAPRGGGG-----------RGGYGAPRGYSRGGRGGGGSNYGSNS 61
+P G GGG GGY +G GG GGGG
Sbjct: 46 AGPKPKPRPQPDPGNPGGDGGGFSDSDLADVAGDGGYKPDKGKGGGGGGGGGGGTDGGP 104
Score = 29.4 bits (66), Expect = 4.6
Identities = 14/35 (40%), Positives = 17/35 (48%)
Query: 10 ELSRVYGRPSYSGGAPRGGAPRGGGGRGGYGAPRG 44
+L+ V G Y +GG GGGG G G P G
Sbjct: 72 DLADVAGDGGYKPDKGKGGGGGGGGGGGTDGGPEG 106
>gnl|CDD|221477 pfam12235, FXR1P_C, Fragile X-related 1 protein C terminal. This
domain family is found in eukaryotes, and is typically
between 126 and 160 amino acids in length. The family
is found in association with pfam05641, pfam00013. This
family is the C terminal region of the fragile X
related 1 protein FXR1P. FXR1P contains two KH domains
and a RGG box that are characteristic motifs in
RNA-binding proteins as well as nuclear localization
and export signals. FXR1P is thought to regulate mRNA
transport and translation.
Length = 79
Score = 30.1 bits (68), Expect = 0.77
Identities = 17/50 (34%), Positives = 19/50 (38%), Gaps = 1/50 (2%)
Query: 22 GGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS-NSNLRTPNWNS 70
G R R G GY RG GGG N S +S L P+ N
Sbjct: 19 GAGSRPPPSRPGDKEKGYLTDDSGGRGRGGGGKGNSNSISSVLDDPDSNP 68
>gnl|CDD|236966 PRK11747, dinG, ATP-dependent DNA helicase DinG; Provisional.
Length = 697
Score = 32.5 bits (75), Expect = 1.1
Identities = 11/20 (55%), Positives = 12/20 (60%)
Query: 168 LVAIAKTGSGKTLGYIAPAI 187
LV A TG GKTL Y+ I
Sbjct: 52 LVIEAGTGVGKTLSYLLAGI 71
>gnl|CDD|217818 pfam03964, Chorion_2, Chorion family 2. The chorion genes of
Drosophila are amplified in response to developmental
signals in the follicle cells of the ovary.
Length = 96
Score = 29.9 bits (67), Expect = 1.1
Identities = 15/47 (31%), Positives = 18/47 (38%), Gaps = 2/47 (4%)
Query: 15 YGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNS 61
YG G A + G GYG RG GR G + S+S
Sbjct: 2 YGGVGVGGYAYQ--VQPAGYYGPGYGRGRGRGGYGRRGYLDSSPSDS 46
>gnl|CDD|109979 pfam00944, Peptidase_S3, Alphavirus core protein. Also known as
coat protein C and capsid protein C. This makes the
literature very confusing. Alphaviruses consist of a
nucleoprotein core, a lipid membrane which envelopes the
core, and glycoprotein spikes protruding from the lipid
membrane.
Length = 157
Score = 31.0 bits (70), Expect = 1.3
Identities = 21/59 (35%), Positives = 27/59 (45%), Gaps = 5/59 (8%)
Query: 15 YGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGG---GGSNYGSNSNLRTPNWNS 70
+G YS G R P+G GG+G G P + G GG+N GS + L WN
Sbjct: 86 HGAVQYSNG--RFTVPKGVGGKGDSGRPILDNTGRVVAIVLGGANEGSRTALSVVTWNK 142
>gnl|CDD|236148 PRK08074, PRK08074, bifunctional ATP-dependent DNA helicase/DNA
polymerase III subunit epsilon; Validated.
Length = 928
Score = 32.2 bits (74), Expect = 1.4
Identities = 23/93 (24%), Positives = 35/93 (37%), Gaps = 34/93 (36%)
Query: 135 FPPY--------IMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAI-AKTGSGKTLGYIAP 185
P Y +MK++Y AL + I A TG+GK+L Y+ P
Sbjct: 253 MPKYEKREGQQEMMKEVYT----------------ALRDSEHALIEAGTGTGKSLAYLLP 296
Query: 186 AIVH-VNSQRPLRSGEGPIVLVLAPTRELAQQI 217
A + P V++ T +L QQ+
Sbjct: 297 AAYFAKKKEEP--------VVISTYTIQLQQQL 321
>gnl|CDD|112992 pfam04202, Mfp-3, Foot protein 3. Mytilus foot protein-3 (Mfp-3)
is a highly polymorphic protein family located in the
byssal adhesive plaques of blue mussels.
Length = 71
Score = 29.2 bits (65), Expect = 1.4
Identities = 16/38 (42%), Positives = 17/38 (44%), Gaps = 1/38 (2%)
Query: 15 YGRPSYSGGAPRGGAPRGGGGRGGY-GAPRGYSRGGRG 51
YG P GG G R G GG G G+ RG RG
Sbjct: 32 YGPPRRYGGGYYNGYNRYARGYGGNKGWNNGWKRGRRG 69
>gnl|CDD|215597 PLN03137, PLN03137, ATP-dependent DNA helicase; Q4-like;
Provisional.
Length = 1195
Score = 32.2 bits (73), Expect = 1.5
Identities = 19/56 (33%), Positives = 27/56 (48%), Gaps = 12/56 (21%)
Query: 163 LSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQ-QI 217
+SG D+ + TG GK+L Y PA++ I LV++P L Q QI
Sbjct: 473 MSGYDVFVLMPTGGGKSLTYQLPALIC-----------PGITLVISPLVSLIQDQI 517
>gnl|CDD|220138 pfam09195, Endonuc-BglII, Restriction endonuclease BglII. Members
of this family are predominantly found in prokaryotic
restriction endonuclease BglII, and adopt a structure
consisting of an alpha/beta core containing a
six-stranded beta-sheet surrounded by five
alpha-helices, two of which are involved in
homodimerisation of the endonuclease. They recognise the
double-stranded DNA sequence AGATCT and cleave after
A-1, resulting in specific double-stranded fragments
with terminal 5'-phosphates.
Length = 155
Score = 30.7 bits (70), Expect = 1.6
Identities = 23/98 (23%), Positives = 36/98 (36%), Gaps = 8/98 (8%)
Query: 478 ASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQ-QERDRVL-------NEFRIGR 529
IL S + + +E + VL F + A + S N+S + + NE RI
Sbjct: 3 EEILASDFPQEWEEIEEVLLAFPLTLAELKASGGNRSDGTQLYKDELVAKGWKNEPRISL 62
Query: 530 ASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQ 567
I +K ++RV E + G K Q
Sbjct: 63 DGIEEESPHKIDFVKNRVALEVQFGNYDFFYRDLLKFQ 100
Score = 30.7 bits (70), Expect = 1.6
Identities = 23/98 (23%), Positives = 36/98 (36%), Gaps = 8/98 (8%)
Query: 530 ASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQ-QERDRVL-------NEFRIGR 581
IL S + + +E + VL F + A + S N+S + + NE RI
Sbjct: 3 EEILASDFPQEWEEIEEVLLAFPLTLAELKASGGNRSDGTQLYKDELVAKGWKNEPRISL 62
Query: 582 ASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQ 619
I +K ++RV E + G K Q
Sbjct: 63 DGIEEESPHKIDFVKNRVALEVQFGNYDFFYRDLLKFQ 100
>gnl|CDD|236794 PRK10917, PRK10917, ATP-dependent DNA helicase RecG; Provisional.
Length = 681
Score = 31.7 bits (73), Expect = 2.0
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 405 IHGNKSQQERDRVLNEFRIGRASILVS 431
+HG E+D V+ F+ G ILV+
Sbjct: 511 LHGRMKPAEKDAVMAAFKAGEIDILVA 537
>gnl|CDD|233398 TIGR01407, dinG_rel, DnaQ family exonuclease/DinG family helicase,
putative. This model represents a family of proteins in
Gram-positive bacteria. The N-terminal region of about
200 amino acids resembles the epsilon subunit of E. coli
DNA polymerase III and the homologous region of the
Gram-positive type DNA polymerase III alpha subunit. The
epsilon subunit contains an exonuclease domain. The
remainder of this protein family resembles a predicted
ATP-dependent helicase, the DNA damage-inducible protein
DinG of E. coli [DNA metabolism, DNA replication,
recombination, and repair].
Length = 850
Score = 31.7 bits (72), Expect = 2.0
Identities = 16/46 (34%), Positives = 27/46 (58%), Gaps = 8/46 (17%)
Query: 172 AKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQI 217
A TG+GKTLGY+ PA+ + +++P V++ T+ L Q+
Sbjct: 271 APTGTGKTLGYLLPALYYAITEKP--------VVISTNTKVLQSQL 308
>gnl|CDD|218152 pfam04568, IATP, Mitochondrial ATPase inhibitor, IATP. ATP
synthase inhibitor prevents the enzyme from switching
to ATP hydrolysis during collapse of the
electrochemical gradient, for example during oxygen
deprivation ATP synthase inhibitor forms a one to one
complex with the F1 ATPase, possibly by binding at the
alpha-beta interface. It is thought to inhibit ATP
synthesis by preventing the release of ATP. The minimum
inhibitory region for bovine inhibitor is from residues
39 to 72. The inhibitor has two oligomeric states,
dimer (the active state) and tetramer. At low pH, the
inhibitor forms a dimer via antiparallel coiled coil
interactions between the C terminal regions of two
monomers. At high pH, the inhibitor forms tetramers and
higher oligomers by coiled coil interactions involving
the N terminus and inhibitory region, thus preventing
the inhibitory activity.
Length = 90
Score = 28.9 bits (65), Expect = 2.7
Identities = 12/23 (52%), Positives = 12/23 (52%)
Query: 16 GRPSYSGGAPRGGAPRGGGGRGG 38
R S S G GA RGGGG G
Sbjct: 23 ARRSMSEGDTGSGAGRGGGGEAG 45
>gnl|CDD|224035 COG1110, COG1110, Reverse gyrase [DNA replication, recombination,
and repair].
Length = 1187
Score = 31.2 bits (71), Expect = 2.8
Identities = 65/328 (19%), Positives = 115/328 (35%), Gaps = 94/328 (28%)
Query: 163 LSGCDLVAIAKTGSGKT-LGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQ----I 217
+ G IA TG GKT G + + +R V ++ PT L +Q +
Sbjct: 95 VRGKSFAIIAPTGVGKTTFGLLMSLYLAKKGKR---------VYIIVPTTTLVRQVYERL 145
Query: 218 ETVANDFGSATATRVACVFGGAPKGPQVKAL-----QTGAEIVIATPGRLIDYLEQGTIN 272
+ A D GS V V+ A + + +I+I T L E
Sbjct: 146 KKFAEDAGS---LDVLVVYHSALPTKEKEEALERIESGDFDILITTSQFLSKRFE----E 198
Query: 273 LHRT--SYLVLDEAD----------RMLDM-GFE-------PQIRKIIGQIRPDRQVLMW 312
L + ++ +D+ D R+L + GF ++ K+ ++ +++
Sbjct: 199 LSKLKFDFIFVDDVDAILKASKNVDRLLRLLGFSEEVIESAYELIKLRRKLYGEKRAERV 258
Query: 313 SATWPKEVQKLAEDFLVDYVQLNIGSL---NPTAN------------------------H 345
+E + E + +G L + T
Sbjct: 259 -----REELREVEREREKK-RRKLGILVVSSATGKPRGSRLKLFRELLGFEVGSGGEGLR 312
Query: 346 NIVQIVDVCQEHEKDYKLQGLLSQIGSERTSKTIIFVET---KRKADDITRSVRNKGWAA 402
NI VD+ E E K+ L+ ++G +IFV + KA+++ +R+ G A
Sbjct: 313 NI---VDIYVESESLEKVVELVKKLG----DGGLIFVPIDYGREKAEELAEYLRSHGINA 365
Query: 403 VAIHGNKSQQERDRVLNEFRIGRASILV 430
IH E++ L +F G +LV
Sbjct: 366 ELIH-----AEKEEALEDFEEGEVDVLV 388
>gnl|CDD|215598 PLN03138, PLN03138, Protein TOC75; Provisional.
Length = 796
Score = 31.0 bits (70), Expect = 2.9
Identities = 17/50 (34%), Positives = 19/50 (38%)
Query: 5 SPDMFELSRVYGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGG 54
+ L S A GG GGGG GG+G G GG GG
Sbjct: 56 TSSSASLVANGAVALLSASAISGGGGGGGGGFGGFGGGGGGGGGGGGGWR 105
>gnl|CDD|178680 PLN03134, PLN03134, glycine-rich RNA-binding protein 4;
Provisional.
Length = 144
Score = 29.6 bits (66), Expect = 3.1
Identities = 18/30 (60%), Positives = 18/30 (60%), Gaps = 1/30 (3%)
Query: 26 RGGAPRGGGGRGGY-GAPRGYSRGGRGGGG 54
R APR GG GGY G GY GG GGGG
Sbjct: 114 RPSAPRAYGGGGGYSGGGGGYGGGGDGGGG 143
>gnl|CDD|197548 smart00157, PRP, Major prion protein. The prion protein is a major
component of scrapie-associated fibrils in
Creutzfeldt-Jakob disease, kuru, Gerstmann-Straussler
syndrome and bovine spongiform encephalopathy.
Length = 218
Score = 30.2 bits (68), Expect = 3.2
Identities = 32/115 (27%), Positives = 41/115 (35%), Gaps = 27/115 (23%)
Query: 20 YSGGAPRGGAPRGGGG----RGGYGAPRGYSRGGRGGGGSNYGSNSNLRTPNWNSMSMEQ 75
Y G P GGG GG+G P G G GGG G ++ WN S +
Sbjct: 28 YPPQGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQGGGTH---NQWNKPSKPK 84
Query: 76 INKNLYTPNHLTVSRSQGEVESYLNHHDVTVKGRYYLLRLESRPGKCVPRPIQHF 130
N H+ + + G V L G Y L G + RP+ HF
Sbjct: 85 TNM-----KHVAGAAAAGAVVGGL--------GGYML-------GSAMSRPLIHF 119
>gnl|CDD|224124 COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense
mechanisms].
Length = 733
Score = 30.9 bits (70), Expect = 3.3
Identities = 47/313 (15%), Positives = 97/313 (30%), Gaps = 70/313 (22%)
Query: 164 SGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQI----ET 219
+V A TG GKT + A+ ++ + L V+ + P R + + + +
Sbjct: 213 RSLLVVLEAPTGYGKTEASLILALALLDEKIKL----KSRVIYVLPFRTIIEDMYRRAKE 268
Query: 220 VANDFGSATATRVACVFGGAPK----GPQVKAL------QTGAEIVIATPG-RLIDYL-- 266
+ F + + + P L + ++++A I L
Sbjct: 269 IFGLFSV----IGKSLHSSSKEPLLLEPDQDILLTLTTNDSYKKLLLALIVVTPIQILIF 324
Query: 267 -----EQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIG----QIRPDRQVLMWSATWP 317
+ + L TS ++LDE D E + ++ VL+ SAT P
Sbjct: 325 SVKGFKFEFLALLLTSLVILDEVHLYAD---ETMLAALLALLEALAEAGVPVLLMSATLP 381
Query: 318 KEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDYKL-----QGLLSQIGS 372
+ ++ L + V E + L +
Sbjct: 382 PFL----KEKLKKALG----------KGREVVENAKFCPKEDEPGLKRKERVDVEDGPQE 427
Query: 373 ERTSKTIIFVE----------TKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFR 422
E V+ T +A ++ ++ KG + +H + ++R+ E +
Sbjct: 428 ELIELISEEVKEGKKVLVIVNTVDRAIELYEKLKEKGPKVLLLHSRFTLKDREEKERELK 487
Query: 423 ----IGRASILVS 431
I+V+
Sbjct: 488 KLFKQNEGFIVVA 500
>gnl|CDD|218811 pfam05918, API5, Apoptosis inhibitory protein 5 (API5). This
family consists of apoptosis inhibitory protein 5 (API5)
sequences from several organisms. Apoptosis or
programmed cell death is a physiological form of cell
death that occurs in embryonic development and organ
formation. It is characterized by biochemical and
morphological changes such as DNA fragmentation and cell
volume shrinkage. API5 is an anti apoptosis gene located
in human chromosome 11, whose expression prevents the
programmed cell death that occurs upon the deprivation
of growth factors.
Length = 543
Score = 30.4 bits (68), Expect = 3.7
Identities = 13/36 (36%), Positives = 13/36 (36%), Gaps = 1/36 (2%)
Query: 25 PRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSN 60
G G G G R RG GGGG G N
Sbjct: 505 KSGEGISKVGQSYG-GRGRTRGRGRGGGGGRGRGYN 539
Score = 30.0 bits (67), Expect = 6.2
Identities = 16/44 (36%), Positives = 18/44 (40%), Gaps = 5/44 (11%)
Query: 5 SPDMFELSRVYGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRG 48
E G+ G RG GGGGRG RGY+RG
Sbjct: 503 VNKSGEGISKVGQSYGGRGRTRGRGRGGGGGRG-----RGYNRG 541
>gnl|CDD|222005 pfam13245, AAA_19, Part of AAA domain.
Length = 73
Score = 28.1 bits (63), Expect = 3.9
Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 8/56 (14%)
Query: 175 GSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATAT 230
G+GKT A I + L +G VLV+APT A+++ A A+
Sbjct: 20 GTGKT-ATAAAIIARL-----LAAGRS--VLVVAPTGRAARRLRERLAALDLADAS 67
>gnl|CDD|216868 pfam02084, Bindin, Bindin.
Length = 239
Score = 29.9 bits (67), Expect = 4.1
Identities = 16/47 (34%), Positives = 17/47 (36%)
Query: 20 YSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNLRTP 66
Y+ G GGG G GAP G GG GG G G
Sbjct: 31 YANQGMGGPVGGGGGPGAGGGAPGGPVGGGGGGSGGPPGGGEVAGEA 77
Score = 29.9 bits (67), Expect = 4.9
Identities = 16/39 (41%), Positives = 16/39 (41%), Gaps = 1/39 (2%)
Query: 22 GGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSN 60
GG GG G GG GG G G GGG YG
Sbjct: 186 GGGGYGGGGGGMGG-GGGGGMGEMGGMGPQGGGGGYGGM 223
Score = 29.1 bits (65), Expect = 7.2
Identities = 14/39 (35%), Positives = 15/39 (38%), Gaps = 2/39 (5%)
Query: 22 GGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSN 60
GG G GG G G G GY G G G+ Y
Sbjct: 199 GGGGGGMGEMGGMGPQGGGG--GYGGMGAPGQGNAYNQG 235
>gnl|CDD|240255 PTZ00070, PTZ00070, 40S ribosomal protein S2; Provisional.
Length = 257
Score = 30.0 bits (68), Expect = 4.5
Identities = 13/32 (40%), Positives = 13/32 (40%)
Query: 24 APRGGAPRGGGGRGGYGAPRGYSRGGRGGGGS 55
G G GRGG G RG RGG G
Sbjct: 5 RGGEGRGFGRRGRGGRGRGRGRGRGGEGEEKE 36
>gnl|CDD|226927 COG4555, NatA, ABC-type Na+ transport system, ATPase component
[Energy production and conversion / Inorganic ion
transport and metabolism].
Length = 245
Score = 29.7 bits (67), Expect = 4.9
Identities = 16/57 (28%), Positives = 31/57 (54%)
Query: 273 LHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRPDRQVLMWSATWPKEVQKLAEDFLV 329
+H S LVLDE LD+ + I Q++ + + +++S+ +EV+ L + +V
Sbjct: 149 VHDPSILVLDEPTSGLDIRTRRKFHDFIKQLKNEGRAVIFSSHIMQEVEALCDRVIV 205
>gnl|CDD|233063 TIGR00631, uvrb, excinuclease ABC, B subunit. All proteins in this
family for wich functions are known are DNA helicases
that function in the nucleotide excision repair and are
endonucleases that make the 3' incision next to DNA
damage. They are part of a pathway requiring UvrA, UvrB,
UvrC, and UvrD homologs. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 655
Score = 30.0 bits (68), Expect = 5.3
Identities = 19/74 (25%), Positives = 36/74 (48%), Gaps = 2/74 (2%)
Query: 359 KDYKLQGLLSQIG--SERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDR 416
D ++ LLS+I R + ++ TK+ A+D+T ++ G +H ER
Sbjct: 424 TDGQVDDLLSEIRQRVARNERVLVTTLTKKMAEDLTDYLKELGIKVRYLHSEIDTLERVE 483
Query: 417 VLNEFRIGRASILV 430
++ + R+G +LV
Sbjct: 484 IIRDLRLGEFDVLV 497
>gnl|CDD|239467 cd03370, NADH_oxidase, NADPH_oxidase. Nitroreductase family
containing NADH oxidase and other, uncharacterized
proteins that are similar to nitroreductase.
Nitroreductase catalyzes the reduction of nitroaromatic
compounds such as nitrotoluenes, nitrofurans and
nitroimidazoles. This process requires NAD(P)H as
electron donor in an obligatory two-electron transfer
and uses FMN as cofactor. The enzyme is typically a
homodimer. Members of this family are also called NADH
dehydrogenase, oxygen-insensitive NAD(P)H nitrogenase or
dihydropteridine reductase.
Length = 156
Score = 28.9 bits (65), Expect = 5.6
Identities = 11/39 (28%), Positives = 21/39 (53%)
Query: 125 RPIQHFEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIAL 163
R I+ F+ P +++++ E +AP+A Q W I +
Sbjct: 9 RSIRQFDTDPIPEDLLRRLLEAALRAPSAFNLQPWRIVV 47
>gnl|CDD|225203 COG2321, COG2321, Predicted metalloprotease [General function
prediction only].
Length = 295
Score = 29.7 bits (67), Expect = 5.8
Identities = 14/39 (35%), Positives = 15/39 (38%)
Query: 3 MRSPDMFELSRVYGRPSYSGGAPRGGAPRGGGGRGGYGA 41
MR E V R S G+ RGG P G G G
Sbjct: 1 MRWRGRRESDNVEDRRGRSSGSVRGGGPGLRLGGGIGGL 39
>gnl|CDD|224271 COG1352, CheR, Methylase of chemotaxis methyl-accepting proteins
[Cell motility and secretion / Signal transduction
mechanisms].
Length = 268
Score = 29.6 bits (67), Expect = 6.1
Identities = 10/58 (17%), Positives = 17/58 (29%), Gaps = 2/58 (3%)
Query: 446 EFRIGRASILVSQYKESQQKR--DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRI 501
R I YK + R R L + + ++ +E L+ I
Sbjct: 9 ALIYQRTGIDFDNYKRTLVYRRLSRRLRKLGLKNFEEYLNLLESDSEELQAFLDALTI 66
>gnl|CDD|227493 COG5164, SPT5, Transcription elongation factor [Transcription].
Length = 607
Score = 30.0 bits (67), Expect = 6.4
Identities = 15/55 (27%), Positives = 22/55 (40%), Gaps = 1/55 (1%)
Query: 18 PSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNLRTPNWNSMS 72
P+Y+ G GA G GY P G+ G + ++ G + NW S
Sbjct: 434 PNYATGGIAAGAAATSSGLSGYRTP-GWKDGPKTPALNDPGIDWGDEKDNWYKGS 487
Score = 29.6 bits (66), Expect = 7.8
Identities = 12/46 (26%), Positives = 15/46 (32%), Gaps = 1/46 (2%)
Query: 26 RGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS-NSNLRTPNWNS 70
RG + Y+ GG G + S S RTP W
Sbjct: 417 RGLSKPLRYSEAIGSKTPNYATGGIAAGAAATSSGLSGYRTPGWKD 462
>gnl|CDD|237541 PRK13881, PRK13881, conjugal transfer protein TrbI; Provisional.
Length = 472
Score = 29.7 bits (67), Expect = 7.3
Identities = 14/40 (35%), Positives = 17/40 (42%)
Query: 23 GAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSN 62
G G A GGG G GAP+ GGG ++Y
Sbjct: 218 GVGGGMAAPAGGGMGAGGAPQLLQTSTSGGGRNSYAQFGG 257
>gnl|CDD|147458 pfam05268, GP38, Phage tail fibre adhesin Gp38. This family
contains several Gp38 proteins from T-even-like phages.
Gp38, together with a second phage protein, gp57,
catalyzes the organisation of gp37 but is absent from
the phage particle. Gp37 is responsible for receptor
recognition.
Length = 261
Score = 29.4 bits (66), Expect = 7.4
Identities = 17/44 (38%), Positives = 19/44 (43%), Gaps = 3/44 (6%)
Query: 16 GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS 59
G GG GG P G GG+ G G + GGGS GS
Sbjct: 169 GNGVCGGG---GGRPFGAGGKSGSHMSGGNASLTAPGGGSGTGS 209
>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein. Taxilin
contains an extraordinarily long coiled-coil domain in
its C-terminal half and is ubiquitously expressed. It is
a novel binding partner of several syntaxin family
members and is possibly involved in Ca2+-dependent
exocytosis in neuroendocrine cells. Gamma-taxilin,
described as leucine zipper protein Factor Inhibiting
ATF4-mediated Transcription (FIAT), localises to the
nucleus in osteoblasts and dimerises with ATF4 to form
inactive dimers, thus inhibiting ATF4-mediated
transcription.
Length = 309
Score = 29.3 bits (66), Expect = 7.6
Identities = 18/96 (18%), Positives = 39/96 (40%), Gaps = 20/96 (20%)
Query: 406 HGNKSQQERDRVLNEFRIGRASILVSQYNKSQQERDRVLNEFRIGRASILVSQYKESQQK 465
H N + + + NE + L+ QY +Q +++L K+ +Q+
Sbjct: 126 HSNPNHKLCE--ENEELREKLKELIEQYELREQHFEKLLKT-------------KDLEQQ 170
Query: 466 RDRVLNEFRIGRASILVSH-YNKSQQERDRVLNEFR 500
LN+ ++ +A K ++E++ +L E
Sbjct: 171 ----LNDAKLEQAQEQAEQEQEKHKREKEILLKEAA 202
>gnl|CDD|237537 PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional.
Length = 440
Score = 29.5 bits (67), Expect = 8.2
Identities = 14/34 (41%), Positives = 15/34 (44%), Gaps = 6/34 (17%)
Query: 21 SGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGG 54
+ A RGGA G GA YS G GG G
Sbjct: 306 AAAAARGGAAAAG------GASSAYSAGAAGGSG 333
>gnl|CDD|224121 COG1200, RecG, RecG-like helicase [DNA replication, recombination,
and repair / Transcription].
Length = 677
Score = 29.5 bits (67), Expect = 8.4
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 405 IHGNKSQQERDRVLNEFRIGRASILVS 431
+HG E+D V+ F+ G ILV+
Sbjct: 513 VHGRMKPAEKDAVMEAFKEGEIDILVA 539
>gnl|CDD|182211 PRK10051, csgA, major curlin subunit; Provisional.
Length = 151
Score = 28.3 bits (63), Expect = 8.4
Identities = 18/43 (41%), Positives = 20/43 (46%), Gaps = 12/43 (27%)
Query: 21 SGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNL 63
SG A G P+ GGG G GGGG+N G NS L
Sbjct: 15 SGSALAGVVPQYGGG------------GNHGGGGNNSGPNSTL 45
>gnl|CDD|237655 PRK14279, PRK14279, chaperone protein DnaJ; Provisional.
Length = 392
Score = 29.3 bits (66), Expect = 9.6
Identities = 17/47 (36%), Positives = 21/47 (44%), Gaps = 9/47 (19%)
Query: 17 RPSYSGGAPRGGAPRGGGGRGGYGAPRG---------YSRGGRGGGG 54
R ++GG G GGGG GG+G + GRGGGG
Sbjct: 75 RRLFAGGGFGGRRFDGGGGFGGFGTGGDGAEFNLNDLFDAAGRGGGG 121
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.135 0.396
Gapped
Lambda K H
0.267 0.0781 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 39,207,899
Number of extensions: 3860166
Number of successful extensions: 5013
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4674
Number of HSP's successfully gapped: 242
Length of query: 779
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 674
Effective length of database: 6,280,432
Effective search space: 4233011168
Effective search space used: 4233011168
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (27.9 bits)