RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17912
         (779 letters)



>gnl|CDD|240273 PTZ00110, PTZ00110, helicase; Provisional.
          Length = 545

 Score =  466 bits (1202), Expect = e-157
 Identities = 204/428 (47%), Positives = 264/428 (61%), Gaps = 20/428 (4%)

Query: 12  SRVYGRPSYSGGAPRGGAPRGGGGRGGYGAPR-GYSRGGRGGGGSNYGSN------SNLR 64
           S  Y        +           +  YG  R GY     G GG    S         L+
Sbjct: 18  SNNYNSYGPYPDSSNPYGNYQANHQDNYGGFRPGYGNYSGGYGGFGMNSYGSSTLGKRLQ 77

Query: 65  TPNWNSMSMEQINKNLYTPNHLTVSRSQGEVESYLNHHDVTVKGRYYLLRLESRPGKCVP 124
             +W S+++    KN Y  +    + S  EV+      ++T+             G+ VP
Sbjct: 78  PIDWKSINLVPFEKNFYKEHPEVSALSSKEVDEIRKEKEITIIA-----------GENVP 126

Query: 125 RPIQHFEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIA 184
           +P+  FE  +FP YI+K +   GF  PT IQ QGWPIALSG D++ IA+TGSGKTL ++ 
Sbjct: 127 KPVVSFEYTSFPDYILKSLKNAGFTEPTPIQVQGWPIALSGRDMIGIAETGSGKTLAFLL 186

Query: 185 PAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQ 244
           PAIVH+N+Q  LR G+GPIVLVLAPTRELA+QI    N FG+++  R    +GG PK  Q
Sbjct: 187 PAIVHINAQPLLRYGDGPIVLVLAPTRELAEQIREQCNKFGASSKIRNTVAYGGVPKRGQ 246

Query: 245 VKALQTGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIR 304
           + AL+ G EI+IA PGRLID+LE    NL R +YLVLDEADRMLDMGFEPQIRKI+ QIR
Sbjct: 247 IYALRRGVEILIACPGRLIDFLESNVTNLRRVTYLVLDEADRMLDMGFEPQIRKIVSQIR 306

Query: 305 PDRQVLMWSATWPKEVQKLAEDFLVDY-VQLNIGSLNPTANHNIVQIVDVCQEHEKDYKL 363
           PDRQ LMWSATWPKEVQ LA D   +  V +N+GSL+ TA HNI Q V V +EHEK  KL
Sbjct: 307 PDRQTLMWSATWPKEVQSLARDLCKEEPVHVNVGSLDLTACHNIKQEVFVVEEHEKRGKL 366

Query: 364 QGLLSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRI 423
           + LL +I  +   K +IFVETK+ AD +T+ +R  GW A+ IHG+K Q+ER  VLNEF+ 
Sbjct: 367 KMLLQRIMRD-GDKILIFVETKKGADFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKT 425

Query: 424 GRASILVS 431
           G++ I+++
Sbjct: 426 GKSPIMIA 433



 Score = 36.7 bits (85), Expect = 0.056
 Identities = 19/56 (33%), Positives = 34/56 (60%), Gaps = 2/56 (3%)

Query: 454 ILVSQYKESQQKRDRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 509
           IL+  + E+++  D +  E R+     L  H +K Q+ER  VLNEF+ G++ I+++
Sbjct: 380 ILI--FVETKKGADFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKTGKSPIMIA 433



 Score = 35.1 bits (81), Expect = 0.15
 Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 3/77 (3%)

Query: 461 ESQQKRDRVLNEF--RIGRASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQQER 518
           E  +KR + L     RI R    +  + ++++  D +  E R+     L  H +K Q+ER
Sbjct: 358 EEHEKRGK-LKMLLQRIMRDGDKILIFVETKKGADFLTKELRLDGWPALCIHGDKKQEER 416

Query: 519 DRVLNEFRIGRASILVS 535
             VLNEF+ G++ I+++
Sbjct: 417 TWVLNEFKTGKSPIMIA 433



 Score = 35.1 bits (81), Expect = 0.16
 Identities = 16/43 (37%), Positives = 26/43 (60%)

Query: 519 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 561
           D +  E R+     L  H +K Q+ER  VLNEF+ G++ I+++
Sbjct: 391 DFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKTGKSPIMIA 433



 Score = 35.1 bits (81), Expect = 0.16
 Identities = 16/43 (37%), Positives = 26/43 (60%)

Query: 545 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 587
           D +  E R+     L  H +K Q+ER  VLNEF+ G++ I+++
Sbjct: 391 DFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKTGKSPIMIA 433



 Score = 35.1 bits (81), Expect = 0.16
 Identities = 16/43 (37%), Positives = 26/43 (60%)

Query: 571 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 613
           D +  E R+     L  H +K Q+ER  VLNEF+ G++ I+++
Sbjct: 391 DFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKTGKSPIMIA 433



 Score = 35.1 bits (81), Expect = 0.16
 Identities = 16/43 (37%), Positives = 26/43 (60%)

Query: 597 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 639
           D +  E R+     L  H +K Q+ER  VLNEF+ G++ I+++
Sbjct: 391 DFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKTGKSPIMIA 433



 Score = 35.1 bits (81), Expect = 0.16
 Identities = 16/43 (37%), Positives = 26/43 (60%)

Query: 623 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 665
           D +  E R+     L  H +K Q+ER  VLNEF+ G++ I+++
Sbjct: 391 DFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKTGKSPIMIA 433


>gnl|CDD|223587 COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA
           replication, recombination, and repair / Transcription /
           Translation, ribosomal structure and biogenesis].
          Length = 513

 Score =  334 bits (858), Expect = e-106
 Identities = 112/312 (35%), Positives = 175/312 (56%), Gaps = 7/312 (2%)

Query: 123 VPRPIQHFEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGY 182
             +    F      P +++ + ++GF+ PT IQ    P+ L+G D++  A+TG+GKT  +
Sbjct: 24  EEKTPPEFASLGLSPELLQALKDLGFEEPTPIQLAAIPLILAGRDVLGQAQTGTGKTAAF 83

Query: 183 IAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATA-TRVACVFGGAPK 241
           + P +  +      +       L+LAPTRELA QI       G      RVA V+GG   
Sbjct: 84  LLPLLQKILKSVERK---YVSALILAPTRELAVQIAEELRKLGKNLGGLRVAVVYGGVSI 140

Query: 242 GPQVKALQTGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIG 301
             Q++AL+ G +IV+ATPGRL+D +++G ++L     LVLDEADRMLDMGF   I KI+ 
Sbjct: 141 RKQIEALKRGVDIVVATPGRLLDLIKRGKLDLSGVETLVLDEADRMLDMGFIDDIEKILK 200

Query: 302 QIRPDRQVLMWSATWPKEVQKLAEDFLVDYVQLNIGS-LNPTANHNIVQIVDVCQEHEKD 360
            + PDRQ L++SAT P ++++LA  +L D V++ +           I Q         ++
Sbjct: 201 ALPPDRQTLLFSATMPDDIRELARRYLNDPVEIEVSVEKLERTLKKIKQFYLEV--ESEE 258

Query: 361 YKLQGLLSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNE 420
            KL+ LL  +  E   + I+FV TKR  +++  S+R +G+   A+HG+  Q+ERDR L +
Sbjct: 259 EKLELLLKLLKDEDEGRVIVFVRTKRLVEELAESLRKRGFKVAALHGDLPQEERDRALEK 318

Query: 421 FRIGRASILVSQ 432
           F+ G   +LV+ 
Sbjct: 319 FKDGELRVLVAT 330



 Score = 33.6 bits (77), Expect = 0.38
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query: 562 HYNKSQQERDRVLNEFRIGRASILV 586
           H +  Q+ERDR L +F+ G   +LV
Sbjct: 304 HGDLPQEERDRALEKFKDGELRVLV 328



 Score = 33.6 bits (77), Expect = 0.38
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query: 588 HYNKSQQERDRVLNEFRIGRASILV 612
           H +  Q+ERDR L +F+ G   +LV
Sbjct: 304 HGDLPQEERDRALEKFKDGELRVLV 328



 Score = 33.6 bits (77), Expect = 0.38
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query: 614 HYNKSQQERDRVLNEFRIGRASILV 638
           H +  Q+ERDR L +F+ G   +LV
Sbjct: 304 HGDLPQEERDRALEKFKDGELRVLV 328



 Score = 33.6 bits (77), Expect = 0.38
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query: 640 HYNKSQQERDRVLNEFRIGRASILV 664
           H +  Q+ERDR L +F+ G   +LV
Sbjct: 304 HGDLPQEERDRALEKFKDGELRVLV 328


>gnl|CDD|238167 cd00268, DEADc, DEAD-box helicases. A diverse family of proteins
           involved in ATP-dependent RNA unwinding, needed in a
           variety of cellular processes including splicing,
           ribosome biogenesis and RNA degradation. The name
           derives from the sequence of the Walker  B motif (motif
           II). This domain contains the ATP- binding region.
          Length = 203

 Score =  307 bits (788), Expect = e-100
 Identities = 98/206 (47%), Positives = 134/206 (65%), Gaps = 3/206 (1%)

Query: 130 FEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVH 189
           FEE    P +++ IY +GF+ PT IQA+  P  LSG D++  A+TGSGKT  ++ P +  
Sbjct: 1   FEELGLSPELLRGIYALGFEKPTPIQARAIPPLLSGRDVIGQAQTGSGKTAAFLIPILEK 60

Query: 190 VNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQVKALQ 249
           ++         GP  L+LAPTRELA QI  VA   G  T  +V  ++GG     Q++ L+
Sbjct: 61  LDPSPKKD---GPQALILAPTRELALQIAEVARKLGKHTNLKVVVIYGGTSIDKQIRKLK 117

Query: 250 TGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRPDRQV 309
            G  IV+ATPGRL+D LE+G ++L +  YLVLDEADRMLDMGFE QIR+I+  +  DRQ 
Sbjct: 118 RGPHIVVATPGRLLDLLERGKLDLSKVKYLVLDEADRMLDMGFEDQIREILKLLPKDRQT 177

Query: 310 LMWSATWPKEVQKLAEDFLVDYVQLN 335
           L++SAT PKEV+ LA  FL + V++ 
Sbjct: 178 LLFSATMPKEVRDLARKFLRNPVRIL 203


>gnl|CDD|236977 PRK11776, PRK11776, ATP-dependent RNA helicase DbpA; Provisional.
          Length = 460

 Score =  235 bits (603), Expect = 1e-69
 Identities = 112/309 (36%), Positives = 171/309 (55%), Gaps = 25/309 (8%)

Query: 130 FEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKT----LGYIAP 185
           F     PP ++  + E+G+   T IQAQ  P  L+G D++A AKTGSGKT    LG +  
Sbjct: 6   FSTLPLPPALLANLNELGYTEMTPIQAQSLPAILAGKDVIAQAKTGSGKTAAFGLGLLQ- 64

Query: 186 AIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGS-ATAT---RVACVFGGAPK 241
               ++ +R          LVL PTRELA Q   VA +    A      +V  + GG P 
Sbjct: 65  ---KLDVKRF-----RVQALVLCPTRELADQ---VAKEIRRLARFIPNIKVLTLCGGVPM 113

Query: 242 GPQVKALQTGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIG 301
           GPQ+ +L+ GA I++ TPGR++D+L +GT++L   + LVLDEADRMLDMGF+  I  II 
Sbjct: 114 GPQIDSLEHGAHIIVGTPGRILDHLRKGTLDLDALNTLVLDEADRMLDMGFQDAIDAIIR 173

Query: 302 QIRPDRQVLMWSATWPKEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDY 361
           Q    RQ L++SAT+P+ +  +++ F  D V++ + S        I Q        E+  
Sbjct: 174 QAPARRQTLLFSATYPEGIAAISQRFQRDPVEVKVES--THDLPAIEQRFYEVSPDERLP 231

Query: 362 KLQGLLSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEF 421
            LQ LL      +    ++F  TK++  ++  ++  +G++A+A+HG+  Q++RD+VL  F
Sbjct: 232 ALQRLLLH---HQPESCVVFCNTKKECQEVADALNAQGFSALALHGDLEQRDRDQVLVRF 288

Query: 422 RIGRASILV 430
                S+LV
Sbjct: 289 ANRSCSVLV 297


>gnl|CDD|236722 PRK10590, PRK10590, ATP-dependent RNA helicase RhlE; Provisional.
          Length = 456

 Score =  212 bits (541), Expect = 6e-61
 Identities = 117/305 (38%), Positives = 174/305 (57%), Gaps = 9/305 (2%)

Query: 130 FEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVH 189
           F+     P I++ + E G++ PT IQ Q  P  L G DL+A A+TG+GKT G+  P + H
Sbjct: 3   FDSLGLSPDILRAVAEQGYREPTPIQQQAIPAVLEGRDLMASAQTGTGKTAGFTLPLLQH 62

Query: 190 VNSQRPLRSGEGPI-VLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQVKAL 248
           + +++P   G  P+  L+L PTRELA QI     D+      R   VFGG    PQ+  L
Sbjct: 63  LITRQPHAKGRRPVRALILTPTRELAAQIGENVRDYSKYLNIRSLVVFGGVSINPQMMKL 122

Query: 249 QTGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRPDRQ 308
           + G ++++ATPGRL+D   Q  + L +   LVLDEADRMLDMGF   IR+++ ++   RQ
Sbjct: 123 RGGVDVLVATPGRLLDLEHQNAVKLDQVEILVLDEADRMLDMGFIHDIRRVLAKLPAKRQ 182

Query: 309 VLMWSATWPKEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDYKLQG-LL 367
            L++SAT+  +++ LAE  L + +++ +   N TA+  + Q V     H  D K +  LL
Sbjct: 183 NLLFSATFSDDIKALAEKLLHNPLEIEVARRN-TASEQVTQHV-----HFVDKKRKRELL 236

Query: 368 SQ-IGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGRA 426
           SQ IG     + ++F  TK  A+ +   +   G  + AIHGNKSQ  R R L +F+ G  
Sbjct: 237 SQMIGKGNWQQVLVFTRTKHGANHLAEQLNKDGIRSAAIHGNKSQGARTRALADFKSGDI 296

Query: 427 SILVS 431
            +LV+
Sbjct: 297 RVLVA 301



 Score = 32.5 bits (74), Expect = 0.88
 Identities = 18/53 (33%), Positives = 22/53 (41%), Gaps = 4/53 (7%)

Query: 18  PSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNLRTPNWNS 70
           P  +G   RGG   G G  GG G  +G  R  RG GG+   S      P+   
Sbjct: 386 PIQNGRQQRGG--GGRGQGGGRGQQQGQPR--RGEGGAKSASAKPAEKPSRRL 434



 Score = 31.7 bits (72), Expect = 1.8
 Identities = 30/95 (31%), Positives = 48/95 (50%), Gaps = 12/95 (12%)

Query: 448 RIGRASILVSQYKE-SQQKRDRVLNEFRIGRAS---ILVSHYNKSQQERDRV---LNEFR 500
           R   AS  V+Q+     +KR R L    IG+ +   +LV  + +++   + +   LN+  
Sbjct: 212 RRNTASEQVTQHVHFVDKKRKRELLSQMIGKGNWQQVLV--FTRTKHGANHLAEQLNKDG 269

Query: 501 IGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 535
           I  A+I   H NKSQ  R R L +F+ G   +LV+
Sbjct: 270 IRSAAI---HGNKSQGARTRALADFKSGDIRVLVA 301



 Score = 30.2 bits (68), Expect = 5.0
 Identities = 30/93 (32%), Positives = 46/93 (49%), Gaps = 8/93 (8%)

Query: 422 RIGRASILVSQY-NKSQQERDRVLNEFRIGRAS---ILV-SQYKESQQKRDRVLNEFRIG 476
           R   AS  V+Q+ +   ++R R L    IG+ +   +LV ++ K         LN+  I 
Sbjct: 212 RRNTASEQVTQHVHFVDKKRKRELLSQMIGKGNWQQVLVFTRTKHGANHLAEQLNKDGIR 271

Query: 477 RASILVSHYNKSQQERDRVLNEFRIGRASILVS 509
            A+I   H NKSQ  R R L +F+ G   +LV+
Sbjct: 272 SAAI---HGNKSQGARTRALADFKSGDIRVLVA 301



 Score = 29.4 bits (66), Expect = 7.7
 Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 3/40 (7%)

Query: 522 LNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 561
           LN+  I  A+I   H NKSQ  R R L +F+ G   +LV+
Sbjct: 265 LNKDGIRSAAI---HGNKSQGARTRALADFKSGDIRVLVA 301



 Score = 29.4 bits (66), Expect = 7.7
 Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 3/40 (7%)

Query: 548 LNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 587
           LN+  I  A+I   H NKSQ  R R L +F+ G   +LV+
Sbjct: 265 LNKDGIRSAAI---HGNKSQGARTRALADFKSGDIRVLVA 301



 Score = 29.4 bits (66), Expect = 7.7
 Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 3/40 (7%)

Query: 574 LNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 613
           LN+  I  A+I   H NKSQ  R R L +F+ G   +LV+
Sbjct: 265 LNKDGIRSAAI---HGNKSQGARTRALADFKSGDIRVLVA 301



 Score = 29.4 bits (66), Expect = 7.7
 Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 3/40 (7%)

Query: 600 LNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 639
           LN+  I  A+I   H NKSQ  R R L +F+ G   +LV+
Sbjct: 265 LNKDGIRSAAI---HGNKSQGARTRALADFKSGDIRVLVA 301



 Score = 29.4 bits (66), Expect = 7.7
 Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 3/40 (7%)

Query: 626 LNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 665
           LN+  I  A+I   H NKSQ  R R L +F+ G   +LV+
Sbjct: 265 LNKDGIRSAAI---HGNKSQGARTRALADFKSGDIRVLVA 301


>gnl|CDD|215103 PLN00206, PLN00206, DEAD-box ATP-dependent RNA helicase;
           Provisional.
          Length = 518

 Score =  213 bits (545), Expect = 7e-61
 Identities = 113/317 (35%), Positives = 165/317 (52%), Gaps = 10/317 (3%)

Query: 120 GKCVPRPIQHFEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKT 179
           G+ VP PI  F  C  PP ++  +   G++ PT IQ Q  P ALSG  L+  A TGSGKT
Sbjct: 113 GEAVPPPILSFSSCGLPPKLLLNLETAGYEFPTPIQMQAIPAALSGRSLLVSADTGSGKT 172

Query: 180 LGYIAPAIVHVNSQRP--LRSGEGPIVLVLAPTRELAQQIETVANDFGSATATRVACVFG 237
             ++ P I    + R         P+ +VL PTREL  Q+E  A   G     + A V G
Sbjct: 173 ASFLVPIISRCCTIRSGHPSEQRNPLAMVLTPTRELCVQVEDQAKVLGKGLPFKTALVVG 232

Query: 238 GAPKGPQVKALQTGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIR 297
           G     Q+  +Q G E+++ TPGRLID L +  I L   S LVLDE D ML+ GF  Q+ 
Sbjct: 233 GDAMPQQLYRIQQGVELIVGTPGRLIDLLSKHDIELDNVSVLVLDEVDCMLERGFRDQVM 292

Query: 298 KIIGQIRPDRQVLMWSATWPKEVQKLAEDFLVDYVQLNIGSLN-PT-ANHNIVQIVDVCQ 355
           +I  Q     QVL++SAT   EV+K A     D + ++IG+ N P  A   +   V+  Q
Sbjct: 293 QIF-QALSQPQVLLFSATVSPEVEKFASSLAKDIILISIGNPNRPNKAVKQLAIWVETKQ 351

Query: 356 EHEKDYKLQGLLSQIGSERTSKTIIFVETKRKADDITRSVR-NKGWAAVAIHGNKSQQER 414
           + +K + +  L S+         ++FV ++  AD +  ++    G  A++IHG KS +ER
Sbjct: 352 KKQKLFDI--LKSK--QHFKPPAVVFVSSRLGADLLANAITVVTGLKALSIHGEKSMKER 407

Query: 415 DRVLNEFRIGRASILVS 431
             V+  F +G   ++V+
Sbjct: 408 REVMKSFLVGEVPVIVA 424


>gnl|CDD|215832 pfam00270, DEAD, DEAD/DEAH box helicase.  Members of this family
           include the DEAD and DEAH box helicases. Helicases are
           involved in unwinding nucleic acids. The DEAD box
           helicases are involved in various aspects of RNA
           metabolism, including nuclear transcription, pre mRNA
           splicing, ribosome biogenesis, nucleocytoplasmic
           transport, translation, RNA decay and organellar gene
           expression.
          Length = 169

 Score =  192 bits (489), Expect = 4e-57
 Identities = 75/174 (43%), Positives = 105/174 (60%), Gaps = 7/174 (4%)

Query: 152 TAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTR 211
           T IQAQ  P  LSG D++  A TGSGKTL ++ P +  +  ++      GP  LVLAPTR
Sbjct: 1   TPIQAQAIPAILSGKDVLVQAPTGSGKTLAFLLPILQALLPKKG-----GPQALVLAPTR 55

Query: 212 ELAQQIETVANDFGSATATRVACVFGGAPKGPQVKALQTG-AEIVIATPGRLIDYLEQGT 270
           ELA+QI             RVA + GG     Q + L+ G A+I++ TPGRL+D L +G 
Sbjct: 56  ELAEQIYEELKKLFKILGLRVALLTGGTSLKEQARKLKKGKADILVGTPGRLLDLLRRGK 115

Query: 271 INLHRT-SYLVLDEADRMLDMGFEPQIRKIIGQIRPDRQVLMWSATWPKEVQKL 323
           + L +    LVLDEA R+LDMGF   + +I+ ++ PDRQ+L+ SAT P+ ++ L
Sbjct: 116 LKLLKNLKLLVLDEAHRLLDMGFGDDLEEILSRLPPDRQILLLSATLPRNLEDL 169


>gnl|CDD|214692 smart00487, DEXDc, DEAD-like helicases superfamily. 
          Length = 201

 Score =  180 bits (459), Expect = 2e-52
 Identities = 78/210 (37%), Positives = 118/210 (56%), Gaps = 11/210 (5%)

Query: 143 IYEMGFQAPTAIQAQGWPIALSG-CDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEG 201
           I + GF+     Q +     LSG  D++  A TGSGKTL  + PA+        L+ G+G
Sbjct: 1   IEKFGFEPLRPYQKEAIEALLSGLRDVILAAPTGSGKTLAALLPALEA------LKRGKG 54

Query: 202 PIVLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQVKALQTG-AEIVIATPG 260
             VLVL PTRELA+Q        G +   +V  ++GG  K  Q++ L++G  +I++ TPG
Sbjct: 55  GRVLVLVPTRELAEQWAEELKKLGPSLGLKVVGLYGGDSKREQLRKLESGKTDILVTTPG 114

Query: 261 RLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRPDRQVLMWSATWPKEV 320
           RL+D LE   ++L     ++LDEA R+LD GF  Q+ K++  +  + Q+L+ SAT P+E+
Sbjct: 115 RLLDLLENDKLSLSNVDLVILDEAHRLLDGGFGDQLEKLLKLLPKNVQLLLLSATPPEEI 174

Query: 321 QKLAEDFLVDYVQLNIGSLNPTANHNIVQI 350
           + L E FL D V +++G    T    I Q 
Sbjct: 175 ENLLELFLNDPVFIDVG---FTPLEPIEQF 201


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score =  185 bits (472), Expect = 2e-51
 Identities = 96/305 (31%), Positives = 151/305 (49%), Gaps = 9/305 (2%)

Query: 130 FEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVH 189
           F E      +++ + + G+  PTAIQA+  P AL G D++  A TG+GKT  ++ PA+ H
Sbjct: 3   FSELELDESLLEALQDKGYTRPTAIQAEAIPPALDGRDVLGSAPTGTGKTAAFLLPALQH 62

Query: 190 VNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQVKALQ 249
           +    P R    P +L+L PTRELA Q+   A +    T   +A + GG       +   
Sbjct: 63  L-LDFPRRKSGPPRILILTPTRELAMQVADQARELAKHTHLDIATITGGVAYMNHAEVFS 121

Query: 250 TGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRPDRQV 309
              +IV+ATPGRL+ Y+++   +      L+LDEADRMLDMGF   I  I  + R  +Q 
Sbjct: 122 ENQDIVVATPGRLLQYIKEENFDCRAVETLILDEADRMLDMGFAQDIETIAAETRWRKQT 181

Query: 310 LMWSATWPKE-VQKLAEDFLVDYVQLNIGSLNPTANH--NIVQIVDVCQEHEKDYKLQGL 366
           L++SAT   + VQ  AE  L D V++      P+      I Q      + E    L  L
Sbjct: 182 LLFSATLEGDAVQDFAERLLNDPVEVEA---EPSRRERKKIHQWYYRADDLEHKTAL--L 236

Query: 367 LSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGRA 426
              +     +++I+FV T+ +  ++   +R  G     + G   Q +R+  +     GR 
Sbjct: 237 CHLLKQPEVTRSIVFVRTRERVHELAGWLRKAGINCCYLEGEMVQAKRNEAIKRLTDGRV 296

Query: 427 SILVS 431
           ++LV+
Sbjct: 297 NVLVA 301


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score =  180 bits (458), Expect = 2e-49
 Identities = 107/308 (34%), Positives = 160/308 (51%), Gaps = 11/308 (3%)

Query: 130 FEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVH 189
           F + N  P +M  I+++GF   T IQAQ     L+G D +  A+TG+GKT  ++   I  
Sbjct: 89  FHDFNLAPELMHAIHDLGFPYCTPIQAQVLGYTLAGHDAIGRAQTGTGKTAAFLISIINQ 148

Query: 190 VNSQRPLRS---GEGPIVLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQVK 246
           +    P +    GE P  L++APTREL  QI   A      T   V    GG     Q+K
Sbjct: 149 LLQTPPPKERYMGE-PRALIIAPTRELVVQIAKDAAALTKYTGLNVMTFVGGMDFDKQLK 207

Query: 247 ALQTG-AEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRP 305
            L+    +I++ATPGRL+D+ ++G ++L     +VLDEADRMLDMGF PQ+R+II Q   
Sbjct: 208 QLEARFCDILVATPGRLLDFNQRGEVHLDMVEVMVLDEADRMLDMGFIPQVRQIIRQTPR 267

Query: 306 --DRQVLMWSATWPKEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDYKL 363
             +RQ L++SAT+  +V  LA+ +  D   + I   N  A+  + Q V      +K   L
Sbjct: 268 KEERQTLLFSATFTDDVMNLAKQWTTDPAIVEIEPEN-VASDTVEQHVYAVAGSDKYKLL 326

Query: 364 QGLLSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRI 423
             L++Q   ER    ++F   K +   I   +   G  A  + G+  Q +R + L  FR 
Sbjct: 327 YNLVTQNPWER---VMVFANRKDEVRRIEERLVKDGINAAQLSGDVPQHKRIKTLEGFRE 383

Query: 424 GRASILVS 431
           G+  +LV+
Sbjct: 384 GKIRVLVA 391


>gnl|CDD|235307 PRK04537, PRK04537, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 572

 Score =  173 bits (441), Expect = 3e-46
 Identities = 115/308 (37%), Positives = 175/308 (56%), Gaps = 11/308 (3%)

Query: 130 FEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVH 189
           F   +  P ++  +   GF   T IQA   P+AL G D+   A+TG+GKTL ++  A+++
Sbjct: 11  FSSFDLHPALLAGLESAGFTRCTPIQALTLPVALPGGDVAGQAQTGTGKTLAFLV-AVMN 69

Query: 190 VNSQRPL---RSGEGPIVLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQVK 246
               RP    R  E P  L+LAPTRELA QI   A  FG+    R A V+GG     Q +
Sbjct: 70  RLLSRPALADRKPEDPRALILAPTRELAIQIHKDAVKFGADLGLRFALVYGGVDYDKQRE 129

Query: 247 ALQTGAEIVIATPGRLIDYLEQ-GTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQI-- 303
            LQ G +++IATPGRLIDY++Q   ++LH     VLDEADRM D+GF   IR ++ ++  
Sbjct: 130 LLQQGVDVIIATPGRLIDYVKQHKVVSLHACEICVLDEADRMFDLGFIKDIRFLLRRMPE 189

Query: 304 RPDRQVLMWSATWPKEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDYKL 363
           R  RQ L++SAT    V +LA + + +  +L + +   TA   + Q +    + EK   L
Sbjct: 190 RGTRQTLLFSATLSHRVLELAYEHMNEPEKLVVETETITAAR-VRQRIYFPADEEKQTLL 248

Query: 364 QGLLSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRI 423
            GLLS+  SE  ++T++FV TK   + + R++   G+    + G+  Q++R+ +LN F+ 
Sbjct: 249 LGLLSR--SE-GARTMVFVNTKAFVERVARTLERHGYRVGVLSGDVPQKKRESLLNRFQK 305

Query: 424 GRASILVS 431
           G+  ILV+
Sbjct: 306 GQLEILVA 313



 Score = 33.0 bits (75), Expect = 0.63
 Identities = 13/41 (31%), Positives = 14/41 (34%), Gaps = 2/41 (4%)

Query: 26  RGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNLRTP 66
           R G  R G G G   +      GGR  G    G     R P
Sbjct: 431 RRGGGRSGPGGGS-RSGSV-GGGGRRDGAGADGKPRPRRKP 469



 Score = 29.9 bits (67), Expect = 6.6
 Identities = 8/31 (25%), Positives = 8/31 (25%)

Query: 23  GAPRGGAPRGGGGRGGYGAPRGYSRGGRGGG 53
           G  R G   G       G  R    G  G  
Sbjct: 433 GGGRSGPGGGSRSGSVGGGGRRDGAGADGKP 463


>gnl|CDD|185609 PTZ00424, PTZ00424, helicase 45; Provisional.
          Length = 401

 Score =  166 bits (423), Expect = 4e-45
 Identities = 96/306 (31%), Positives = 161/306 (52%), Gaps = 10/306 (3%)

Query: 127 IQHFEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPA 186
           +  F+       +++ IY  GF+ P+AIQ +G    L G D +  A++G+GKT  ++  A
Sbjct: 27  VDSFDALKLNEDLLRGIYSYGFEKPSAIQQRGIKPILDGYDTIGQAQSGTGKTATFVIAA 86

Query: 187 IVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATATRV-ACVFGGAPKGPQV 245
           +  ++    L + +    L+LAPTRELAQQI+ V    G     R  ACV GG      +
Sbjct: 87  LQLIDYD--LNACQ---ALILAPTRELAQQIQKVVLALGDYLKVRCHACV-GGTVVRDDI 140

Query: 246 KALQTGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRP 305
             L+ G  +V+ TPGR+ D +++  + +      +LDEAD ML  GF+ QI  +  ++ P
Sbjct: 141 NKLKAGVHMVVGTPGRVYDMIDKRHLRVDDLKLFILDEADEMLSRGFKGQIYDVFKKLPP 200

Query: 306 DRQVLMWSATWPKEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDYKLQG 365
           D QV ++SAT P E+ +L   F+ D  ++ +     T    I Q   V  E E ++K   
Sbjct: 201 DVQVALFSATMPNEILELTTKFMRDPKRILVKKDELTL-EGIRQFY-VAVEKE-EWKFDT 257

Query: 366 LLSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGR 425
           L     +   ++ II+  T+RK D +T+ +  + +    +HG+  Q++RD ++ EFR G 
Sbjct: 258 LCDLYETLTITQAIIYCNTRRKVDYLTKKMHERDFTVSCMHGDMDQKDRDLIMREFRSGS 317

Query: 426 ASILVS 431
             +L++
Sbjct: 318 TRVLIT 323


>gnl|CDD|235314 PRK04837, PRK04837, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 423

 Score =  166 bits (423), Expect = 6e-45
 Identities = 102/317 (32%), Positives = 159/317 (50%), Gaps = 28/317 (8%)

Query: 128 QHFEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAI 187
           Q F +    P +++ + + GF   T IQA   P+ L+G D+   A+TG+GKT+ ++    
Sbjct: 8   QKFSDFALHPQVVEALEKKGFHNCTPIQALALPLTLAGRDVAGQAQTGTGKTMAFLTATF 67

Query: 188 VHV--NSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQV 245
            ++  +     R    P  L++APTRELA QI   A     AT  ++   +GG     Q+
Sbjct: 68  HYLLSHPAPEDRKVNQPRALIMAPTRELAVQIHADAEPLAQATGLKLGLAYGGDGYDKQL 127

Query: 246 KALQTGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRP 305
           K L++G +I+I T GRLIDY +Q  INL     +VLDEADRM D+GF   IR +  ++ P
Sbjct: 128 KVLESGVDILIGTTGRLIDYAKQNHINLGAIQVVVLDEADRMFDLGFIKDIRWLFRRMPP 187

Query: 306 --DRQVLMWSATWPKEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEH----EK 359
              R  +++SAT    V++LA + + +   + +     T  H I       +E       
Sbjct: 188 ANQRLNMLFSATLSYRVRELAFEHMNNPEYVEVEPEQKTG-HRI-------KEELFYPSN 239

Query: 360 DYKLQGLLSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIH------GNKSQQE 413
           + K++ L + I  E   + IIF  TK + ++I       G  A   H      G+ +Q++
Sbjct: 240 EEKMRLLQTLIEEEWPDRAIIFANTKHRCEEIW------GHLAADGHRVGLLTGDVAQKK 293

Query: 414 RDRVLNEFRIGRASILV 430
           R R+L EF  G   ILV
Sbjct: 294 RLRILEEFTRGDLDILV 310


>gnl|CDD|236941 PRK11634, PRK11634, ATP-dependent RNA helicase DeaD; Provisional.
          Length = 629

 Score =  164 bits (417), Expect = 1e-42
 Identities = 97/297 (32%), Positives = 172/297 (57%), Gaps = 16/297 (5%)

Query: 139 IMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRS 198
           I++ + ++G++ P+ IQA+  P  L+G D++ +A+TGSGKT  +  P + +++ +  L++
Sbjct: 17  ILEALNDLGYEKPSPIQAECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLHNLDPE--LKA 74

Query: 199 GEGPIVLVLAPTRELAQQIETVANDFGS-ATATRVACVFGGAPKGPQVKALQTGAEIVIA 257
              P +LVLAPTRELA Q+     DF        V  ++GG     Q++AL+ G +IV+ 
Sbjct: 75  ---PQILVLAPTRELAVQVAEAMTDFSKHMRGVNVVALYGGQRYDVQLRALRQGPQIVVG 131

Query: 258 TPGRLIDYLEQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRPDRQVLMWSATWP 317
           TPGRL+D+L++GT++L + S LVLDEAD ML MGF   +  I+ QI    Q  ++SAT P
Sbjct: 132 TPGRLLDHLKRGTLDLSKLSGLVLDEADEMLRMGFIEDVETIMAQIPEGHQTALFSATMP 191

Query: 318 KEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDY---KLQGLLSQIGSER 374
           + ++++   F+ +  ++ I S       ++    D+ Q +   +   K + L+  + +E 
Sbjct: 192 EAIRRITRRFMKEPQEVRIQS-------SVTTRPDISQSYWTVWGMRKNEALVRFLEAED 244

Query: 375 TSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGRASILVS 431
               IIFV TK    ++  ++   G+ + A++G+ +Q  R++ L   + GR  IL++
Sbjct: 245 FDAAIIFVRTKNATLEVAEALERNGYNSAALNGDMNQALREQTLERLKDGRLDILIA 301


>gnl|CDD|238005 cd00046, DEXDc, DEAD-like helicases superfamily. A diverse family
           of proteins involved in ATP-dependent RNA or DNA
           unwinding. This domain contains the ATP-binding region.
          Length = 144

 Score =  144 bits (364), Expect = 4e-40
 Identities = 61/150 (40%), Positives = 84/150 (56%), Gaps = 7/150 (4%)

Query: 167 DLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGS 226
           D++  A TGSGKTL  + P +        L S +G  VLVLAPTRELA Q+     +   
Sbjct: 2   DVLLAAPTGSGKTLAALLPILEL------LDSLKGGQVLVLAPTRELANQVAERLKELFG 55

Query: 227 ATATRVACVFGGAPKGPQVKALQTGAEIVIATPGRLIDYLEQGTINLHRTSYLVLDEADR 286
               +V  + GG     Q K L    +IV+ TPGRL+D LE+  ++L +   L+LDEA R
Sbjct: 56  EGI-KVGYLIGGTSIKQQEKLLSGKTDIVVGTPGRLLDELERLKLSLKKLDLLILDEAHR 114

Query: 287 MLDMGFEPQIRKIIGQIRPDRQVLMWSATW 316
           +L+ GF     KI+ ++  DRQVL+ SAT 
Sbjct: 115 LLNQGFGLLGLKILLKLPKDRQVLLLSATP 144


>gnl|CDD|238034 cd00079, HELICc, Helicase superfamily c-terminal domain; associated
           with DEXDc-, DEAD-, and DEAH-box proteins, yeast
           initiation factor 4A, Ski2p, and Hepatitis C virus NS3
           helicases; this domain is found in a wide variety of
           helicases and helicase related proteins; may not be an
           autonomously folding unit, but an integral part of the
           helicase; 4 helicase superfamilies at present according
           to the organization of their signature motifs; all
           helicases share the ability to unwind nucleic acid
           duplexes with a distinct directional polarity; they
           utilize the free energy from nucleoside triphosphate
           hydrolysis to fuel their translocation along DNA,
           unwinding the duplex in the process.
          Length = 131

 Score = 69.6 bits (171), Expect = 4e-14
 Identities = 29/88 (32%), Positives = 45/88 (51%), Gaps = 6/88 (6%)

Query: 346 NIVQIVDVCQEHEKDYKLQGLLSQIGSERTS--KTIIFVETKRKADDITRSVRNKGWAAV 403
            I Q V       +D KL+ LL  +        K +IF  +K+  D++   +R  G    
Sbjct: 1   PIKQYVLPV----EDEKLEALLELLKEHLKKGGKVLIFCPSKKMLDELAELLRKPGIKVA 56

Query: 404 AIHGNKSQQERDRVLNEFRIGRASILVS 431
           A+HG+ SQ+ER+ VL +FR G   +LV+
Sbjct: 57  ALHGDGSQEEREEVLKDFREGEIVVLVA 84



 Score = 38.8 bits (91), Expect = 0.002
 Identities = 19/82 (23%), Positives = 33/82 (40%), Gaps = 8/82 (9%)

Query: 433 YNKSQQERDRVLNEF-----RIGRASILVSQYKESQQKRDRVLNEFRIGRASILVSHYNK 487
               + E+   L E      + G   ++    K+     D +    R     +   H + 
Sbjct: 6   VLPVEDEKLEALLELLKEHLKKGGKVLIFCPSKK---MLDELAELLRKPGIKVAALHGDG 62

Query: 488 SQQERDRVLNEFRIGRASILVS 509
           SQ+ER+ VL +FR G   +LV+
Sbjct: 63  SQEEREEVLKDFREGEIVVLVA 84



 Score = 38.0 bits (89), Expect = 0.004
 Identities = 18/80 (22%), Positives = 35/80 (43%), Gaps = 5/80 (6%)

Query: 459 YKESQQKRDRVLNEFRIGRAS---ILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQ 515
                +K + +L   +        +L+  +  S++  D +    R     +   H + SQ
Sbjct: 7   LPVEDEKLEALLELLKEHLKKGGKVLI--FCPSKKMLDELAELLRKPGIKVAALHGDGSQ 64

Query: 516 QERDRVLNEFRIGRASILVS 535
           +ER+ VL +FR G   +LV+
Sbjct: 65  EEREEVLKDFREGEIVVLVA 84



 Score = 37.2 bits (87), Expect = 0.006
 Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 2/69 (2%)

Query: 493 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQQERDRVLNEFR 552
             +L E       +L+  +  S++  D +    R     +   H + SQ+ER+ VL +FR
Sbjct: 18  LELLKEHLKKGGKVLI--FCPSKKMLDELAELLRKPGIKVAALHGDGSQEEREEVLKDFR 75

Query: 553 IGRASILVS 561
            G   +LV+
Sbjct: 76  EGEIVVLVA 84



 Score = 37.2 bits (87), Expect = 0.006
 Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 2/69 (2%)

Query: 519 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQQERDRVLNEFR 578
             +L E       +L+  +  S++  D +    R     +   H + SQ+ER+ VL +FR
Sbjct: 18  LELLKEHLKKGGKVLI--FCPSKKMLDELAELLRKPGIKVAALHGDGSQEEREEVLKDFR 75

Query: 579 IGRASILVS 587
            G   +LV+
Sbjct: 76  EGEIVVLVA 84



 Score = 37.2 bits (87), Expect = 0.006
 Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 2/69 (2%)

Query: 545 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQQERDRVLNEFR 604
             +L E       +L+  +  S++  D +    R     +   H + SQ+ER+ VL +FR
Sbjct: 18  LELLKEHLKKGGKVLI--FCPSKKMLDELAELLRKPGIKVAALHGDGSQEEREEVLKDFR 75

Query: 605 IGRASILVS 613
            G   +LV+
Sbjct: 76  EGEIVVLVA 84



 Score = 37.2 bits (87), Expect = 0.006
 Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 2/69 (2%)

Query: 571 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQQERDRVLNEFR 630
             +L E       +L+  +  S++  D +    R     +   H + SQ+ER+ VL +FR
Sbjct: 18  LELLKEHLKKGGKVLI--FCPSKKMLDELAELLRKPGIKVAALHGDGSQEEREEVLKDFR 75

Query: 631 IGRASILVS 639
            G   +LV+
Sbjct: 76  EGEIVVLVA 84



 Score = 37.2 bits (87), Expect = 0.006
 Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 2/69 (2%)

Query: 597 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQQERDRVLNEFR 656
             +L E       +L+  +  S++  D +    R     +   H + SQ+ER+ VL +FR
Sbjct: 18  LELLKEHLKKGGKVLI--FCPSKKMLDELAELLRKPGIKVAALHGDGSQEEREEVLKDFR 75

Query: 657 IGRASILVS 665
            G   +LV+
Sbjct: 76  EGEIVVLVA 84


>gnl|CDD|223989 COG1061, SSL2, DNA or RNA helicases of superfamily II
           [Transcription / DNA replication, recombination, and
           repair].
          Length = 442

 Score = 60.5 bits (147), Expect = 2e-09
 Identities = 64/336 (19%), Positives = 104/336 (30%), Gaps = 87/336 (25%)

Query: 169 VAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSAT 228
           V +  TG+GKT   +A   +    +           LVL PT+EL  Q       F    
Sbjct: 59  VIVLPTGAGKT--VVAAEAIAELKRS---------TLVLVPTKELLDQWAEALKKFLLLN 107

Query: 229 ATRVACVFGGAPKGPQVKALQTGAEIVIATPGRLID--YLEQGTINLH--RTSYLVLDEA 284
                 ++GG  K          A++ +AT    +      Q            ++ DE 
Sbjct: 108 DEIG--IYGGGEKEL------EPAKVTVAT----VQTLARRQLLDEFLGNEFGLIIFDEV 155

Query: 285 DRMLDMGFEPQIRKIIGQIRPDRQVLMWSATWPKEVQK---------------------L 323
             +    +    R+I+  +      L  +AT  +E                        +
Sbjct: 156 HHLPAPSY----RRILELLSAAYPRLGLTATPEREDGGRIGDLFDLIGPIVYEVSLKELI 211

Query: 324 AEDFLVDYV----------------------QLNIGSLNPTANHNIVQIVDVCQEHEKDY 361
            E +L  Y                          +     T                K  
Sbjct: 212 DEGYLAPYKYVEIKVTLTEDEEREYAKESARFRELLRARGTLRAENEARRIAIASERKIA 271

Query: 362 KLQGLLSQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEF 421
            ++GLL +    R  KT+IF      A +I +     G    AI G   ++ER+ +L  F
Sbjct: 272 AVRGLLLKH--ARGDKTLIFASDVEHAYEIAKLFLAPGIVE-AITGETPKEEREAILERF 328

Query: 422 RIGRASILVSQYNKSQQERDRVLNE-FRIGRASILV 456
           R G   +LV+          +VL+E   I  A +L+
Sbjct: 329 RTGGIKVLVTV---------KVLDEGVDIPDADVLI 355



 Score = 29.3 bits (66), Expect = 8.3
 Identities = 24/136 (17%), Positives = 47/136 (34%), Gaps = 19/136 (13%)

Query: 543 ERDRVLNEFRIGRASILVSHYNKSQQE--RDRVLNEFRIGRASILVSHYNKSQQERDRVL 600
           E  R     R    +  ++  ++ +    R  +L   R  +  I  S  +        + 
Sbjct: 245 ELLRARGTLRAENEARRIAIASERKIAAVRGLLLKHARGDKTLIFAS--DVEHA--YEIA 300

Query: 601 NEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQQERDRVLNE-FRIGR 659
             F        ++     ++ER+ +L  FR G   +LV+          +VL+E   I  
Sbjct: 301 KLFLAPGIVEAITG-ETPKEEREAILERFRTGGIKVLVT---------VKVLDEGVDIPD 350

Query: 660 AS--ILVSDFDILRQW 673
           A   I++      R +
Sbjct: 351 ADVLIILRPTGSRRLF 366


>gnl|CDD|224122 COG1201, Lhr, Lhr-like helicases [General function prediction
           only].
          Length = 814

 Score = 60.7 bits (148), Expect = 2e-09
 Identities = 74/335 (22%), Positives = 125/335 (37%), Gaps = 70/335 (20%)

Query: 135 FPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQR 194
             P + +      F + T  Q    P   SG +++ IA TGSGKT     P I  + S  
Sbjct: 8   LDPRVREWFKR-KFTSLTPPQRYAIPEIHSGENVLIIAPTGSGKTEAAFLPVINELLSLG 66

Query: 195 PLRSGEGPIVLVLAPTRELAQQIETVANDFGSATATRVACVFGGAPKGPQVKALQTGAEI 254
             +  +G   L ++P + L   I     +        VA   G  P+  + K L+    I
Sbjct: 67  KGKLEDGIYALYISPLKALNNDIRRRLEEPLRELGIEVAVRHGDTPQSEKQKMLKNPPHI 126

Query: 255 VIATP---------GRLIDYLEQGTINLHRTSYLVLDE------ADR--MLDMGFEPQIR 297
           +I TP          +  + L     ++    Y+++DE      + R   L +  E ++R
Sbjct: 127 LITTPESLAILLNSPKFRELLR----DVR---YVIVDEIHALAESKRGVQLALSLE-RLR 178

Query: 298 KIIGQIRPDRQVLMWSATW--PKEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDV-- 353
           ++ G  +  R  L  SAT   P+EV K    FLV                +  +IVDV  
Sbjct: 179 ELAGDFQ--RIGL--SATVGPPEEVAK----FLV-------------GFGDPCEIVDVSA 217

Query: 354 ----------------CQEHEKDYKLQGLLSQIGSERTSKTIIFVETKRKADDITRSVRN 397
                             E       + +   +   RT  T+IF  T+  A+ +   ++ 
Sbjct: 218 AKKLEIKVISPVEDLIYDEELWAALYERIAELVKKHRT--TLIFTNTRSGAERLAFRLKK 275

Query: 398 KGWAAVAIH-GNKSQQERDRVLNEFRIGRASILVS 431
            G   + +H G+ S++ R  V    + G    +V+
Sbjct: 276 LGPDIIEVHHGSLSRELRLEVEERLKEGELKAVVA 310


>gnl|CDD|224126 COG1205, COG1205, Distinct helicase family with a unique C-terminal
           domain including a metal-binding cysteine cluster
           [General function prediction only].
          Length = 851

 Score = 53.2 bits (128), Expect = 4e-07
 Identities = 60/320 (18%), Positives = 115/320 (35%), Gaps = 67/320 (20%)

Query: 155 QAQGWPIALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELA 214
           Q     +   G ++V    TGSGKT  ++ P + H+     LR       L+L PT  LA
Sbjct: 75  QVDALRLIREGRNVVVTTGTGSGKTESFLLPILDHL-----LRDPSAR-ALLLYPTNALA 128

Query: 215 QQIETVANDFGSATATRVAC-VFGGAPKGPQVKALQTG-AEIVIATPGRLIDYLEQGTIN 272
                   +  S    +V    + G     + +A+     +I++  P    D L    + 
Sbjct: 129 NDQAERLRELISDLPGKVTFGRYTGDTPPEERRAIIRNPPDILLTNP----DMLHYLLLR 184

Query: 273 LH--------RTSYLVLDEA---------------DRMLDMGFEPQIRKIIGQIRPDRQV 309
            H           YLV+DE                 R+L           + +     Q+
Sbjct: 185 NHDAWLWLLRNLKYLVVDELHTYRGVQGSEVALLLRRLLRR---------LRRYGSPLQI 235

Query: 310 LMWSATW--PKEVQKLAEDFLVDYVQLNI-GSLNPTANHNIV----QIVDVCQEHEKDYK 362
           +  SAT   P E    AE+      ++ +    +P      V     I ++ +   +   
Sbjct: 236 ICTSATLANPGE---FAEELFGRDFEVPVDEDGSPRGLRYFVRREPPIRELAESIRRSAL 292

Query: 363 LQGLLSQIGS---ERTSKTIIFVETKRKADDITRSVRN-------KGWAAVAIH-GNKSQ 411
            +  L+ + +       +T++F  ++++ + +  S R        K   AV+ +     +
Sbjct: 293 AE--LATLAALLVRNGIQTLVFFRSRKQVELLYLSPRRRLVREGGKLLDAVSTYRAGLHR 350

Query: 412 QERDRVLNEFRIGRASILVS 431
           +ER R+  EF+ G    +++
Sbjct: 351 EERRRIEAEFKEGELLGVIA 370


>gnl|CDD|223588 COG0514, RecQ, Superfamily II DNA helicase [DNA replication,
           recombination, and repair].
          Length = 590

 Score = 52.7 bits (127), Expect = 5e-07
 Identities = 64/287 (22%), Positives = 121/287 (42%), Gaps = 50/287 (17%)

Query: 163 LSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQ-QIET-- 219
           LSG D + +  TG GK+L Y  PA++           EG   LV++P   L + Q++   
Sbjct: 30  LSGKDTLVVMPTGGGKSLCYQIPALL----------LEGL-TLVVSPLISLMKDQVDQLE 78

Query: 220 ---VANDFGSATATRVACVFGGAPKGPQVKALQTGA-EIVIATPGRL-----IDYLEQGT 270
              +   + ++T +R         +   +  L++G  +++  +P RL     ++ L++  
Sbjct: 79  AAGIRAAYLNSTLSREE-------RQQVLNQLKSGQLKLLYISPERLMSPRFLELLKRLP 131

Query: 271 INLHRTSYLVLDEADRMLDMG--FEPQIRKIIGQIRPD--RQVLMW-SATWPKEVQK--L 323
           I+L     + +DEA  +   G  F P  R++ G++R       ++  +AT    V+    
Sbjct: 132 ISL-----VAIDEAHCISQWGHDFRPDYRRL-GRLRAGLPNPPVLALTATATPRVRDDIR 185

Query: 324 AEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDYKLQGLLSQIGSERTSKTIIFVE 383
            +  L D      GS +       V       E          L+ +  + +   II+  
Sbjct: 186 EQLGLQDANIF-RGSFDRPNLALKVVEKG---EPSDQL---AFLATVLPQLSKSGIIYCL 238

Query: 384 TKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGRASILV 430
           T++K +++   +R  G +A A H   S +ER+RV   F      ++V
Sbjct: 239 TRKKVEELAEWLRKNGISAGAYHAGLSNEERERVQQAFLNDEIKVMV 285


>gnl|CDD|224123 COG1202, COG1202, Superfamily II helicase, archaea-specific
           [General function prediction only].
          Length = 830

 Score = 52.5 bits (126), Expect = 7e-07
 Identities = 72/292 (24%), Positives = 116/292 (39%), Gaps = 48/292 (16%)

Query: 163 LSGCDLVAIAKTGSGKTL-GYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVA 221
           L G +L+ ++ T SGKTL G +A         R L  G+  + LV  P   LA Q     
Sbjct: 230 LEGENLLVVSATASGKTLIGELA------GIPRLLSGGKKMLFLV--PLVALANQKY--- 278

Query: 222 NDF-------GSATATRVACVFGGAPKGPQVKALQTGAEIVIAT-PGRLIDYLEQGTINL 273
            DF       G   A RV        + P V      A+I++ T  G  IDYL +   +L
Sbjct: 279 EDFKERYSKLGLKVAIRVGMSRIKTREEPVVVDTSPDADIIVGTYEG--IDYLLRTGKDL 336

Query: 274 HRTSYLVLDEADRMLDMGFEPQIRKIIGQIR---PDRQVLMWSATW--PKEVQKLAEDFL 328
                +V+DE   + D    P++  +IG++R   P  Q +  SAT   P+E+ K     L
Sbjct: 337 GDIGTVVIDEIHTLEDEERGPRLDGLIGRLRYLFPGAQFIYLSATVGNPEELAKKLGAKL 396

Query: 329 VDYVQLNIGSLNPTANHNIVQI---VDVCQEHEKDYKLQGLLSQIGSERTSK------TI 379
           V Y            +   V +   +   +   + + +   L +      S       TI
Sbjct: 397 VLY------------DERPVPLERHLVFARNESEKWDIIARLVKREFSTESSKGYRGQTI 444

Query: 380 IFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGRASILVS 431
           +F  ++R+  ++  ++  KG  A   H     +ER  V   F     + +V+
Sbjct: 445 VFTYSRRRCHELADALTGKGLKAAPYHAGLPYKERKSVERAFAAQELAAVVT 496


>gnl|CDD|129701 TIGR00614, recQ_fam, ATP-dependent DNA helicase, RecQ family.  All
           proteins in this family for which functions are known
           are 3'-5' DNA-DNA helicases. These proteins are used for
           recombination, recombinational repair, and possibly
           maintenance of chromosome stability. This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 470

 Score = 49.8 bits (119), Expect = 4e-06
 Identities = 67/276 (24%), Positives = 112/276 (40%), Gaps = 42/276 (15%)

Query: 162 ALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTREL----AQQI 217
            L G D   +  TG GK+L Y  PA+               I LV++P   L      Q+
Sbjct: 23  VLLGRDCFVVMPTGGGKSLCYQLPALC-----------SDGITLVISPLISLMEDQVLQL 71

Query: 218 ET--VANDF--GSATATRVACVFGGAPKGPQVKALQTGAEIVIATPGRLIDYLEQGTINL 273
           +   +   F   S +  +   V     K  ++K L    E   +   RL+  LE      
Sbjct: 72  KASGIPATFLNSSQSKEQQKNVLTDL-KDGKIKLLYVTPEKC-SASNRLLQTLE----ER 125

Query: 274 HRTSYLVLDEADRMLDMG--FEPQIRKIIGQIR---PDRQVLMWSATWPKEVQKLAEDFL 328
              + + +DEA  +   G  F P  + + G ++   P+  ++  +AT    V++     L
Sbjct: 126 KGITLIAVDEAHCISQWGHDFRPDYKAL-GSLKQKFPNVPIMALTATASPSVREDILRQL 184

Query: 329 -VDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDYK-LQGLLSQIGSERTSKT-IIFVETK 385
            +   Q+   S +     N+   V       K  K L+ LL  I  E   K+ II+  ++
Sbjct: 185 NLKNPQIFCTSFD---RPNLYYEV-----RRKTPKILEDLLRFIRKEFKGKSGIIYCPSR 236

Query: 386 RKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEF 421
           +K++ +T S++N G AA A H       RD V ++F
Sbjct: 237 KKSEQVTASLQNLGIAAGAYHAGLEISARDDVHHKF 272


>gnl|CDD|234365 TIGR03817, DECH_helic, helicase/secretion neighborhood putative
           DEAH-box helicase.  A conserved gene neighborhood widely
           spread in the Actinobacteria contains this
           uncharacterized DEAH-box family helicase encoded
           convergently towards an operon of genes for protein
           homologous to type II secretion and pilus formation
           proteins. The context suggests that this helicase may
           play a role in conjugal transfer of DNA.
          Length = 742

 Score = 49.7 bits (119), Expect = 5e-06
 Identities = 45/182 (24%), Positives = 67/182 (36%), Gaps = 26/182 (14%)

Query: 111 YLLRLESRPGKCVPRPIQHFEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLVA 170
           ++  L +R G+  P P          P ++  +   G   P   QA+   +A +G  +V 
Sbjct: 3   HVEHLPARAGRTAPWPAW------AHPDVVAALEAAGIHRPWQHQARAAELAHAGRHVVV 56

Query: 171 IAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATAT 230
              T SGK+L Y  P +        L        L LAPT+ LA        +       
Sbjct: 57  ATGTASGKSLAYQLPVL------SALADDPRATALYLAPTKALAADQLRAVRELT-LRGV 109

Query: 231 RVACVFGGAPKGPQVKALQTGAEIVIATPGRLIDYLEQGTINLH--------RTSYLVLD 282
           R A   G  P   +  A +  A  V+  P    D L +G +  H        R  Y+V+D
Sbjct: 110 RPATYDGDTPTEERRWAREH-ARYVLTNP----DMLHRGILPSHARWARFLRRLRYVVID 164

Query: 283 EA 284
           E 
Sbjct: 165 EC 166


>gnl|CDD|130456 TIGR01389, recQ, ATP-dependent DNA helicase RecQ.  The
           ATP-dependent DNA helicase RecQ of E. coli is about 600
           residues long. This model represents bacterial proteins
           with a high degree of similarity in domain architecture
           and in primary sequence to E. coli RecQ. The model
           excludes eukaryotic and archaeal proteins with RecQ-like
           regions, as well as more distantly related bacterial
           helicases related to RecQ [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 591

 Score = 47.4 bits (113), Expect = 2e-05
 Identities = 61/289 (21%), Positives = 119/289 (41%), Gaps = 54/289 (18%)

Query: 163 LSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQ-QIETVA 221
           L G D++ +  TG GK+L Y  PA+        L  G   + +V++P   L + Q++ + 
Sbjct: 26  LDGRDVLVVMPTGGGKSLCYQVPAL--------LLKG---LTVVISPLISLMKDQVDQL- 73

Query: 222 NDFGSATATRVACVFGGAPKGPQ---VKALQTG-AEIVIATPGRL-----IDYLEQGTIN 272
                A     A +        Q    KAL  G  +++   P RL     ++ L++  I 
Sbjct: 74  ----RAAGVAAAYLNSTLSAKEQQDIEKALVNGELKLLYVAPERLEQDYFLNMLQRIPIA 129

Query: 273 LHRTSYLVLDEADRMLDMG--FEPQIRKI--IGQIRPDRQVLMWSATWPKEVQKLAEDFL 328
           L     + +DEA  +   G  F P+ +++  + +  P    +  +AT   E ++      
Sbjct: 130 L-----VAVDEAHCVSQWGHDFRPEYQRLGSLAERFPQVPRIALTATADAETRQ------ 178

Query: 329 VDYV-QLNIGSLN----PTANHNIV-QIVDVCQEHEKDYKLQGLLSQIGSERTSKTIIFV 382
            D    L +   N         N+   +V      +K+ K + LL  +   R    II+ 
Sbjct: 179 -DIRELLRLADANEFITSFDRPNLRFSVV------KKNNKQKFLLDYLKKHRGQSGIIYA 231

Query: 383 ETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGRASILVS 431
            +++K +++   + ++G +A+A H   S + R     +F      ++V+
Sbjct: 232 SSRKKVEELAERLESQGISALAYHAGLSNKVRAENQEDFLYDDVKVMVA 280


>gnl|CDD|201125 pfam00271, Helicase_C, Helicase conserved C-terminal domain.  The
           Prosite family is restricted to DEAD/H helicases,
           whereas this domain family is found in a wide variety of
           helicases and helicase related proteins. It may be that
           this is not an autonomously folding unit, but an
           integral part of the helicase.
          Length = 78

 Score = 40.2 bits (95), Expect = 2e-04
 Identities = 14/36 (38%), Positives = 23/36 (63%)

Query: 396 RNKGWAAVAIHGNKSQQERDRVLNEFRIGRASILVS 431
           R  G     +HG  SQ+ER+ +L +FR G++ +LV+
Sbjct: 4   RKPGIKVARLHGGLSQEEREEILEDFRNGKSKVLVA 39



 Score = 29.4 bits (67), Expect = 1.5
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 479 SILVSHYNKSQQERDRVLNEFRIGRASILVS 509
            +   H   SQ+ER+ +L +FR G++ +LV+
Sbjct: 9   KVARLHGGLSQEEREEILEDFRNGKSKVLVA 39



 Score = 29.4 bits (67), Expect = 1.5
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 505 SILVSHYNKSQQERDRVLNEFRIGRASILVS 535
            +   H   SQ+ER+ +L +FR G++ +LV+
Sbjct: 9   KVARLHGGLSQEEREEILEDFRNGKSKVLVA 39



 Score = 29.4 bits (67), Expect = 1.5
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 531 SILVSHYNKSQQERDRVLNEFRIGRASILVS 561
            +   H   SQ+ER+ +L +FR G++ +LV+
Sbjct: 9   KVARLHGGLSQEEREEILEDFRNGKSKVLVA 39



 Score = 29.4 bits (67), Expect = 1.5
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 557 SILVSHYNKSQQERDRVLNEFRIGRASILVS 587
            +   H   SQ+ER+ +L +FR G++ +LV+
Sbjct: 9   KVARLHGGLSQEEREEILEDFRNGKSKVLVA 39



 Score = 29.4 bits (67), Expect = 1.5
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 583 SILVSHYNKSQQERDRVLNEFRIGRASILVS 613
            +   H   SQ+ER+ +L +FR G++ +LV+
Sbjct: 9   KVARLHGGLSQEEREEILEDFRNGKSKVLVA 39



 Score = 29.4 bits (67), Expect = 1.5
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 609 SILVSHYNKSQQERDRVLNEFRIGRASILVS 639
            +   H   SQ+ER+ +L +FR G++ +LV+
Sbjct: 9   KVARLHGGLSQEEREEILEDFRNGKSKVLVA 39



 Score = 29.4 bits (67), Expect = 1.5
 Identities = 11/31 (35%), Positives = 20/31 (64%)

Query: 635 SILVSHYNKSQQERDRVLNEFRIGRASILVS 665
            +   H   SQ+ER+ +L +FR G++ +LV+
Sbjct: 9   KVARLHGGLSQEEREEILEDFRNGKSKVLVA 39



 Score = 27.9 bits (63), Expect = 4.3
 Identities = 10/24 (41%), Positives = 18/24 (75%)

Query: 434 NKSQQERDRVLNEFRIGRASILVS 457
             SQ+ER+ +L +FR G++ +LV+
Sbjct: 16  GLSQEEREEILEDFRNGKSKVLVA 39


>gnl|CDD|240291 PTZ00146, PTZ00146, fibrillarin; Provisional.
          Length = 293

 Score = 43.6 bits (103), Expect = 2e-04
 Identities = 20/41 (48%), Positives = 20/41 (48%), Gaps = 1/41 (2%)

Query: 16 GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSN 56
          GR    GG   GG   GG G G  G  RG   GGRGGGG  
Sbjct: 13 GRGGGGGGGRGGGGRGGGRGGGR-GRGRGGGGGGRGGGGGG 52



 Score = 41.6 bits (98), Expect = 0.001
 Identities = 17/41 (41%), Positives = 17/41 (41%), Gaps = 1/41 (2%)

Query: 16 GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSN 56
          G      G    G  RGGG   G G   G  RGG GGGG  
Sbjct: 16 GGGGGGRGGGGRGGGRGGGRGRGRGGGGG-GRGGGGGGGPG 55



 Score = 40.5 bits (95), Expect = 0.002
 Identities = 19/40 (47%), Positives = 19/40 (47%), Gaps = 1/40 (2%)

Query: 15 YGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGG 54
           G     GG  RGG   GG G GG G  RG  RG   GGG
Sbjct: 4  GGFGGGRGGG-RGGGGGGGRGGGGRGGGRGGGRGRGRGGG 42



 Score = 37.8 bits (88), Expect = 0.018
 Identities = 17/36 (47%), Positives = 17/36 (47%), Gaps = 3/36 (8%)

Query: 20 YSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGS 55
             G  RGG   G GG GG G   G   GGRGGG  
Sbjct: 4  GGFGGGRGG---GRGGGGGGGRGGGGRGGGRGGGRG 36



 Score = 37.4 bits (87), Expect = 0.024
 Identities = 17/38 (44%), Positives = 17/38 (44%), Gaps = 2/38 (5%)

Query: 16 GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGG 53
          GR     G  RGG    G GRGG G  RG   GG  G 
Sbjct: 21 GRGGGGRGGGRGG--GRGRGRGGGGGGRGGGGGGGPGK 56



 Score = 37.0 bits (86), Expect = 0.024
 Identities = 17/37 (45%), Positives = 17/37 (45%)

Query: 22 GGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYG 58
           G   GG   GG G GG G   G  RGG  GGG   G
Sbjct: 2  MGGGFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRG 38



 Score = 32.0 bits (73), Expect = 1.0
 Identities = 14/26 (53%), Positives = 15/26 (57%)

Query: 34 GGRGGYGAPRGYSRGGRGGGGSNYGS 59
          G  GG+G  RG  RGG GGGG   G 
Sbjct: 1  GMGGGFGGGRGGGRGGGGGGGRGGGG 26



 Score = 32.0 bits (73), Expect = 1.1
 Identities = 15/33 (45%), Positives = 15/33 (45%)

Query: 26 RGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYG 58
            G   GGG  GG G   G  RGG G GG   G
Sbjct: 1  GMGGGFGGGRGGGRGGGGGGGRGGGGRGGGRGG 33


>gnl|CDD|197757 smart00490, HELICc, helicase superfamily c-terminal domain. 
          Length = 82

 Score = 39.5 bits (93), Expect = 4e-04
 Identities = 12/43 (27%), Positives = 26/43 (60%)

Query: 389 DDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGRASILVS 431
           +++   ++  G     +HG  SQ+ER+ +L++F  G+  +LV+
Sbjct: 1   EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVA 43



 Score = 29.5 bits (67), Expect = 1.4
 Identities = 10/43 (23%), Positives = 22/43 (51%)

Query: 467 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 509
           + +    +     +   H   SQ+ER+ +L++F  G+  +LV+
Sbjct: 1   EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVA 43



 Score = 29.5 bits (67), Expect = 1.4
 Identities = 10/43 (23%), Positives = 22/43 (51%)

Query: 493 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 535
           + +    +     +   H   SQ+ER+ +L++F  G+  +LV+
Sbjct: 1   EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVA 43



 Score = 29.5 bits (67), Expect = 1.4
 Identities = 10/43 (23%), Positives = 22/43 (51%)

Query: 519 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 561
           + +    +     +   H   SQ+ER+ +L++F  G+  +LV+
Sbjct: 1   EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVA 43



 Score = 29.5 bits (67), Expect = 1.4
 Identities = 10/43 (23%), Positives = 22/43 (51%)

Query: 545 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 587
           + +    +     +   H   SQ+ER+ +L++F  G+  +LV+
Sbjct: 1   EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVA 43



 Score = 29.5 bits (67), Expect = 1.4
 Identities = 10/43 (23%), Positives = 22/43 (51%)

Query: 571 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 613
           + +    +     +   H   SQ+ER+ +L++F  G+  +LV+
Sbjct: 1   EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVA 43



 Score = 29.5 bits (67), Expect = 1.4
 Identities = 10/43 (23%), Positives = 22/43 (51%)

Query: 597 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 639
           + +    +     +   H   SQ+ER+ +L++F  G+  +LV+
Sbjct: 1   EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVA 43



 Score = 29.5 bits (67), Expect = 1.4
 Identities = 10/43 (23%), Positives = 22/43 (51%)

Query: 623 DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRIGRASILVS 665
           + +    +     +   H   SQ+ER+ +L++F  G+  +LV+
Sbjct: 1   EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVA 43


>gnl|CDD|182933 PRK11057, PRK11057, ATP-dependent DNA helicase RecQ; Provisional.
          Length = 607

 Score = 42.0 bits (99), Expect = 0.001
 Identities = 71/286 (24%), Positives = 115/286 (40%), Gaps = 66/286 (23%)

Query: 162 ALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQ-QIET- 219
            LSG D + +  TG GK+L Y  PA+V          G   + LV++P   L + Q++  
Sbjct: 37  VLSGRDCLVVMPTGGGKSLCYQIPALV--------LDG---LTLVVSPLISLMKDQVDQL 85

Query: 220 ----VANDFGSATATR--VACVFGGAPKGPQVKALQTGAEIVIATPGRLI--DYLEQGTI 271
               VA    ++T TR     V  G   G Q+K       ++   P RL+  ++LE    
Sbjct: 86  LANGVAAACLNSTQTREQQLEVMAGCRTG-QIK-------LLYIAPERLMMDNFLEH--- 134

Query: 272 NLH--RTSYLVLDEADRMLDMG--FEPQIRKIIGQIR---PDRQVLMWSATWPKEVQKLA 324
            L     + L +DEA  +   G  F P+    +GQ+R   P    +  +AT     ++  
Sbjct: 135 -LAHWNPALLAVDEAHCISQWGHDFRPEYAA-LGQLRQRFPTLPFMALTATADDTTRQ-- 190

Query: 325 EDFLVDYVQL--------NIGSLN-PTANHNIVQIVDVCQEHEKDYKLQGLLSQIGSERT 375
                D V+L         I S + P   + +V         EK   L  L+  +  +R 
Sbjct: 191 -----DIVRLLGLNDPLIQISSFDRPNIRYTLV---------EKFKPLDQLMRYVQEQRG 236

Query: 376 SKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEF 421
              II+  ++ K +D    ++++G +A A H       R  V   F
Sbjct: 237 KSGIIYCNSRAKVEDTAARLQSRGISAAAYHAGLDNDVRADVQEAF 282


>gnl|CDD|224036 COG1111, MPH1, ERCC4-like helicases [DNA replication,
           recombination, and repair].
          Length = 542

 Score = 41.2 bits (97), Expect = 0.002
 Identities = 41/126 (32%), Positives = 56/126 (44%), Gaps = 11/126 (8%)

Query: 162 ALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTREL-AQQIETV 220
           AL    LV +  TG GKT   IA A+V  N  R      G  VL LAPT+ L  Q  E  
Sbjct: 27  ALFKNTLVVLP-TGLGKTF--IA-AMVIANRLR----WFGGKVLFLAPTKPLVLQHAEFC 78

Query: 221 ANDFGSATATRVACVFGGAPKGPQVKALQTGAEIVIATPGRLIDYLEQGTINLHRTSYLV 280
               G      +A + G      + +      ++ +ATP  + + L+ G I+L   S L+
Sbjct: 79  RKVTG-IPEDEIAALTGEVRPEEREELWAKK-KVFVATPQVVENDLKAGRIDLDDVSLLI 136

Query: 281 LDEADR 286
            DEA R
Sbjct: 137 FDEAHR 142



 Score = 30.0 bits (68), Expect = 6.4
 Identities = 23/86 (26%), Positives = 44/86 (51%), Gaps = 10/86 (11%)

Query: 356 EHEKDYKLQGLL-SQIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAI---HGNK-- 409
           EH K  KL+ +L  Q+     S+ I+F E +  A++I   ++  G  A        ++  
Sbjct: 346 EHPKLEKLREILKEQLEKNGDSRVIVFTEYRDTAEEIVNFLKKIGIKARVRFIGQASREG 405

Query: 410 ----SQQERDRVLNEFRIGRASILVS 431
               SQ+E+  ++++FR G  ++LV+
Sbjct: 406 DKGMSQKEQKEIIDQFRKGEYNVLVA 431


>gnl|CDD|219320 pfam07172, GRP, Glycine rich protein family.  This family of
          proteins includes several glycine rich proteins as well
          as two nodulins 16 and 24. The family also contains
          proteins that are induced in response to various
          stresses.
          Length = 91

 Score = 37.4 bits (87), Expect = 0.003
 Identities = 20/45 (44%), Positives = 20/45 (44%), Gaps = 2/45 (4%)

Query: 16 GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSN 60
          G   Y GG   GG   GGG  GG G   G   G  GGGG  YG  
Sbjct: 48 GGGGYGGGG--GGGYGGGGYYGGGGGYGGGGGGYPGGGGGGYGGG 90



 Score = 36.3 bits (84), Expect = 0.008
 Identities = 16/41 (39%), Positives = 16/41 (39%), Gaps = 3/41 (7%)

Query: 22 GGAPRGGAPRGGGGR---GGYGAPRGYSRGGRGGGGSNYGS 59
           G   GG   GGGG    GGY    G   GG GG     G 
Sbjct: 45 YGGGGGGYGGGGGGGYGGGGYYGGGGGYGGGGGGYPGGGGG 85



 Score = 33.2 bits (76), Expect = 0.069
 Identities = 19/39 (48%), Positives = 19/39 (48%), Gaps = 3/39 (7%)

Query: 16 GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGG 54
          G   Y GG   GG    GGG GGY    G   GG GGGG
Sbjct: 56 GGGGYGGGGYYGGGGGYGGGGGGYPGGGG---GGYGGGG 91



 Score = 32.4 bits (74), Expect = 0.16
 Identities = 16/31 (51%), Positives = 17/31 (54%), Gaps = 2/31 (6%)

Query: 30 PRGGGGRGGY--GAPRGYSRGGRGGGGSNYG 58
           + GGG GGY  G   GY  GG  GGG  YG
Sbjct: 43 DKYGGGGGGYGGGGGGGYGGGGYYGGGGGYG 73


>gnl|CDD|224125 COG1204, COG1204, Superfamily II helicase [General function
           prediction only].
          Length = 766

 Score = 40.1 bits (94), Expect = 0.005
 Identities = 30/126 (23%), Positives = 50/126 (39%), Gaps = 13/126 (10%)

Query: 160 PIALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIET 219
              LS  +++  A TGSGKTL      I  +     L  G G +V +  P + LA++   
Sbjct: 42  KGLLSDENVLISAPTGSGKTL------IALLAILSTLLEGGGKVVYI-VPLKALAEEKYE 94

Query: 220 VANDFGSATATRVACVFGGAPKGPQVKALQTGAEIVIATPGRLIDYLEQGTINLHR-TSY 278
             +        RV    G      +  A     ++++ TP +L D L +   +       
Sbjct: 95  EFSRL-EELGIRVGISTGDYDLDDERLA---RYDVIVTTPEKL-DSLTRKRPSWIEEVDL 149

Query: 279 LVLDEA 284
           +V+DE 
Sbjct: 150 VVIDEI 155


>gnl|CDD|226808 COG4371, COG4371, Predicted membrane protein [Function unknown].
          Length = 334

 Score = 38.8 bits (90), Expect = 0.008
 Identities = 19/37 (51%), Positives = 20/37 (54%), Gaps = 1/37 (2%)

Query: 19  SYSGGAPRGGAPRGGG-GRGGYGAPRGYSRGGRGGGG 54
            YSGG P GG   GGG   GG+G P     GG GGG 
Sbjct: 67  GYSGGGPSGGGYSGGGYSGGGFGFPFIIPGGGGGGGF 103



 Score = 35.7 bits (82), Expect = 0.085
 Identities = 14/34 (41%), Positives = 16/34 (47%)

Query: 21 SGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGG 54
          SGG   GG+ R   G     +  G S GG  GGG
Sbjct: 49 SGGRIGGGSFRAPSGYSRGYSGGGPSGGGYSGGG 82



 Score = 34.1 bits (78), Expect = 0.28
 Identities = 17/42 (40%), Positives = 18/42 (42%), Gaps = 1/42 (2%)

Query: 18 PSYSGGAPRGGAPRGGGGRGGYGAP-RGYSRGGRGGGGSNYG 58
          P     A R G   GGG         RGYS GG  GGG + G
Sbjct: 40 PLSVAAAARSGGRIGGGSFRAPSGYSRGYSGGGPSGGGYSGG 81


>gnl|CDD|221784 pfam12810, Gly_rich, Glycine rich protein.  This family of proteins
           is greatly expanded in Trichomonas vaginalis. The
           proteins are composed of several glycine rich motifs
           interspersed through the sequence. Although many
           proteins have been annotated by similarity in the family
           these annotations given the biased composition of the
           sequences these are unlikely to be functionally
           relevant.
          Length = 248

 Score = 38.3 bits (90), Expect = 0.010
 Identities = 22/53 (41%), Positives = 26/53 (49%)

Query: 15  YGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNLRTPN 67
                 +GG   GG+  GGGG GGY    G    G GGGGS+Y S  +   PN
Sbjct: 154 GASSGGNGGFGYGGSGNGGGGGGGYFGGGGGHYAGGGGGGSSYISGKDGCIPN 206



 Score = 31.0 bits (71), Expect = 1.8
 Identities = 10/29 (34%), Positives = 14/29 (48%)

Query: 34 GGRGGYGAPRGYSRGGRGGGGSNYGSNSN 62
          GG+G   +     +GG  GGG     NS+
Sbjct: 56 GGKGEDNSSNNMVKGGYNGGGDGGNDNSS 84


>gnl|CDD|222478 pfam13966, zf-RVT, zinc-binding in reverse transcriptase.  This
           domain would appear to be a zinc-binding region of a
           putative reverse transcriptase.
          Length = 86

 Score = 35.3 bits (82), Expect = 0.014
 Identities = 15/49 (30%), Positives = 20/49 (40%), Gaps = 7/49 (14%)

Query: 704 LW-VTLNRIRTKFGKCAANLHKWGLKSSAACD-CGAEKQTILHIIQDCP 750
            W    NR+ TK       L K G +    C  CG E++T  H+   C 
Sbjct: 37  AWLAAWNRLLTK-----DRLQKRGWQLPNRCVLCGQEEETRDHLFFHCS 80


>gnl|CDD|220915 pfam10961, DUF2763, Protein of unknown function (DUF2763).  This
          eukaryotic family of proteins has no known function.
          Length = 91

 Score = 34.7 bits (80), Expect = 0.020
 Identities = 15/41 (36%), Positives = 16/41 (39%), Gaps = 3/41 (7%)

Query: 16 GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSN 56
           R + S G        G GG G  G  R   R G GGG S 
Sbjct: 48 YRKNSSSGKRW---DGGRGGPGPPGGGRRMGRIGGGGGPSR 85



 Score = 27.4 bits (61), Expect = 8.3
 Identities = 12/28 (42%), Positives = 12/28 (42%), Gaps = 1/28 (3%)

Query: 22 GGAPRGGAPRGGGGRGGYGAPRGYSRGG 49
           G P GG   G  G GG G  R    GG
Sbjct: 65 PGPPGGGRRMGRIGGGG-GPSRPPMAGG 91


>gnl|CDD|224120 COG1199, DinG, Rad3-related DNA helicases [Transcription / DNA
           replication, recombination, and repair].
          Length = 654

 Score = 37.5 bits (87), Expect = 0.028
 Identities = 27/109 (24%), Positives = 42/109 (38%), Gaps = 33/109 (30%)

Query: 110 YYLLRLESRPGKCVPRPIQHFEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIALSGCDLV 169
                  + PG   PRP Q          + + + E                AL G + +
Sbjct: 3   DDGYLAVAFPGFE-PRPEQRE--------MAEAVAE----------------ALKGGEGL 37

Query: 170 AI-AKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQI 217
            I A TG+GKTL Y+ PA+ +          EG  V++   T+ L +Q+
Sbjct: 38  LIEAPTGTGKTLAYLLPALAY-------AREEGKKVIISTRTKALQEQL 79


>gnl|CDD|234478 TIGR04121, DEXH_lig_assoc, DEXH box helicase, DNA
           ligase-associated.  Members of this protein family are
           DEAD/DEAH box helicases found associated with a
           bacterial ATP-dependent DNA ligase, part of a four-gene
           system that occurs in about 12 % of prokaryotic
           reference genomes. The actual motif in this family is
           DE[VILW]H.
          Length = 803

 Score = 37.1 bits (87), Expect = 0.042
 Identities = 25/70 (35%), Positives = 33/70 (47%), Gaps = 4/70 (5%)

Query: 150 APTAIQAQGWPIALSGCDLVAIAKTGSGKTL-GYIAPAIVHVNSQRPLRSGEGPIVLVLA 208
            P   Q + W  AL G   + IA TGSGKTL G++   I     ++P    +G   L + 
Sbjct: 13  TPRPFQLEMWAAALEGRSGLLIAPTGSGKTLAGFLPSLIDLAGPEKP---KKGLHTLYIT 69

Query: 209 PTRELAQQIE 218
           P R LA  I 
Sbjct: 70  PLRALAVDIA 79


>gnl|CDD|233251 TIGR01054, rgy, reverse gyrase.  This model describes reverse
           gyrase, found in both archaeal and bacterial
           thermophiles. This enzyme, a fusion of a type I
           topoisomerase domain and a helicase domain, introduces
           positive supercoiling to increase the melting
           temperature of DNA double strands. Generally, these
           gyrases are encoded as a single polypeptide. An
           exception was found in Methanopyrus kandleri, where
           enzyme is split within the topoisomerase domain,
           yielding a heterodimer of gene products designated RgyB
           and RgyA [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 1171

 Score = 36.3 bits (84), Expect = 0.070
 Identities = 58/302 (19%), Positives = 103/302 (34%), Gaps = 60/302 (19%)

Query: 170 AIAKTGSGKT-LGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQ----IETVANDF 224
            IA TG GKT  G      +    +R           ++ PT  L  Q    I ++A   
Sbjct: 98  IIAPTGVGKTTFGLAMSLFLAKKGKR---------CYIILPTTLLVIQVAEKISSLAEKA 148

Query: 225 GSATATRVACVFGGAPKGPQVKALQ----TGAEIVIATPGRLIDYLEQGTINLH-RTSYL 279
           G  T   +       P   + + ++       +I+I T   L    +     L  +  ++
Sbjct: 149 GVGTV-NIGAYHSRLPTKEKKEFMERIENGDFDILITTTMFLSKNYD----ELGPKFDFI 203

Query: 280 VLDEADRMLD-----------MGF-EPQIRKIIGQIRPDRQVLMWSATWPKEVQKLAEDF 327
            +D+ D +L            +GF E  I K    IR   ++        +     A   
Sbjct: 204 FVDDVDALLKASKNVDKLLKLLGFSEELIEKAWKLIRLRLKLYRALHAKKRLELLEAIPG 263

Query: 328 LVDYVQLNIGS----------------LNPTANHNIVQIVDVCQEHEKDYKLQGLLSQIG 371
                 L + S                L          + +V   + +D  L+  L +I 
Sbjct: 264 KKRGC-LIVSSATGRPRGKRAKLFRELLGFEVGGGSDTLRNVVDVYVEDEDLKETLLEIV 322

Query: 372 SERTSKTIIFVET---KRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFRIGRASI 428
            +  +  I++V     K KA++I   + N G  AVA H  K +++ ++    F  G   +
Sbjct: 323 KKLGTGGIVYVSIDYGKEKAEEIAEFLENHGVKAVAYHATKPKEDYEK----FAEGEIDV 378

Query: 429 LV 430
           L+
Sbjct: 379 LI 380


>gnl|CDD|219420 pfam07466, DUF1517, Protein of unknown function (DUF1517).  This
          family consists of several hypothetical glycine rich
          plant and bacterial proteins of around 300 residues in
          length. The function of this family is unknown.
          Length = 280

 Score = 35.4 bits (82), Expect = 0.097
 Identities = 18/48 (37%), Positives = 23/48 (47%), Gaps = 4/48 (8%)

Query: 15 YGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRG----GRGGGGSNYG 58
          +  PS S  +PR  +P GGG  G  G   G+       G GGGG  +G
Sbjct: 13 FRAPSRSSSSPRSSSPGGGGYYGSPGGGFGFPFLIPFFGFGGGGGLFG 60



 Score = 32.3 bits (74), Expect = 0.75
 Identities = 15/49 (30%), Positives = 17/49 (34%), Gaps = 5/49 (10%)

Query: 4  RSPDMFELSRVYGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGG 52
          R+P     SR    P  S     G     GGG G       +  GG GG
Sbjct: 14 RAP-----SRSSSSPRSSSPGGGGYYGSPGGGFGFPFLIPFFGFGGGGG 57


>gnl|CDD|233069 TIGR00643, recG, ATP-dependent DNA helicase RecG.  [DNA metabolism,
           DNA replication, recombination, and repair].
          Length = 630

 Score = 35.4 bits (82), Expect = 0.13
 Identities = 37/150 (24%), Positives = 61/150 (40%), Gaps = 29/150 (19%)

Query: 175 GSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATATRVAC 234
           GSGKTL      +  + +        G  V ++APT  LA+Q      +  +     VA 
Sbjct: 266 GSGKTLVAALAMLAAIEA--------GYQVALMAPTEILAEQHYNSLRNLLAPLGIEVAL 317

Query: 235 VFG---GAPKGPQVKALQTGA-EIVIATPGRLIDYLEQGTINLHRTSYLVLDEADRMLDM 290
           + G   G  +   ++ + +G   +V+ T       L Q  +   R + +++DE  R    
Sbjct: 318 LTGSLKGKRRKELLETIASGQIHLVVGTHA-----LIQEKVEFKRLALVIIDEQHR---- 368

Query: 291 GFEPQIRKII-----GQIRPDRQVLMWSAT 315
            F  + RK +     G   P   VL+ SAT
Sbjct: 369 -FGVEQRKKLREKGQGGFTPH--VLVMSAT 395



 Score = 31.5 bits (72), Expect = 1.8
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 405 IHGNKSQQERDRVLNEFRIGRASILVS 431
           +HG     E++ V+ EFR G   ILV+
Sbjct: 488 LHGRMKSDEKEAVMEEFREGEVDILVA 514


>gnl|CDD|180777 PRK06958, PRK06958, single-stranded DNA-binding protein;
           Provisional.
          Length = 182

 Score = 34.0 bits (78), Expect = 0.13
 Identities = 17/44 (38%), Positives = 17/44 (38%)

Query: 16  GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS 59
           GR    GG   G     GGG GG G   G      GGGG   G 
Sbjct: 110 GRGGSGGGGGGGDEGGYGGGGGGGGGGYGGESRSGGGGGRASGG 153



 Score = 34.0 bits (78), Expect = 0.17
 Identities = 17/41 (41%), Positives = 19/41 (46%), Gaps = 3/41 (7%)

Query: 19  SYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS 59
            Y GG   GG   GG  R G G   G    G GGGG+  G+
Sbjct: 125 GYGGGGGGGGGGYGGESRSGGG---GGRASGGGGGGAGGGA 162



 Score = 32.5 bits (74), Expect = 0.52
 Identities = 18/36 (50%), Positives = 20/36 (55%), Gaps = 4/36 (11%)

Query: 26  RGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNS 61
           RGG+  GGGG GG     GY  GG GGGG   G + 
Sbjct: 111 RGGS--GGGGGGGDEG--GYGGGGGGGGGGYGGESR 142



 Score = 31.7 bits (72), Expect = 0.93
 Identities = 18/54 (33%), Positives = 19/54 (35%)

Query: 16  GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNLRTPNWN 69
           G     GG   GG   GGG  G   +  G  R   GGGG   G  S    P   
Sbjct: 118 GGGGDEGGYGGGGGGGGGGYGGESRSGGGGGRASGGGGGGAGGGASRPSAPAGG 171



 Score = 31.3 bits (71), Expect = 1.2
 Identities = 16/38 (42%), Positives = 17/38 (44%)

Query: 22  GGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS 59
           GG    G   GGG  GGYG   G   GG GG   + G 
Sbjct: 109 GGRGGSGGGGGGGDEGGYGGGGGGGGGGYGGESRSGGG 146



 Score = 30.9 bits (70), Expect = 1.7
 Identities = 18/41 (43%), Positives = 18/41 (43%), Gaps = 3/41 (7%)

Query: 15  YGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGS 55
           YG     GG   GG  R GGG G      G   GG GGG S
Sbjct: 126 YGGGGGGGGGGYGGESRSGGGGGRAS---GGGGGGAGGGAS 163


>gnl|CDD|236092 PRK07772, PRK07772, single-stranded DNA-binding protein;
           Provisional.
          Length = 186

 Score = 33.5 bits (77), Expect = 0.21
 Identities = 18/36 (50%), Positives = 19/36 (52%), Gaps = 2/36 (5%)

Query: 24  APRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS 59
           A RGG   GGG  GG+G   G S GG GGGG     
Sbjct: 120 ASRGGGGGGGG--GGFGGGGGGSGGGGGGGGGGGAP 153



 Score = 33.5 bits (77), Expect = 0.25
 Identities = 20/48 (41%), Positives = 23/48 (47%), Gaps = 3/48 (6%)

Query: 26  RGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNLRTPN---WNS 70
            GG   GGGG GG G   G   GG GGGG+  G  +    P    W+S
Sbjct: 123 GGGGGGGGGGFGGGGGGSGGGGGGGGGGGAPGGGGAQASAPADDPWSS 170



 Score = 31.9 bits (73), Expect = 0.70
 Identities = 19/43 (44%), Positives = 19/43 (44%)

Query: 17  RPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS 59
           R S  GG   GG   GGGG G  G   G   GG  GGG    S
Sbjct: 119 RASRGGGGGGGGGGFGGGGGGSGGGGGGGGGGGAPGGGGAQAS 161



 Score = 28.8 bits (65), Expect = 7.1
 Identities = 16/35 (45%), Positives = 17/35 (48%), Gaps = 3/35 (8%)

Query: 29  APRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNL 63
           A RGGGG GG G   G+  GG G GG   G     
Sbjct: 120 ASRGGGGGGGGG---GFGGGGGGSGGGGGGGGGGG 151


>gnl|CDD|203709 pfam07652, Flavi_DEAD, Flavivirus DEAD domain. 
          Length = 146

 Score = 33.1 bits (76), Expect = 0.25
 Identities = 17/44 (38%), Positives = 24/44 (54%), Gaps = 7/44 (15%)

Query: 175 GSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIE 218
           G+GKT   + P +V     R LR+      LVLAPTR +  ++E
Sbjct: 12  GAGKTRKVL-PELVRECIDRRLRT------LVLAPTRVVLAEME 48


>gnl|CDD|221759 pfam12764, Gly-rich_Ago1, Glycine-rich region of argonaut.  This
          domain is often found at the very N-terminal of
          argonaut-like proteins.
          Length = 102

 Score = 31.9 bits (72), Expect = 0.31
 Identities = 17/36 (47%), Positives = 18/36 (50%), Gaps = 2/36 (5%)

Query: 16 GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRG 51
          G P Y  G  RGG P+  GGRGG G  RG    G  
Sbjct: 2  GPPEY-QGRGRGGPPQ-QGGRGGGGGGRGGGSTGGP 35


>gnl|CDD|164795 PHA00370, III, attachment protein.
          Length = 297

 Score = 33.7 bits (77), Expect = 0.32
 Identities = 24/72 (33%), Positives = 29/72 (40%), Gaps = 9/72 (12%)

Query: 19  SYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNLRTPN------WNSMS 72
           S +GG   GG   GG G G  G   G   GG  GGGS  GS     T        ++   
Sbjct: 94  SDTGGDTGGGNTGGGSGGGDTGGSGG---GGSDGGGSEGGSTGKSLTKEGVGAGDFDYPK 150

Query: 73  MEQINKNLYTPN 84
           M   NK+  T +
Sbjct: 151 MANANKDALTED 162



 Score = 29.5 bits (66), Expect = 7.6
 Identities = 16/42 (38%), Positives = 16/42 (38%), Gaps = 1/42 (2%)

Query: 18  PSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS 59
               GG    G    GG  GG G   G S GG  GG    GS
Sbjct: 82  KDGDGGGTGEGGSDTGGDTGG-GNTGGGSGGGDTGGSGGGGS 122


>gnl|CDD|218292 pfam04851, ResIII, Type III restriction enzyme, res subunit. 
          Length = 100

 Score = 31.8 bits (73), Expect = 0.33
 Identities = 14/50 (28%), Positives = 24/50 (48%), Gaps = 8/50 (16%)

Query: 174 TGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVAND 223
           TGSGKT    A A++   ++   +      VL + P ++L +Q   +  D
Sbjct: 27  TGSGKT--LTAAALIARLAKGKKK------VLFVVPRKDLLEQALVIIID 68


>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
          Length = 773

 Score = 34.1 bits (79), Expect = 0.36
 Identities = 43/127 (33%), Positives = 63/127 (49%), Gaps = 13/127 (10%)

Query: 162 ALSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVA 221
           AL    LV +  TG GKT   IA   + V ++R L    G  VL+LAPT+ L +Q     
Sbjct: 27  ALKKNTLV-VLPTGLGKTA--IA---LLVIAER-LHKKGGK-VLILAPTKPLVEQHAEFF 78

Query: 222 NDFGSATATRVACVFGGAPKGPQVKALQTGAEIVIATPGRLI--DYLEQGTINLHRTSYL 279
             F +    ++  VF G     +   L   A++++ATP ++I  D +  G I+L   S L
Sbjct: 79  RKFLNIPEEKIV-VFTGEVSPEKRAELWEKAKVIVATP-QVIENDLIA-GRISLEDVSLL 135

Query: 280 VLDEADR 286
           + DEA R
Sbjct: 136 IFDEAHR 142



 Score = 32.9 bits (76), Expect = 0.86
 Identities = 27/85 (31%), Positives = 46/85 (54%), Gaps = 9/85 (10%)

Query: 356 EHEKDYKLQGLLS-QIGSERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHG--NK--- 409
           EH K  KL+ ++  Q+G    S+ I+F + +  A+ I   +  +G  AV   G  +K   
Sbjct: 345 EHPKLEKLREIVKEQLGKNPDSRIIVFTQYRDTAEKIVDLLEKEGIKAVRFVGQASKDGD 404

Query: 410 ---SQQERDRVLNEFRIGRASILVS 431
              SQ+E+  +L++FR G  ++LVS
Sbjct: 405 KGMSQKEQIEILDKFRAGEFNVLVS 429


>gnl|CDD|184200 PRK13640, cbiO, cobalt transporter ATP-binding subunit;
           Provisional.
          Length = 282

 Score = 33.2 bits (76), Expect = 0.38
 Identities = 18/61 (29%), Positives = 32/61 (52%), Gaps = 4/61 (6%)

Query: 279 LVLDEADRMLDMGFEPQIRKIIGQIRPDRQVLMWSATWPKEVQKLAEDFLVDYVQLNIGS 338
           ++LDE+  MLD   + QI K+I +++    + + S T   +   +A+  LV    L+ G 
Sbjct: 165 IILDESTSMLDPAGKEQILKLIRKLKKKNNLTVISITHDIDEANMADQVLV----LDDGK 220

Query: 339 L 339
           L
Sbjct: 221 L 221


>gnl|CDD|235782 PRK06341, PRK06341, single-stranded DNA-binding protein;
           Provisional.
          Length = 166

 Score = 32.5 bits (74), Expect = 0.40
 Identities = 19/50 (38%), Positives = 23/50 (46%), Gaps = 1/50 (2%)

Query: 11  LSRVYGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSN 60
           L+ + GR    GG   GG   GGG  G  G  RG  R    GGG N+  +
Sbjct: 111 LTMLDGRGE-GGGGGGGGDDGGGGDFGSSGPSRGGPRPASSGGGGNFSRD 159


>gnl|CDD|218350 pfam04959, ARS2, Arsenite-resistance protein 2.  Arsenite is a
           carcinogenic compound which can act as a co-mutagen by
           inhibiting DNA repair. Arsenite-resistance protein 2 is
           thought to play a role in arsenite resistance.
          Length = 211

 Score = 32.9 bits (75), Expect = 0.43
 Identities = 20/63 (31%), Positives = 22/63 (34%), Gaps = 12/63 (19%)

Query: 15  YGRPSYSGGAPRGGAPR---GGGGRGGYGAPRGYSRGGRG--------GGGSNYGSNSNL 63
           Y   +     P  G PR    G GRGG   P     GGRG        GG      N +L
Sbjct: 150 YPPQTPQALMP-YGQPRPPMMGYGRGGPPFPPNQYGGGRGNYDEFRGQGGYYGKPRNRDL 208

Query: 64  RTP 66
             P
Sbjct: 209 DGP 211



 Score = 30.9 bits (70), Expect = 2.0
 Identities = 19/56 (33%), Positives = 19/56 (33%), Gaps = 18/56 (32%)

Query: 22  GGAPRGGAPRGGG-------GRGGYGAPR----GYSRGGR-------GGGGSNYGS 59
              P G AP   G           YG PR    GY RGG        GGG  NY  
Sbjct: 137 KPDPGGLAPGLPGYPPQTPQALMPYGQPRPPMMGYGRGGPPFPPNQYGGGRGNYDE 192


>gnl|CDD|221690 pfam12650, DUF3784, Domain of unknown function (DUF3784).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and archaea.
           Proteins in this family are typically between 96 and 110
           amino acids in length.
          Length = 96

 Score = 31.0 bits (71), Expect = 0.47
 Identities = 9/33 (27%), Positives = 20/33 (60%), Gaps = 2/33 (6%)

Query: 421 FRIGRASILVSQYN-KSQQERDRVLNEFRIGRA 452
              G+ S L+S YN  S++E+++  +E ++ + 
Sbjct: 14  LLKGKGSFLISGYNTMSKEEKEKY-DEKKLCKF 45



 Score = 31.0 bits (71), Expect = 0.52
 Identities = 9/33 (27%), Positives = 20/33 (60%), Gaps = 2/33 (6%)

Query: 473 FRIGRASILVSHYN-KSQQERDRVLNEFRIGRA 504
              G+ S L+S YN  S++E+++  +E ++ + 
Sbjct: 14  LLKGKGSFLISGYNTMSKEEKEKY-DEKKLCKF 45



 Score = 31.0 bits (71), Expect = 0.52
 Identities = 9/33 (27%), Positives = 20/33 (60%), Gaps = 2/33 (6%)

Query: 499 FRIGRASILVSHYN-KSQQERDRVLNEFRIGRA 530
              G+ S L+S YN  S++E+++  +E ++ + 
Sbjct: 14  LLKGKGSFLISGYNTMSKEEKEKY-DEKKLCKF 45



 Score = 31.0 bits (71), Expect = 0.52
 Identities = 9/33 (27%), Positives = 20/33 (60%), Gaps = 2/33 (6%)

Query: 525 FRIGRASILVSHYN-KSQQERDRVLNEFRIGRA 556
              G+ S L+S YN  S++E+++  +E ++ + 
Sbjct: 14  LLKGKGSFLISGYNTMSKEEKEKY-DEKKLCKF 45



 Score = 31.0 bits (71), Expect = 0.52
 Identities = 9/33 (27%), Positives = 20/33 (60%), Gaps = 2/33 (6%)

Query: 551 FRIGRASILVSHYN-KSQQERDRVLNEFRIGRA 582
              G+ S L+S YN  S++E+++  +E ++ + 
Sbjct: 14  LLKGKGSFLISGYNTMSKEEKEKY-DEKKLCKF 45



 Score = 31.0 bits (71), Expect = 0.52
 Identities = 9/33 (27%), Positives = 20/33 (60%), Gaps = 2/33 (6%)

Query: 577 FRIGRASILVSHYN-KSQQERDRVLNEFRIGRA 608
              G+ S L+S YN  S++E+++  +E ++ + 
Sbjct: 14  LLKGKGSFLISGYNTMSKEEKEKY-DEKKLCKF 45



 Score = 31.0 bits (71), Expect = 0.52
 Identities = 9/33 (27%), Positives = 20/33 (60%), Gaps = 2/33 (6%)

Query: 603 FRIGRASILVSHYN-KSQQERDRVLNEFRIGRA 634
              G+ S L+S YN  S++E+++  +E ++ + 
Sbjct: 14  LLKGKGSFLISGYNTMSKEEKEKY-DEKKLCKF 45



 Score = 31.0 bits (71), Expect = 0.52
 Identities = 9/33 (27%), Positives = 20/33 (60%), Gaps = 2/33 (6%)

Query: 629 FRIGRASILVSHYN-KSQQERDRVLNEFRIGRA 660
              G+ S L+S YN  S++E+++  +E ++ + 
Sbjct: 14  LLKGKGSFLISGYNTMSKEEKEKY-DEKKLCKF 45


>gnl|CDD|224429 COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase
           type [General function prediction only].
          Length = 271

 Score = 32.7 bits (75), Expect = 0.56
 Identities = 17/57 (29%), Positives = 23/57 (40%), Gaps = 3/57 (5%)

Query: 1   MVMRSPDMFELSRVYGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGR--GGGGS 55
           M  RS  +      +            G+  G GG GG  +  G+S GG   GGGG+
Sbjct: 212 MFGRSLYLGRQPDRWLNGVLGRRRRSSGS-GGSGGSGGGSSGGGFSGGGGSSGGGGA 267



 Score = 31.6 bits (72), Expect = 1.3
 Identities = 17/48 (35%), Positives = 18/48 (37%), Gaps = 5/48 (10%)

Query: 15  YGRPSYSGGA---PRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS 59
            GR           R     G GG GG G   G S GG  GGG + G 
Sbjct: 219 LGRQPDRWLNGVLGRRRRSSGSGGSGGSG--GGSSGGGFSGGGGSSGG 264



 Score = 30.8 bits (70), Expect = 2.8
 Identities = 17/45 (37%), Positives = 17/45 (37%), Gaps = 5/45 (11%)

Query: 11  LSRVYGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGS 55
           L R        G    GG   GGG  GG     G S GG G  GS
Sbjct: 231 LGRRRRSSGSGGSGGSGGGSSGGGFSGG-----GGSSGGGGASGS 270


>gnl|CDD|221518 pfam12301, CD99L2, CD99 antigen like protein 2.  This family of
           proteins is found in eukaryotes. Proteins in this family
           are typically between 165 and 237 amino acids in length.
           CD99L2 and CD99 are involved in trans-endothelial
           migration of neutrophils in vitro and in the recruitment
           of neutrophils into inflamed peritoneum.
          Length = 154

 Score = 31.7 bits (72), Expect = 0.70
 Identities = 15/59 (25%), Positives = 17/59 (28%), Gaps = 11/59 (18%)

Query: 14  VYGRPSYSGGAPRGGAPRGGGG-----------RGGYGAPRGYSRGGRGGGGSNYGSNS 61
              +P        G     GGG            GGY   +G   GG GGGG       
Sbjct: 46  AGPKPKPRPQPDPGNPGGDGGGFSDSDLADVAGDGGYKPDKGKGGGGGGGGGGGTDGGP 104



 Score = 29.4 bits (66), Expect = 4.6
 Identities = 14/35 (40%), Positives = 17/35 (48%)

Query: 10  ELSRVYGRPSYSGGAPRGGAPRGGGGRGGYGAPRG 44
           +L+ V G   Y     +GG   GGGG G  G P G
Sbjct: 72  DLADVAGDGGYKPDKGKGGGGGGGGGGGTDGGPEG 106


>gnl|CDD|221477 pfam12235, FXR1P_C, Fragile X-related 1 protein C terminal.  This
          domain family is found in eukaryotes, and is typically
          between 126 and 160 amino acids in length. The family
          is found in association with pfam05641, pfam00013. This
          family is the C terminal region of the fragile X
          related 1 protein FXR1P. FXR1P contains two KH domains
          and a RGG box that are characteristic motifs in
          RNA-binding proteins as well as nuclear localization
          and export signals. FXR1P is thought to regulate mRNA
          transport and translation.
          Length = 79

 Score = 30.1 bits (68), Expect = 0.77
 Identities = 17/50 (34%), Positives = 19/50 (38%), Gaps = 1/50 (2%)

Query: 22 GGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS-NSNLRTPNWNS 70
          G   R    R G    GY       RG  GGG  N  S +S L  P+ N 
Sbjct: 19 GAGSRPPPSRPGDKEKGYLTDDSGGRGRGGGGKGNSNSISSVLDDPDSNP 68


>gnl|CDD|236966 PRK11747, dinG, ATP-dependent DNA helicase DinG; Provisional.
          Length = 697

 Score = 32.5 bits (75), Expect = 1.1
 Identities = 11/20 (55%), Positives = 12/20 (60%)

Query: 168 LVAIAKTGSGKTLGYIAPAI 187
           LV  A TG GKTL Y+   I
Sbjct: 52  LVIEAGTGVGKTLSYLLAGI 71


>gnl|CDD|217818 pfam03964, Chorion_2, Chorion family 2.  The chorion genes of
          Drosophila are amplified in response to developmental
          signals in the follicle cells of the ovary.
          Length = 96

 Score = 29.9 bits (67), Expect = 1.1
 Identities = 15/47 (31%), Positives = 18/47 (38%), Gaps = 2/47 (4%)

Query: 15 YGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNS 61
          YG     G A +      G    GYG  RG    GR G   +  S+S
Sbjct: 2  YGGVGVGGYAYQ--VQPAGYYGPGYGRGRGRGGYGRRGYLDSSPSDS 46


>gnl|CDD|109979 pfam00944, Peptidase_S3, Alphavirus core protein.  Also known as
           coat protein C and capsid protein C. This makes the
           literature very confusing. Alphaviruses consist of a
           nucleoprotein core, a lipid membrane which envelopes the
           core, and glycoprotein spikes protruding from the lipid
           membrane.
          Length = 157

 Score = 31.0 bits (70), Expect = 1.3
 Identities = 21/59 (35%), Positives = 27/59 (45%), Gaps = 5/59 (8%)

Query: 15  YGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGG---GGSNYGSNSNLRTPNWNS 70
           +G   YS G  R   P+G GG+G  G P   + G       GG+N GS + L    WN 
Sbjct: 86  HGAVQYSNG--RFTVPKGVGGKGDSGRPILDNTGRVVAIVLGGANEGSRTALSVVTWNK 142


>gnl|CDD|236148 PRK08074, PRK08074, bifunctional ATP-dependent DNA helicase/DNA
           polymerase III subunit epsilon; Validated.
          Length = 928

 Score = 32.2 bits (74), Expect = 1.4
 Identities = 23/93 (24%), Positives = 35/93 (37%), Gaps = 34/93 (36%)

Query: 135 FPPY--------IMKKIYEMGFQAPTAIQAQGWPIALSGCDLVAI-AKTGSGKTLGYIAP 185
            P Y        +MK++Y                 AL   +   I A TG+GK+L Y+ P
Sbjct: 253 MPKYEKREGQQEMMKEVYT----------------ALRDSEHALIEAGTGTGKSLAYLLP 296

Query: 186 AIVH-VNSQRPLRSGEGPIVLVLAPTRELAQQI 217
           A       + P        V++   T +L QQ+
Sbjct: 297 AAYFAKKKEEP--------VVISTYTIQLQQQL 321


>gnl|CDD|112992 pfam04202, Mfp-3, Foot protein 3.  Mytilus foot protein-3 (Mfp-3)
          is a highly polymorphic protein family located in the
          byssal adhesive plaques of blue mussels.
          Length = 71

 Score = 29.2 bits (65), Expect = 1.4
 Identities = 16/38 (42%), Positives = 17/38 (44%), Gaps = 1/38 (2%)

Query: 15 YGRPSYSGGAPRGGAPRGGGGRGGY-GAPRGYSRGGRG 51
          YG P   GG    G  R   G GG  G   G+ RG RG
Sbjct: 32 YGPPRRYGGGYYNGYNRYARGYGGNKGWNNGWKRGRRG 69


>gnl|CDD|215597 PLN03137, PLN03137, ATP-dependent DNA helicase; Q4-like;
           Provisional.
          Length = 1195

 Score = 32.2 bits (73), Expect = 1.5
 Identities = 19/56 (33%), Positives = 27/56 (48%), Gaps = 12/56 (21%)

Query: 163 LSGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQ-QI 217
           +SG D+  +  TG GK+L Y  PA++              I LV++P   L Q QI
Sbjct: 473 MSGYDVFVLMPTGGGKSLTYQLPALIC-----------PGITLVISPLVSLIQDQI 517


>gnl|CDD|220138 pfam09195, Endonuc-BglII, Restriction endonuclease BglII.  Members
           of this family are predominantly found in prokaryotic
           restriction endonuclease BglII, and adopt a structure
           consisting of an alpha/beta core containing a
           six-stranded beta-sheet surrounded by five
           alpha-helices, two of which are involved in
           homodimerisation of the endonuclease. They recognise the
           double-stranded DNA sequence AGATCT and cleave after
           A-1, resulting in specific double-stranded fragments
           with terminal 5'-phosphates.
          Length = 155

 Score = 30.7 bits (70), Expect = 1.6
 Identities = 23/98 (23%), Positives = 36/98 (36%), Gaps = 8/98 (8%)

Query: 478 ASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQ-QERDRVL-------NEFRIGR 529
             IL S + +  +E + VL  F +  A +  S  N+S   +  +         NE RI  
Sbjct: 3   EEILASDFPQEWEEIEEVLLAFPLTLAELKASGGNRSDGTQLYKDELVAKGWKNEPRISL 62

Query: 530 ASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQ 567
             I     +K    ++RV  E + G          K Q
Sbjct: 63  DGIEEESPHKIDFVKNRVALEVQFGNYDFFYRDLLKFQ 100



 Score = 30.7 bits (70), Expect = 1.6
 Identities = 23/98 (23%), Positives = 36/98 (36%), Gaps = 8/98 (8%)

Query: 530 ASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQ-QERDRVL-------NEFRIGR 581
             IL S + +  +E + VL  F +  A +  S  N+S   +  +         NE RI  
Sbjct: 3   EEILASDFPQEWEEIEEVLLAFPLTLAELKASGGNRSDGTQLYKDELVAKGWKNEPRISL 62

Query: 582 ASILVSHYNKSQQERDRVLNEFRIGRASILVSHYNKSQ 619
             I     +K    ++RV  E + G          K Q
Sbjct: 63  DGIEEESPHKIDFVKNRVALEVQFGNYDFFYRDLLKFQ 100


>gnl|CDD|236794 PRK10917, PRK10917, ATP-dependent DNA helicase RecG; Provisional.
          Length = 681

 Score = 31.7 bits (73), Expect = 2.0
 Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 405 IHGNKSQQERDRVLNEFRIGRASILVS 431
           +HG     E+D V+  F+ G   ILV+
Sbjct: 511 LHGRMKPAEKDAVMAAFKAGEIDILVA 537


>gnl|CDD|233398 TIGR01407, dinG_rel, DnaQ family exonuclease/DinG family helicase,
           putative.  This model represents a family of proteins in
           Gram-positive bacteria. The N-terminal region of about
           200 amino acids resembles the epsilon subunit of E. coli
           DNA polymerase III and the homologous region of the
           Gram-positive type DNA polymerase III alpha subunit. The
           epsilon subunit contains an exonuclease domain. The
           remainder of this protein family resembles a predicted
           ATP-dependent helicase, the DNA damage-inducible protein
           DinG of E. coli [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 850

 Score = 31.7 bits (72), Expect = 2.0
 Identities = 16/46 (34%), Positives = 27/46 (58%), Gaps = 8/46 (17%)

Query: 172 AKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQI 217
           A TG+GKTLGY+ PA+ +  +++P        V++   T+ L  Q+
Sbjct: 271 APTGTGKTLGYLLPALYYAITEKP--------VVISTNTKVLQSQL 308


>gnl|CDD|218152 pfam04568, IATP, Mitochondrial ATPase inhibitor, IATP.  ATP
          synthase inhibitor prevents the enzyme from switching
          to ATP hydrolysis during collapse of the
          electrochemical gradient, for example during oxygen
          deprivation ATP synthase inhibitor forms a one to one
          complex with the F1 ATPase, possibly by binding at the
          alpha-beta interface. It is thought to inhibit ATP
          synthesis by preventing the release of ATP. The minimum
          inhibitory region for bovine inhibitor is from residues
          39 to 72. The inhibitor has two oligomeric states,
          dimer (the active state) and tetramer. At low pH, the
          inhibitor forms a dimer via antiparallel coiled coil
          interactions between the C terminal regions of two
          monomers. At high pH, the inhibitor forms tetramers and
          higher oligomers by coiled coil interactions involving
          the N terminus and inhibitory region, thus preventing
          the inhibitory activity.
          Length = 90

 Score = 28.9 bits (65), Expect = 2.7
 Identities = 12/23 (52%), Positives = 12/23 (52%)

Query: 16 GRPSYSGGAPRGGAPRGGGGRGG 38
           R S S G    GA RGGGG  G
Sbjct: 23 ARRSMSEGDTGSGAGRGGGGEAG 45


>gnl|CDD|224035 COG1110, COG1110, Reverse gyrase [DNA replication, recombination,
           and repair].
          Length = 1187

 Score = 31.2 bits (71), Expect = 2.8
 Identities = 65/328 (19%), Positives = 115/328 (35%), Gaps = 94/328 (28%)

Query: 163 LSGCDLVAIAKTGSGKT-LGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQ----I 217
           + G     IA TG GKT  G +    +    +R         V ++ PT  L +Q    +
Sbjct: 95  VRGKSFAIIAPTGVGKTTFGLLMSLYLAKKGKR---------VYIIVPTTTLVRQVYERL 145

Query: 218 ETVANDFGSATATRVACVFGGAPKGPQVKAL-----QTGAEIVIATPGRLIDYLEQGTIN 272
           +  A D GS     V  V+  A    + +           +I+I T   L    E     
Sbjct: 146 KKFAEDAGS---LDVLVVYHSALPTKEKEEALERIESGDFDILITTSQFLSKRFE----E 198

Query: 273 LHRT--SYLVLDEAD----------RMLDM-GFE-------PQIRKIIGQIRPDRQVLMW 312
           L +    ++ +D+ D          R+L + GF         ++ K+  ++  +++    
Sbjct: 199 LSKLKFDFIFVDDVDAILKASKNVDRLLRLLGFSEEVIESAYELIKLRRKLYGEKRAERV 258

Query: 313 SATWPKEVQKLAEDFLVDYVQLNIGSL---NPTAN------------------------H 345
                +E  +  E       +  +G L   + T                           
Sbjct: 259 -----REELREVEREREKK-RRKLGILVVSSATGKPRGSRLKLFRELLGFEVGSGGEGLR 312

Query: 346 NIVQIVDVCQEHEKDYKLQGLLSQIGSERTSKTIIFVET---KRKADDITRSVRNKGWAA 402
           NI   VD+  E E   K+  L+ ++G       +IFV     + KA+++   +R+ G  A
Sbjct: 313 NI---VDIYVESESLEKVVELVKKLG----DGGLIFVPIDYGREKAEELAEYLRSHGINA 365

Query: 403 VAIHGNKSQQERDRVLNEFRIGRASILV 430
             IH      E++  L +F  G   +LV
Sbjct: 366 ELIH-----AEKEEALEDFEEGEVDVLV 388


>gnl|CDD|215598 PLN03138, PLN03138, Protein TOC75; Provisional.
          Length = 796

 Score = 31.0 bits (70), Expect = 2.9
 Identities = 17/50 (34%), Positives = 19/50 (38%)

Query: 5   SPDMFELSRVYGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGG 54
           +     L         S  A  GG   GGGG GG+G   G   GG GG  
Sbjct: 56  TSSSASLVANGAVALLSASAISGGGGGGGGGFGGFGGGGGGGGGGGGGWR 105


>gnl|CDD|178680 PLN03134, PLN03134, glycine-rich RNA-binding protein 4;
           Provisional.
          Length = 144

 Score = 29.6 bits (66), Expect = 3.1
 Identities = 18/30 (60%), Positives = 18/30 (60%), Gaps = 1/30 (3%)

Query: 26  RGGAPRGGGGRGGY-GAPRGYSRGGRGGGG 54
           R  APR  GG GGY G   GY  GG GGGG
Sbjct: 114 RPSAPRAYGGGGGYSGGGGGYGGGGDGGGG 143


>gnl|CDD|197548 smart00157, PRP, Major prion protein.  The prion protein is a major
           component of scrapie-associated fibrils in
           Creutzfeldt-Jakob disease, kuru, Gerstmann-Straussler
           syndrome and bovine spongiform encephalopathy.
          Length = 218

 Score = 30.2 bits (68), Expect = 3.2
 Identities = 32/115 (27%), Positives = 41/115 (35%), Gaps = 27/115 (23%)

Query: 20  YSGGAPRGGAPRGGGG----RGGYGAPRGYSRGGRGGGGSNYGSNSNLRTPNWNSMSMEQ 75
           Y       G P GGG      GG+G P G   G   GGG   G  ++     WN  S  +
Sbjct: 28  YPPQGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQGGGTH---NQWNKPSKPK 84

Query: 76  INKNLYTPNHLTVSRSQGEVESYLNHHDVTVKGRYYLLRLESRPGKCVPRPIQHF 130
            N       H+  + + G V   L        G Y L       G  + RP+ HF
Sbjct: 85  TNM-----KHVAGAAAAGAVVGGL--------GGYML-------GSAMSRPLIHF 119


>gnl|CDD|224124 COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense
           mechanisms].
          Length = 733

 Score = 30.9 bits (70), Expect = 3.3
 Identities = 47/313 (15%), Positives = 97/313 (30%), Gaps = 70/313 (22%)

Query: 164 SGCDLVAIAKTGSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQI----ET 219
               +V  A TG GKT   +  A+  ++ +  L       V+ + P R + + +    + 
Sbjct: 213 RSLLVVLEAPTGYGKTEASLILALALLDEKIKL----KSRVIYVLPFRTIIEDMYRRAKE 268

Query: 220 VANDFGSATATRVACVFGGAPK----GPQVKAL------QTGAEIVIATPG-RLIDYL-- 266
           +   F          +   + +     P    L       +  ++++A      I  L  
Sbjct: 269 IFGLFSV----IGKSLHSSSKEPLLLEPDQDILLTLTTNDSYKKLLLALIVVTPIQILIF 324

Query: 267 -----EQGTINLHRTSYLVLDEADRMLDMGFEPQIRKIIG----QIRPDRQVLMWSATWP 317
                +   + L  TS ++LDE     D   E  +  ++            VL+ SAT P
Sbjct: 325 SVKGFKFEFLALLLTSLVILDEVHLYAD---ETMLAALLALLEALAEAGVPVLLMSATLP 381

Query: 318 KEVQKLAEDFLVDYVQLNIGSLNPTANHNIVQIVDVCQEHEKDYKL-----QGLLSQIGS 372
             +    ++ L   +               V         E +  L       +      
Sbjct: 382 PFL----KEKLKKALG----------KGREVVENAKFCPKEDEPGLKRKERVDVEDGPQE 427

Query: 373 ERTSKTIIFVE----------TKRKADDITRSVRNKGWAAVAIHGNKSQQERDRVLNEFR 422
           E        V+          T  +A ++   ++ KG   + +H   + ++R+    E +
Sbjct: 428 ELIELISEEVKEGKKVLVIVNTVDRAIELYEKLKEKGPKVLLLHSRFTLKDREEKERELK 487

Query: 423 ----IGRASILVS 431
                    I+V+
Sbjct: 488 KLFKQNEGFIVVA 500


>gnl|CDD|218811 pfam05918, API5, Apoptosis inhibitory protein 5 (API5).  This
           family consists of apoptosis inhibitory protein 5 (API5)
           sequences from several organisms. Apoptosis or
           programmed cell death is a physiological form of cell
           death that occurs in embryonic development and organ
           formation. It is characterized by biochemical and
           morphological changes such as DNA fragmentation and cell
           volume shrinkage. API5 is an anti apoptosis gene located
           in human chromosome 11, whose expression prevents the
           programmed cell death that occurs upon the deprivation
           of growth factors.
          Length = 543

 Score = 30.4 bits (68), Expect = 3.7
 Identities = 13/36 (36%), Positives = 13/36 (36%), Gaps = 1/36 (2%)

Query: 25  PRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSN 60
             G      G   G G  R   RG  GGGG   G N
Sbjct: 505 KSGEGISKVGQSYG-GRGRTRGRGRGGGGGRGRGYN 539



 Score = 30.0 bits (67), Expect = 6.2
 Identities = 16/44 (36%), Positives = 18/44 (40%), Gaps = 5/44 (11%)

Query: 5   SPDMFELSRVYGRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRG 48
                E     G+     G  RG    GGGGRG     RGY+RG
Sbjct: 503 VNKSGEGISKVGQSYGGRGRTRGRGRGGGGGRG-----RGYNRG 541


>gnl|CDD|222005 pfam13245, AAA_19, Part of AAA domain. 
          Length = 73

 Score = 28.1 bits (63), Expect = 3.9
 Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 8/56 (14%)

Query: 175 GSGKTLGYIAPAIVHVNSQRPLRSGEGPIVLVLAPTRELAQQIETVANDFGSATAT 230
           G+GKT    A  I  +     L +G    VLV+APT   A+++         A A+
Sbjct: 20  GTGKT-ATAAAIIARL-----LAAGRS--VLVVAPTGRAARRLRERLAALDLADAS 67


>gnl|CDD|216868 pfam02084, Bindin, Bindin. 
          Length = 239

 Score = 29.9 bits (67), Expect = 4.1
 Identities = 16/47 (34%), Positives = 17/47 (36%)

Query: 20 YSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNLRTP 66
          Y+     G    GGG   G GAP G   GG GG G   G        
Sbjct: 31 YANQGMGGPVGGGGGPGAGGGAPGGPVGGGGGGSGGPPGGGEVAGEA 77



 Score = 29.9 bits (67), Expect = 4.9
 Identities = 16/39 (41%), Positives = 16/39 (41%), Gaps = 1/39 (2%)

Query: 22  GGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSN 60
           GG   GG   G GG GG G        G  GGG  YG  
Sbjct: 186 GGGGYGGGGGGMGG-GGGGGMGEMGGMGPQGGGGGYGGM 223



 Score = 29.1 bits (65), Expect = 7.2
 Identities = 14/39 (35%), Positives = 15/39 (38%), Gaps = 2/39 (5%)

Query: 22  GGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSN 60
           GG   G    GG G  G G   GY   G  G G+ Y   
Sbjct: 199 GGGGGGMGEMGGMGPQGGGG--GYGGMGAPGQGNAYNQG 235


>gnl|CDD|240255 PTZ00070, PTZ00070, 40S ribosomal protein S2; Provisional.
          Length = 257

 Score = 30.0 bits (68), Expect = 4.5
 Identities = 13/32 (40%), Positives = 13/32 (40%)

Query: 24 APRGGAPRGGGGRGGYGAPRGYSRGGRGGGGS 55
              G   G  GRGG G  RG  RGG G    
Sbjct: 5  RGGEGRGFGRRGRGGRGRGRGRGRGGEGEEKE 36


>gnl|CDD|226927 COG4555, NatA, ABC-type Na+ transport system, ATPase component
           [Energy production and conversion / Inorganic ion
           transport and metabolism].
          Length = 245

 Score = 29.7 bits (67), Expect = 4.9
 Identities = 16/57 (28%), Positives = 31/57 (54%)

Query: 273 LHRTSYLVLDEADRMLDMGFEPQIRKIIGQIRPDRQVLMWSATWPKEVQKLAEDFLV 329
           +H  S LVLDE    LD+    +    I Q++ + + +++S+   +EV+ L +  +V
Sbjct: 149 VHDPSILVLDEPTSGLDIRTRRKFHDFIKQLKNEGRAVIFSSHIMQEVEALCDRVIV 205


>gnl|CDD|233063 TIGR00631, uvrb, excinuclease ABC, B subunit.  All proteins in this
           family for wich functions are known are DNA helicases
           that function in the nucleotide excision repair and are
           endonucleases that make the 3' incision next to DNA
           damage. They are part of a pathway requiring UvrA, UvrB,
           UvrC, and UvrD homologs. This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 655

 Score = 30.0 bits (68), Expect = 5.3
 Identities = 19/74 (25%), Positives = 36/74 (48%), Gaps = 2/74 (2%)

Query: 359 KDYKLQGLLSQIG--SERTSKTIIFVETKRKADDITRSVRNKGWAAVAIHGNKSQQERDR 416
            D ++  LLS+I     R  + ++   TK+ A+D+T  ++  G     +H      ER  
Sbjct: 424 TDGQVDDLLSEIRQRVARNERVLVTTLTKKMAEDLTDYLKELGIKVRYLHSEIDTLERVE 483

Query: 417 VLNEFRIGRASILV 430
           ++ + R+G   +LV
Sbjct: 484 IIRDLRLGEFDVLV 497


>gnl|CDD|239467 cd03370, NADH_oxidase, NADPH_oxidase. Nitroreductase family
           containing NADH oxidase and other, uncharacterized
           proteins that are similar to nitroreductase.
           Nitroreductase catalyzes the reduction of nitroaromatic
           compounds such as nitrotoluenes, nitrofurans and
           nitroimidazoles. This process requires NAD(P)H as
           electron donor in an obligatory two-electron transfer
           and uses FMN as cofactor.  The enzyme is typically a
           homodimer. Members of this family are also called NADH
           dehydrogenase, oxygen-insensitive NAD(P)H nitrogenase or
           dihydropteridine reductase.
          Length = 156

 Score = 28.9 bits (65), Expect = 5.6
 Identities = 11/39 (28%), Positives = 21/39 (53%)

Query: 125 RPIQHFEECNFPPYIMKKIYEMGFQAPTAIQAQGWPIAL 163
           R I+ F+    P  +++++ E   +AP+A   Q W I +
Sbjct: 9   RSIRQFDTDPIPEDLLRRLLEAALRAPSAFNLQPWRIVV 47


>gnl|CDD|225203 COG2321, COG2321, Predicted metalloprotease [General function
          prediction only].
          Length = 295

 Score = 29.7 bits (67), Expect = 5.8
 Identities = 14/39 (35%), Positives = 15/39 (38%)

Query: 3  MRSPDMFELSRVYGRPSYSGGAPRGGAPRGGGGRGGYGA 41
          MR     E   V  R   S G+ RGG P    G G  G 
Sbjct: 1  MRWRGRRESDNVEDRRGRSSGSVRGGGPGLRLGGGIGGL 39


>gnl|CDD|224271 COG1352, CheR, Methylase of chemotaxis methyl-accepting proteins
           [Cell motility and secretion / Signal transduction
           mechanisms].
          Length = 268

 Score = 29.6 bits (67), Expect = 6.1
 Identities = 10/58 (17%), Positives = 17/58 (29%), Gaps = 2/58 (3%)

Query: 446 EFRIGRASILVSQYKESQQKR--DRVLNEFRIGRASILVSHYNKSQQERDRVLNEFRI 501
                R  I    YK +   R   R L +  +      ++      +E    L+   I
Sbjct: 9   ALIYQRTGIDFDNYKRTLVYRRLSRRLRKLGLKNFEEYLNLLESDSEELQAFLDALTI 66


>gnl|CDD|227493 COG5164, SPT5, Transcription elongation factor [Transcription].
          Length = 607

 Score = 30.0 bits (67), Expect = 6.4
 Identities = 15/55 (27%), Positives = 22/55 (40%), Gaps = 1/55 (1%)

Query: 18  PSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNLRTPNWNSMS 72
           P+Y+ G    GA     G  GY  P G+  G +    ++ G +      NW   S
Sbjct: 434 PNYATGGIAAGAAATSSGLSGYRTP-GWKDGPKTPALNDPGIDWGDEKDNWYKGS 487



 Score = 29.6 bits (66), Expect = 7.8
 Identities = 12/46 (26%), Positives = 15/46 (32%), Gaps = 1/46 (2%)

Query: 26  RGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS-NSNLRTPNWNS 70
           RG +               Y+ GG   G +   S  S  RTP W  
Sbjct: 417 RGLSKPLRYSEAIGSKTPNYATGGIAAGAAATSSGLSGYRTPGWKD 462


>gnl|CDD|237541 PRK13881, PRK13881, conjugal transfer protein TrbI; Provisional.
          Length = 472

 Score = 29.7 bits (67), Expect = 7.3
 Identities = 14/40 (35%), Positives = 17/40 (42%)

Query: 23  GAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSN 62
           G   G A   GGG G  GAP+       GGG ++Y     
Sbjct: 218 GVGGGMAAPAGGGMGAGGAPQLLQTSTSGGGRNSYAQFGG 257


>gnl|CDD|147458 pfam05268, GP38, Phage tail fibre adhesin Gp38.  This family
           contains several Gp38 proteins from T-even-like phages.
           Gp38, together with a second phage protein, gp57,
           catalyzes the organisation of gp37 but is absent from
           the phage particle. Gp37 is responsible for receptor
           recognition.
          Length = 261

 Score = 29.4 bits (66), Expect = 7.4
 Identities = 17/44 (38%), Positives = 19/44 (43%), Gaps = 3/44 (6%)

Query: 16  GRPSYSGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGS 59
           G     GG   GG P G GG+ G     G +     GGGS  GS
Sbjct: 169 GNGVCGGG---GGRPFGAGGKSGSHMSGGNASLTAPGGGSGTGS 209


>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein.  Taxilin
           contains an extraordinarily long coiled-coil domain in
           its C-terminal half and is ubiquitously expressed. It is
           a novel binding partner of several syntaxin family
           members and is possibly involved in Ca2+-dependent
           exocytosis in neuroendocrine cells. Gamma-taxilin,
           described as leucine zipper protein Factor Inhibiting
           ATF4-mediated Transcription (FIAT), localises to the
           nucleus in osteoblasts and dimerises with ATF4 to form
           inactive dimers, thus inhibiting ATF4-mediated
           transcription.
          Length = 309

 Score = 29.3 bits (66), Expect = 7.6
 Identities = 18/96 (18%), Positives = 39/96 (40%), Gaps = 20/96 (20%)

Query: 406 HGNKSQQERDRVLNEFRIGRASILVSQYNKSQQERDRVLNEFRIGRASILVSQYKESQQK 465
           H N + +  +   NE    +   L+ QY   +Q  +++L               K+ +Q+
Sbjct: 126 HSNPNHKLCE--ENEELREKLKELIEQYELREQHFEKLLKT-------------KDLEQQ 170

Query: 466 RDRVLNEFRIGRASILVSH-YNKSQQERDRVLNEFR 500
               LN+ ++ +A         K ++E++ +L E  
Sbjct: 171 ----LNDAKLEQAQEQAEQEQEKHKREKEILLKEAA 202


>gnl|CDD|237537 PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional.
          Length = 440

 Score = 29.5 bits (67), Expect = 8.2
 Identities = 14/34 (41%), Positives = 15/34 (44%), Gaps = 6/34 (17%)

Query: 21  SGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGG 54
           +  A RGGA   G      GA   YS G  GG G
Sbjct: 306 AAAAARGGAAAAG------GASSAYSAGAAGGSG 333


>gnl|CDD|224121 COG1200, RecG, RecG-like helicase [DNA replication, recombination,
           and repair / Transcription].
          Length = 677

 Score = 29.5 bits (67), Expect = 8.4
 Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 405 IHGNKSQQERDRVLNEFRIGRASILVS 431
           +HG     E+D V+  F+ G   ILV+
Sbjct: 513 VHGRMKPAEKDAVMEAFKEGEIDILVA 539


>gnl|CDD|182211 PRK10051, csgA, major curlin subunit; Provisional.
          Length = 151

 Score = 28.3 bits (63), Expect = 8.4
 Identities = 18/43 (41%), Positives = 20/43 (46%), Gaps = 12/43 (27%)

Query: 21 SGGAPRGGAPRGGGGRGGYGAPRGYSRGGRGGGGSNYGSNSNL 63
          SG A  G  P+ GGG            G  GGGG+N G NS L
Sbjct: 15 SGSALAGVVPQYGGG------------GNHGGGGNNSGPNSTL 45


>gnl|CDD|237655 PRK14279, PRK14279, chaperone protein DnaJ; Provisional.
          Length = 392

 Score = 29.3 bits (66), Expect = 9.6
 Identities = 17/47 (36%), Positives = 21/47 (44%), Gaps = 9/47 (19%)

Query: 17  RPSYSGGAPRGGAPRGGGGRGGYGAPRG---------YSRGGRGGGG 54
           R  ++GG   G    GGGG GG+G             +   GRGGGG
Sbjct: 75  RRLFAGGGFGGRRFDGGGGFGGFGTGGDGAEFNLNDLFDAAGRGGGG 121


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.135    0.396 

Gapped
Lambda     K      H
   0.267   0.0781    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 39,207,899
Number of extensions: 3860166
Number of successful extensions: 5013
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4674
Number of HSP's successfully gapped: 242
Length of query: 779
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 674
Effective length of database: 6,280,432
Effective search space: 4233011168
Effective search space used: 4233011168
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (27.9 bits)