RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8497
         (2355 letters)



>gnl|CDD|223587 COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA
           replication, recombination, and repair / Transcription /
           Translation, ribosomal structure and biogenesis].
          Length = 513

 Score =  311 bits (799), Expect = 5e-93
 Identities = 142/507 (28%), Positives = 235/507 (46%), Gaps = 37/507 (7%)

Query: 28  IEKPIRQGLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLP 87
           +   + Q L++ GF + +PIQ+AA P +L G D + Q+++GTGK+  F++  L K++K  
Sbjct: 36  LSPELLQALKDLGFEEPTPIQLAAIPLILAGRDVLGQAQTGTGKTAAFLLPLLQKILKSV 95

Query: 88  -VQQPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGT----QVERPKRPVQIVV 142
             +    +ILAPTRE+AVQI + +R +G ++  L V    GG     Q+E  KR V IVV
Sbjct: 96  ERKYVSALILAPTRELAVQIAEELRKLGKNLGGLRVAVVYGGVSIRKQIEALKRGVDIVV 155

Query: 143 GSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSATY 202
            +PGR+  +IK   L++  V   ++DEAD++++ GF++DI  I   LPP +Q L+ SAT 
Sbjct: 156 ATPGRLLDLIKRGKLDLSGVETLVLDEADRMLDMGFIDDIEKILKALPPDRQTLLFSATM 215

Query: 203 SHDNLATLQKYMVDPLLIR--PEDATRPLLGVKQLVALIPECKNPSLRYVDEEKKLVQLL 260
             D     ++Y+ DP+ I    E   R L  +KQ      E ++   +     + L++LL
Sbjct: 216 PDDIRELARRYLNDPVEIEVSVEKLERTLKKIKQFYL---EVESEEEK----LELLLKLL 268

Query: 261 SQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRILV 320
                 + ++F   +   E + E+L    F    L G   Q+ R  +L+ FK G++R+LV
Sbjct: 269 KDEDEGRVIVFVRTKRLVEELAESLRKRGFKVAALHGDLPQEERDRALEKFKDGELRVLV 328

Query: 321 TTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVITIVSA-ESLVKFHS 379
            TD+AARG+D  +V  VIN ++P D   Y+HR+GR GR G +G+ I+ V+  E + K   
Sbjct: 329 ATDVAARGLDIPDVSHVINYDLPLDPEDYVHRIGRTGRAGRKGVAISFVTEEEEVKKLKR 388

Query: 380 LMGEINLDHAFNVGLVPDNLTGDQINWTQRVQTLLAKPLDQAQEREDVEQTAEESSKVEN 439
           +                      ++     +     +P D    +       EES   + 
Sbjct: 389 IE----------------KRLERKLPSAVLLPLD--EPEDAKLLKTTRPGLEEESDISDE 430

Query: 440 TKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESDSENDGKPDENGSQRGSKRSKEFKT 499
            K L+ +    +   G     +K     G+    +       P+       S        
Sbjct: 431 IKKLKSSKKALLRGLGVRFTLSKLLANLGKEIPGAGDAVTIDPELERRSPNSAD----DI 486

Query: 500 QEKCVKGAGEDENGDRKASKDEQKIDG 526
           +      +   E    K      K+  
Sbjct: 487 EYILKGLSYRAEERTAKNEAANIKLQS 513


>gnl|CDD|185609 PTZ00424, PTZ00424, helicase 45; Provisional.
          Length = 401

 Score =  235 bits (601), Expect = 7e-68
 Identities = 136/384 (35%), Positives = 215/384 (55%), Gaps = 27/384 (7%)

Query: 9   ARTKDVLSNIDVL---FENDVLIEKPIRQGLQENGFVKESPIQMAAFPYVLNGYDAIVQS 65
           A T  + SN D +   F+   L E  +R G+   GF K S IQ      +L+GYD I Q+
Sbjct: 14  ASTGTIESNYDEIVDSFDALKLNEDLLR-GIYSYGFEKPSAIQQRGIKPILDGYDTIGQA 72

Query: 66  KSGTGKSIVFVVAALNKLIKLPVQQPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYF 125
           +SGTGK+  FV+AAL +LI   +   Q +ILAPTRE+A QI  VV ++G ++K +     
Sbjct: 73  QSGTGKTATFVIAAL-QLIDYDLNACQALILAPTRELAQQIQKVVLALGDYLK-VRCHAC 130

Query: 126 IGGTQV----ERPKRPVQIVVGSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVED 181
           +GGT V     + K  V +VVG+PGR+  MI  ++L +D ++LFI+DEAD++++ GF   
Sbjct: 131 VGGTVVRDDINKLKAGVHMVVGTPGRVYDMIDKRHLRVDDLKLFILDEADEMLSRGFKGQ 190

Query: 182 ITWIYSQLPPMKQMLVVSATYSHDNLATLQKYMVDP--LLIRPEDATRPLLGVKQLVALI 239
           I  ++ +LPP  Q+ + SAT  ++ L    K+M DP  +L++ ++ T  L G++Q    +
Sbjct: 191 IYDVFKKLPPDVQVALFSATMPNEILELTTKFMRDPKRILVKKDELT--LEGIRQFYVAV 248

Query: 240 PECKNPSLRYVDEEKK---LVQLLSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLS 296
            +          EE K   L  L       Q +I+ N + + + + + +   +F    + 
Sbjct: 249 EK----------EEWKFDTLCDLYETLTITQAIIYCNTRRKVDYLTKKMHERDFTVSCMH 298

Query: 297 GAQDQKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRA 356
           G  DQK R   +  F+ G  R+L+TTDL ARGID   V LVIN ++P     Y+HR+GR+
Sbjct: 299 GDMDQKDRDLIMREFRSGSTRVLITTDLLARGIDVQQVSLVINYDLPASPENYIHRIGRS 358

Query: 357 GRYGTRGLVITIVSAESLVKFHSL 380
           GR+G +G+ I  V+ + + +   +
Sbjct: 359 GRFGRKGVAINFVTPDDIEQLKEI 382


>gnl|CDD|238167 cd00268, DEADc, DEAD-box helicases. A diverse family of proteins
           involved in ATP-dependent RNA unwinding, needed in a
           variety of cellular processes including splicing,
           ribosome biogenesis and RNA degradation. The name
           derives from the sequence of the Walker  B motif (motif
           II). This domain contains the ATP- binding region.
          Length = 203

 Score =  205 bits (525), Expect = 3e-60
 Identities = 75/205 (36%), Positives = 121/205 (59%), Gaps = 7/205 (3%)

Query: 22  FENDVLIEKPIRQGLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALN 81
           FE ++ +   + +G+   GF K +PIQ  A P +L+G D I Q+++G+GK+  F++  L 
Sbjct: 1   FE-ELGLSPELLRGIYALGFEKPTPIQARAIPPLLSGRDVIGQAQTGSGKTAAFLIPILE 59

Query: 82  KLIKLP-VQQPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGT----QVERPKR 136
           KL   P    PQ +ILAPTRE+A+QI +V R +G H  NL V    GGT    Q+ + KR
Sbjct: 60  KLDPSPKKDGPQALILAPTRELALQIAEVARKLGKH-TNLKVVVIYGGTSIDKQIRKLKR 118

Query: 137 PVQIVVGSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQML 196
              IVV +PGR+  +++   L++  V+  ++DEAD++++ GF + I  I   LP  +Q L
Sbjct: 119 GPHIVVATPGRLLDLLERGKLDLSKVKYLVLDEADRMLDMGFEDQIREILKLLPKDRQTL 178

Query: 197 VVSATYSHDNLATLQKYMVDPLLIR 221
           + SAT   +     +K++ +P+ I 
Sbjct: 179 LFSATMPKEVRDLARKFLRNPVRIL 203


>gnl|CDD|236977 PRK11776, PRK11776, ATP-dependent RNA helicase DbpA; Provisional.
          Length = 460

 Score =  210 bits (537), Expect = 1e-58
 Identities = 110/356 (30%), Positives = 183/356 (51%), Gaps = 40/356 (11%)

Query: 34  QGLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKL-IKLPVQQPQ 92
             L E G+ + +PIQ  + P +L G D I Q+K+G+GK+  F +  L KL +K    + Q
Sbjct: 17  ANLNELGYTEMTPIQAQSLPAILAGKDVIAQAKTGSGKTAAFGLGLLQKLDVKRF--RVQ 74

Query: 93  TIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGTQVERPKRP--------VQIVVGS 144
            ++L PTRE+A Q+   +R +   + N+ V    GG     P  P          I+VG+
Sbjct: 75  ALVLCPTRELADQVAKEIRRLARFIPNIKVLTLCGGV----PMGPQIDSLEHGAHIIVGT 130

Query: 145 PGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSATYSH 204
           PGRI   ++   L++D++   ++DEAD++++ GF + I  I  Q P  +Q L+ SATY  
Sbjct: 131 PGRILDHLRKGTLDLDALNTLVLDEADRMLDMGFQDAIDAIIRQAPARRQTLLFSATYP- 189

Query: 205 DNLATL-QKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLR---Y-VDEEKK---L 256
           + +A + Q++  DP+ ++ E                     P++    Y V  +++   L
Sbjct: 190 EGIAAISQRFQRDPVEVKVESTH----------------DLPAIEQRFYEVSPDERLPAL 233

Query: 257 VQLLSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKV 316
            +LL       CV+F N +   + + + L    F A  L G  +Q+ R   L  F     
Sbjct: 234 QRLLLHHQPESCVVFCNTKKECQEVADALNAQGFSALALHGDLEQRDRDQVLVRFANRSC 293

Query: 317 RILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVITIVSAE 372
            +LV TD+AARG+D   ++ VIN E+  D   ++HR+GR GR G++GL +++V+ E
Sbjct: 294 SVLVATDVAARGLDIKALEAVINYELARDPEVHVHRIGRTGRAGSKGLALSLVAPE 349


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score =  197 bits (501), Expect = 6e-54
 Identities = 107/352 (30%), Positives = 189/352 (53%), Gaps = 22/352 (6%)

Query: 32  IRQGLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQ-- 89
           +   + + GF   +PIQ     Y L G+DAI ++++GTGK+  F+++ +N+L++ P    
Sbjct: 98  LMHAIHDLGFPYCTPIQAQVLGYTLAGHDAIGRAQTGTGKTAAFLISIINQLLQTPPPKE 157

Query: 90  ----QPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGTQVERPKRPVQ-----I 140
               +P+ +I+APTRE+ VQI     ++  +   L+V  F+GG   ++  + ++     I
Sbjct: 158 RYMGEPRALIIAPTRELVVQIAKDAAALTKYT-GLNVMTFVGGMDFDKQLKQLEARFCDI 216

Query: 141 VVGSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPM--KQMLVV 198
           +V +PGR+    +   +++D V + ++DEAD++++ GF+  +  I  Q P    +Q L+ 
Sbjct: 217 LVATPGRLLDFNQRGEVHLDMVEVMVLDEADRMLDMGFIPQVRQIIRQTPRKEERQTLLF 276

Query: 199 SATYSHDNLATLQKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLRYVDEEKKLVQ 258
           SAT++ D +   +++  DP ++  E        V+Q V  +           D+ K L  
Sbjct: 277 SATFTDDVMNLAKQWTTDPAIVEIEPENVASDTVEQHVYAVAGS--------DKYKLLYN 328

Query: 259 LLSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRI 318
           L++Q P+ + ++F+N +     I E L      A  LSG   Q  R+ +L+ F+ GK+R+
Sbjct: 329 LVTQNPWERVMVFANRKDEVRRIEERLVKDGINAAQLSGDVPQHKRIKTLEGFREGKIRV 388

Query: 319 LVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVITIVS 370
           LV TD+A RGI    +  VIN  +P D   Y+HR+GR GR G  G+ I+   
Sbjct: 389 LVATDVAGRGIHIDGISHVINFTLPEDPDDYVHRIGRTGRAGASGVSISFAG 440


>gnl|CDD|235314 PRK04837, PRK04837, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 423

 Score =  185 bits (471), Expect = 2e-50
 Identities = 121/361 (33%), Positives = 196/361 (54%), Gaps = 33/361 (9%)

Query: 25  DVLIEKPIRQGLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLI 84
           D  +   + + L++ GF   +PIQ  A P  L G D   Q+++GTGK++ F+ A  + L+
Sbjct: 12  DFALHPQVVEALEKKGFHNCTPIQALALPLTLAGRDVAGQAQTGTGKTMAFLTATFHYLL 71

Query: 85  KLP------VQQPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGG----TQVERP 134
             P      V QP+ +I+APTRE+AVQI      +      L +    GG     Q++  
Sbjct: 72  SHPAPEDRKVNQPRALIMAPTRELAVQIHADAEPLAQAT-GLKLGLAYGGDGYDKQLKVL 130

Query: 135 KRPVQIVVGSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQ 194
           +  V I++G+ GR+    K  ++N+ ++++ ++DEAD++ + GF++DI W++ ++PP  Q
Sbjct: 131 ESGVDILIGTTGRLIDYAKQNHINLGAIQVVVLDEADRMFDLGFIKDIRWLFRRMPPANQ 190

Query: 195 ML--VVSATYSH--DNLATLQKYMVDP--LLIRPEDATRPLLGVKQLVALIPECKNPSLR 248
            L  + SAT S+    LA   ++M +P  + + PE  T     +K+      E   PS  
Sbjct: 191 RLNMLFSATLSYRVRELAF--EHMNNPEYVEVEPEQKTGHR--IKE------ELFYPS-- 238

Query: 249 YVDEEK-KLVQ-LLSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLA 306
             +EEK +L+Q L+ +   ++ +IF+N + R E I  +LA        L+G   QK RL 
Sbjct: 239 --NEEKMRLLQTLIEEEWPDRAIIFANTKHRCEEIWGHLAADGHRVGLLTGDVAQKKRLR 296

Query: 307 SLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVI 366
            L+ F RG + ILV TD+AARG+    V  V N ++P D   Y+HR+GR GR G  G  I
Sbjct: 297 ILEEFTRGDLDILVATDVAARGLHIPAVTHVFNYDLPDDCEDYVHRIGRTGRAGASGHSI 356

Query: 367 T 367
           +
Sbjct: 357 S 357


>gnl|CDD|236941 PRK11634, PRK11634, ATP-dependent RNA helicase DeaD; Provisional.
          Length = 629

 Score =  187 bits (476), Expect = 2e-49
 Identities = 107/351 (30%), Positives = 192/351 (54%), Gaps = 17/351 (4%)

Query: 25  DVLIEKPIRQGLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLI 84
           D+ ++ PI + L + G+ K SPIQ    P++LNG D +  +++G+GK+  F +  L+  +
Sbjct: 10  DLGLKAPILEALNDLGYEKPSPIQAECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLHN-L 68

Query: 85  KLPVQQPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGG----TQVERPKRPVQI 140
              ++ PQ ++LAPTRE+AVQ+ + +     H++ ++V    GG     Q+   ++  QI
Sbjct: 69  DPELKAPQILVLAPTRELAVQVAEAMTDFSKHMRGVNVVALYGGQRYDVQLRALRQGPQI 128

Query: 141 VVGSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSA 200
           VVG+PGR+   +K   L++  +   ++DEAD+++  GF+ED+  I +Q+P   Q  + SA
Sbjct: 129 VVGTPGRLLDHLKRGTLDLSKLSGLVLDEADEMLRMGFIEDVETIMAQIPEGHQTALFSA 188

Query: 201 TYSHDNLATLQKYMVDPLLIRPEDA--TRPLLGVKQLVALIPECKNPSLRYVDEEKKLVQ 258
           T         +++M +P  +R + +  TRP +               ++  + + + LV+
Sbjct: 189 TMPEAIRRITRRFMKEPQEVRIQSSVTTRPDISQSYW----------TVWGMRKNEALVR 238

Query: 259 LLSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRI 318
            L    F+  +IF   +     + E L    + +  L+G  +Q  R  +L+  K G++ I
Sbjct: 239 FLEAEDFDAAIIFVRTKNATLEVAEALERNGYNSAALNGDMNQALREQTLERLKDGRLDI 298

Query: 319 LVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVITIV 369
           L+ TD+AARG+D   + LV+N +IP D+ +Y+HR+GR GR G  G  +  V
Sbjct: 299 LIATDVAARGLDVERISLVVNYDIPMDSESYVHRIGRTGRAGRAGRALLFV 349


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score =  177 bits (450), Expect = 2e-47
 Identities = 121/369 (32%), Positives = 180/369 (48%), Gaps = 44/369 (11%)

Query: 36  LQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQ---QPQ 92
           LQ+ G+ + + IQ  A P  L+G D +  + +GTGK+  F++ AL  L+  P +    P+
Sbjct: 16  LQDKGYTRPTAIQAEAIPPALDGRDVLGSAPTGTGKTAAFLLPALQHLLDFPRRKSGPPR 75

Query: 93  TIILAPTREIAVQITDVVRSVGAHVKNLHVDYFI--GG----TQVERPKRPVQIVVGSPG 146
            +IL PTRE+A+Q+ D  R +  H    H+D     GG       E       IVV +PG
Sbjct: 76  ILILTPTRELAMQVADQARELAKH---THLDIATITGGVAYMNHAEVFSENQDIVVATPG 132

Query: 147 RIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSAT----- 201
           R+ Q IK +  +  +V   I+DEAD++++ GF +DI  I ++    KQ L+ SAT     
Sbjct: 133 RLLQYIKEENFDCRAVETLILDEADRMLDMGFAQDIETIAAETRWRKQTLLFSATLEGDA 192

Query: 202 ---YSHDNLATLQKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLRYVDEEKK--- 255
              ++   L        DP+ +  E + R    + Q             R  D E K   
Sbjct: 193 VQDFAERLLN-------DPVEVEAEPSRRERKKIHQWY----------YRADDLEHKTAL 235

Query: 256 LVQLLSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGK 315
           L  LL Q    + ++F   + R   +   L  A     YL G   Q  R  ++     G+
Sbjct: 236 LCHLLKQPEVTRSIVFVRTRERVHELAGWLRKAGINCCYLEGEMVQAKRNEAIKRLTDGR 295

Query: 316 VRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVITIVSAESLV 375
           V +LV TD+AARGID  +V  VIN ++P  A TYLHR+GR GR G +G  I++V A    
Sbjct: 296 VNVLVATDVAARGIDIDDVSHVINFDMPRSADTYLHRIGRTGRAGRKGTAISLVEAHD-- 353

Query: 376 KFHSLMGEI 384
             H L+G+I
Sbjct: 354 --HLLLGKI 360


>gnl|CDD|235307 PRK04537, PRK04537, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 572

 Score =  176 bits (447), Expect = 5e-46
 Identities = 115/348 (33%), Positives = 189/348 (54%), Gaps = 26/348 (7%)

Query: 35  GLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPV------ 88
           GL+  GF + +PIQ    P  L G D   Q+++GTGK++ F+VA +N+L+  P       
Sbjct: 23  GLESAGFTRCTPIQALTLPVALPGGDVAGQAQTGTGKTLAFLVAVMNRLLSRPALADRKP 82

Query: 89  QQPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGT----QVERPKRPVQIVVGS 144
           + P+ +ILAPTRE+A+QI       GA +  L      GG     Q E  ++ V +++ +
Sbjct: 83  EDPRALILAPTRELAIQIHKDAVKFGADL-GLRFALVYGGVDYDKQRELLQQGVDVIIAT 141

Query: 145 PGRIKQMIKL-KYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPP--MKQMLVVSAT 201
           PGR+   +K  K +++ +  + ++DEAD++ + GF++DI ++  ++P    +Q L+ SAT
Sbjct: 142 PGRLIDYVKQHKVVSLHACEICVLDEADRMFDLGFIKDIRFLLRRMPERGTRQTLLFSAT 201

Query: 202 YSHDNLATLQKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLRYVDEEKK--LVQL 259
            SH  L    ++M +P  +  E  T     V+Q +        P+    DEEK+  L+ L
Sbjct: 202 LSHRVLELAYEHMNEPEKLVVETETITAARVRQRIYF------PA----DEEKQTLLLGL 251

Query: 260 LSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRIL 319
           LS++   + ++F N +   E +   L    +    LSG   QK R + L+ F++G++ IL
Sbjct: 252 LSRSEGARTMVFVNTKAFVERVARTLERHGYRVGVLSGDVPQKKRESLLNRFQKGQLEIL 311

Query: 320 VTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVIT 367
           V TD+AARG+    V  V N ++P DA  Y+HR+GR  R G  G  I+
Sbjct: 312 VATDVAARGLHIDGVKYVYNYDLPFDAEDYVHRIGRTARLGEEGDAIS 359


>gnl|CDD|236722 PRK10590, PRK10590, ATP-dependent RNA helicase RhlE; Provisional.
          Length = 456

 Score =  169 bits (429), Expect = 1e-44
 Identities = 112/365 (30%), Positives = 190/365 (52%), Gaps = 28/365 (7%)

Query: 32  IRQGLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQ-- 89
           I + + E G+ + +PIQ  A P VL G D +  +++GTGK+  F +  L  LI       
Sbjct: 12  ILRAVAEQGYREPTPIQQQAIPAVLEGRDLMASAQTGTGKTAGFTLPLLQHLITRQPHAK 71

Query: 90  --QP-QTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGT----QVERPKRPVQIVV 142
             +P + +IL PTRE+A QI + VR    ++ N+      GG     Q+ + +  V ++V
Sbjct: 72  GRRPVRALILTPTRELAAQIGENVRDYSKYL-NIRSLVVFGGVSINPQMMKLRGGVDVLV 130

Query: 143 GSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSATY 202
            +PGR+  +     + +D V + ++DEAD++++ GF+ DI  + ++LP  +Q L+ SAT+
Sbjct: 131 ATPGRLLDLEHQNAVKLDQVEILVLDEADRMLDMGFIHDIRRVLAKLPAKRQNLLFSATF 190

Query: 203 SHDNLATLQKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLRYVDEEKK---LVQL 259
           S D  A  +K + +PL I           V Q V            +VD+++K   L Q+
Sbjct: 191 SDDIKALAEKLLHNPLEIEVARRNTASEQVTQHV-----------HFVDKKRKRELLSQM 239

Query: 260 LSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRIL 319
           + +  + Q ++F+  +     + E L      +  + G + Q AR  +L  FK G +R+L
Sbjct: 240 IGKGNWQQVLVFTRTKHGANHLAEQLNKDGIRSAAIHGNKSQGARTRALADFKSGDIRVL 299

Query: 320 VTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVITIVSAESLVKFHS 379
           V TD+AARG+D   +  V+N E+P+    Y+HR+GR GR    G  +++V     V  H 
Sbjct: 300 VATDIAARGLDIEELPHVVNYELPNVPEDYVHRIGRTGRAAATGEALSLVC----VDEHK 355

Query: 380 LMGEI 384
           L+ +I
Sbjct: 356 LLRDI 360


>gnl|CDD|215832 pfam00270, DEAD, DEAD/DEAH box helicase.  Members of this family
           include the DEAD and DEAH box helicases. Helicases are
           involved in unwinding nucleic acids. The DEAD box
           helicases are involved in various aspects of RNA
           metabolism, including nuclear transcription, pre mRNA
           splicing, ribosome biogenesis, nucleocytoplasmic
           transport, translation, RNA decay and organellar gene
           expression.
          Length = 169

 Score =  155 bits (393), Expect = 4e-43
 Identities = 58/167 (34%), Positives = 101/167 (60%), Gaps = 8/167 (4%)

Query: 45  SPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQQPQTIILAPTREIAV 104
           +PIQ  A P +L+G D +VQ+ +G+GK++ F++  L  L+      PQ ++LAPTRE+A 
Sbjct: 1   TPIQAQAIPAILSGKDVLVQAPTGSGKTLAFLLPILQALLPKK-GGPQALVLAPTRELAE 59

Query: 105 QITDVVRSVGAHVKNLHVDYFIGGTQVER-----PKRPVQIVVGSPGRIKQMIKLKYLNM 159
           QI + ++ +      L V    GGT ++       K    I+VG+PGR+  +++   L +
Sbjct: 60  QIYEELKKLFKI-LGLRVALLTGGTSLKEQARKLKKGKADILVGTPGRLLDLLRRGKLKL 118

Query: 160 DS-VRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSATYSHD 205
              ++L ++DEA +L++ GF +D+  I S+LPP +Q+L++SAT   +
Sbjct: 119 LKNLKLLVLDEAHRLLDMGFGDDLEEILSRLPPDRQILLLSATLPRN 165


>gnl|CDD|214692 smart00487, DEXDc, DEAD-like helicases superfamily. 
          Length = 201

 Score =  150 bits (381), Expect = 5e-41
 Identities = 57/205 (27%), Positives = 105/205 (51%), Gaps = 11/205 (5%)

Query: 36  LQENGFVKESPIQMAAFPYVLNGY-DAIVQSKSGTGKSIVFVVAALNKLIKLPVQQPQTI 94
           +++ GF    P Q  A   +L+G  D I+ + +G+GK++  ++ AL  L +      + +
Sbjct: 1   IEKFGFEPLRPYQKEAIEALLSGLRDVILAAPTGSGKTLAALLPALEALKRGK--GGRVL 58

Query: 95  ILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGTQVERPKR-----PVQIVVGSPGRIK 149
           +L PTRE+A Q  + ++ +G     L V    GG       R        I+V +PGR+ 
Sbjct: 59  VLVPTRELAEQWAEELKKLGPS-LGLKVVGLYGGDSKREQLRKLESGKTDILVTTPGRLL 117

Query: 150 QMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSATYSHDNLAT 209
            +++   L++ +V L I+DEA +L++ GF + +  +   LP   Q+L++SAT   +    
Sbjct: 118 DLLENDKLSLSNVDLVILDEAHRLLDGGFGDQLEKLLKLLPKNVQLLLLSATPPEEIENL 177

Query: 210 LQKYMVDPLLIRPEDATRPLLGVKQ 234
           L+ ++ DP+ I       PL  ++Q
Sbjct: 178 LELFLNDPVFIDVGF--TPLEPIEQ 200


>gnl|CDD|240273 PTZ00110, PTZ00110, helicase; Provisional.
          Length = 545

 Score =  144 bits (364), Expect = 2e-35
 Identities = 112/366 (30%), Positives = 184/366 (50%), Gaps = 41/366 (11%)

Query: 36  LQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQQ----P 91
           L+  GF + +PIQ+  +P  L+G D I  +++G+GK++ F++ A+  +   P+ +    P
Sbjct: 145 LKNAGFTEPTPIQVQGWPIALSGRDMIGIAETGSGKTLAFLLPAIVHINAQPLLRYGDGP 204

Query: 92  QTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGTQVERPKRP--------VQIVVG 143
             ++LAPTRE+A QI +     GA  K +      GG     PKR         V+I++ 
Sbjct: 205 IVLVLAPTRELAEQIREQCNKFGASSK-IRNTVAYGGV----PKRGQIYALRRGVEILIA 259

Query: 144 SPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSATYS 203
            PGR+   ++    N+  V   ++DEAD++++ GF   I  I SQ+ P +Q L+ SAT+ 
Sbjct: 260 CPGRLIDFLESNVTNLRRVTYLVLDEADRMLDMGFEPQIRKIVSQIRPDRQTLMWSATWP 319

Query: 204 HD--NLA-TLQKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLRYVDEEK--KLVQ 258
            +  +LA  L K     + +   D T     +KQ V ++ E           EK  KL  
Sbjct: 320 KEVQSLARDLCKEEPVHVNVGSLDLT-ACHNIKQEVFVVEE----------HEKRGKLKM 368

Query: 259 LLSQ--TPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKV 316
           LL +     ++ +IF   +   + + + L    + A  + G + Q+ R   L+ FK GK 
Sbjct: 369 LLQRIMRDGDKILIFVETKKGADFLTKELRLDGWPALCIHGDKKQEERTWVLNEFKTGKS 428

Query: 317 RILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVITIV------S 370
            I++ TD+A+RG+D  +V  VIN + P+    Y+HR+GR GR G +G   T +       
Sbjct: 429 PIMIATDVASRGLDVKDVKYVINFDFPNQIEDYVHRIGRTGRAGAKGASYTFLTPDKYRL 488

Query: 371 AESLVK 376
           A  LVK
Sbjct: 489 ARDLVK 494


>gnl|CDD|215103 PLN00206, PLN00206, DEAD-box ATP-dependent RNA helicase;
           Provisional.
          Length = 518

 Score =  139 bits (351), Expect = 4e-34
 Identities = 104/363 (28%), Positives = 180/363 (49%), Gaps = 29/363 (7%)

Query: 34  QGLQENGFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQ---- 89
             L+  G+   +PIQM A P  L+G   +V + +G+GK+  F+V  +++   +       
Sbjct: 134 LNLETAGYEFPTPIQMQAIPAALSGRSLLVSADTGSGKTASFLVPIISRCCTIRSGHPSE 193

Query: 90  --QPQTIILAPTREIAVQITDVVRSVGAHVKNL--HVDYFIGG----TQVERPKRPVQIV 141
              P  ++L PTRE+ VQ+ D  + +G   K L       +GG     Q+ R ++ V+++
Sbjct: 194 QRNPLAMVLTPTRELCVQVEDQAKVLG---KGLPFKTALVVGGDAMPQQLYRIQQGVELI 250

Query: 142 VGSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVSAT 201
           VG+PGR+  ++    + +D+V + ++DE D ++  GF + +  I+  L    Q+L+ SAT
Sbjct: 251 VGTPGRLIDLLSKHDIELDNVSVLVLDEVDCMLERGFRDQVMQIFQALS-QPQVLLFSAT 309

Query: 202 YSHDNLATLQKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLRYVDEEKKLVQLL- 260
            S +          D +LI   +  RP   VKQL   + E K        +++KL  +L 
Sbjct: 310 VSPEVEKFASSLAKDIILISIGNPNRPNKAVKQLAIWV-ETKQ-------KKQKLFDILK 361

Query: 261 SQTPFNQ-CVIFSNYQLRPEVICENLANANFGAEYLS--GAQDQKARLASLDSFKRGKVR 317
           S+  F    V+F + +L  +++   +     G + LS  G +  K R   + SF  G+V 
Sbjct: 362 SKQHFKPPAVVFVSSRLGADLLANAITVVT-GLKALSIHGEKSMKERREVMKSFLVGEVP 420

Query: 318 ILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTRGLVITIVSAESLVKF 377
           ++V T +  RG+D   V  VI  ++P+    Y+H++GRA R G +G  I  V+ E    F
Sbjct: 421 VIVATGVLGRGVDLLRVRQVIIFDMPNTIKEYIHQIGRASRMGEKGTAIVFVNEEDRNLF 480

Query: 378 HSL 380
             L
Sbjct: 481 PEL 483


>gnl|CDD|238005 cd00046, DEXDc, DEAD-like helicases superfamily. A diverse family
           of proteins involved in ATP-dependent RNA or DNA
           unwinding. This domain contains the ATP-binding region.
          Length = 144

 Score =  125 bits (315), Expect = 7e-33
 Identities = 48/146 (32%), Positives = 82/146 (56%), Gaps = 8/146 (5%)

Query: 60  DAIVQSKSGTGKSIVFVVAALNKLIKLPVQQPQTIILAPTREIAVQITDVVRSVGAHVKN 119
           D ++ + +G+GK++  ++  L  L  L  +  Q ++LAPTRE+A Q+ + ++ +      
Sbjct: 2   DVLLAAPTGSGKTLAALLPILELLDSL--KGGQVLVLAPTRELANQVAERLKELFGEGIK 59

Query: 120 LHVDYFIGGT----QVERPKRPVQIVVGSPGRIKQMIKLKYLNMDSVRLFIMDEADKLIN 175
             V Y IGGT    Q +       IVVG+PGR+   ++   L++  + L I+DEA +L+N
Sbjct: 60  --VGYLIGGTSIKQQEKLLSGKTDIVVGTPGRLLDELERLKLSLKKLDLLILDEAHRLLN 117

Query: 176 TGFVEDITWIYSQLPPMKQMLVVSAT 201
            GF      I  +LP  +Q+L++SAT
Sbjct: 118 QGFGLLGLKILLKLPKDRQVLLLSAT 143


>gnl|CDD|238034 cd00079, HELICc, Helicase superfamily c-terminal domain; associated
           with DEXDc-, DEAD-, and DEAH-box proteins, yeast
           initiation factor 4A, Ski2p, and Hepatitis C virus NS3
           helicases; this domain is found in a wide variety of
           helicases and helicase related proteins; may not be an
           autonomously folding unit, but an integral part of the
           helicase; 4 helicase superfamilies at present according
           to the organization of their signature motifs; all
           helicases share the ability to unwind nucleic acid
           duplexes with a distinct directional polarity; they
           utilize the free energy from nucleoside triphosphate
           hydrolysis to fuel their translocation along DNA,
           unwinding the duplex in the process.
          Length = 131

 Score =  113 bits (286), Expect = 4e-29
 Identities = 43/126 (34%), Positives = 68/126 (53%), Gaps = 4/126 (3%)

Query: 247 LRYVDEEKK--LVQLLSQ--TPFNQCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQK 302
           +  V++EK   L++LL +      + +IF   +   + + E L         L G   Q+
Sbjct: 6   VLPVEDEKLEALLELLKEHLKKGGKVLIFCPSKKMLDELAELLRKPGIKVAALHGDGSQE 65

Query: 303 ARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTR 362
            R   L  F+ G++ +LV TD+ ARGID  NV +VIN ++P   ++YL R+GRAGR G +
Sbjct: 66  EREEVLKDFREGEIVVLVATDVIARGIDLPNVSVVINYDLPWSPSSYLQRIGRAGRAGQK 125

Query: 363 GLVITI 368
           G  I +
Sbjct: 126 GTAILL 131



 Score = 30.3 bits (69), Expect = 4.8
 Identities = 16/57 (28%), Positives = 33/57 (57%), Gaps = 5/57 (8%)

Query: 78  AALNKLIKLPVQQP-QTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGTQVER 133
            AL +L+K  +++  + +I  P++++  ++ +++R  G  V  LH D    G+Q ER
Sbjct: 15  EALLELLKEHLKKGGKVLIFCPSKKMLDELAELLRKPGIKVAALHGD----GSQEER 67


>gnl|CDD|197757 smart00490, HELICc, helicase superfamily c-terminal domain. 
          Length = 82

 Score = 93.4 bits (233), Expect = 1e-22
 Identities = 35/82 (42%), Positives = 47/82 (57%)

Query: 279 EVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVI 338
           E + E L         L G   Q+ R   LD F  GK+++LV TD+A RG+D   VDLVI
Sbjct: 1   EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKVLVATDVAERGLDLPGVDLVI 60

Query: 339 NLEIPHDAATYLHRMGRAGRYG 360
             ++P   A+Y+ R+GRAGR G
Sbjct: 61  IYDLPWSPASYIQRIGRAGRAG 82


>gnl|CDD|201125 pfam00271, Helicase_C, Helicase conserved C-terminal domain.  The
           Prosite family is restricted to DEAD/H helicases,
           whereas this domain family is found in a wide variety of
           helicases and helicase related proteins. It may be that
           this is not an autonomously folding unit, but an
           integral part of the helicase.
          Length = 78

 Score = 90.3 bits (225), Expect = 1e-21
 Identities = 33/76 (43%), Positives = 47/76 (61%)

Query: 285 LANANFGAEYLSGAQDQKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPH 344
           L         L G   Q+ R   L+ F+ GK ++LV TD+A RGID  +V+LVIN ++P 
Sbjct: 3   LRKPGIKVARLHGGLSQEEREEILEDFRNGKSKVLVATDVAGRGIDLPDVNLVINYDLPW 62

Query: 345 DAATYLHRMGRAGRYG 360
           + A+Y+ R+GRAGR G
Sbjct: 63  NPASYIQRIGRAGRAG 78


>gnl|CDD|224122 COG1201, Lhr, Lhr-like helicases [General function prediction
           only].
          Length = 814

 Score = 60.4 bits (147), Expect = 1e-08
 Identities = 76/375 (20%), Positives = 145/375 (38%), Gaps = 93/375 (24%)

Query: 46  PIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQQPQTIILA----PTRE 101
           P Q  A P + +G + ++ + +G+GK+    +  +N+L+ L   + +  I A    P + 
Sbjct: 25  PPQRYAIPEIHSGENVLIIAPTGSGKTEAAFLPVINELLSLGKGKLEDGIYALYISPLKA 84

Query: 102 IAVQI----TDVVRSVGAHVKNLHVDYFIGGTQVERP---KRPVQIVVGSP--------- 145
           +   I     + +R +G  V   H D      Q E+    K P  I++ +P         
Sbjct: 85  LNNDIRRRLEEPLRELGIEVAVRHGDT----PQSEKQKMLKNPPHILITTPESLAILLNS 140

Query: 146 GRIKQMIKLKYLNMDSVRLFIMDEADKLINT--GFVEDITWIYSQLPPMKQMLVVSATYS 203
            + +++++        VR  I+DE   L  +  G          QL      L       
Sbjct: 141 PKFRELLR-------DVRYVIVDEIHALAESKRG---------VQL-----AL------- 172

Query: 204 HDNLATLQKYMVDPLLI-------RPEDATRPLLG--VKQLVALIPECKNPSLRYVDEE- 253
             +L  L++   D   I        PE+  + L+G      +  +   K   ++ +    
Sbjct: 173 --SLERLRELAGDFQRIGLSATVGPPEEVAKFLVGFGDPCEIVDVSAAKKLEIKVISPVE 230

Query: 254 -------------KKLVQLLSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYLS--GA 298
                        +++ +L+ +      +IF+N +   E +   L             G+
Sbjct: 231 DLIYDEELWAALYERIAELVKKH--RTTLIFTNTRSGAERLAFRL-KKLGPDIIEVHHGS 287

Query: 299 QDQKARLASLDSFKRGKVRILVTT---DLAARGIDAANVDLVINLEIPHDAATYLHRMGR 355
             ++ RL   +  K G+++ +V T   +L   GID  ++DLVI L  P     +L R+GR
Sbjct: 288 LSRELRLEVEERLKEGELKAVVATSSLEL---GIDIGDIDLVIQLGSPKSVNRFLQRIGR 344

Query: 356 AGRYGT---RGLVIT 367
           AG       +G++I 
Sbjct: 345 AGHRLGEVSKGIIIA 359


>gnl|CDD|182933 PRK11057, PRK11057, ATP-dependent DNA helicase RecQ; Provisional.
          Length = 607

 Score = 55.9 bits (135), Expect = 2e-07
 Identities = 92/359 (25%), Positives = 150/359 (41%), Gaps = 64/359 (17%)

Query: 29  EKPIRQGLQEN-GFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLP 87
           E   +Q LQE  G+ +  P Q      VL+G D +V   +G GKS+ + + AL       
Sbjct: 10  ESLAKQVLQETFGYQQFRPGQQEIIDAVLSGRDCLVVMPTGGGKSLCYQIPAL------- 62

Query: 88  VQQPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIGGTQVERPKRPVQIVVGSPGR 147
           V    T++++P   +     D + + G           +  TQ     R  Q+ V +  R
Sbjct: 63  VLDGLTLVVSPLISLMKDQVDQLLANGVAAA------CLNSTQ----TREQQLEVMAGCR 112

Query: 148 IKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMKQMLVVS-----ATY 202
             Q IKL         L+I  E  +L+   F+E +        P   +L V      + +
Sbjct: 113 TGQ-IKL---------LYIAPE--RLMMDNFLEHLA----HWNPA--LLAVDEAHCISQW 154

Query: 203 SHD------NLATLQKYMVD----PLLIRPEDATR----PLLGVKQLVALIPECKNPSLR 248
            HD       L  L++         L    +D TR     LLG+   +  I     P++R
Sbjct: 155 GHDFRPEYAALGQLRQRFPTLPFMALTATADDTTRQDIVRLLGLNDPLIQISSFDRPNIR 214

Query: 249 Y--VDEEKKLVQLLS--QTPFNQC-VIFSNYQLRPEVICENLANANFG-AEYLSGAQ-DQ 301
           Y  V++ K L QL+   Q    +  +I+ N + + E     L +     A Y +G   D 
Sbjct: 215 YTLVEKFKPLDQLMRYVQEQRGKSGIIYCNSRAKVEDTAARLQSRGISAAAYHAGLDNDV 274

Query: 302 KARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYG 360
           +A +   ++F+R  ++I+V T     GI+  NV  V++ +IP +  +Y    GRAGR G
Sbjct: 275 RADVQ--EAFQRDDLQIVVATVAFGMGINKPNVRFVVHFDIPRNIESYYQETGRAGRDG 331


>gnl|CDD|223989 COG1061, SSL2, DNA or RNA helicases of superfamily II
           [Transcription / DNA replication, recombination, and
           repair].
          Length = 442

 Score = 55.1 bits (133), Expect = 3e-07
 Identities = 58/342 (16%), Positives = 117/342 (34%), Gaps = 54/342 (15%)

Query: 54  YVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQQPQTIILAPTREIAVQITDVVRSV 113
                   ++   +G GK++V    A+ +L +       T++L PT+E+  Q  + ++  
Sbjct: 51  NRRTERRGVIVLPTGAGKTVVAA-EAIAELKR------STLVLVPTKELLDQWAEALK-- 101

Query: 114 GAHVKNLHVDYFIGGTQVERPKRPVQIVVGSPGRIKQMIKLKYLNMDSVRLFIMDEADKL 173
              +         GG + E     V +        +Q++     N     L I DE   L
Sbjct: 102 -KFLLLNDEIGIYGGGEKELEPAKVTVATVQTLARRQLLDEFLGN--EFGLIIFDEVHHL 158

Query: 174 INTGFVEDITWIYSQLPPMKQMLVVSAT--YSHDNLATLQKYMVDPLL--IRPEDATRP- 228
               +      I   L      L ++AT              ++ P++  +  ++     
Sbjct: 159 PAPSYRR----ILELLSAAYPRLGLTATPEREDGGRIGDLFDLIGPIVYEVSLKELIDEG 214

Query: 229 -LLGVKQLVALIPECKNPSLRYVDEEKKLVQLLSQTPFN--------------------- 266
            L   K +   +   ++    Y  E  +  +LL                           
Sbjct: 215 YLAPYKYVEIKVTLTEDEEREYAKESARFRELLRARGTLRAENEARRIAIASERKIAAVR 274

Query: 267 ----------QCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKV 316
                     + +IF++       I +    A    E ++G   ++ R A L+ F+ G +
Sbjct: 275 GLLLKHARGDKTLIFASDVEHAYEIAKLF-LAPGIVEAITGETPKEEREAILERFRTGGI 333

Query: 317 RILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGR 358
           ++LVT  +   G+D  + D++I L        ++ R+GR  R
Sbjct: 334 KVLVTVKVLDEGVDIPDADVLIILRPTGSRRLFIQRLGRGLR 375


>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
          Length = 773

 Score = 52.6 bits (127), Expect = 2e-06
 Identities = 40/154 (25%), Positives = 69/154 (44%), Gaps = 23/154 (14%)

Query: 232 VKQLVALIPECK--NPSLRYVDEEKKLVQLLSQT----PFNQCVIFSNYQLRPEVICENL 285
            ++ V    E    +P L       KL +++ +     P ++ ++F+ Y+   E I + L
Sbjct: 332 FRKAVRKAKELDIEHPKLE------KLREIVKEQLGKNPDSRIIVFTQYRDTAEKIVDLL 385

Query: 286 ANANFGAEYLSGAQD--------QKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLV 337
                 A    G           QK ++  LD F+ G+  +LV+T +A  G+D  +VDLV
Sbjct: 386 EKEGIKAVRFVGQASKDGDKGMSQKEQIEILDKFRAGEFNVLVSTSVAEEGLDIPSVDLV 445

Query: 338 INLE-IPHDAATYLHRMGRAGRYGTRGLVITIVS 370
           I  E +P      + R GR GR    G V+ +++
Sbjct: 446 IFYEPVP-SEIRSIQRKGRTGR-QEEGRVVVLIA 477


>gnl|CDD|224036 COG1111, MPH1, ERCC4-like helicases [DNA replication,
           recombination, and repair].
          Length = 542

 Score = 50.4 bits (121), Expect = 1e-05
 Identities = 24/70 (34%), Positives = 37/70 (52%), Gaps = 1/70 (1%)

Query: 301 QKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYG 360
           QK +   +D F++G+  +LV T +   G+D   VDLVI  E        + R GR GR  
Sbjct: 411 QKEQKEIIDQFRKGEYNVLVATSVGEEGLDIPEVDLVIFYEPVPSEIRSIQRKGRTGR-K 469

Query: 361 TRGLVITIVS 370
            +G V+ +V+
Sbjct: 470 RKGRVVVLVT 479



 Score = 33.1 bits (76), Expect = 2.0
 Identities = 24/106 (22%), Positives = 47/106 (44%), Gaps = 7/106 (6%)

Query: 68  GTGKSIVFVVAALNKLIKLPVQQPQTIILAPTREIAVQITDVVRSVGAHVKNLHVDYFIG 127
           G GK+ +  +   N+L        + + LAPT+ + +Q  +  R V   +    +    G
Sbjct: 39  GLGKTFIAAMVIANRLRWFG---GKVLFLAPTKPLVLQHAEFCRKVTG-IPEDEIAALTG 94

Query: 128 GTQVERPKR---PVQIVVGSPGRIKQMIKLKYLNMDSVRLFIMDEA 170
             + E  +      ++ V +P  ++  +K   +++D V L I DEA
Sbjct: 95  EVRPEEREELWAKKKVFVATPQVVENDLKAGRIDLDDVSLLIFDEA 140


>gnl|CDD|129701 TIGR00614, recQ_fam, ATP-dependent DNA helicase, RecQ family.  All
           proteins in this family for which functions are known
           are 3'-5' DNA-DNA helicases. These proteins are used for
           recombination, recombinational repair, and possibly
           maintenance of chromosome stability. This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 470

 Score = 50.2 bits (120), Expect = 1e-05
 Identities = 73/343 (21%), Positives = 129/343 (37%), Gaps = 63/343 (18%)

Query: 46  PIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAAL--NKLIKLPVQQPQTIILAPTREIA 103
           P+Q+     VL G D  V   +G GKS+ + + AL  + +         T++++P   + 
Sbjct: 14  PVQLEVINAVLLGRDCFVVMPTGGGKSLCYQLPALCSDGI---------TLVISPLISLM 64

Query: 104 VQITDVVRSVGAHVKNLHVDYFIGGTQVERPKRPVQIVVGSPGRIKQMIKLKYLN----M 159
                 +++ G          F+  +Q +  ++ V   +         IKL Y+      
Sbjct: 65  EDQVLQLKASGIPAT------FLNSSQSKEQQKNVLTDLKDG-----KIKLLYVTPEKCS 113

Query: 160 DSVRLFIMDEADKLINTGFVEDITWI----------YSQLPPMKQML----VVSATYSHD 205
            S RL    E  K I    V++   I          Y  L  +KQ      +++ T +  
Sbjct: 114 ASNRLLQTLEERKGITLIAVDEAHCISQWGHDFRPDYKALGSLKQKFPNVPIMALTATAS 173

Query: 206 NLATLQKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLRYVDEEKK------LVQL 259
                         +R ED  R  L +K           P+L Y    K       L++ 
Sbjct: 174 P------------SVR-EDILR-QLNLKNPQIFCTSFDRPNLYYEVRRKTPKILEDLLRF 219

Query: 260 L-SQTPFNQCVIFSNYQLRPEVICENLANANFGA-EYLSGAQDQKARLASLDSFKRGKVR 317
           +  +      +I+   + + E +  +L N    A  Y +G +   AR      F+R +++
Sbjct: 220 IRKEFKGKSGIIYCPSRKKSEQVTASLQNLGIAAGAYHAGLEI-SARDDVHHKFQRDEIQ 278

Query: 318 ILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYG 360
           ++V T     GI+  +V  VI+  +P    +Y    GRAGR G
Sbjct: 279 VVVATVAFGMGINKPDVRFVIHYSLPKSMESYYQESGRAGRDG 321


>gnl|CDD|130456 TIGR01389, recQ, ATP-dependent DNA helicase RecQ.  The
           ATP-dependent DNA helicase RecQ of E. coli is about 600
           residues long. This model represents bacterial proteins
           with a high degree of similarity in domain architecture
           and in primary sequence to E. coli RecQ. The model
           excludes eukaryotic and archaeal proteins with RecQ-like
           regions, as well as more distantly related bacterial
           helicases related to RecQ [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 591

 Score = 47.4 bits (113), Expect = 8e-05
 Identities = 77/352 (21%), Positives = 132/352 (37%), Gaps = 56/352 (15%)

Query: 32  IRQGLQEN-GFVKESPIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAAL---------N 81
            +Q L+   G+    P Q     +VL+G D +V   +G GKS+ + V AL         +
Sbjct: 1   AQQVLKRTFGYDDFRPGQEEIISHVLDGRDVLVVMPTGGGKSLCYQVPALLLKGLTVVIS 60

Query: 82  KLIKLPVQQPQTIILAPTREIAV-------QITDVVRS-VGAHVKNLHVDYFIGGTQVER 133
            LI L   Q   +  A      +       +  D+ ++ V   +K L+V           
Sbjct: 61  PLISLMKDQVDQLRAAGVAAAYLNSTLSAKEQQDIEKALVNGELKLLYV----------- 109

Query: 134 PKRPVQIVVGSPGRIKQMIKLKYLNMDSVRLFIMDEADKLINTGFVEDITWIYSQLPPMK 193
                     +P R++Q   L  L    + L  +DEA  +          W +   P  +
Sbjct: 110 ----------APERLEQDYFLNMLQRIPIALVAVDEAHCVSQ--------WGHDFRPEYQ 151

Query: 194 QMLVVSATYSHDNLATLQKYMVDPLLIRPEDATRPLLGVKQLVALIPECKNPSLRY--VD 251
           ++  ++  +       L                R LL +      I     P+LR+  V 
Sbjct: 152 RLGSLAERFPQVPRIALTATADAET----RQDIRELLRLADANEFITSFDRPNLRFSVVK 207

Query: 252 EEKKLVQLLS--QTPFNQC-VIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASL 308
           +  K   LL   +    Q  +I+++ + + E + E L +    A         K R  + 
Sbjct: 208 KNNKQKFLLDYLKKHRGQSGIIYASSRKKVEELAERLESQGISALAYHAGLSNKVRAENQ 267

Query: 309 DSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYG 360
           + F    V+++V T+    GID  NV  VI+ ++P +  +Y    GRAGR G
Sbjct: 268 EDFLYDDVKVMVATNAFGMGIDKPNVRFVIHYDMPGNLESYYQEAGRAGRDG 319


>gnl|CDD|235514 PRK05580, PRK05580, primosome assembly protein PriA; Validated.
          Length = 679

 Score = 47.1 bits (113), Expect = 1e-04
 Identities = 28/82 (34%), Positives = 33/82 (40%), Gaps = 21/82 (25%)

Query: 302 KARLAS-LDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLH--------R 352
           K  L   L  F RG+  IL+ T + A+G D  NV LV  L    DA   L         R
Sbjct: 466 KGALEQLLAQFARGEADILIGTQMLAKGHDFPNVTLVGVL----DADLGLFSPDFRASER 521

Query: 353 M--------GRAGRYGTRGLVI 366
                    GRAGR    G V+
Sbjct: 522 TFQLLTQVAGRAGRAEKPGEVL 543


>gnl|CDD|233040 TIGR00595, priA, primosomal protein N'.  All proteins in this
           family for which functions are known are components of
           the primosome which is involved in replication, repair,
           and recombination.This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 505

 Score = 46.2 bits (110), Expect = 2e-04
 Identities = 25/78 (32%), Positives = 30/78 (38%), Gaps = 12/78 (15%)

Query: 301 QKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLV------INLEIP------HDAAT 348
           + A  A L+ F  GK  IL+ T + A+G    NV LV        L  P           
Sbjct: 298 KGAHEALLNQFANGKADILIGTQMIAKGHHFPNVTLVGVLDADSGLHSPDFRAAERGFQL 357

Query: 349 YLHRMGRAGRYGTRGLVI 366
                GRAGR    G VI
Sbjct: 358 LTQVAGRAGRAEDPGQVI 375


>gnl|CDD|234478 TIGR04121, DEXH_lig_assoc, DEXH box helicase, DNA
           ligase-associated.  Members of this protein family are
           DEAD/DEAH box helicases found associated with a
           bacterial ATP-dependent DNA ligase, part of a four-gene
           system that occurs in about 12 % of prokaryotic
           reference genomes. The actual motif in this family is
           DE[VILW]H.
          Length = 803

 Score = 46.4 bits (111), Expect = 2e-04
 Identities = 85/371 (22%), Positives = 160/371 (43%), Gaps = 68/371 (18%)

Query: 46  PIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKL--PVQQPQTIILAPTREIA 103
           P Q+  +   L G   ++ + +G+GK++   + +L  L     P +   T+ + P R +A
Sbjct: 16  PFQLEMWAAALEGRSGLLIAPTGSGKTLAGFLPSLIDLAGPEKPKKGLHTLYITPLRALA 75

Query: 104 VQITDVVRSVGAHVKNLHVDYFIG---G--TQVERP---KRPVQIVVGSPGRIKQMIKLK 155
           V   D+ R++ A ++ L +   +    G  +  ER    K+P  I++ +P  +  ++   
Sbjct: 76  V---DIARNLQAPIEELGLPIRVETRTGDTSSSERARQRKKPPDILLTTPESLALLLSYP 132

Query: 156 YLNMDSVRLF------IMDEADKLINT--G-FVE-DITWIYSQLPPMKQMLVVSATYSHD 205
               D+ RLF      ++DE  +L  +  G  +E  +  +  +L P  +   +SAT    
Sbjct: 133 ----DAARLFKDLRCVVVDEWHELAGSKRGDQLELALARL-RRLAPGLRRWGLSATIG-- 185

Query: 206 NLATLQKYMV-----DPLLIRPEDATRPLLGVKQLVALIPEC--KNP-----SLRYVDEE 253
           NL   ++ ++       +L+R +      +     ++L+PE   + P      LR + E 
Sbjct: 186 NLEEARRVLLGVGGAPAVLVRGKLPKAIPV-----ISLLPESEERFPWAGHLGLRALPE- 239

Query: 254 KKLVQLLSQTPFNQCVIFSNYQLRPEVICENLANANFGAEYL------SGAQDQKARLAS 307
             +   + Q      ++F+N + + E+  + L  AN   E+        G+ D++ R   
Sbjct: 240 --VYAEIDQA--RTTLVFTNTRSQAELWFQALWEAN--PEFALPIALHHGSLDREQRRWV 293

Query: 308 LDSFKRGKVRILVTT---DLAARGIDAANVDLVINLEIPHDAATYLHRMGRAG-RYGTRG 363
             +   G++R +V T   DL   G+D   VDLVI +  P   A  L R GR+  R G   
Sbjct: 294 EAAMAAGRLRAVVCTSSLDL---GVDFGPVDLVIQIGSPKGVARLLQRAGRSNHRPG-EP 349

Query: 364 LVITIVSAESL 374
               +V    L
Sbjct: 350 SRALLVPTNRL 360


>gnl|CDD|223588 COG0514, RecQ, Superfamily II DNA helicase [DNA replication,
           recombination, and repair].
          Length = 590

 Score = 45.0 bits (107), Expect = 5e-04
 Identities = 24/92 (26%), Positives = 39/92 (42%)

Query: 279 EVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVI 338
           E + E L      A         + R     +F   +++++V T+    GID  +V  VI
Sbjct: 244 EELAEWLRKNGISAGAYHAGLSNEERERVQQAFLNDEIKVMVATNAFGMGIDKPDVRFVI 303

Query: 339 NLEIPHDAATYLHRMGRAGRYGTRGLVITIVS 370
           + ++P    +Y    GRAGR G     I + S
Sbjct: 304 HYDLPGSIESYYQETGRAGRDGLPAEAILLYS 335


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 44.4 bits (105), Expect = 8e-04
 Identities = 45/252 (17%), Positives = 67/252 (26%), Gaps = 21/252 (8%)

Query: 1804 SPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVKSNESHFGNSRCQPKTAYQSS 1863
            +          S  D         S   S   +  S  ++   S        P  A  S 
Sbjct: 101  AREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDL-SEMLRPVGSPGPPPAASPPAAGASP 159

Query: 1864 SAQSEDGTSYKSISWDAYSVIDEPESYDVSWRTCSSESAAREKNSGQRSRGSGDKAARKS 1923
            +A + D  S +  +    S  +E            S   A    S   +  S     R S
Sbjct: 160  AAVASDAASSRQAA-LPLSSPEETA-------RAPSSPPAEPPPSTPPAAASPRPPRRSS 211

Query: 1924 -ASLPASQIALRDGK-GDFKSPRKASHSLKDGKRYGNAQSFEAARNSNPYEAPMPGA--- 1978
              S  AS  A   G+     +   +S S        ++       N  P   P P     
Sbjct: 212  PISASASSPAPAPGRSAADDAGASSSDSSS----SESSGCGWGPENECPLPRPAPITLPT 267

Query: 1979 --WQPVLEGTSVFDDAEKSSKSGPKSSKAGPKSSKPGPKSSKPGPKSSKRYPSRSGSSGL 2036
              W+              SS S P+     P  S PG     P    +    S S  S  
Sbjct: 268  RIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPG-SGPAPSSPRASSSSSSSRESSS 326

Query: 2037 NNRTEGNSVSHT 2048
            ++ +  +  S  
Sbjct: 327  SSTSSSSESSRG 338



 Score = 33.6 bits (77), Expect = 1.5
 Identities = 40/239 (16%), Positives = 64/239 (26%), Gaps = 31/239 (12%)

Query: 1898 SSESAAREKNSGQRSRGSGDKAARKSASLPASQIALRDGKGDFKSPRKASHSLKDGKRYG 1957
            SS S +     G  +     + A  +      + +  +G      P  +S S    +R  
Sbjct: 239  SSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSS--PRERSP 296

Query: 1958 NAQSFEAARNSNPYEAPMPGAWQPVLEGTSVFDDA-EKSSKSGPKSSKAGPKSSKPGPKS 2016
            +         S+P   P P + +     +S  + +   +S S   S  A         +S
Sbjct: 297  SPSP------SSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRS 350

Query: 2017 SKPGPKSSKRYPSRSGSSGLNNRTEGNSVSHTWENHFATGTDRRSHQYPAYGSMSRHDLY 2076
              P        PS        +R   +  +          T RR+    A  +  R    
Sbjct: 351  PSPSRPPPPADPSSPRKRPRPSRAPSSPAASA-----GRPTRRRARAAVAGRARRRD--- 402

Query: 2077 FDESQLLGNEALGRGEIRGSGGGFASRVSSVGPANESGLVVAG--WNRCDREGHDPARG 2133
                          G  R S    A   S    A    L  +G  W         P  G
Sbjct: 403  -------ATGRFPAGRPRPSPLD-AGAASGAFYARYPLLTPSGEPWPGSP----PPPPG 449


>gnl|CDD|224119 COG1198, PriA, Primosomal protein N' (replication factor Y) -
           superfamily II helicase [DNA replication, recombination,
           and repair].
          Length = 730

 Score = 43.5 bits (103), Expect = 0.001
 Identities = 28/83 (33%), Positives = 34/83 (40%), Gaps = 21/83 (25%)

Query: 306 ASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHR------------- 352
             LD F  G+  IL+ T + A+G D  NV LV  L    DA T L               
Sbjct: 525 DLLDQFANGEADILIGTQMIAKGHDFPNVTLVGVL----DADTGLGSPDFRASERTFQLL 580

Query: 353 ---MGRAGRYGTRGLVITIVSAE 372
               GRAGR G  G V+ I +  
Sbjct: 581 MQVAGRAGRAGKPGEVV-IQTYN 602


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 43.4 bits (102), Expect = 0.002
 Identities = 29/132 (21%), Positives = 50/132 (37%), Gaps = 11/132 (8%)

Query: 1746 ESSHKRNRKANDFNDTCGAVANET---FDGEDSKGGSDD---ENNLIVTLSEAGDSEVDV 1799
            +       K +D  D     A+E     D E  +   D+   E N  +      D   D+
Sbjct: 3996 DEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDL 4055

Query: 1800 EGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVKSNESHFGNSRCQPKTA 1859
              ++   +ED    N Q + EST    +SDE    GE     V  +++   + +   K+ 
Sbjct: 4056 AEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGE-----VPEDQAIDNHPKMDAKST 4110

Query: 1860 YQSSSAQSEDGT 1871
            + S+ A  E+  
Sbjct: 4111 FASAEADEENTD 4122



 Score = 39.6 bits (92), Expect = 0.029
 Identities = 58/329 (17%), Positives = 119/329 (36%), Gaps = 37/329 (11%)

Query: 422  QEREDVEQTAEESSKVENTKPLRG-APSKEVTKKGASKE--GTKWNLTDGQSEEES-DSE 477
             E +D+E+ A E      +      A   E    G +K+   ++   +D + E +  D E
Sbjct: 3832 NEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEE 3891

Query: 478  NDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKASKDEQKIDGANSAEKSLEDK 537
             +  P++  +    K   E   +E  ++   E ++ ++ A+ +E  +       K+LEDK
Sbjct: 3892 VNDIPEDLSNSLNEKLWDE-PNEEDLLET--EQKSNEQSAANNESDLVSKEDDNKALEDK 3948

Query: 538  HGKKITNSFEAVGEIVQNDTETTKENNQTDEVPEENG--LVGNPNKAEVGDLNECESCTD 595
                         +  +++ E + +    DE+  +        P + E  DL E      
Sbjct: 3949 D-----------RQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPE------ 3991

Query: 596  SFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDP-QGNNANQREIR--- 651
               ++E   +    ++ ++   +   ++K++ D ++       DP + NN    +I+   
Sbjct: 3992 DLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDD 4051

Query: 652  -------DNSATLDKTEKESVNNVESKDERPPSTANNPDGTEWEDIPSDTIPNDETSNDN 704
                   D     D  E+    N ES ++   S      G   ED   D  P  +  +  
Sbjct: 4052 FSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTF 4111

Query: 705  KLTPVDFDYINDFVNQTPSETGDRPLVHG 733
                 D +  +  +     E G+   V G
Sbjct: 4112 ASAEADEENTDKGIVGENEELGEEDGVRG 4140


>gnl|CDD|137505 PRK09751, PRK09751, putative ATP-dependent helicase Lhr;
           Provisional.
          Length = 1490

 Score = 43.4 bits (102), Expect = 0.002
 Identities = 24/62 (38%), Positives = 35/62 (56%), Gaps = 2/62 (3%)

Query: 296 SGAQDQKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGR 355
           S +++Q+A      + K G++R +V T     GID   VDLVI +  P   A+ L R+GR
Sbjct: 310 SVSKEQRAITEQ--ALKSGELRCVVATSSLELGIDMGAVDLVIQVATPLSVASGLQRIGR 367

Query: 356 AG 357
           AG
Sbjct: 368 AG 369


>gnl|CDD|224126 COG1205, COG1205, Distinct helicase family with a unique C-terminal
           domain including a metal-binding cysteine cluster
           [General function prediction only].
          Length = 851

 Score = 42.4 bits (100), Expect = 0.003
 Identities = 40/180 (22%), Positives = 65/180 (36%), Gaps = 17/180 (9%)

Query: 294 YLSGAQDQKARLASLDSFKRGKVRILVTTDLAAR-GIDAANVDLVINLEIP-HDAATYLH 351
           Y +G   ++ R    + FK G++  ++ T  A   GID  ++D VI    P     ++  
Sbjct: 344 YRAGLHREERRRIEAE-FKEGELLGVIAT-NALELGIDIGSLDAVIAYGYPGVSVLSFRQ 401

Query: 352 RMGRAGRYGTRGLVITIVSAESLVKFHSLMGEINLDHAFNVGLVPDNLTGDQINWTQRVQ 411
           R GRAGR G   LV+ ++ ++ L  ++    E         G V      D   +     
Sbjct: 402 RAGRAGRRGQESLVLVVLRSDPLDSYYLRHPE--ELLETGFGPVESVRVDDNNEYLLDGH 459

Query: 412 TLLA-----------KPLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEG 460
            L A            P D + E E       E          R +      ++GAS+  
Sbjct: 460 LLCADAELPLKDRELLPSDGSLEEEIARSLVGEGQLHHENDKYRPSSRGLCPREGASQVS 519



 Score = 32.8 bits (75), Expect = 2.9
 Identities = 15/56 (26%), Positives = 32/56 (57%), Gaps = 2/56 (3%)

Query: 48  QMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKLPVQQPQTIILAPTREIA 103
           Q+ A   +  G + +V + +G+GK+  F++  L+ L++ P    + ++L PT  +A
Sbjct: 75  QVDALRLIREGRNVVVTTGTGSGKTESFLLPILDHLLRDP--SARALLLYPTNALA 128


>gnl|CDD|224125 COG1204, COG1204, Superfamily II helicase [General function
           prediction only].
          Length = 766

 Score = 42.0 bits (99), Expect = 0.005
 Identities = 25/134 (18%), Positives = 63/134 (47%), Gaps = 16/134 (11%)

Query: 46  PIQMAAFPYVLNGYDAIVQSKSGTGKSIVFVVAALNKLIKL--PVQQPQTIILAPTR--- 100
           P Q A    +L+  + ++ + +G+GK+++ ++A L+ L++    V     + + P +   
Sbjct: 35  PQQEAVEKGLLSDENVLISAPTGSGKTLIALLAILSTLLEGGGKV-----VYIVPLKALA 89

Query: 101 -EIAVQITDVVRSVGAHVKNLHVDYFIGGTQVERPKRPVQIVVGSPGRIKQMIKLKYLNM 159
            E   + + +   +G  V     DY +   ++ R      ++V +P ++  + + +   +
Sbjct: 90  EEKYEEFSRLEE-LGIRVGISTGDYDLDDERLARY----DVIVTTPEKLDSLTRKRPSWI 144

Query: 160 DSVRLFIMDEADKL 173
           + V L ++DE   L
Sbjct: 145 EEVDLVVIDEIHLL 158



 Score = 32.7 bits (75), Expect = 3.1
 Identities = 20/77 (25%), Positives = 29/77 (37%), Gaps = 11/77 (14%)

Query: 304 RLASLDSFKRGKVRILVTTDLAARG---------IDAANVDLVINLEIPHDAATYLHRMG 354
           R    D+F++GK+++LV+T   A G         I            +       L   G
Sbjct: 329 RQLVEDAFRKGKIKVLVSTPTLAAGVNLPARTVIIKDTRRYDPKGGIVDIPVLDVLQMAG 388

Query: 355 RAGR--YGTRGLVITIV 369
           RAGR  Y   G  I + 
Sbjct: 389 RAGRPGYDDYGEAIILA 405


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 40.9 bits (95), Expect = 0.011
 Identities = 48/267 (17%), Positives = 97/267 (36%), Gaps = 20/267 (7%)

Query: 414  LAKPLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEE 473
              K  ++A++ ++ ++ AEE+ K +  K       K+  +   + E  K      ++EE 
Sbjct: 1462 AKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEA 1521

Query: 474  SDSENDGKPDENGSQRGSKRSKEFKTQEKC-----VKGAGEDENGDRKASKDEQKIDGAN 528
              ++   K +E      +K+++E K  ++      +K A E +  +     +E K     
Sbjct: 1522 KKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALR 1581

Query: 529  SAE--KSLEDKHGKKITNSFEAV----GEIVQNDTETTKENNQTDEVPEENGLVGNPNKA 582
             AE  K  E+   +++   +E       E  +   E   +  +  +  EE   V    K 
Sbjct: 1582 KAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKK 1641

Query: 583  EVGDLNECESCTDSFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDPQG 642
            E  +  + E         E     +        + +K  + KKK +  +K   D++    
Sbjct: 1642 EAEEKKKAE---------ELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAE 1692

Query: 643  NNANQREIRDNSATLDKTEKESVNNVE 669
                + E    +  L K E E     E
Sbjct: 1693 ALKKEAEEAKKAEELKKKEAEEKKKAE 1719



 Score = 37.0 bits (85), Expect = 0.15
 Identities = 59/280 (21%), Positives = 117/280 (41%), Gaps = 28/280 (10%)

Query: 415  AKPLDQAQEREDVEQT-------------AEESSKVENTKPLRGAPSKEVTKKGASKEGT 461
            A+ L +A+E++  E+              AEE+ K E  +        E  KK  ++E  
Sbjct: 1554 AEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAK 1613

Query: 462  KWNLTDGQSEEESDSENDGKPDENGSQRGS---KRSKEFKTQEKCVKGAGEDENGDRKAS 518
            K      ++EE   +E + K  E   ++ +   K+++E K  E+  K    +E   +KA 
Sbjct: 1614 KAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEA--KKAE 1671

Query: 519  KDEQKIDGANSAEKSLEDKHGKKITNSFEA--VGEIVQNDTETTKENNQTDEVPEENGLV 576
            +D++K + A  AE+  +         + EA    E+ + + E  K+  +  +  EEN + 
Sbjct: 1672 EDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKI- 1730

Query: 577  GNPNKAEVGDLNECESCTDSFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGD 636
                KAE       E    +    +  +E     + K+ +++K  + +K+K+   +   D
Sbjct: 1731 ----KAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELD 1786

Query: 637  KRDPQGNNANQREIRD---NSATLDKTEKESVNNVESKDE 673
            + D +      ++I+D   N A + +  KE    +    E
Sbjct: 1787 EEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKE 1826



 Score = 35.9 bits (82), Expect = 0.40
 Identities = 48/237 (20%), Positives = 88/237 (37%), Gaps = 5/237 (2%)

Query: 416  KPLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESD 475
            K  D+A++ E+ ++  E   K E  K    A  K    K  +    K      ++ E + 
Sbjct: 1290 KKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAK 1349

Query: 476  SENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKASKDEQKIDGANSAEKSLE 535
            +E +   DE  +      + E K +E   K     +  + K   DE K       +K+ E
Sbjct: 1350 AEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE 1409

Query: 536  DKHG---KKITNSFEAVGEIVQNDTETTKENNQTDEVPEENGLVGNPNKAE--VGDLNEC 590
             K     KK  +  +   E  +   E  K+  +  +  E         KAE       E 
Sbjct: 1410 LKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEA 1469

Query: 591  ESCTDSFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDPQGNNANQ 647
            +   ++    E + + D      E  K+K  ++KK  + K+K +  K+  +   A++
Sbjct: 1470 KKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE 1526



 Score = 35.5 bits (81), Expect = 0.45
 Identities = 59/256 (23%), Positives = 102/256 (39%), Gaps = 25/256 (9%)

Query: 420  QAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESDSEND 479
            +A  + + E+ AEE+ K E+ K        E  KK A +          ++EEE ++E  
Sbjct: 1203 EAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAK--------KAEEERNNEEI 1254

Query: 480  GKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKASKDEQKIDGANSAEKSLEDKHG 539
             K +E      ++R    K +E   + A E      K +++++K D A  AE+  +    
Sbjct: 1255 RKFEEARMAHFARRQAAIKAEEA--RKADE-----LKKAEEKKKADEAKKAEEKKKADEA 1307

Query: 540  KKITNSFEAVGEIVQNDTETTKENNQTDEVPEENGLVGNPNKAEVGDLNECESCTDSFHM 599
            KK     +   E  +   E  K+ +   +  EE        KA      E E+  D    
Sbjct: 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEA------KKAAEAAKAEAEAAADEAEA 1361

Query: 600  NEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDPQGNNANQREIRDNSATLDK 659
             E   E       KE  K+K   +KKK + K+K +  K+  + +     E++  +A   K
Sbjct: 1362 AEEKAEAA--EKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419

Query: 660  TE--KESVNNVESKDE 673
             +  K+     +  DE
Sbjct: 1420 ADEAKKKAEEKKKADE 1435



 Score = 34.3 bits (78), Expect = 0.98
 Identities = 53/269 (19%), Positives = 104/269 (38%), Gaps = 16/269 (5%)

Query: 408  QRVQTLLAKPLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTD 467
            ++      K  +  ++ +++++ A    K +  K       K    K  ++E  K +   
Sbjct: 1391 KKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAK 1450

Query: 468  GQSEEESDSENDGKPDENG--SQRGSKRSKEFKTQEKCVKGAGEDENGD---RKASKDEQ 522
             ++EE   +E   K  E    +    K+++E K  ++  K A E +      +KA++ ++
Sbjct: 1451 KKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKK 1510

Query: 523  KIDGANSAEKSLEDKHGKKITNSFEAVGEIVQNDTETTKENNQTDEV--PEENGLVGNPN 580
            K D A  AE++ +    KK   + +A       + +   E  + +E+   EE        
Sbjct: 1511 KADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAK 1570

Query: 581  KAEVGDLNECESCTDSFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDP 640
            KAE  D N      +     E +   ++    +E +K K  ++KK ++ K K    K+  
Sbjct: 1571 KAE-EDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKK-- 1627

Query: 641  QGNNANQREIRDNSATLDKTEKESVNNVE 669
                    E +     L K E E     E
Sbjct: 1628 ------AEEEKKKVEQLKKKEAEEKKKAE 1650



 Score = 32.8 bits (74), Expect = 2.8
 Identities = 49/251 (19%), Positives = 99/251 (39%), Gaps = 14/251 (5%)

Query: 414  LAKPLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEE 473
              K  ++ ++ ++ ++ AEE+ K +  K       K    K  ++E  K +    ++EE 
Sbjct: 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEA 1482

Query: 474  SDSENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKASKDEQKIDGANSAEKS 533
              ++   K  E       K++ E K   +  K A E +  +     DE K   A  A+K+
Sbjct: 1483 KKADEAKKKAE----EAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAK--KAEEAKKA 1536

Query: 534  LEDKHGKKITNSFEAVGEIVQNDTETTKENNQTDEVPEENGLVGNPNKAEVGDLNECESC 593
             E K  ++   + E          E  K+  +  +  E+  +     KAE     + E  
Sbjct: 1537 DEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMA--LRKAEE--AKKAEEA 1592

Query: 594  TDSFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDPQGNNANQREIRDN 653
                 M  + +E  +    K  + +K  ++K K +  +K   +K+  +     + E +  
Sbjct: 1593 RIEEVMKLYEEEKKM----KAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKK 1648

Query: 654  SATLDKTEKES 664
            +  L K E+E+
Sbjct: 1649 AEELKKAEEEN 1659


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of AF4
            (Proto-oncogene AF4) and FMR2 (Fragile X E mental
            retardation syndrome) nuclear proteins. These proteins
            have been linked to human diseases such as acute
            lymphoblastic leukaemia and mental retardation. The
            family also contains a Drosophila AF4 protein homologue
            Lilliputian which contains an AT-hook domain. Lilliputian
            represents a novel pair-rule gene that acts in
            cytoskeleton regulation, segmentation and morphogenesis
            in Drosophila.
          Length = 1154

 Score = 40.3 bits (94), Expect = 0.017
 Identities = 53/270 (19%), Positives = 94/270 (34%), Gaps = 29/270 (10%)

Query: 1782 ENNLIVTLSEAGDSEVDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHS 1841
            E++L ++ SE  D E   E   S            ++ E  A +S S  +S   ES S S
Sbjct: 357  EDDLKLSSSEDSDEEQATEKPPSRNTPPSAP---SSNPEPAASSSGSSSSSSGSESSSGS 413

Query: 1842 VKSNESHFGNSRCQPKTAYQSSSAQSEDGTSYK--------SISWDAYSVIDEPESYDVS 1893
               +ES   +S  +     +++S + E  ++ K         ++    S  +   S    
Sbjct: 414  DSESESSSSDS--EENEPPRTASPEPEPPSTNKWQLDNWLNKVNPHKVSPAESVSSNPPI 471

Query: 1894 WRTCSSESAAREKNSGQRSRGSGDK---AARKSASLPASQIALRDGKGDFKSPRKASHSL 1950
             +    E   +   S             ++++      +Q     G+G  KSP   S + 
Sbjct: 472  KQPMEKEGKVKSSGSQYHPESKEPPPKSSSKEKRRPRTAQKGPESGRGKQKSP-AQSEAP 530

Query: 1951 KDGKRYGNAQSFEAARNSNPYEAPMPGAWQPVLEGTSVFDDAEKSSKSGPKSSKAG---- 2006
               +  G  Q  +  + S   E            GT  +  + ++    PK++  G    
Sbjct: 531  PQRRTVGKKQPKKPEKASAGDERTGLRPESEP--GTLPYGSSVQTPPDRPKAATKGSRKP 588

Query: 2007 -----PKSSKPGPKSSKPGPKSSKRYPSRS 2031
                 PKSS P P + K   KS  +   +S
Sbjct: 589  SPRKEPKSSVP-PAAEKRKYKSPSKIVPKS 617


>gnl|CDD|237497 PRK13767, PRK13767, ATP-dependent helicase; Provisional.
          Length = 876

 Score = 39.9 bits (94), Expect = 0.019
 Identities = 20/54 (37%), Positives = 32/54 (59%)

Query: 304 RLASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAG 357
           RL   +  KRG+++++V++     GID   +DLV+ L  P   +  L R+GRAG
Sbjct: 329 RLEVEEKLKRGELKVVVSSTSLELGIDIGYIDLVVLLGSPKSVSRLLQRIGRAG 382


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 39.0 bits (91), Expect = 0.029
 Identities = 19/165 (11%), Positives = 58/165 (35%), Gaps = 4/165 (2%)

Query: 1729 SSSHSQSEFDPKRLHSKESSHKRNRKANDFNDTCGAVANETFDGEDSKGGSDDENNLIVT 1788
            ++S+  +           +S   N     +      +     +   +K   DD  +L   
Sbjct: 67   NTSNQDNNDKKFSTIDSSTSDSNNIIDFIYK----NLPQTNINQLLTKNKYDDNYSLTTL 122

Query: 1789 LSEAGDSEVDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVKSNESH 1848
            +    +   D+     P + ++  N+S  + +S+ +N    ++S   ++D+    S+ + 
Sbjct: 123  IQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNT 182

Query: 1849 FGNSRCQPKTAYQSSSAQSEDGTSYKSISWDAYSVIDEPESYDVS 1893
              ++  +   + + +     +       + +  S   + +S   S
Sbjct: 183  KPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKDNQSMSDS 227



 Score = 37.1 bits (86), Expect = 0.12
 Identities = 36/233 (15%), Positives = 81/233 (34%), Gaps = 5/233 (2%)

Query: 1779 SDDENNLIVTLSEAGDSEVDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESD 1838
             D         S+  + +   +  +S   +  D NN+ N   +  + S  D ++ D  + 
Sbjct: 32   KDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNI 91

Query: 1839 SHSVKSNESHFGNSRCQPKTAYQSSSAQSEDGTSYKSISWDAYSVIDEPESYDVSWRTCS 1898
               +  N      ++   K  Y  + + +    +  +++ D  S  ++P + + S    +
Sbjct: 92   IDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDI-SDYEQPRNSEKSTNDSN 150

Query: 1899 SESAAREKNSGQRSRGSGDKAARKSASLPASQIALRDGKGDFKSPRKASHSLKDGKRYGN 1958
              S +  KN         DKA  + A    +       K    +  K +   +   +  +
Sbjct: 151  KNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQP--NSPKPTQPNQSNSQPAS 208

Query: 1959 AQSFEAARNSNPYEAPMPGAWQPVLEGTSVFDDAEKSSKSGPKSSKAGPKSSK 2011
              +     +S   ++    A   +L+  S  +DA+K+ K     SK     + 
Sbjct: 209  DDTANQKSSSKDNQSMSDSALDSILDQYS--EDAKKTQKDYASQSKKDKTETS 259



 Score = 37.1 bits (86), Expect = 0.13
 Identities = 38/209 (18%), Positives = 69/209 (33%), Gaps = 16/209 (7%)

Query: 516 KASKDEQKIDGANSAEKSLEDKHGKKITNSFEAVGEIVQNDTETTKENNQTDEV-----P 570
           K S D++     +S  K  +       +N      +    D+ T+  NN  D +      
Sbjct: 44  KKSNDDETSKDTSS--KDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIYKNLPQ 101

Query: 571 EENGLVGNPNKAEVGDLNECESCTDSFHMNEHSDENDLGNNTKESQKQKGHKSK---KKK 627
                +   NK +    +      + F++N    + +   N+++S       S    K  
Sbjct: 102 TNINQLLTKNKYDDN-YSLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKND 160

Query: 628 DRKRKGNGDKRDPQGNNANQREIRDNSATLDKTEKESVNNVESKDERPPSTANNPDGTE- 686
              +    DK D Q   ++       S     + K +  N  +       TAN    ++ 
Sbjct: 161 TDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKD 220

Query: 687 ---WEDIPSDTIPNDETSNDNKLTPVDFD 712
                D   D+I  D+ S D K T  D+ 
Sbjct: 221 NQSMSDSALDSIL-DQYSEDAKKTQKDYA 248


>gnl|CDD|234365 TIGR03817, DECH_helic, helicase/secretion neighborhood putative
           DEAH-box helicase.  A conserved gene neighborhood widely
           spread in the Actinobacteria contains this
           uncharacterized DEAH-box family helicase encoded
           convergently towards an operon of genes for protein
           homologous to type II secretion and pilus formation
           proteins. The context suggests that this helicase may
           play a role in conjugal transfer of DNA.
          Length = 742

 Score = 38.9 bits (91), Expect = 0.032
 Identities = 24/87 (27%), Positives = 42/87 (48%), Gaps = 5/87 (5%)

Query: 288 ANFGAEYLSGAQDQKARLASLDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAA 347
           A + A YL   +D++    +L     G++  + TT+    G+D + +D V+    P   A
Sbjct: 307 AAYRAGYL--PEDRRELERALRD---GELLGVATTNALELGVDISGLDAVVIAGFPGTRA 361

Query: 348 TYLHRMGRAGRYGTRGLVITIVSAESL 374
           +   + GRAGR G   LV+ +   + L
Sbjct: 362 SLWQQAGRAGRRGQGALVVLVARDDPL 388


>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
           family.  Members of this protein family occur in
           Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
           related Mycoplasmas in small paralogous families that
           may also include truncated forms and/or pseudogenes.
           Members are predicted lipoproteins with a conserved
           signal peptidase II processing and lipid attachment
           site. Note that the name for certain characterized
           members, p72, reflects an anomalous apparent molecular
           weight, given a theoretical MW of about 61 kDa.
          Length = 541

 Score = 38.7 bits (90), Expect = 0.038
 Identities = 24/113 (21%), Positives = 46/113 (40%), Gaps = 6/113 (5%)

Query: 557 TETTKENNQTDEVPEENGLVGNPNKAEVGDLNECESCTDSFHMNEHSDENDLGNNTKESQ 616
           + T+  + Q ++ PE       P   +  D  E     ++   NE+ DE +  +   E +
Sbjct: 25  STTSSNSKQPEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKK 84

Query: 617 KQKGH-KSKKKKDRKRKGNGDKR----DPQGNNANQREI-RDNSATLDKTEKE 663
                 K + +K +      DK+     P   +A+Q    + + + LDK +KE
Sbjct: 85  PDPSKNKEEIEKPKDEPKKPDKKPQADQPNNVHADQPNNNKVDFSDLDKLKKE 137


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 38.5 bits (90), Expect = 0.046
 Identities = 48/269 (17%), Positives = 94/269 (34%), Gaps = 35/269 (13%)

Query: 448 SKEVTKKGASKEGTKWNLTDGQSEEESDSENDGKPDENGSQRG-SKRSKEFKTQEKCVKG 506
            +E+ +K   K  ++ +  +    EE D ++D   D+  +     K+  + K  E   + 
Sbjct: 311 GEELRRKIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEEN 370

Query: 507 AG--------------EDENGD-----RKASKDEQKIDGANSAEKSLEDKHGKKITNSFE 547
           +G              ++EN       R+  + E++ D     E+  +   G++      
Sbjct: 371 SGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEE-ENEEPSKKNVGRRKFG--P 427

Query: 548 AVGEIVQNDTETTKENNQTDEVP----EENGLVGNPNKAEVGDLNECESCTDSFHMNEHS 603
             GE      +  KEN    +      EE  L            N+    ++     E  
Sbjct: 428 ENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEE 487

Query: 604 DENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDPQGNNANQREIRDNSATLDKTEKE 663
           +E D  N   ++    G KS KK+D K+K +        + A  +  +       K +KE
Sbjct: 488 EELDEENPWLKTTSSVG-KSAKKQDSKKKSSSKL-----DKAANKISKAAVKVKKKKKKE 541

Query: 664 SVNNVESK--DERPPSTANNPDGTEWEDI 690
              +++    DE      +  D  + +D 
Sbjct: 542 KSIDLDDDLIDEEDSIKLDVDDEEDEDDE 570



 Score = 33.9 bits (78), Expect = 1.2
 Identities = 18/121 (14%), Positives = 39/121 (32%), Gaps = 4/121 (3%)

Query: 417 PLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESDS 476
              + + + + ++  E   + E          K   K     E  +      + EEE D 
Sbjct: 437 KKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKE----EEEEELDE 492

Query: 477 ENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKASKDEQKIDGANSAEKSLED 536
           EN      +   + +K+    K     +  A    +      K ++K + +   +  L D
Sbjct: 493 ENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLID 552

Query: 537 K 537
           +
Sbjct: 553 E 553


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 38.4 bits (89), Expect = 0.050
 Identities = 46/269 (17%), Positives = 99/269 (36%), Gaps = 21/269 (7%)

Query: 423 EREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESDSENDGKP 482
           +  + E T E + +           + E +   A +EG      + +SE E  +E  G+ 
Sbjct: 634 DVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQ 693

Query: 483 DENGSQRGSKRSKEFKTQEKCVKGAGEDE--------------NGDRKASKDEQKIDGAN 528
           +  G     +   + +T+ + V+  GE E               G+    + E + +G +
Sbjct: 694 EGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKH 753

Query: 529 SAEKSLEDKHGKKITNSFEAVGEIVQNDTETTKENNQTDEVPEENGLVGNPNKAEVGDLN 588
             E   + K  +   +  E   E  +++ E   +  +  E+  + G  G        +  
Sbjct: 754 EVETEGDRKETE---HEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETE-- 808

Query: 589 ECESCTDSFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDPQGNNANQR 648
             E           +D+ ++ + T E +    ++ + K+D K    G   D  G ++ + 
Sbjct: 809 AGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSD--GGDSEEE 866

Query: 649 EIRDNSATLDKTEKESVNNVESKDERPPS 677
           E  +     ++ E+E     E ++E P S
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENEEPLS 895



 Score = 35.7 bits (82), Expect = 0.40
 Identities = 37/209 (17%), Positives = 74/209 (35%), Gaps = 8/209 (3%)

Query: 417 PLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTK-KGASKEGTKWNLTDGQSEEESD 475
           P ++  E+E   +   + +  +          +  T+ +G   EG      +G+ E E +
Sbjct: 686 PAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGE-EVEDE 744

Query: 476 SENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKASKD-EQKIDGANSAEKSL 534
            E + +        G ++  E + + +      EDE G+ +A +D E K D     +   
Sbjct: 745 GEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDE-GEIQAGEDGEMKGDEGAEGKVEH 803

Query: 535 EDKHGKKITNSFEAVGEIVQNDTETTKENNQTDEVPEENGLVGNPNKA----EVGDLNEC 590
           E +      +  E   E   +DTE   E  + +   E  G      K        D  + 
Sbjct: 804 EGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDS 863

Query: 591 ESCTDSFHMNEHSDENDLGNNTKESQKQK 619
           E   +     E  +E +     +E + ++
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEE 892


>gnl|CDD|224123 COG1202, COG1202, Superfamily II helicase, archaea-specific
           [General function prediction only].
          Length = 830

 Score = 38.6 bits (90), Expect = 0.054
 Identities = 31/123 (25%), Positives = 53/123 (43%), Gaps = 8/123 (6%)

Query: 267 QCVIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRILVTTDLAA 326
           Q ++F+  + R   + + L      A         K R +   +F   ++  +VTT   A
Sbjct: 442 QTIVFTYSRRRCHELADALTGKGLKAAPYHAGLPYKERKSVERAFAAQELAAVVTTAALA 501

Query: 327 RGID-AANVDLVINLEIPHD---AATYLHRMGRAGR--YGTRGLVITIVSAESLVKFHSL 380
            G+D  A+  +  +L +  +      +   +GRAGR  Y  RG V  +V  E   K+H+ 
Sbjct: 502 AGVDFPASQVIFESLAMGIEWLSVREFQQMLGRAGRPDYHDRGKVYLLV--EPGKKYHAS 559

Query: 381 MGE 383
           M E
Sbjct: 560 MEE 562


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 37.2 bits (86), Expect = 0.13
 Identities = 37/222 (16%), Positives = 85/222 (38%), Gaps = 26/222 (11%)

Query: 409 RVQTLLAKPLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDG 468
           R++ +++K   +  + E+ E+   +  + + T      PSK  +  G S           
Sbjct: 383 RLEEVISKYAMKQDDTEEEERRKRQERERQGTSSRSSDPSKASSTSGESP------SMAS 436

Query: 469 QSEEESDSENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKASKDEQKIDGAN 528
           Q  EE +S  + + +E   +   + S+E + +++      E+E      +  E++++G++
Sbjct: 437 QESEEEESVEEEEEEEEEEEEEEQESEEEEGEDE------EEEEEVEADNGSEEEMEGSS 490

Query: 529 SAEKSLEDKHGKKITNSFEAVGEIVQNDTETTKENNQTDEVPEENGLVGNPNKAEVGDLN 588
             +   E+        + E  G    ++ +  + ++   E P+E      P + E     
Sbjct: 491 EGDGDGEEPEEDAERRNSEMAGISRMSEGQQPRGSSVQPESPQE-----EPLQPES---- 541

Query: 589 ECESCTDSFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRK 630
                 D+  + E SDE  L   +  S   +        + K
Sbjct: 542 -----MDAESVGEESDEELLAEESPLSSHTELEGVATPVETK 578



 Score = 33.3 bits (76), Expect = 1.7
 Identities = 21/136 (15%), Positives = 45/136 (33%), Gaps = 13/136 (9%)

Query: 1745 KESSHKRNRKANDFNDTCGAVANETFDGEDSKGGSDDENN--------LIVTLSEAGDSE 1796
            +E     +R ++    +  +  + +   ++S+     E               SE  + E
Sbjct: 409  RERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGE 468

Query: 1797 VDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVKSNESHFGNSRCQP 1856
             + E E     E E  N S+ + E +++     E   +     +S  +  S     +   
Sbjct: 469  DEEEEE-----EVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQQPR 523

Query: 1857 KTAYQSSSAQSEDGTS 1872
             ++ Q  S Q E    
Sbjct: 524  GSSVQPESPQEEPLQP 539


>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
           splicing factor.  These splicing factors consist of an
           N-terminal arginine-rich low complexity domain followed
           by three tandem RNA recognition motifs (pfam00076). The
           well-characterized members of this family are auxilliary
           components of the U2 small nuclear ribonuclearprotein
           splicing factor (U2AF). These proteins are closely
           related to the CC1-like subfamily of splicing factors
           (TIGR01622). Members of this subfamily are found in
           plants, metazoa and fungi.
          Length = 509

 Score = 35.6 bits (82), Expect = 0.38
 Identities = 22/94 (23%), Positives = 40/94 (42%), Gaps = 1/94 (1%)

Query: 893 DRTKRRSRPRKNKENTDILNTLGQDKTSAHGRTPSGGHRTPERRRSDGVRNDTRDERNGA 952
           DR++ R R R+++E +   ++  +D+     R+P     +  RR  D  R  +R  R+  
Sbjct: 30  DRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRRRSRSVRSIE 89

Query: 953 YTGDKGGRNRGHSVNSRKDHSFAGETPRSKGGFD 986
               +  R+R  S   RKD            G++
Sbjct: 90  -QHRRRLRDRSPSNQWRKDDKKRSLWDIKPPGYE 122



 Score = 32.2 bits (73), Expect = 3.6
 Identities = 27/146 (18%), Positives = 46/146 (31%), Gaps = 28/146 (19%)

Query: 1043 EQPVDEKKSRRELSQKMDSDRYRPDSDGDPNRRPSSQESLHHNDSLDIKLRKKLKPGHRL 1102
            E+P  E++  R   +   S+R R         R  S+    H  S +   R+  +P  R 
Sbjct: 3    EEPDREREKSRGRDRDRSSERPR------RRSRDRSRFRDRHRRSRERSYREDSRPRDRR 56

Query: 1103 QRNSKKRHSKISTERSNSNSAPREKSSDAKQAANEEENITGGNDENKTPVASEVRNKNKK 1162
            + +S+   S   +    S   PR +S   +                          + + 
Sbjct: 57   RYDSRSPRSLRYSSVRRSRDRPRRRSRSVRSIEQHRR-------------------RLRD 97

Query: 1163 VKGVKEKNPSIVGDEKRRYWWRSPPG 1188
                 +       D+KR  W   PPG
Sbjct: 98   RSPSNQWRKD---DKKRSLWDIKPPG 120


>gnl|CDD|188395 TIGR03865, PQQ_CXXCW, PQQ-dependent catabolism-associated CXXCW
           motif protein.  Members of this protein family have a
           CXXXCW motif, consistent with a possible role in redox
           cofactor binding. This protein family shows strong
           relationships by phylogenetic profiling and conserved
           gene neighborhoods with a transport system for alcohols
           metabolized by PQQ-dependent enzymes.
          Length = 162

 Score = 33.9 bits (78), Expect = 0.40
 Identities = 19/62 (30%), Positives = 25/62 (40%), Gaps = 5/62 (8%)

Query: 674 RPPSTANNPDGTEWEDIPSDTIPND----ETSNDNKLTPVDFDYINDFVNQTPSETGDRP 729
           RPP   N  +GT W D P  ++P       T   N L P    Y+   + +       RP
Sbjct: 60  RPPKPKNLLEGTVWRDEPHLSLPGSLWLPNTGYGN-LAPAWQAYLRRGLARATGGDKGRP 118

Query: 730 LV 731
           LV
Sbjct: 119 LV 120


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 35.3 bits (81), Expect = 0.44
 Identities = 45/268 (16%), Positives = 92/268 (34%), Gaps = 35/268 (13%)

Query: 453 KKGASKEGTKWNLTDGQSEEESDSENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDEN 512
           +KG    G    + D + ++E D +   K  E+G +  SK+ K+   + K  K   +D+ 
Sbjct: 203 EKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNK--KKLDDDKK 260

Query: 513 GDRKASKDEQKIDGAN--------------SAEKSLEDKHGKKITNSFEAVGEIVQNDTE 558
           G R    D  + D  +              SA  +  ++   K++    A  EI Q++  
Sbjct: 261 GKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDS 320

Query: 559 TTKENNQTDEVPEENGLVGNPNKAEVGDL-----NECESCTDSFHMNEHSDENDLGNNTK 613
              E    +E  EE G +    K            + +        ++   + +   +  
Sbjct: 321 EESE----EEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDIDGEDSVSLV 376

Query: 614 ESQKQKGHKSKKKKDR---KRKGNGDKRDP--QGNNANQREIRDNSAT-----LDKTEKE 663
            ++KQK  K ++  D        +G  R      +   ++   + S +       K + E
Sbjct: 377 TAKKQKEPKKEEPVDSNPSSPGNSGPARPSPESKDKGKRKAANEVSKSPASVPAKKLKTE 436

Query: 664 SVNNVESKDERPPSTANNPDGTEWEDIP 691
           +     S    P + + +   +   D  
Sbjct: 437 NAPKSSSGKSTPQTFSGSKSSSNAADGG 464



 Score = 32.2 bits (73), Expect = 4.3
 Identities = 19/103 (18%), Positives = 36/103 (34%), Gaps = 1/103 (0%)

Query: 416 KPLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESD 475
           K   +   + ++EQ  +     E      G  SK+  KK    +G K  L    S+   D
Sbjct: 303 KLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKK-GKKLKKLKGKKNGLDKDDSDSGDD 361

Query: 476 SENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKAS 518
           S++     E+     + + ++   +E+ V            A 
Sbjct: 362 SDDSDIDGEDSVSLVTAKKQKEPKKEEPVDSNPSSPGNSGPAR 404


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 35.2 bits (81), Expect = 0.48
 Identities = 23/76 (30%), Positives = 33/76 (43%), Gaps = 3/76 (3%)

Query: 468 GQSEEESDSENDGKPDENGSQRGSKRSKEFKTQEK--CVKGAGEDENGDRKASKDEQKID 525
           G SE+   S N+  P  + S +GSK+ K   T  K    +   +DE    K  K  QK  
Sbjct: 398 GSSEKSLGS-NESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKG 456

Query: 526 GANSAEKSLEDKHGKK 541
              S++   + K G K
Sbjct: 457 RDKSSKVPSDSKAGGK 472


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
            conserved protein present in eukaryotes.
          Length = 680

 Score = 35.3 bits (81), Expect = 0.50
 Identities = 25/161 (15%), Positives = 49/161 (30%), Gaps = 2/161 (1%)

Query: 1094 KKLKPGHRLQRNSKKRHSKISTERSNSNSAPREKSSDAKQAANEEENITGGNDENKTPVA 1153
            K+L P      N     S  S       S   +    +    +    I   + ++     
Sbjct: 228  KELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIGINNHHSKHADSKL 287

Query: 1154 SEVRNKNKKVKGVKEKNPSIVGDEKRRYWWRSPPGGGPNAARH-KVCDDTDFQISWNDPH 1212
              +    +        + S     K      S    G   ++  K    ++   S + P 
Sbjct: 288  QTI-EVIENHSNKSRPSSSSTNGSKETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPK 346

Query: 1213 PPKLTHGTGKSRSKSSHSKQSKRSAENPSCRRMRGQDAFAP 1253
                 +G+  S S S +  + KR++++ S  R   +DA   
Sbjct: 347  SHSSANGSVPSSSVSDNESKQKRASKSSSGARDSKKDASGM 387


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
           region.  This highly conserved region is found towards
           the C-terminus of the transmembrane domain. The function
           is unclear.
          Length = 151

 Score = 33.4 bits (76), Expect = 0.56
 Identities = 28/97 (28%), Positives = 37/97 (38%), Gaps = 9/97 (9%)

Query: 599 MNEHSDENDLGNNTKESQKQKGHKSKKKKDRK---RKGNGDKRD--PQGNNANQREIRDN 653
           M E S E  L    KES K +  KS KKK  K   +KG   K++  P G        ++ 
Sbjct: 1   MKETSYEEALAKQRKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEKPNGKIPEHEPNQEV 60

Query: 654 SATLDKTEKESVNNVESKDER----PPSTANNPDGTE 686
           +      EKE V  V           P  A  P  ++
Sbjct: 61  TEVEVIIEKEPVPAVAVAPVPVAVVAPVVAPKPKKSQ 97


>gnl|CDD|236779 PRK10864, PRK10864, putative methyltransferase; Provisional.
          Length = 346

 Score = 34.8 bits (80), Expect = 0.57
 Identities = 24/78 (30%), Positives = 33/78 (42%), Gaps = 4/78 (5%)

Query: 915 GQDKTSAHGRTPSGGHRTPERRRSDGVRNDTRDERNGAYTGDKGGRNRGHSVNSRKDHSF 974
             DK + + RT  GG R   + R+DG R   RD+RN      K   +   +V SR     
Sbjct: 22  DSDKRTHNPRTGKGGGRPSGKSRADGGRRPARDDRNSQSRDRKWEDSPWRTV-SRAPGDE 80

Query: 975 AGETPRSKGGFD--SYLD 990
             E     GG    S++D
Sbjct: 81  TPEKA-DHGGISGKSFID 97


>gnl|CDD|236641 PRK10019, PRK10019, nickel/cobalt efflux protein RcnA; Provisional.
          Length = 279

 Score = 34.0 bits (78), Expect = 0.88
 Identities = 10/43 (23%), Positives = 16/43 (37%), Gaps = 6/43 (13%)

Query: 1537 QGWNEHSELQNGIDYGRLPIGHQHNHDHRHKHRFNHSSSGGNP 1579
            + W E+    +          H H+H+H H H  +H    G  
Sbjct: 114  RNWLENMHHHDH------DHDHDHDHEHHHDHGHHHHHEHGAT 150


>gnl|CDD|226583 COG4098, comFA, Superfamily II DNA/RNA helicase required for DNA
           uptake (late competence protein) [DNA replication,
           recombination, and repair].
          Length = 441

 Score = 34.0 bits (78), Expect = 0.92
 Identities = 24/73 (32%), Positives = 41/73 (56%), Gaps = 5/73 (6%)

Query: 298 AQDQKARLASLDSFKRGKVRILVTTDLAARGIDAANVDL-VINLEIPHDAATYLHRM-GR 355
           ++DQ  R   +++F+ GK+ +L+TT +  RG+   NVD+ V+  E      + L ++ GR
Sbjct: 339 SEDQH-RKEKVEAFRDGKITLLITTTILERGVTFPNVDVFVLGAEHRVFTESALVQIAGR 397

Query: 356 AGRYGTR--GLVI 366
            GR   R  G V+
Sbjct: 398 VGRSLERPTGDVL 410


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 34.3 bits (78), Expect = 0.98
 Identities = 19/82 (23%), Positives = 30/82 (36%), Gaps = 3/82 (3%)

Query: 1951 KDGKRYGNAQSFEAARNSNPYEAPMPGAWQPVLEGTSVFDDAEKSSKSGPKSSKAG--PK 2008
            K  K+    +  ++ ++  P E P      P   G    ++ E          K G  P 
Sbjct: 491  KSKKKLAPIEEEDSDKHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPG 550

Query: 2009 SSKPGPKSSKPGPKSSKRYPSR 2030
             +K G    KPGP    + PS+
Sbjct: 551  ETKEGEVGKKPGPAKEHK-PSK 571



 Score = 31.2 bits (70), Expect = 8.4
 Identities = 22/105 (20%), Positives = 35/105 (33%), Gaps = 3/105 (2%)

Query: 1923 SASLPASQIALRDGKGDFKSPRKASHSLKDGKRYGNAQSFEAARNSNPYEAPMPGAWQPV 1982
            ++ LP      ++G+       K S   K+G + G  +  E  +   P +   P    P 
Sbjct: 516  ASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKI-PT 574

Query: 1983 LEGTSVFDDAEKSSK--SGPKSSKAGPKSSKPGPKSSKPGPKSSK 2025
            L     F    K  K    PK  K    + +P    S   P+   
Sbjct: 575  LSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLD 619


>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2.  A Saccharomyces
           cerevisiae member of this family (PGA2) is an ER protein
           which has been implicated in protein trafficking.
          Length = 139

 Score = 32.1 bits (73), Expect = 1.3
 Identities = 21/103 (20%), Positives = 39/103 (37%), Gaps = 12/103 (11%)

Query: 423 EREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESDSENDGKP 482
            +   ++  +E ++ E  +  +   S    + GA+          G  EE++D E D + 
Sbjct: 39  AKAQEKEHEKERAEREEAREKKAKISPNALRGGATA---------GHGEEDTDDEEDEED 89

Query: 483 DENGS---QRGSKRSKEFKTQEKCVKGAGEDENGDRKASKDEQ 522
               S   Q G K  K  +   + +  A E    D+   +DE 
Sbjct: 90  FATPSAVPQWGKKARKRQRKVIRKLLEAEEQLREDQYDDEDED 132


>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN).  This family
            consists of several eukaryotic survival motor neuron
            (SMN) proteins. The Survival of Motor Neurons (SMN)
            protein, the product of the spinal muscular
            atrophy-determining gene, is part of a large
            macromolecular complex (SMN complex) that functions in
            the assembly of spliceosomal small nuclear
            ribonucleoproteins (snRNPs). The SMN complex functions as
            a specificity factor essential for the efficient assembly
            of Sm proteins on U snRNAs and likely protects cells from
            illicit, and potentially deleterious, non-specific
            binding of Sm proteins to RNAs.
          Length = 264

 Score = 33.4 bits (76), Expect = 1.3
 Identities = 35/128 (27%), Positives = 50/128 (39%), Gaps = 28/128 (21%)

Query: 1735 SEFDPKRLHSKESSHKRNRKANDFN----------DTCGAVANE-------TFDGEDSKG 1777
             E D +    K  ++K+NR     N          D+C AV +E       T    D K 
Sbjct: 38   QENDEQNPGKKRKNNKKNRSRKKCNAAPLKEWQVGDSCNAVWSEDGNLYTATITSIDQKR 97

Query: 1778 GSDDENNLIVTLSEAGDSE----VDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSH 1833
            G+      +V  +  G+ E     D+     P D DEDA    N  E+ +   +SD +SH
Sbjct: 98   GT-----CVVFYTGYGNEEEQNLADLLTP--PPDVDEDALKEANVNETESSTDESDRSSH 150

Query: 1834 DGESDSHS 1841
              E  S S
Sbjct: 151  SHEVRSKS 158


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 33.7 bits (77), Expect = 1.4
 Identities = 22/149 (14%), Positives = 52/149 (34%), Gaps = 6/149 (4%)

Query: 430 TAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESDSENDGKPDENGSQR 489
                     TKP +   ++   ++   KE  K      + + + + + D KP E   ++
Sbjct: 82  KGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPK-DRKPKEEAKEK 140

Query: 490 GSKRSKEFKTQEKCVKGAGEDENGDR-----KASKDEQKIDGANSAEKSLEDKHGKKITN 544
              + KE + ++K  +    +E   R     K+   +       + +K   ++  ++   
Sbjct: 141 RPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAA 200

Query: 545 SFEAVGEIVQNDTETTKENNQTDEVPEEN 573
                G+  + D    +E  + D    E 
Sbjct: 201 REAVKGKPEEPDVNEEREKEEDDGKDRET 229


>gnl|CDD|237253 PRK12898, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 656

 Score = 33.4 bits (77), Expect = 1.6
 Identities = 37/113 (32%), Positives = 48/113 (42%), Gaps = 22/113 (19%)

Query: 285 LANANFGAEYLSGAQDQKARLASLDSFKRGKVRILVTTDLAARGID-------AANVDL- 336
           L  A    + L+  QD  A  A++ +    + RI V T++A RG D       AA   L 
Sbjct: 493 LREAGLPHQVLNAKQD--AEEAAIVARAGQRGRITVATNMAGRGTDIKLEPGVAARGGLH 550

Query: 337 VINLEIPHDAATYLHRM-----GRAGRYGTRGLVITIVSAES--LVKFHSLMG 382
           VI  E  HD+A    R+     GR GR G  G    I+S E   L  F    G
Sbjct: 551 VILTE-RHDSA----RIDRQLAGRCGRQGDPGSYEAILSLEDDLLQSFLGSRG 598


>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
           Provisional.
          Length = 672

 Score = 33.3 bits (77), Expect = 1.7
 Identities = 29/235 (12%), Positives = 66/235 (28%), Gaps = 23/235 (9%)

Query: 415 AKPLDQAQEREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEES 474
            +    A      EQ A E++  +         +     + AS           +  E +
Sbjct: 78  RRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAA 137

Query: 475 DSENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRKASKDEQKIDGANSAEKSL 534
                 K  E G Q  ++   +   + +  +       GDR+  + E +       E+  
Sbjct: 138 RRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERG 197

Query: 535 EDKHGKKITNSFEAVGEIVQNDTETTKENNQTDEVPEENGLVGNPNKAEVGDLNECESCT 594
            D              +  + D     +  +     +     G   + +  D        
Sbjct: 198 RDG------------DDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDAR------ 239

Query: 595 DSFHMNEHSDENDLGNNTKESQKQKGHKSKKKKDRKRKGNGDKRDPQGNNANQRE 649
                 + + E+    +  + + + G + ++ +DR R+G              RE
Sbjct: 240 -----GDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGGNEREPELRE 289



 Score = 32.2 bits (74), Expect = 3.9
 Identities = 16/107 (14%), Positives = 29/107 (27%), Gaps = 5/107 (4%)

Query: 880 NSSDYQSDSKPYGDRTKRRSRPRKNKENTDILNTLGQDKTSAHGRTPSGGHRTPERRRSD 939
            +   +   +   D  +RR      +   +       ++     R   G  R    RR  
Sbjct: 157 RADAAERTEEEERDERRRRGDREDRQAEAE-----RGERGRREERGRDGDDRDRRDRREQ 211

Query: 940 GVRNDTRDERNGAYTGDKGGRNRGHSVNSRKDHSFAGETPRSKGGFD 986
           G R + R  R+G     +  R          +    G+     G   
Sbjct: 212 GDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGR 258


>gnl|CDD|233032 TIGR00580, mfd, transcription-repair coupling factor (mfd).  All
           proteins in this family for which functions are known
           are DNA-dependent ATPases that function in the process
           of transcription-coupled DNA repair in which the repair
           of the transcribed strand of actively transcribed genes
           is repaired at a higher rate than the repair of
           non-transcribed regions of the genome and than the
           non-transcribed strand of the same gene. This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University). This family is
           closely related to the RecG and UvrB families [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 926

 Score = 33.5 bits (77), Expect = 1.7
 Identities = 17/59 (28%), Positives = 27/59 (45%), Gaps = 9/59 (15%)

Query: 310 SFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATY----LHRM-GRAGRYGTRG 363
            F +G+ ++LV T +   GID  N     N  I   A  +    L+++ GR GR   + 
Sbjct: 707 EFYKGEFQVLVCTTIIETGIDIPNA----NTIIIERADKFGLAQLYQLRGRVGRSKKKA 761


>gnl|CDD|234506 TIGR04221, SecA2_Mycobac, accessory Sec system translocase SecA2,
           Actinobacterial type.  Members of this family are the
           SecA2 subunit of the Mycobacterial type of accessory
           secretory system. This family is quite different SecA2
           of the Staph/Strep type (TIGR03714).
          Length = 762

 Score = 33.6 bits (77), Expect = 1.7
 Identities = 31/110 (28%), Positives = 42/110 (38%), Gaps = 17/110 (15%)

Query: 317 RILVTTDLAARGID-------AANVDLVINLEIPHDAATYLHRM--------GRAGRYGT 361
            + V+T +A RG D        A+ D V  L   H   T  HR         GRAGR G 
Sbjct: 475 AVTVSTQMAGRGTDIRLGGSDEADHDRVAELGGLHVIGTGRHRTARLDNQLRGRAGRQGD 534

Query: 362 RGLVITIVSAE-SLVKFHSLMGEINLDHAFNVGLVPDNLTGDQINWTQRV 410
            G  +  VS E  +V        +    A + G +      D ++  QRV
Sbjct: 535 PGSSVFFVSLEDDVVAVGGAGETVPAQPAED-GRIESPRVQDFVDHAQRV 583


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 33.4 bits (76), Expect = 1.7
 Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 23/164 (14%)

Query: 414 LAKPLDQAQ-----EREDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDG 468
             +  D AQ     +R++V Q  +E+  +            +   +   +E  K  +   
Sbjct: 232 AQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIK 291

Query: 469 QSEEESDSENDGKPDENGSQRGSKRSKEFKTQEKCVKGAGEDENGDRK-ASKDEQKIDGA 527
           +++EE+    D K  +          +E K  EK  +    +    R+  ++D QK    
Sbjct: 292 KNDEEALKAKDHKAFD--------LKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQ 343

Query: 528 NSAEKSLEDK---------HGKKITNSFEAVGEIVQNDTETTKE 562
             A+ +  ++         +G K+ +    + E+V  D +T   
Sbjct: 344 VEAQPTSLNEDAIDSSNPVYGLKVVDPITNLSELVLIDLKTEVR 387


>gnl|CDD|239431 cd03315, MLE_like, Muconate lactonizing enzyme (MLE) like subgroup of
            the enolase superfamily. Enzymes of this subgroup share
            three conserved carboxylate ligands for the essential
            divalent metal ion (usually Mg2+), two aspartates and a
            glutamate, and residues that can function as general
            acid/base catalysts, a Lys-X-Lys motif and another
            conserved lysine. Despite these conserved residues, the
            members of the MLE subgroup, like muconate lactonizing
            enzyme, o-succinylbenzoate synthase (OSBS) and
            N-acylamino acid racemase (NAAAR), catalyze different
            reactions.
          Length = 265

 Score = 32.7 bits (75), Expect = 2.1
 Identities = 16/52 (30%), Positives = 27/52 (51%), Gaps = 7/52 (13%)

Query: 1016 DLVVYDDAFDMFSP-----AANNLDEYEIDYVEQPV--DEKKSRRELSQKMD 1060
            D  +  DA   ++P     A   L++  +DYVEQP+  D+ + R  L++  D
Sbjct: 129  DAELRVDANRGWTPKQAIRALRALEDLGLDYVEQPLPADDLEGRAALARATD 180


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
            nuclear protein involved in bud-site selection. It plays
            a role in positioning the proximal bud pole signal. More
            recently it has been shown to be involved in ribosome
            biogenesis.
          Length = 424

 Score = 32.9 bits (75), Expect = 2.2
 Identities = 36/184 (19%), Positives = 65/184 (35%), Gaps = 26/184 (14%)

Query: 1740 KRLHSKESSHKRNRKANDFNDTCGAVANETFDGEDSKGGSDDENNLIVTLSEAGDSEVDV 1799
            K    K++   + ++A + +D       E  + ED     +   +         + E D 
Sbjct: 148  KGKAKKKTKKSKKKEAKESSDK---DDEEESESEDESKSEESAED-----DSDDEEEEDS 199

Query: 1800 EGE----------NSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVKSNESHF 1849
            + E          +S  +E+ +   S N  E T+++   + +S   ES S S     S  
Sbjct: 200  DSEDYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSEESS-- 257

Query: 1850 GNSRCQPKTAYQSSSAQSEDGTSYKSISWDAYSVIDEPESYDVSWRTCSSESAAREKNSG 1909
                 +PK    SS+        Y S S D     ++ +   V       +   R+   G
Sbjct: 258  -PPSKKPKEKKTSSTFLPSLMGGYFSGSEDEDDDDEDIDPDQV-----VKKPVKRKNRRG 311

Query: 1910 QRSR 1913
            QR+R
Sbjct: 312  QRAR 315


>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
          Length = 534

 Score = 33.1 bits (75), Expect = 2.3
 Identities = 17/92 (18%), Positives = 33/92 (35%), Gaps = 2/92 (2%)

Query: 1791 EAGDSEVDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVKSNESHFG 1850
            E G +       N+P D+D +A       +     + +D    D +        +E   G
Sbjct: 410  EEGRAAAAAAAANAPADQDVEAEAHDEFDQDPGAPAHADRADSDEDDMDEQESGDERADG 469

Query: 1851 NSRCQPKTAYQSSSAQSEDGTSYKSISWDAYS 1882
                    +Y ++S++ E  ++    SW   S
Sbjct: 470  EDDSDSSYSYSTTSSEDESDSADD--SWGDES 499


>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
            Provisional.
          Length = 337

 Score = 32.8 bits (75), Expect = 2.3
 Identities = 14/64 (21%), Positives = 28/64 (43%), Gaps = 5/64 (7%)

Query: 1789 LSEAGDSEVDVEGE--NSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVKSNE 1846
            L EA D + D   +  +   D  + ++++ +D  S    ++ D+      + +   KSNE
Sbjct: 276  LGEAIDEDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTNEEDDRGEKESNGA---KSNE 332

Query: 1847 SHFG 1850
             H  
Sbjct: 333  LHLT 336


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 31.7 bits (72), Expect = 2.9
 Identities = 20/64 (31%), Positives = 32/64 (50%)

Query: 425 EDVEQTAEESSKVENTKPLRGAPSKEVTKKGASKEGTKWNLTDGQSEEESDSENDGKPDE 484
           EDVE  ++++SK + T P +   +K+  KK   KE       +  S+ E + END K D 
Sbjct: 41  EDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDY 100

Query: 485 NGSQ 488
              +
Sbjct: 101 ELPK 104


>gnl|CDD|233483 TIGR01587, cas3_core, CRISPR-associated helicase Cas3.  This model
           represents the highly conserved core region of an
           alignment of Cas3, a protein found in association with
           CRISPR repeat elements in a broad range of bacteria and
           archaea. Cas3 appears to be a helicase, with regions
           found by pfam00270 (DEAD/DEAH box helicase) and
           pfam00271 (Helicase conserved C-terminal domain). Some
           but not all members have an N-terminal HD domain region
           (pfam01966) that is not included within this model
           [Mobile and extrachromosomal element functions, Other].
          Length = 358

 Score = 32.4 bits (74), Expect = 2.9
 Identities = 15/55 (27%), Positives = 28/55 (50%), Gaps = 3/55 (5%)

Query: 308 LDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTR 362
           L+  K+ +  ++V T +    +D  + D++I    P D+     R+GR  RYG +
Sbjct: 271 LEEMKKNEKFVIVATQVIEASLD-ISADVMITELAPIDSLI--QRLGRLHRYGRK 322


>gnl|CDD|215597 PLN03137, PLN03137, ATP-dependent DNA helicase; Q4-like;
           Provisional.
          Length = 1195

 Score = 32.6 bits (74), Expect = 3.2
 Identities = 26/99 (26%), Positives = 44/99 (44%), Gaps = 1/99 (1%)

Query: 265 FNQC-VIFSNYQLRPEVICENLANANFGAEYLSGAQDQKARLASLDSFKRGKVRILVTTD 323
           F++C +I+   ++  E + E L      A +  G+ D   R      + + ++ I+  T 
Sbjct: 679 FDECGIIYCLSRMDCEKVAERLQEFGHKAAFYHGSMDPAQRAFVQKQWSKDEINIICATV 738

Query: 324 LAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTR 362
               GI+  +V  VI+  +P     Y    GRAGR G R
Sbjct: 739 AFGMGINKPDVRFVIHHSLPKSIEGYHQECGRAGRDGQR 777


>gnl|CDD|215814 pfam00242, DNA_pol_viral_N, DNA polymerase (viral) N-terminal domain.
            
          Length = 379

 Score = 32.1 bits (73), Expect = 3.7
 Identities = 15/82 (18%), Positives = 25/82 (30%), Gaps = 3/82 (3%)

Query: 1995 SSKSGPKSSKAGPKSSKPGPKSSKPGPKSSKRYPSRSGSSGLNN---RTEGNSVSHTWEN 2051
              +    S +    S+   P   +P         + S SS  +    R    S   T E 
Sbjct: 262  GQQGRSGSIRGRKHSTTRRPFGVEPSSSGVTTNRASSSSSCFHQSAVRETAYSSLSTSER 321

Query: 2052 HFATGTDRRSHQYPAYGSMSRH 2073
            H ++G        P     S++
Sbjct: 322  HSSSGHAVELRSIPGGSVSSQN 343


>gnl|CDD|114045 pfam05297, Herpes_LMP1, Herpesvirus latent membrane protein 1 (LMP1).
             This family consists of several latent membrane protein
            1 or LMP1s mostly from Epstein-Barr virus. LMP1 of EBV is
            a 62-65 kDa plasma membrane protein possessing six
            membrane spanning regions, a short cytoplasmic N-terminus
            and a long cytoplasmic carboxy tail of 200 amino acids.
            EBV latent membrane protein 1 (LMP1) is essential for
            EBV-mediated transformation and has been associated with
            several cases of malignancies. EBV-like viruses in
            Cynomolgus monkeys (Macaca fascicularis) have been
            associated with high lymphoma rates in immunosuppressed
            monkeys.
          Length = 382

 Score = 31.9 bits (72), Expect = 4.2
 Identities = 37/142 (26%), Positives = 51/142 (35%), Gaps = 16/142 (11%)

Query: 1725 KHHHSSS--HSQSEFDPKRLHSKESSHKRNRK------ANDFNDTCGAVANETFDGEDSK 1776
            +HHH  S  H Q   D    H  +S+    R       A D    C         G D+ 
Sbjct: 194  EHHHDDSLPHPQQATDDSG-HESDSNSNEGRHHLLVSGAGDGPPLCSQNLGAPGGGPDN- 251

Query: 1777 GGSDDENNLIVTLSEAGDSEVDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGE 1836
             G  D +N      + G  + D   +N P D D   +N   D ++TA N   D ++ D  
Sbjct: 252  -GPQDPDNT----DDNGPQDPDNTDDNGPQDPDNTDDNGPQDPDNTADNGPQDPDNTDDN 306

Query: 1837 SDSHSVKSNES-HFGNSRCQPK 1857
                 +  N S   GN    PK
Sbjct: 307  GPHDPLPHNPSDSAGNDGGPPK 328


>gnl|CDD|114337 pfam05609, LAP1C, Lamina-associated polypeptide 1C (LAP1C).  This
            family contains rat LAP1C proteins and several
            uncharacterized highly related sequences from both mice
            and humans. LAP1s (lamina-associated polypeptide 1s) are
            type 2 integral membrane proteins with a single
            membrane-spanning region of the inner nuclear membrane.
            LAP1s bind to both A- and B-type lamins and have a
            putative role in the membrane attachment and assembly of
            the nuclear lamina.
          Length = 465

 Score = 31.9 bits (72), Expect = 4.3
 Identities = 25/105 (23%), Positives = 37/105 (35%), Gaps = 16/105 (15%)

Query: 1898 SSESAAREKNSGQRSRGSGDKAARKSASLPASQIALRDGKGDFKSPRKASHSLKDGKRYG 1957
            SS +  +E    + S  S DK  R  A L +S  AL     DF +  +    L  G +  
Sbjct: 109  SSSNVTKEPLRSRDSHESSDKVGRADAHLGSSSWALPKSASDFTAHSQQPSVLTTGSQ-- 166

Query: 1958 NAQSFEAARNSNPYEAPMPGAWQPVLEGTSVFDDAEKSSKSGPKS 2002
                          +AP     Q   E T +       S++G +S
Sbjct: 167  --------------KAPQEWHSQAGRERTRMLRRRILKSEAGNQS 197


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1 protein
            homologues. SDA1 is a Saccharomyces cerevisiae protein
            which is involved in the control of the actin
            cytoskeleton. The protein is essential for cell viability
            and is localised in the nucleus.
          Length = 317

 Score = 31.5 bits (72), Expect = 4.7
 Identities = 14/78 (17%), Positives = 31/78 (39%), Gaps = 1/78 (1%)

Query: 1771 DGEDSKGGSDDENNLIVTLSEAGDSEVD-VEGENSPTDEDEDANNSQNDGESTAQNSQSD 1829
             G +S    D+E    V   E  D E + ++ E+    E  D+ + +   E+  +  +  
Sbjct: 90   QGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDS 149

Query: 1830 ENSHDGESDSHSVKSNES 1847
            +     E +  + +  E+
Sbjct: 150  DEELSEEDEEEAAEEEEA 167


>gnl|CDD|200575 cd10951, CE4_ClCDA_like, Catalytic NodB homology domain of
           Colletotrichum lindemuthianum chitin deacetylase and
           similar proteins.  This family is represented by the
           chitin deacetylase (endo-chitin de-N-acetylase, ClCDA,
           EC 3.5.1.41) from Colletotrichum lindemuthianum (also
           known as Glomerella lindemuthiana), which is a member of
           the carbohydrate esterase 4 (CE4) superfamily. ClCDA
           catalyzes the hydrolysis of N-acetamido groups of
           N-acetyl-D-glucosamine residues in chitin, converting it
           to chitosan in fungal cell walls. It consists of a
           single catalytic domain similar to the deformed
           (alpha/beta)8 barrel fold adopted by other CE4
           esterases, which encompasses a mononuclear metalloenzyme
           employing a conserved His-His-Asp zinc-binding triad
           closely associated with the conserved catalytic base
           (aspartic acid) and acid (histidine), to carry out
           acid/base catalysis. It possesses a highly conserved
           substrate-binding groove, with subtle alterations that
           influence substrate specificity and subsite affinity.
           Unlike its bacterial homologs, ClCDA contains two
           intramolecular disulfide bonds that may add stability to
           this secreted protein. The family also includes many
           uncharacterized deacetylases and hypothetical proteins
           mainly from eukaryotes, which show high sequence
           similarity to ClCDA.
          Length = 197

 Score = 31.1 bits (71), Expect = 4.9
 Identities = 14/38 (36%), Positives = 21/38 (55%)

Query: 196 LVVSATYSHDNLATLQKYMVDPLLIRPEDATRPLLGVK 233
            + S T+SH +L  L    +   + + EDA R +LGVK
Sbjct: 65  QIASHTWSHPDLTKLSAAQIRDEMTKLEDALRKILGVK 102


>gnl|CDD|130302 TIGR01235, pyruv_carbox, pyruvate carboxylase.  This enzyme plays a
           role in gluconeogensis but not glycolysis [Energy
           metabolism, Glycolysis/gluconeogenesis].
          Length = 1143

 Score = 32.1 bits (73), Expect = 5.3
 Identities = 17/41 (41%), Positives = 23/41 (56%), Gaps = 4/41 (9%)

Query: 753 KACLTNDSVRVQVLMEKLDHVDASVLADWILNKHNNVAALF 793
           KA   ND V V+ L+E+  H++  +L D    KH NV  LF
Sbjct: 189 KAAFGNDEVYVEKLIERPRHIEVQLLGD----KHGNVVHLF 225


>gnl|CDD|222258 pfam13604, AAA_30, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. There is a Walker A and
           Walker B.
          Length = 195

 Score = 31.0 bits (71), Expect = 5.5
 Identities = 22/80 (27%), Positives = 35/80 (43%), Gaps = 11/80 (13%)

Query: 45  SPIQMAAFPYVLNGYD--AIVQSKSGTGKSIVFVVAALNKLIKLPVQQPQTII-LAPTRE 101
           S  Q  A  +VL   D  A+VQ  +GTGK+    + A  +  +        +I LAPT +
Sbjct: 3   SAEQREAVRHVLTSGDRVAVVQGPAGTGKT--TSLKAAREAWE---AAGYRVIGLAPTGK 57

Query: 102 IAVQITDVVRSVGAHVKNLH 121
            A  + +    +G   + L 
Sbjct: 58  AAKVLGE---ELGIEARTLA 74


>gnl|CDD|114448 pfam05722, Ustilago_mating, Ustilago B locus mating-type protein.
            This family consists of several Ustilago mating-type
            proteins. The b locus of the phytopathogenic fungus
            Ustilago maydis encodes a multiallelic recognition
            function that controls the ability of the fungus to form
            a dikaryon and complete the sexual stage of the life
            cycle. The b locus has at least 25 alleles and any
            combination of two different alleles, brought together by
            mating between haploid cells, allows the fungus to cause
            disease and undergo sexual development within the plant.
          Length = 239

 Score = 31.3 bits (70), Expect = 5.7
 Identities = 25/109 (22%), Positives = 48/109 (44%), Gaps = 5/109 (4%)

Query: 1985 GTSVFDDAEKSSKSGPKSSKAGPKSS----KPGPKSS-KPGPKSSKRYPSRSGSSGLNNR 2039
            G  V+D    + KS PK   A   ++     P  K++    PK  K  P  S +  +++ 
Sbjct: 81   GDWVYDLCAANKKSPPKPGMARAVTTAAKRHPARKTAPAAKPKPKKAKPRASKTPSIDST 140

Query: 2040 TEGNSVSHTWENHFATGTDRRSHQYPAYGSMSRHDLYFDESQLLGNEAL 2088
             + +++  T E    +  D     + + GS+  HD + D + +L + +L
Sbjct: 141  LDSSALESTPELSACSTADNSFSSFCSNGSLGHHDPFQDGNDILQSPSL 189


>gnl|CDD|187770 cd09639, Cas3_I, CRISPR/Cas system-associated protein Cas3.  CRISPR
           (Clustered Regularly Interspaced Short Palindromic
           Repeats) and associated Cas proteins comprise a system
           for heritable host defense by prokaryotic cells against
           phage and other foreign DNA; DEAD/DEAH box helicase DNA
           helicase cas3'; Often but not always is fused to HD
           nuclease domain; signature gene for Type I.
          Length = 353

 Score = 31.6 bits (72), Expect = 5.8
 Identities = 17/55 (30%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 308 LDSFKRGKVRILVTTDLAARGIDAANVDLVINLEIPHDAATYLHRMGRAGRYGTR 362
           L  FK+ +  ++V T +    +D  +VD++I    P D+     R+GR  RYG +
Sbjct: 267 LLEFKKSEKFVIVATQVIEASLD-ISVDVMITELAPIDSLI--QRLGRLHRYGEK 318


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
            Superfamily; Provisional.
          Length = 844

 Score = 31.8 bits (72), Expect = 6.1
 Identities = 16/84 (19%), Positives = 32/84 (38%), Gaps = 2/84 (2%)

Query: 1793 GDSEVDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVK--SNESHFG 1850
            G+S  D +      D DEDA+   N+ ++   +     NS   +    SV   S E    
Sbjct: 111  GESSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIE 170

Query: 1851 NSRCQPKTAYQSSSAQSEDGTSYK 1874
              + Q +  ++  +  ++   +  
Sbjct: 171  EKKGQTENTFKFYNFGNDGEEAAA 194


>gnl|CDD|218601 pfam05477, SURF2, Surfeit locus protein 2 (SURF2).  Surfeit locus
           protein 2 is part of a group of at least six sequence
           unrelated genes (Surf-1 to Surf-6). The six Surfeit
           genes have been classified as housekeeping genes, being
           expressed in all tissue types tested and not containing
           a TATA box in their promoter region. The exact function
           of SURF2 is unknown.
          Length = 244

 Score = 31.0 bits (70), Expect = 6.5
 Identities = 25/74 (33%), Positives = 36/74 (48%), Gaps = 5/74 (6%)

Query: 562 ENNQTDEVPEENGLVGNPNKAEVGDLNECESCTDSFHMNEHSDENDLGNNTKESQKQKGH 621
           E++ +D  P E   + NP K + GD ++     D   M   S E       K S+KQ G 
Sbjct: 168 EDSMSDLYPPELFTLKNPGKEQNGDEDDDFETDDEDEMEVESPELQ----QKRSKKQSGS 223

Query: 622 KSKK-KKDRKRKGN 634
            +KK KK+ K+KG 
Sbjct: 224 LTKKFKKNHKKKGP 237


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
            unknown].
          Length = 869

 Score = 31.6 bits (71), Expect = 6.8
 Identities = 39/168 (23%), Positives = 69/168 (41%), Gaps = 15/168 (8%)

Query: 1728 HSSSHSQSEFDPKRLHSKESSHKRNRKANDFNDTCGAVANETFDGEDSKGGSDDENNLIV 1787
            HS     + FD + L   +         N  +D     ++E FD ED K  +D   N   
Sbjct: 22   HSYEEESAGFDSEELEDNDEQGYSF-GVNSEDDEEID-SDEAFDEEDEKRFADWSFN--- 76

Query: 1788 TLSEAGDSEVDVEGENSP-----TDEDEDANNSQNDGE-STAQNSQSDENSHDGESDSHS 1841
              S++G S  D +  N+       D D+  N+ + + E S +   +++    D   D+  
Sbjct: 77   -ASKSGKSNKDHKNLNNTKEISLNDSDDSVNSDKLENEGSVSSIDENELVDLDTLLDNDQ 135

Query: 1842 VKSNESHFGNSRCQPKTAYQSSSAQSEDGTSYKSISWDAYSVIDEPES 1889
             + NES   +     +   +S ++ S D  S +S   D+ S I+  +S
Sbjct: 136  PEKNESGNNDHATDKENLLESDASSSNDSESEES---DSESEIESSDS 180



 Score = 31.2 bits (70), Expect = 8.7
 Identities = 27/124 (21%), Positives = 51/124 (41%), Gaps = 18/124 (14%)

Query: 1728 HSSSHSQSEFDPKRLHSKESSHKRNRKANDFNDTCGAVANETFDGEDSKGG--SDDENNL 1785
            ++S   +S  D K L++ +     +   +              D  +++G   S DEN L
Sbjct: 76   NASKSGKSNKDHKNLNNTKEISLNDSDDS-----------VNSDKLENEGSVSSIDENEL 124

Query: 1786 IVTLSEAGDSEVDVEGENSPTDEDEDANNSQNDGESTAQNSQSDENSHDGESDSHSVKSN 1845
             V L    D++   + E+   D   D  N      S++ +S+S+E+  + E +S    S+
Sbjct: 125  -VDLDTLLDNDQPEKNESGNNDHATDKENLLESDASSSNDSESEESDSESEIES----SD 179

Query: 1846 ESHF 1849
              H 
Sbjct: 180  SDHD 183


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
            proteins whose interaction is required for the maturation
            of the 18S rRNA and for 40S ribosome production.
          Length = 809

 Score = 31.5 bits (72), Expect = 7.0
 Identities = 13/62 (20%), Positives = 29/62 (46%)

Query: 1771 DGEDSKGGSDDENNLIVTLSEAGDSEVDVEGENSPTDEDEDANNSQNDGESTAQNSQSDE 1830
            DG D +   DD+++L     +   S+ + + E+  +D+++D    + + E   + S    
Sbjct: 328  DGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAEST 387

Query: 1831 NS 1832
             S
Sbjct: 388  RS 389


>gnl|CDD|147601 pfam05505, Ebola_NP, Ebola nucleoprotein.  This family consists of
           Ebola and Marburg virus nucleoproteins. These proteins
           are responsible for encapsidation of genomic RNA. It has
           been found that nucleoprotein DNA vaccines can offer
           protection from the virus.
          Length = 717

 Score = 31.1 bits (70), Expect = 8.1
 Identities = 14/78 (17%), Positives = 31/78 (39%), Gaps = 6/78 (7%)

Query: 615 SQKQKGHKSKKKKDRKRKGNGDKRDPQGNNANQRE----IRDNSATLDKT--EKESVNNV 668
           S +     S+   +  +K   +  +P G  + + +    +++     D+T  +  S+  +
Sbjct: 505 SHQPGNRASQDLNNNNQKQEDESTNPIGKTSLRYQELTPVQEEDEPEDQTDDDDSSLPPL 564

Query: 669 ESKDERPPSTANNPDGTE 686
           ES D+         D TE
Sbjct: 565 ESDDDPGSDNEQGVDLTE 582


>gnl|CDD|173959 cd08549, G1PDH_related, Glycerol-1-phosphate_dehydrogenase and
           related proteins.  Bacterial and archeal
           glycerol-1-phosphate dehydrogenase-like oxidoreductases.
           The proteins have similarity with glycerol-1-phosphate
           dehydrogenase (G1PDH). G1PDH plays a role in the
           synthesis of phosphoglycerolipids in gram-positive
           bacterial species. It catalyzes the reversibly reduction
           of dihydroxyacetone phosphate (DHAP) to
           glycerol-1-phosphate (G1P) in a NADH-dependent manner.
           Its activity requires Ni++ ion. It also contains
           archaeal Sn-glycerol-1-phosphate dehydrogenase (Gro1PDH)
           that plays an important role in the formation of the
           enantiomeric configuration of the glycerophosphate
           backbone (sn-glycerol-1-phosphate) of archaeal ether
           lipids.
          Length = 332

 Score = 30.6 bits (69), Expect = 9.7
 Identities = 23/89 (25%), Positives = 38/89 (42%), Gaps = 3/89 (3%)

Query: 254 KKLVQLLSQTPFNQCVIFSNYQLRP--EVICENLANANFGAEYLSGAQDQKA-RLASLDS 310
           K++++ L    F + V+  +  L P    + E L   +   E+L G        L    S
Sbjct: 41  KEIIERLESNNFTKEVLERDSLLIPDEYELGEVLIKLDKDTEFLLGIGSGTIIDLVKFVS 100

Query: 311 FKRGKVRILVTTDLAARGIDAANVDLVIN 339
           FK GK  I V T  +  G  ++   L++N
Sbjct: 101 FKVGKPFISVPTAPSMDGYASSVASLIVN 129


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.128    0.373 

Gapped
Lambda     K      H
   0.267   0.0781    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 115,132,887
Number of extensions: 11144633
Number of successful extensions: 8087
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7802
Number of HSP's successfully gapped: 193
Length of query: 2355
Length of database: 10,937,602
Length adjustment: 113
Effective length of query: 2242
Effective length of database: 5,925,600
Effective search space: 13285195200
Effective search space used: 13285195200
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 67 (29.5 bits)