RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6230
         (699 letters)



>gnl|CDD|215487 PLN02900, PLN02900, alanyl-tRNA synthetase.
          Length = 936

 Score =  317 bits (814), Expect = 4e-96
 Identities = 139/369 (37%), Positives = 194/369 (52%), Gaps = 24/369 (6%)

Query: 328 ARRRPIMSNHTATHILNHALRFVLGPEADQKGSLVAPDRLRFDFTNKGAMTADQVKKAEA 387
            RRR I  NHTATH+LN AL+ VLG   DQKGSLVA ++LRFDF++   MT +++++ E+
Sbjct: 589 DRRRRIAPNHTATHLLNSALKEVLGDHVDQKGSLVAFEKLRFDFSHGKPMTPEELREVES 648

Query: 388 EVIRIVQTKKPVYAKLCSLADAKKIKGLRAMFDETYPDPVRVVCVGTPVEQLESEPESQA 447
            V   +    PV AK   LADAK+I GLRA+F E YPDPVRVV VG              
Sbjct: 649 LVNEWIGDALPVEAKEMPLADAKRINGLRAVFGEKYPDPVRVVSVGGVY----------- 697

Query: 448 GMETSVEFCGGTHLAMAHHVGDFCISSEEAIAKGIRRIVALSGNEATKAIKKAQVIETEV 507
               S+E CGGTH++       F + SEE IAKGIRRI A++G  A +AI  A  +E E 
Sbjct: 698 ----SMELCGGTHVSNTAEAEAFKLLSEEGIAKGIRRITAVTGGAAVEAINAADSLERE- 752

Query: 508 LNVKNTLSNPNTADSKALSKKIIDITEEIAKATLPYWKKEDLRNLLKTMKKSLDEAERLQ 567
                 L +    +   L KK+  +   +  A +P  KKE++R  +  ++K L  A++  
Sbjct: 753 ------LDSALKVEGSDLEKKVASLKSRVDAAVIPAAKKEEIRARVSALQKELRAAQKEA 806

Query: 568 KAAHASKVLDRVKSLVEERKAD--KFIVEILDAGSNTKALDAALKQVKTSAPHMAAMFIS 625
            A  A   + +   L  +  +     +V  LD G +  AL  A ++V       AA+ +S
Sbjct: 807 AALRAKLAVAKATELASKALSAGKSVLVARLDVGVDAAALKEAAEKVIAKLGDPAAVVLS 866

Query: 626 VDEAAKKIFCLSSVPKEAVSAGLTAIDWVRYLTDIMGGKGGGKAESAQASGPNYENAPLV 685
            DE   K+  +++VP   V  GL A  W+  +  + GG GGGK   AQ  G + E     
Sbjct: 867 SDEEKGKVSLVAAVPPGVVKKGLKAGKWLGAIAKLCGGGGGGKPGFAQGQGRDAEKLDAA 926

Query: 686 LSKARDYAS 694
           L KAR +A 
Sbjct: 927 LEKARAFAV 935



 Score =  129 bits (327), Expect = 6e-31
 Identities = 64/194 (32%), Positives = 92/194 (47%), Gaps = 24/194 (12%)

Query: 88  IPGLRAMFDETYPDPVRVVCVGTPVEQLESEPESQAGMETSVEFCGGTHLAMAHHVGDFC 147
           I GLRA+F E YPDPVRVV VG                  S+E CGGTH++       F 
Sbjct: 673 INGLRAVFGEKYPDPVRVVSVGGVY---------------SMELCGGTHVSNTAEAEAFK 717

Query: 148 ISSEEAIAKGIRRIVALSGNEATKAIKKAQVIETEVLNVKNTLSNPNTADSKALSKKIID 207
           + SEE IAKGIRRI A++G  A +AI  A  +E E       L +    +   L KK+  
Sbjct: 718 LLSEEGIAKGIRRITAVTGGAAVEAINAADSLERE-------LDSALKVEGSDLEKKVAS 770

Query: 208 ITEEIAKATLPYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKSLVEERKAD-- 265
           +   +  A +P  KKE++R  +  ++K L  A++   A  A   + +   L  +  +   
Sbjct: 771 LKSRVDAAVIPAAKKEEIRARVSALQKELRAAQKEAAALRAKLAVAKATELASKALSAGK 830

Query: 266 KFIVEILDAGSNTK 279
             +V  LD G +  
Sbjct: 831 SVLVARLDVGVDAA 844


>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
          Length = 865

 Score =  248 bits (636), Expect = 2e-71
 Identities = 116/370 (31%), Positives = 182/370 (49%), Gaps = 48/370 (12%)

Query: 328 ARRRPIMSNHTATHILNHALRFVLGPEADQKGSLVAPDRLRFDFTNKGAMTADQVKKAEA 387
            RRR I  NH+ATH+L+ ALR VLG    Q GSLVAPDRLRFDF++  A+T ++++K E 
Sbjct: 544 ERRRAIARNHSATHLLHAALREVLGEHVTQAGSLVAPDRLRFDFSHFEALTPEELRKIED 603

Query: 388 EVIRIVQTKKPVYAKLCSLADAKKIKGLRAMFDETYPDPVRVVCVGTPVEQLESEPESQA 447
            V   ++   PV  +  S+ +AKK  G  A+F E Y D VRVV +G              
Sbjct: 604 LVNEKIRENLPVETEEMSIEEAKK-MGAMALFGEKYGDEVRVVSIG-------------- 648

Query: 448 GMETSVEFCGGTHLAMAHHVGDFCISSEEAIAKGIRRIVALSGNEATKAIKKAQVIETEV 507
             + S+E CGGTH++    +G F I SE  +A G+RRI A++G  A + + +    E  +
Sbjct: 649 --DFSIELCGGTHVSNTGDIGLFKIVSESGVAAGVRRIEAVTGEAALEYLNEQ---EALL 703

Query: 508 LNVKNTLSNPNTADSKALSKKIIDITEEIAKATLPYWKKEDLRNLLKTMKKSLDEAERLQ 567
             +   L          L +++  + EE                 LK ++K L++ +   
Sbjct: 704 KELAALLKAK----PSELPERVEALLEE-----------------LKELEKELEQLKAKL 742

Query: 568 KAAHASKVLDRVKSLVEERKADKFIVEILDAGSNTKALDAALKQVKTSAPHMAAMFISVD 627
            AA A  +L + K    E    K +   ++ G + KAL      +K        +  +V+
Sbjct: 743 AAAAAGDLLAQAK----EVNGVKVLAAQVE-GVDAKALRTLADDLKDKLGSGVVVLATVE 797

Query: 628 EAAKKIFCLSSVPKEAVSAGLTAIDWVRYLTDIMGGKGGGKAESAQASGPNYENAPLVLS 687
           +   K+  + +V K+    GL A D V+ L  I+GGKGGG+ + AQA G +       L+
Sbjct: 798 D--GKVSLVVAVTKDLTDKGLKAGDLVKELAKIVGGKGGGRPDMAQAGGKDPAKLDEALA 855

Query: 688 KARDYASQQL 697
             +++ + +L
Sbjct: 856 AVKEWLAAKL 865



 Score = 83.6 bits (208), Expect = 1e-16
 Identities = 46/182 (25%), Positives = 76/182 (41%), Gaps = 44/182 (24%)

Query: 93  AMFDETYPDPVRVVCVGTPVEQLESEPESQAGMETSVEFCGGTHLAMAHHVGDFCISSEE 152
           A+F E Y D VRVV +G                + S+E CGGTH++    +G F I SE 
Sbjct: 632 ALFGEKYGDEVRVVSIG----------------DFSIELCGGTHVSNTGDIGLFKIVSES 675

Query: 153 AIAKGIRRIVALSGNEATKAIKKAQVIETEVLNVKNTLSNPNTADSKALSKKIIDITEEI 212
            +A G+RRI A++G  A + + +    E  +  +   L          L +++  + EE 
Sbjct: 676 GVAAGVRRIEAVTGEAALEYLNEQ---EALLKELAALLKAK----PSELPERVEALLEE- 727

Query: 213 AKATLPYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKSLVEERKADKFIVEIL 272
                           LK ++K L++ +    AA A  +L + K    E    K +   +
Sbjct: 728 ----------------LKELEKELEQLKAKLAAAAAGDLLAQAK----EVNGVKVLAAQV 767

Query: 273 DA 274
           + 
Sbjct: 768 EG 769


>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 879

 Score =  243 bits (623), Expect = 1e-69
 Identities = 117/370 (31%), Positives = 186/370 (50%), Gaps = 48/370 (12%)

Query: 328 ARRRPIMSNHTATHILNHALRFVLGPEADQKGSLVAPDRLRFDFTNKGAMTADQVKKAEA 387
            RRR +M NHTATH+L+ ALR VLG    Q GSLV P++LRFDF++  A+TA+++K+ E 
Sbjct: 558 ERRRRLMRNHTATHLLHAALRKVLGDHVWQAGSLVDPEKLRFDFSHYKALTAEELKEIER 617

Query: 388 EVIRIVQTKKPVYAKLCSLADAKKIKGLRAMFDETYPDPVRVVCVGTPVEQLESEPESQA 447
            V  I++   PV  +   L +AK  KG  A+F E Y D VRVV +G              
Sbjct: 618 LVNEIIRENLPVKTEEMDLDEAKA-KGAMALFGEKYGDEVRVVEIG-------------- 662

Query: 448 GMETSVEFCGGTHLAMAHHVGDFCISSEEAIAKGIRRIVALSGNEATKAIKKAQVIETEV 507
             + SVE CGGTH++    +G F I SEE IA G+RRI A++G  A + +++ +    ++
Sbjct: 663 --DFSVELCGGTHVSNTGDIGLFKIISEEGIAAGVRRIEAVTGEAALEYLQEQE----DL 716

Query: 508 LNVKNTLSNPNTADSKALSKKIIDITEEIAKATLPYWKKEDLRNLLKTMKKSLDEAERLQ 567
           L            +  +L K   ++  ++          E L   LK ++K L+  ++  
Sbjct: 717 L-----------KEIASLLKVPPELLPKV----------ERLLEELKELEKELERLKKKL 755

Query: 568 KAAHASKVLDRVKSLVEERKADKFIVEILDAGSNTKALDAALKQVKTSAPHMAAMFISVD 627
            AA  + +L   K   EE    K + + +D G++ K L      +K        +  SV 
Sbjct: 756 AAAELADLLSNAK--AEEIGGVKVLAKEVD-GADMKELREIADDLKKKLGSAVIVLASVA 812

Query: 628 EAAKKIFCLSSVPKEAVSAGLTAIDWVRYLTDIMGGKGGGKAESAQASGPNYENAPLVLS 687
           +   K+  + +V K  ++  + A + V+ L  I+GGKGGG+ + AQ  G + E     L 
Sbjct: 813 DG--KVSLVVAVSK-DLTDKVKAGELVKELAAIVGGKGGGRPDMAQGGGKDPEKLDEALE 869

Query: 688 KARDYASQQL 697
              ++  ++L
Sbjct: 870 LVIEWVKEKL 879



 Score = 91.2 bits (227), Expect = 7e-19
 Identities = 51/188 (27%), Positives = 83/188 (44%), Gaps = 43/188 (22%)

Query: 90  GLRAMFDETYPDPVRVVCVGTPVEQLESEPESQAGMETSVEFCGGTHLAMAHHVGDFCIS 149
           G  A+F E Y D VRVV +G                + SVE CGGTH++    +G F I 
Sbjct: 643 GAMALFGEKYGDEVRVVEIG----------------DFSVELCGGTHVSNTGDIGLFKII 686

Query: 150 SEEAIAKGIRRIVALSGNEATKAIKKAQVIETEVLNVKNTLSNPNTADSKALSKKIIDIT 209
           SEE IA G+RRI A++G  A + +++ +    ++L            +  +L K   ++ 
Sbjct: 687 SEEGIAAGVRRIEAVTGEAALEYLQEQE----DLL-----------KEIASLLKVPPELL 731

Query: 210 EEIAKATLPYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKSLVEERKADKFIV 269
            ++          E L   LK ++K L+  ++   AA  + +L   K   EE    K + 
Sbjct: 732 PKV----------ERLLEELKELEKELERLKKKLAAAELADLLSNAK--AEEIGGVKVLA 779

Query: 270 EILDAGSN 277
           + +D    
Sbjct: 780 KEVDGADM 787


>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase.  The model describes
           alanine--tRNA ligase. This enzyme catalyzes the reaction
           (tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
           pyrophosphate + AMP) [Protein synthesis, tRNA
           aminoacylation].
          Length = 851

 Score =  191 bits (488), Expect = 2e-51
 Identities = 112/353 (31%), Positives = 169/353 (47%), Gaps = 54/353 (15%)

Query: 327 SARRRPIMSNHTATHILNHALRFVLGPEADQKGSLVAPDRLRFDFTNKGAMTADQVKKAE 386
             RR  IM NH+ATH+L+ AL+ VLG    Q GSLV+  +LRFDF++  A+T ++++K E
Sbjct: 550 EKRRLRIMRNHSATHLLHAALQKVLGNHVWQAGSLVSVKKLRFDFSHYRALTREELEKIE 609

Query: 387 AEVIRIVQTKKPVYAKLCSLADAKKIKGLRAMFDETYPDPVRVVCVGTPVEQLESEPESQ 446
             V   +    P+      L +AK      A+F E Y + VRVV VG             
Sbjct: 610 DLVNEQILANIPIKVIFMDLDEAKAKGAF-ALFGEKYGEKVRVVSVG------------- 655

Query: 447 AGMETSVEFCGGTHLAMAHHVGDFCISSEEAIAKGIRRIVALSGNEATKAIKKAQVIETE 506
              + SVE CGGTH+     +G F I  E  IA G+RRI A++GN    AI+     E +
Sbjct: 656 ---DVSVELCGGTHVRNTGEIGLFKIVKESGIAAGVRRIEAVTGNA---AIEYLNEQEDK 709

Query: 507 VLNVKNTLSNPNTADSKALSKKIIDITEEIAKATLPYWKKEDLRNLLKTMKKSLDEAERL 566
           +  + + L          L KKI    EE        WK   L+  L+++K  +      
Sbjct: 710 LKELADILKV----TPNELPKKIERFFEE--------WK--ALQKELESLKLKI------ 749

Query: 567 QKAAHASKVLDRVKSLVEERKADKFIVEILDAGSNTKALDAA---LKQVKTSAPHMAAMF 623
                A + +D +K   E     K +VE++DA  + K+L      LK    SA ++ A F
Sbjct: 750 -----ADQKIDELKDKAETINGVKVLVEVVDA-KDMKSLKTMADRLKSKLGSAIYVLASF 803

Query: 624 ISVDEAAKKIFCLSSVPKEAVSAGLTAIDWVRYLTDIMGGKGGGKAESAQASG 676
                A  K+  +  V  + ++ G+ A + +  +  ++GGKGGG+ + AQA G
Sbjct: 804 -----ANGKVKLVCGVSNDVLNKGIKAGELINQIAQVLGGKGGGRPDLAQAGG 851



 Score = 82.0 bits (203), Expect = 5e-16
 Identities = 49/177 (27%), Positives = 80/177 (45%), Gaps = 33/177 (18%)

Query: 93  AMFDETYPDPVRVVCVGTPVEQLESEPESQAGMETSVEFCGGTHLAMAHHVGDFCISSEE 152
           A+F E Y + VRVV VG                + SVE CGGTH+     +G F I  E 
Sbjct: 639 ALFGEKYGEKVRVVSVG----------------DVSVELCGGTHVRNTGEIGLFKIVKES 682

Query: 153 AIAKGIRRIVALSGNEATKAIKKAQVIETEVLN-VKNTLSN-PNTADS-----KALSKKI 205
            IA G+RRI A++GN A + + + +    E+ + +K T +  P   +      KAL K++
Sbjct: 683 GIAAGVRRIEAVTGNAAIEYLNEQEDKLKELADILKVTPNELPKKIERFFEEWKALQKEL 742

Query: 206 IDITEEIAKATLPYWKK-----EDLRNLLKT-----MKKSLDEAERLQKAAHASKVL 252
             +  +IA   +   K        ++ L++      MK     A+RL+    ++  +
Sbjct: 743 ESLKLKIADQKIDELKDKAETINGVKVLVEVVDAKDMKSLKTMADRLKSKLGSAIYV 799


>gnl|CDD|237551 PRK13902, alaS, alanyl-tRNA synthetase; Provisional.
          Length = 900

 Score =  116 bits (294), Expect = 6e-27
 Identities = 96/398 (24%), Positives = 152/398 (38%), Gaps = 89/398 (22%)

Query: 318 GGRSEGIPGSARRRPIMSNHTATHILNHALRFVLGPEADQKGSLVAPDRLRFDFTNKGAM 377
           G    G     RR  +M +HTATHI+  A R VLG    Q G+    D+ R D T+   +
Sbjct: 574 GDEVRGEIDWERRYRLMRHHTATHIILAAARKVLGDHVWQAGAQKTVDKARLDITHYKRI 633

Query: 378 TADQVKKAEAEVIRIVQTKKPVYAKLCSLADAKKIKGLRAMFDETY-----PDPV-RVVC 431
           T ++VK+ E     IV   +PV  +     +A++  G R      Y     P    RVV 
Sbjct: 634 TREEVKEIERLANEIVMENRPVKVEWMDRNEAEQKYGFR-----LYQGGVPPGKTIRVVE 688

Query: 432 VGTPVEQLESEPESQAGMETSVEFCGGTHLAMAHHVGDFCISSEEAIAKGIRRIVALSGN 491
           +G                +  VE CGGTH++    +G   I   E I  G+ R+   +G 
Sbjct: 689 IG----------------DWDVEACGGTHVSNTGEIGPIKILKTERIQDGVERLEFAAG- 731

Query: 492 EATKAIKKAQVIETEVLNVKNTLSN-----PNTADS-----KALSKKIIDITEEIAKATL 541
               A++  Q +E  +      L       P T +      K   K+I  + +E+A+   
Sbjct: 732 --DAAVEYIQELEDLLKEAAEILGVPPEQLPKTVERFFEEWKEQKKEIEKLRKELAELLA 789

Query: 542 PYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKSLVEERKADKFIVEILDAGSN 601
                           + L +AE +                       K +VE +DA  +
Sbjct: 790 ---------------SELLSKAEEVGGV--------------------KVVVEEVDADED 814

Query: 602 T--KALDAALKQVKTSAPHMAAMFISVDEAAKKIFCLSSVPKEAVSAGLTAIDWVRYLTD 659
              K  +   K+ K       A+ +  +   +    +       V+AG    + VR L  
Sbjct: 815 ELRKIANELAKEGKV------ALVVLGNGGGQ--IVVGVGEDVGVNAG----EIVRELAK 862

Query: 660 IMGGKGGGKAESAQASGPNYENAPLVLSKARDYASQQL 697
           ++GGKGGGK + AQ  GP+ +     L  A++   + L
Sbjct: 863 VLGGKGGGKPDLAQGGGPDPKKKEEALEAAKEVLKEAL 900



 Score = 44.1 bits (105), Expect = 2e-04
 Identities = 33/153 (21%), Positives = 63/153 (41%), Gaps = 15/153 (9%)

Query: 129 VEFCGGTHLAMAHHVGDFCISSEEAIAKGIRRIVALSGNEATKAIKKAQVIETEVLNVKN 188
           VE CGGTH++    +G   I   E I  G+ R+   +G     A++  Q +E  +     
Sbjct: 694 VEACGGTHVSNTGEIGPIKILKTERIQDGVERLEFAAG---DAAVEYIQELEDLLKEAAE 750

Query: 189 TLSN-----PNTADS-----KALSKKIIDITEEIAKATL--PYWKKEDLRNLLKTMKKSL 236
            L       P T +      K   K+I  + +E+A+        K E++  +   +++  
Sbjct: 751 ILGVPPEQLPKTVERFFEEWKEQKKEIEKLRKELAELLASELLSKAEEVGGVKVVVEEVD 810

Query: 237 DEAERLQKAAHASKVLDRVKSLVEERKADKFIV 269
            + + L+K A+      +V  +V      + +V
Sbjct: 811 ADEDELRKIANELAKEGKVALVVLGNGGGQIVV 843


>gnl|CDD|234309 TIGR03683, A-tRNA_syn_arch, alanyl-tRNA synthetase.  This family of
           alanyl-tRNA synthetases is limited to the archaea, and
           is a subset of those sequences identified by the model
           pfam07973 covering the second additional domain (SAD) of
           alanyl and threonyl tRNA synthetases.
          Length = 902

 Score =  115 bits (291), Expect = 1e-26
 Identities = 94/379 (24%), Positives = 154/379 (40%), Gaps = 56/379 (14%)

Query: 318 GGRSEGIPGSARRRPIMSNHTATHILNHALRFVLGPEADQKGSLVAPDRLRFDFTNKGAM 377
           G   +G     RR  +M +HTATH+L  A R VLG    Q G+    D+ R D T+   +
Sbjct: 578 GDEVKGEIDWERRLRLMRHHTATHVLLAAARRVLGRHVWQAGAQKDTDKARLDITHYKRI 637

Query: 378 TADQVKKAEAEVIRIVQTKKPVYAKLCSLADAKKIKGLRAMFDETYP--DPVRVVCVGTP 435
           + +++K+ E    RIV   +PV  K     +A++  G R ++    P    +RVV +   
Sbjct: 638 SEEEIKEIERLANRIVMENRPVSVKWMDRNEAEQKYGFR-LYQGGVPPGREIRVVEIE-- 694

Query: 436 VEQLESEPESQAGMETSVEFCGGTHLAMAHHVGDFCISSEEAIAKGIRRIVALSGNEATK 495
                         +  V+ CGGTH      +G   I   E I  G+ RI   +G  A +
Sbjct: 695 --------------DWDVQACGGTHCRNTGEIGPIKILKTERIQDGVERIEFAAGEAALE 740

Query: 496 AIKKAQVIETEVLNVKNTLSNPNTADSKALSKKIIDITEEIAKATLPYWKKEDLRNLLKT 555
            I++ + +  E  ++                K   +   E  K     WK  + R  ++ 
Sbjct: 741 YIQELEDLLRESSDI---------------LKVPPEQLPETVKRFFEEWK--EQRKEIER 783

Query: 556 MKKSLDEAERLQKAAHASKVLDRVKSLVEERKADKFIVEILDAGSNTKALDAALKQVKTS 615
           +KK L E +  +  + A ++              + IVE++DA    +  + A K  +  
Sbjct: 784 LKKKLAELKIYELISEAERIGGV-----------EVIVELVDA-DMDELQEIAEKLAEKG 831

Query: 616 APHMAAMFISVDEAAKKIFCLSSVPKEAVSAGLTAIDWVRYLTDIMGGKGGGKAESAQAS 675
                 + +S      ++   SS  K  V AG    + VR +   +GG GGGK + AQ  
Sbjct: 832 L---VVVLVSGGGGNGRVVI-SSGEKAGVDAG----ELVREIAKELGGGGGGKPDLAQGG 883

Query: 676 GPNYENAPLVLSKARDYAS 694
           GP+  N    L K + +  
Sbjct: 884 GPDDLNVDEALEKIKSFLR 902



 Score = 40.0 bits (94), Expect = 0.005
 Identities = 33/137 (24%), Positives = 56/137 (40%), Gaps = 18/137 (13%)

Query: 129 VEFCGGTHLAMAHHVGDFCISSEEAIAKGIRRIVALSGNEATKAIKKAQVIETEVLNVKN 188
           V+ CGGTH      +G   I   E I  G+ RI   +G  A + I++ + +  E  ++  
Sbjct: 698 VQACGGTHCRNTGEIGPIKILKTERIQDGVERIEFAAGEAALEYIQELEDLLRESSDI-- 755

Query: 189 TLSNPNTADSKALSKKIIDITEEIAKATLPYWKKEDLRNLLKTMKKSLDEAERLQKAAHA 248
                         K   +   E  K     WK  + R  ++ +KK L E +  +  + A
Sbjct: 756 -------------LKVPPEQLPETVKRFFEEWK--EQRKEIERLKKKLAELKIYELISEA 800

Query: 249 SKVLDRVKSLVEERKAD 265
            ++   V+ +VE   AD
Sbjct: 801 ERI-GGVEVIVELVDAD 816


>gnl|CDD|234962 PRK01584, PRK01584, alanyl-tRNA synthetase; Provisional.
          Length = 594

 Score =  111 bits (280), Expect = 1e-25
 Identities = 57/152 (37%), Positives = 82/152 (53%), Gaps = 17/152 (11%)

Query: 337 HTATHILNHALRFVLGPEADQKGSLVAPDRLRFDFTNKGAMTADQVKKAEAEVIRIVQTK 396
           HTATH+L+ ALR VLG    QKGS +  +RLRFDF++   MT D++KK E  V   ++  
Sbjct: 457 HTATHLLHKALRLVLGDHVRQKGSNITAERLRFDFSHPEKMTDDEIKKVEDIVNLQIKND 516

Query: 397 KPVYAKLCSLADAKKIKGLRAMFDETYPDPVRVVCVGTPVEQLESEPESQAGMETSVEFC 456
             V  ++ SL +A++ KG  A+F E Y D V+V  +                   S E C
Sbjct: 517 LSVKKEVMSLEEARE-KGAMALFGEKYEDIVKVYEIDG----------------FSKEVC 559

Query: 457 GGTHLAMAHHVGDFCISSEEAIAKGIRRIVAL 488
           GG H+     +G F I  E++ + G+RRI A+
Sbjct: 560 GGPHVENTGELGTFKIQKEQSSSSGVRRIRAI 591



 Score = 45.5 bits (108), Expect = 8e-05
 Identities = 23/75 (30%), Positives = 33/75 (44%), Gaps = 16/75 (21%)

Query: 90  GLRAMFDETYPDPVRVVCVGTPVEQLESEPESQAGMETSVEFCGGTHLAMAHHVGDFCIS 149
           G  A+F E Y D V+V  +                   S E CGG H+     +G F I 
Sbjct: 533 GAMALFGEKYEDIVKVYEIDG----------------FSKEVCGGPHVENTGELGTFKIQ 576

Query: 150 SEEAIAKGIRRIVAL 164
            E++ + G+RRI A+
Sbjct: 577 KEQSSSSGVRRIRAI 591


>gnl|CDD|225427 COG2872, COG2872, Predicted metal-dependent hydrolases related to
           alanyl-tRNA synthetase HxxxH domain [General function
           prediction only].
          Length = 241

 Score = 65.5 bits (160), Expect = 8e-12
 Identities = 47/161 (29%), Positives = 71/161 (44%), Gaps = 23/161 (14%)

Query: 329 RRRPIMSNHTATHILNHALRFVLGPEADQKGSLVAPDRLRFDFTNKGAMTADQVKKAEAE 388
           RR   M  HTA H+L+  L  V G  A   G  +  D  R DF   G  T D++++ EA 
Sbjct: 91  RRYRHMRMHTALHLLSAVLYKVYG--ALTTGFEIGEDYARIDF--DGEDTEDEIEEVEAL 146

Query: 389 VIRIVQTKKPVYAKLCSLADAKKIKGLRAMFDETYPDP---VRVVCVGTPVEQLESEPES 445
              +V+   PV        +A+K+ GL  + ++  PD    +R+V +G            
Sbjct: 147 ANELVKENLPVIIYFIPREEAEKLPGLVKLKNKVPPDVEGKIRIVEIG------------ 194

Query: 446 QAGMETSVEFCGGTHLAMAHHVGDFCISSEEAIAKGIRRIV 486
               +  V+ CGGTH+     +G+  I   E   KG RR+ 
Sbjct: 195 ----DIDVQPCGGTHVKNTGEIGEIKILKTEKKGKGNRRVY 231



 Score = 30.4 bits (69), Expect = 2.8
 Identities = 21/78 (26%), Positives = 34/78 (43%), Gaps = 19/78 (24%)

Query: 88  IPGLRAMFDETYPDP---VRVVCVGTPVEQLESEPESQAGMETSVEFCGGTHLAMAHHVG 144
           +PGL  + ++  PD    +R+V +G                +  V+ CGGTH+     +G
Sbjct: 170 LPGLVKLKNKVPPDVEGKIRIVEIG----------------DIDVQPCGGTHVKNTGEIG 213

Query: 145 DFCISSEEAIAKGIRRIV 162
           +  I   E   KG RR+ 
Sbjct: 214 EIKILKTEKKGKGNRRVY 231


>gnl|CDD|216955 pfam02272, DHHA1, DHHA1 domain.  This domain is often found
           adjacent to the DHH domain pfam01368 and is called DHHA1
           for DHH associated domain. This domain is diagnostic of
           DHH subfamily 1 members. This domains is also found in
           alanyl tRNA synthetase , suggesting that this domain may
           have an RNA binding function. The domain is about 60
           residues long and contains a conserved GG motif.
          Length = 69

 Score = 49.9 bits (120), Expect = 6e-08
 Identities = 13/72 (18%), Positives = 27/72 (37%), Gaps = 3/72 (4%)

Query: 620 AAMFISVDEAAKKIFCLSSVPKEAVSAGLTAIDWVRYLTDIMGGKGGGKAESAQASGPNY 679
               +   E   K+   +   K     G    + ++ + + +GG GGG  ++A A G + 
Sbjct: 1   DKPVVLFAEEDGKVKVSARSSKGLDVKGG---ELLKEVAEKLGGGGGGHPDAAGAGGKDP 57

Query: 680 ENAPLVLSKARD 691
                 L   ++
Sbjct: 58  SKLEEALELLKE 69


>gnl|CDD|219676 pfam07973, tRNA_SAD, Threonyl and Alanyl tRNA synthetase second
           additional domain.  The catalytically active from of
           threonyl/alanyl tRNA synthetase is a dimer. Within the
           tRNA synthetase class II dimer, the bound tRNA interacts
           with both monomers making specific interactions with the
           catalytic domain, the C-terminal domain, and this domain
           (the second additional domain). The second additional
           domain is comprised of a pair of perpendicularly
           orientated antiparallel beta sheets, of four and three
           strands, respectively, that surround a central alpha
           helix that forms the core of the domain.
          Length = 44

 Score = 47.8 bits (115), Expect = 1e-07
 Identities = 18/60 (30%), Positives = 27/60 (45%), Gaps = 16/60 (26%)

Query: 103 VRVVCVGTPVEQLESEPESQAGMETSVEFCGGTHLAMAHHVGDFCISSEEAIAKGIRRIV 162
           VRVV +G                +  V+ CGGTH+     +G F I   ++  KG++RI 
Sbjct: 1   VRVVRIG----------------DFDVDLCGGTHVPNTGEIGAFKILKGDSKNKGLQRIY 44



 Score = 47.8 bits (115), Expect = 1e-07
 Identities = 18/60 (30%), Positives = 27/60 (45%), Gaps = 16/60 (26%)

Query: 427 VRVVCVGTPVEQLESEPESQAGMETSVEFCGGTHLAMAHHVGDFCISSEEAIAKGIRRIV 486
           VRVV +G                +  V+ CGGTH+     +G F I   ++  KG++RI 
Sbjct: 1   VRVVRIG----------------DFDVDLCGGTHVPNTGEIGAFKILKGDSKNKGLQRIY 44


>gnl|CDD|197931 smart00863, tRNA_SAD, Threonyl and Alanyl tRNA synthetase second
           additional domain.  The catalytically active form of
           threonyl/alanyl tRNA synthetase is a dimer. Within the
           tRNA synthetase class II dimer, the bound tRNA interacts
           with both monomers making specific interactions with the
           catalytic domain, the C-terminal domain, and this SAD
           domain (the second additional domain). The second
           additional domain is comprised of a pair of
           perpendicularly orientated antiparallel beta sheets, of
           four and three strands, respectively, that surround a
           central alpha helix that forms the core of the domain.
          Length = 43

 Score = 47.8 bits (115), Expect = 2e-07
 Identities = 20/59 (33%), Positives = 26/59 (44%), Gaps = 16/59 (27%)

Query: 103 VRVVCVGTPVEQLESEPESQAGMETSVEFCGGTHLAMAHHVGDFCISSEEAIAKGIRRI 161
           VRVV +G                + SVE CGGTH+     +G F I S      G++RI
Sbjct: 1   VRVVSIG----------------DFSVELCGGTHVPNTGEIGAFKILSVSGAYWGLQRI 43



 Score = 47.8 bits (115), Expect = 2e-07
 Identities = 20/59 (33%), Positives = 26/59 (44%), Gaps = 16/59 (27%)

Query: 427 VRVVCVGTPVEQLESEPESQAGMETSVEFCGGTHLAMAHHVGDFCISSEEAIAKGIRRI 485
           VRVV +G                + SVE CGGTH+     +G F I S      G++RI
Sbjct: 1   VRVVSIG----------------DFSVELCGGTHVPNTGEIGAFKILSVSGAYWGLQRI 43


>gnl|CDD|150786 pfam10153, DUF2361, Uncharacterized conserved protein (DUF2361).
           This is a region of 120 amino acids that is conserved in
           a family of proteins found from plants to fungi. The
           function is not known.
          Length = 114

 Score = 37.2 bits (87), Expect = 0.004
 Identities = 25/95 (26%), Positives = 47/95 (49%), Gaps = 4/95 (4%)

Query: 524 ALSKKIIDITEEIAKATLPYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKSLV 583
            L K+I DI   + K  LP   + +    L+ +KK L++AER +K    +K    V+   
Sbjct: 1   KLKKQIRDIERLLKKKDLPADVRVEKERALEALKKELEDAERRKKRKKMAKKYHMVR-FF 59

Query: 584 EERKADKFIVEI---LDAGSNTKALDAALKQVKTS 615
           E +KA + + ++   L+  S+ K +    +Q+  +
Sbjct: 60  ERKKATRRLKQLEKKLEETSDEKEIKDLKEQLSKA 94



 Score = 33.7 bits (78), Expect = 0.071
 Identities = 21/69 (30%), Positives = 35/69 (50%), Gaps = 1/69 (1%)

Query: 200 ALSKKIIDITEEIAKATLPYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKSLV 259
            L K+I DI   + K  LP   + +    L+ +KK L++AER +K    +K    V+   
Sbjct: 1   KLKKQIRDIERLLKKKDLPADVRVEKERALEALKKELEDAERRKKRKKMAKKYHMVR-FF 59

Query: 260 EERKADKFI 268
           E +KA + +
Sbjct: 60  ERKKATRRL 68


>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA.  During
           the bacterial cell cycle, the tubulin-like cell-division
           protein FtsZ polymerises into a ring structure that
           establishes the location of the nascent division site.
           EzrA modulates the frequency and position of FtsZ ring
           formation.
          Length = 559

 Score = 34.8 bits (81), Expect = 0.16
 Identities = 22/109 (20%), Positives = 50/109 (45%), Gaps = 22/109 (20%)

Query: 170 TKAIKKAQ------VIETEVLNVKNTLSNPNTADSKALSKKI-------IDITEEIAKAT 216
           T  ++ A+      ++E + L    TL+       + L K++         + E IA+  
Sbjct: 312 TDFLEHAREQNKQLLLELDRLQQSYTLNEDELETVRELEKQLEELEAQYDQLVERIAEKK 371

Query: 217 LPYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKSLV-EERKA 264
           + Y    +L+  L+ ++K L+E E+ Q+     ++ + ++ L  +E +A
Sbjct: 372 VAY---SELQEELEEIEKQLEEIEKEQE-----ELSESLQGLRKDELEA 412



 Score = 34.8 bits (81), Expect = 0.16
 Identities = 22/109 (20%), Positives = 50/109 (45%), Gaps = 22/109 (20%)

Query: 494 TKAIKKAQ------VIETEVLNVKNTLSNPNTADSKALSKKI-------IDITEEIAKAT 540
           T  ++ A+      ++E + L    TL+       + L K++         + E IA+  
Sbjct: 312 TDFLEHAREQNKQLLLELDRLQQSYTLNEDELETVRELEKQLEELEAQYDQLVERIAEKK 371

Query: 541 LPYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKSLV-EERKA 588
           + Y    +L+  L+ ++K L+E E+ Q+     ++ + ++ L  +E +A
Sbjct: 372 VAY---SELQEELEEIEKQLEEIEKEQE-----ELSESLQGLRKDELEA 412


>gnl|CDD|218676 pfam05649, Peptidase_M13_N, Peptidase family M13.  M13 peptidases
           are well-studied proteases found in a wide range of
           organisms including mammals and bacteria. In mammals
           they participate in processes such as cardiovascular
           development, blood-pressure regulation, nervous control
           of respiration, and regulation of the function of
           neuropeptides in the central nervous system. In bacteria
           they may be used for digestion of milk.
          Length = 380

 Score = 34.2 bits (79), Expect = 0.21
 Identities = 18/82 (21%), Positives = 38/82 (46%), Gaps = 7/82 (8%)

Query: 189 TLSNPNTADSKALSKKIIDITEEIAKATLPYWKKEDLRNLLKTM-KKSLDEAERLQKAAH 247
            L+  +  +++  ++ ++    ++AKA+   W +E+LR+  K     +L E   LQ  A 
Sbjct: 167 KLAGLDEDEAEKAAEDVVAFETKLAKAS---WSREELRDPAKLYNPMTLAE---LQALAP 220

Query: 248 ASKVLDRVKSLVEERKADKFIV 269
                  + +L   +  D+ IV
Sbjct: 221 GLDWKRYLSALGLPKDPDEVIV 242



 Score = 34.2 bits (79), Expect = 0.21
 Identities = 18/82 (21%), Positives = 38/82 (46%), Gaps = 7/82 (8%)

Query: 513 TLSNPNTADSKALSKKIIDITEEIAKATLPYWKKEDLRNLLKTM-KKSLDEAERLQKAAH 571
            L+  +  +++  ++ ++    ++AKA+   W +E+LR+  K     +L E   LQ  A 
Sbjct: 167 KLAGLDEDEAEKAAEDVVAFETKLAKAS---WSREELRDPAKLYNPMTLAE---LQALAP 220

Query: 572 ASKVLDRVKSLVEERKADKFIV 593
                  + +L   +  D+ IV
Sbjct: 221 GLDWKRYLSALGLPKDPDEVIV 242


>gnl|CDD|189000 cd08662, M13, Peptidase family M13 includes neprilysin,
           endothelin-converting enzyme I.  M13 family of
           metallopeptidases includes neprilysin (neutral
           endopeptidase, NEP, enkephalinase, CD10, CALLA, EC
           3.4.24.11), endothelin-converting enzyme I (ECE-1, EC
           3.4.24.71), erythrocyte surface antigen KELL (ECE-3),
           phosphate-regulating gene on the X chromosome (PHEX),
           soluble secreted endopeptidase (SEP), and damage-induced
           neuronal endopeptidase (DINE)/X-converting enzyme (XCE).
           These proteins consist of a short N-terminal cytoplasmic
           domain, a single transmembrane helix, and a larger
           C-terminal extracellular domain containing the active
           site. Proteins in this family fulfill a broad range of
           physiological roles due to the greater variation in the
           S2' subsite allowing substrate specificity. NEP is
           expressed in a variety of tissues including kidney and
           brain, and is involved in many physiological and
           pathological processes, including blood pressure and
           inflammatory response. It degrades a wide array of
           substrates such as substance P, enkephalins,
           cholecystokinin, neurotensin and somatostatin.  It is an
           important enzyme in the regulation of amyloid-beta
           (Abeta) protein that forms amyloid plaques that are
           associated with Alzeimers disease (AD). ECE-1 catalyzes
           the final rate-limiting step in the biosynthesis of
           endothelins via post-translational conversion of the
           biologically inactive big endothelins. Like NEP, it also
           hydrolyses bradykinin, substance P, neurotensin and
           Abeta.  Endothelin-1 overproduction has been implicated
           in various diseases, including stroke, asthma,
           hypertension, and cardiac and renal failure. Kell is a
           homolog of NEP and constitutes a major antigen on human
           erythrocytes; it preferentially cleaves big endothelin-3
           to produce bioactive endothelin-3, but is also known to
           cleave substance P and neurokinin A. PHEX forms a
           complex interaction with fibroblast growth factor 23
           (FGF23) and matrix extracellular phosphoglycoprotein,
           causing bone mineralization. A loss-of-function mutation
           in PHEX disrupts this interaction leading to
           hypophosphatemic rickets; X-linked hypophosphatemic
           (XLH) rickets is the most common form of metabolic
           rickets. ECEL1 is a brain metalloprotease involved in
           the critical role in the nervous regulation of the
           respiratory system, while DINE (damage induced neuronal
           endopeptidase) is abundantly expressed in the
           hypothalamus and its expression responds to nerve injury
           as well. Thus, majority of these M13 proteases are prime
           therapeutic targets for selective inhibition.
          Length = 611

 Score = 34.2 bits (79), Expect = 0.25
 Identities = 22/82 (26%), Positives = 42/82 (51%), Gaps = 7/82 (8%)

Query: 189 TLSNPNTADSKALSKKIIDITEEIAKATLPYWKKEDLRNLLKTMKK-SLDEAERLQKAAH 247
            L+  +  D++AL+++++    E+AK +   W +E+ R+  KT    +L E   LQK A 
Sbjct: 141 VLAGEDEEDAEALAEEVLAFETELAKIS---WSEEERRDPEKTYNPMTLAE---LQKLAP 194

Query: 248 ASKVLDRVKSLVEERKADKFIV 269
                  +++L    + +K IV
Sbjct: 195 GIDWKAYLEALGLPSEDEKVIV 216



 Score = 34.2 bits (79), Expect = 0.25
 Identities = 22/82 (26%), Positives = 42/82 (51%), Gaps = 7/82 (8%)

Query: 513 TLSNPNTADSKALSKKIIDITEEIAKATLPYWKKEDLRNLLKTMKK-SLDEAERLQKAAH 571
            L+  +  D++AL+++++    E+AK +   W +E+ R+  KT    +L E   LQK A 
Sbjct: 141 VLAGEDEEDAEALAEEVLAFETELAKIS---WSEEERRDPEKTYNPMTLAE---LQKLAP 194

Query: 572 ASKVLDRVKSLVEERKADKFIV 593
                  +++L    + +K IV
Sbjct: 195 GIDWKAYLEALGLPSEDEKVIV 216


>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein.  This
            model represents a group of paralogous families in
            plasmodium species alternately annotated as reticulocyte
            binding protein, 235-kDa family protein and rhoptry
            protein. Rhoptry protein is localized on the cell surface
            and is extremely large (although apparently lacking in
            repeat structure) and is important for the process of
            invasion of the RBCs by the parasite. These proteins are
            found in P. falciparum, P. vivax and P. yoelii.
          Length = 2757

 Score = 34.3 bits (78), Expect = 0.26
 Identities = 29/122 (23%), Positives = 60/122 (49%), Gaps = 13/122 (10%)

Query: 504  ETEVLNVKNTLSNPNTADSKALSKKIIDITEEIAKATLPYWKKEDLRNLLKTMKKSLDEA 563
            E E  N+ +     +   SK   + I DI E+  K    + ++ D+ ++ K ++K+L +A
Sbjct: 1275 EMETFNISHDDDKDHHIISKKHDENISDIREKSLKIIEDFSEESDINDIKKELQKNLLDA 1334

Query: 564  ER--------LQKAAHASKV--LDRVKSLVEERKADKFIVEILDAGSNTKA-LDAALKQV 612
            ++        L + A+   +  L+++K +++E K  ++  EI +   N K  LD + K +
Sbjct: 1335 QKHNSDINLYLNEIANIYNILKLNKIKKIIDEVK--EYTKEIEENNKNIKDELDKSEKLI 1392

Query: 613  KT 614
            K 
Sbjct: 1393 KK 1394



 Score = 33.1 bits (75), Expect = 0.75
 Identities = 21/94 (22%), Positives = 47/94 (50%), Gaps = 10/94 (10%)

Query: 180  ETEVLNVKNTLSNPNTADSKALSKKIIDITEEIAKATLPYWKKEDLRNLLKTMKKSLDEA 239
            E E  N+ +     +   SK   + I DI E+  K    + ++ D+ ++ K ++K+L +A
Sbjct: 1275 EMETFNISHDDDKDHHIISKKHDENISDIREKSLKIIEDFSEESDINDIKKELQKNLLDA 1334

Query: 240  ER--------LQKAAHASKV--LDRVKSLVEERK 263
            ++        L + A+   +  L+++K +++E K
Sbjct: 1335 QKHNSDINLYLNEIANIYNILKLNKIKKIIDEVK 1368


>gnl|CDD|218006 pfam04286, DUF445, Protein of unknown function (DUF445).  Predicted
           to be a membrane protein.
          Length = 366

 Score = 33.4 bits (77), Expect = 0.41
 Identities = 27/152 (17%), Positives = 56/152 (36%), Gaps = 20/152 (13%)

Query: 152 EAIAKGIRRIVA---LSGNEATKAIKKAQVIETEVLNVKNTLSNPNTADS------KALS 202
           + IA+ + R V    L+     + +++A   E     +   L+ P  A+       + L+
Sbjct: 38  DRIAENLGRFVEEHLLTPEVIARKLREADPAER----LGRWLAQPTNAERLARLVARLLA 93

Query: 203 KKIIDITEE-----IAKATLPYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKS 257
             +  + +E     + KA     ++ D   LL  + + L    R Q       +LDR++ 
Sbjct: 94  GILEALDDERLQRFLKKALRRRLEEIDPAPLLGKLLELLLADGRHQALL--DLLLDRLRD 151

Query: 258 LVEERKADKFIVEILDAGSNTKIPRLRSFEFG 289
            ++  +    I E++D       P        
Sbjct: 152 WLKSEEGKARIAEMIDEFLERWGPTWLLGGIA 183



 Score = 32.2 bits (74), Expect = 0.83
 Identities = 30/164 (18%), Positives = 61/164 (37%), Gaps = 27/164 (16%)

Query: 476 EAIAKGIRRIVA---LSGNEATKAIKKAQVIETEVLNVKNTLSNPNTADS------KALS 526
           + IA+ + R V    L+     + +++A   E     +   L+ P  A+       + L+
Sbjct: 38  DRIAENLGRFVEEHLLTPEVIARKLREADPAER----LGRWLAQPTNAERLARLVARLLA 93

Query: 527 KKIIDITEE-----IAKATLPYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKS 581
             +  + +E     + KA     ++ D   LL  + + L    R Q       +LDR++ 
Sbjct: 94  GILEALDDERLQRFLKKALRRRLEEIDPAPLLGKLLELLLADGRHQALL--DLLLDRLRD 151

Query: 582 LVEERKADKFIVEILDAGSNTKALDAALKQVKTSAPHMAAMFIS 625
            ++  +    I E++D           L         +A MF+ 
Sbjct: 152 WLKSEEGKARIAEMIDEFLERWGPTWLLG-------GIAEMFLR 188


>gnl|CDD|178524 PLN02936, PLN02936, epsilon-ring hydroxylase.
          Length = 489

 Score = 33.2 bits (76), Expect = 0.48
 Identities = 15/51 (29%), Positives = 27/51 (52%), Gaps = 9/51 (17%)

Query: 379 ADQVKKAEAEVIRIVQTKKPVYAKLCSLADAKKIKGLRAMFDET---YPDP 426
            + ++KA+ E+ R++Q + P Y       D K++K L    +E+   YP P
Sbjct: 309 PEALRKAQEELDRVLQGRPPTY------EDIKELKYLTRCINESMRLYPHP 353


>gnl|CDD|202551 pfam03142, Chitin_synth_2, Chitin synthase.  Members of this family
           are fungal chitin synthase EC:2.4.1.16 enzymes. They
           catalyze chitin synthesis as follows:
           UDP-N-acetyl-D-glucosamine +
           {(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N) <=> UDP +
           {(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N+1).
          Length = 527

 Score = 33.2 bits (76), Expect = 0.49
 Identities = 16/50 (32%), Positives = 25/50 (50%), Gaps = 1/50 (2%)

Query: 33  IVFFSIYLLSVPSMYLLLILYSLINLNVVSWG-TREVAVKKTKKVDLTST 81
           + +  IYLL++P    +L  Y+  + +  SWG TR V  +K KK      
Sbjct: 459 VGWMIIYLLALPIWNFVLPSYAFWHFDDFSWGDTRVVEGEKKKKAHGEDE 508


>gnl|CDD|224596 COG1682, TagG, ABC-type polysaccharide/polyol phosphate export
           systems, permease component [Carbohydrate transport and
           metabolism / Cell envelope biogenesis, outer membrane].
          Length = 263

 Score = 32.2 bits (74), Expect = 0.74
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query: 26  YLIEFFCIVFFSIYLLSVPSMYLLLILYSLINLNVVSWG 64
           +LI    I+ F I L   PS + LL+L +L+ L + S G
Sbjct: 122 FLIHLIIILIFLIILGVEPSWHWLLLLPALLLLILFSVG 160


>gnl|CDD|226370 COG3852, NtrB, Signal transduction histidine kinase, nitrogen
           specific [Signal transduction mechanisms].
          Length = 363

 Score = 32.3 bits (74), Expect = 0.76
 Identities = 21/95 (22%), Positives = 41/95 (43%), Gaps = 23/95 (24%)

Query: 176 AQVIETEVLNVKNTLSNPNTADSKALSKKIIDITEEIAKATLPYWKKEDLRNLLKTMKKS 235
           AQ++E  +         P+ A  + L++ II+  +              LRNL+  ++  
Sbjct: 154 AQLLERAL---------PDEAL-RELTQLIIEEADR-------------LRNLVDRLEVL 190

Query: 236 LDEAERLQKAAHASKVLDRVKSLVEERKADKFIVE 270
             +    +   +  +VL+RV++LVE   AD   + 
Sbjct: 191 GPQRPGDRVPVNIHEVLERVRALVEAEFADNVRLI 225



 Score = 32.3 bits (74), Expect = 0.76
 Identities = 21/95 (22%), Positives = 41/95 (43%), Gaps = 23/95 (24%)

Query: 500 AQVIETEVLNVKNTLSNPNTADSKALSKKIIDITEEIAKATLPYWKKEDLRNLLKTMKKS 559
           AQ++E  +         P+ A  + L++ II+  +              LRNL+  ++  
Sbjct: 154 AQLLERAL---------PDEAL-RELTQLIIEEADR-------------LRNLVDRLEVL 190

Query: 560 LDEAERLQKAAHASKVLDRVKSLVEERKADKFIVE 594
             +    +   +  +VL+RV++LVE   AD   + 
Sbjct: 191 GPQRPGDRVPVNIHEVLERVRALVEAEFADNVRLI 225


>gnl|CDD|226805 COG4365, COG4365, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 537

 Score = 31.8 bits (72), Expect = 1.5
 Identities = 16/62 (25%), Positives = 26/62 (41%), Gaps = 6/62 (9%)

Query: 201 LSKKIIDITEEIAKATLPYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKSLVE 260
           L K  +D+  ++   +     KE L   L T+ +   E E      H   +L+RVK  + 
Sbjct: 151 LHKLTLDVPPKVPTVSRVTLDKEWLAEWLATVFRFEAETE------HTDALLERVKRAIA 204

Query: 261 ER 262
           E 
Sbjct: 205 ES 206



 Score = 31.8 bits (72), Expect = 1.5
 Identities = 16/62 (25%), Positives = 26/62 (41%), Gaps = 6/62 (9%)

Query: 525 LSKKIIDITEEIAKATLPYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKSLVE 584
           L K  +D+  ++   +     KE L   L T+ +   E E      H   +L+RVK  + 
Sbjct: 151 LHKLTLDVPPKVPTVSRVTLDKEWLAEWLATVFRFEAETE------HTDALLERVKRAIA 204

Query: 585 ER 586
           E 
Sbjct: 205 ES 206


>gnl|CDD|238544 cd01124, KaiC, KaiC is a circadian clock protein primarily found in
           cyanobacteria KaiC is a RecA-like ATPase, having both
           Walker A and Walker B motifs. A related protein is found
           in archaea.
          Length = 187

 Score = 30.4 bits (69), Expect = 2.0
 Identities = 18/78 (23%), Positives = 32/78 (41%), Gaps = 8/78 (10%)

Query: 224 DLRNLLKTMKKSLDEAERLQKAAHASKVL----DRVKSLVEERKADKFIV----EILDAG 275
           DL  L      ++ +A+  +     S +      R+K  +EE KA + ++     +L   
Sbjct: 52  DLERLEDEGLLAIVDADPDEIGPAESSLRLELIQRLKDAIEEFKAKRVVIDSVSGLLLME 111

Query: 276 SNTKIPRLRSFEFGLKRQ 293
            +T    +R   F LKR 
Sbjct: 112 QSTARLEIRRLLFALKRF 129


>gnl|CDD|220629 pfam10199, Adaptin_binding, Alpha and gamma adaptin binding protein
           p34.  p34 is a protein involved in membrane trafficking.
           It is known to interact with both alpha and gamma
           adaptin. It has been speculated that p34 may play a
           chaperone role such as preventing the soluble adaptors
           from co-assembling with soluble clathrin, or helping to
           remove the adaptors from the coated vesicle. Another
           possible function is in aiding the recruitment of
           soluble adaptors onto the membrane.
          Length = 101

 Score = 29.0 bits (65), Expect = 2.3
 Identities = 25/97 (25%), Positives = 37/97 (38%), Gaps = 11/97 (11%)

Query: 157 GIRRIV-ALSGNEATKAIKKAQVIETEVLNVKNTLSNPNTADSKALSKKIIDITEEIAKA 215
           GI RIV AL  ++ +    K           +   S+ +  DS   SK     ++E    
Sbjct: 4   GIERIVEALETHDWSNMELKDGDGIDLEEEKEGLKSSDDQGDSLEPSKS----SQEEDDD 59

Query: 216 TLPYWKKEDLRNLLKTMK------KSLDEAERLQKAA 246
                  ED  NLL  ++       SL + ER + AA
Sbjct: 60  GDEELDVEDFENLLTKLQQARDMTASLPDEERKKFAA 96



 Score = 29.0 bits (65), Expect = 2.3
 Identities = 25/97 (25%), Positives = 37/97 (38%), Gaps = 11/97 (11%)

Query: 481 GIRRIV-ALSGNEATKAIKKAQVIETEVLNVKNTLSNPNTADSKALSKKIIDITEEIAKA 539
           GI RIV AL  ++ +    K           +   S+ +  DS   SK     ++E    
Sbjct: 4   GIERIVEALETHDWSNMELKDGDGIDLEEEKEGLKSSDDQGDSLEPSKS----SQEEDDD 59

Query: 540 TLPYWKKEDLRNLLKTMK------KSLDEAERLQKAA 570
                  ED  NLL  ++       SL + ER + AA
Sbjct: 60  GDEELDVEDFENLLTKLQQARDMTASLPDEERKKFAA 96


>gnl|CDD|223095 COG0016, PheS, Phenylalanyl-tRNA synthetase alpha subunit
           [Translation, ribosomal structure and biogenesis].
          Length = 335

 Score = 30.7 bits (70), Expect = 2.6
 Identities = 20/100 (20%), Positives = 35/100 (35%), Gaps = 5/100 (5%)

Query: 545 KKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKSLVEERKADKFIVEILDAGSNTKA 604
           KK  L +LLK + K     ER +  A  +++   V+  + E    +     L      + 
Sbjct: 37  KKGVLTDLLKKLGKLSPLEERKEVGALINELKKEVEDAITELTP-ELEAAGLWERLAFEK 95

Query: 605 LDAAL--KQVKTSAPHMAAMFIS--VDEAAKKIFCLSSVP 640
           +D  L  +++   + H     I    D      F     P
Sbjct: 96  IDVTLPGRRIYPGSLHPLTQTIEEIEDIFLGMGFTEVEGP 135



 Score = 29.9 bits (68), Expect = 4.7
 Identities = 15/59 (25%), Positives = 25/59 (42%), Gaps = 5/59 (8%)

Query: 221 KKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKSLVEERKADKFIVEILDAGSNTK 279
           KK  L +LLK + K     ER +  A  +++   V+  + E        E+  AG   +
Sbjct: 37  KKGVLTDLLKKLGKLSPLEERKEVGALINELKKEVEDAITELTP-----ELEAAGLWER 90


>gnl|CDD|193205 pfam12729, 4HB_MCP_1, Four helix bundle sensory module for signal
           transduction.  This family is a four helix bundle that
           operates as a ubiquitous sensory module in prokaryotic
           signal-transduction. The 4HB_MCP is always found between
           two predicted transmembrane helices indicating that it
           detects only extracellular signals. In many cases the
           domain is associated with a cytoplasmic HAMP domain
           suggesting that most proteins carrying the bundle might
           share the mechanism of transmembrane signalling which is
           well-characterized in E coli chemoreceptors.
          Length = 181

 Score = 29.9 bits (68), Expect = 2.8
 Identities = 30/143 (20%), Positives = 55/143 (38%), Gaps = 28/143 (19%)

Query: 491 NEATKAIKKAQVIETEVL----NVKNTLSNPNTADSKALSKKIIDITEEIAKATLPYWKK 546
            +    IK    I + +     N+   +   + A+   L K I ++  EI K        
Sbjct: 42  EDRLLPIKWLGEIRSNLREIRANLLELILTTDPAERDELLKDIEELRAEIDK-------- 93

Query: 547 EDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKSLVEE-RKADKFIVEILDAGSNTKAL 605
                L K  K  L E E+        K+ +  K  ++  RK    ++++  AG N +A 
Sbjct: 94  ----LLKKYEKTILTEEEK--------KLFNEFKEQLKAYRKVRNKVLDLAKAGKNDEAY 141

Query: 606 DAALKQVKTSAPHMAAMFISVDE 628
              L +++   P   A+  ++DE
Sbjct: 142 ALYLTELE---PARDAVIEALDE 161


>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
           Provisional.
          Length = 569

 Score = 30.6 bits (70), Expect = 2.9
 Identities = 23/109 (21%), Positives = 48/109 (44%), Gaps = 22/109 (20%)

Query: 170 TKAIKKAQ------VIETEVLNVKNTLSNPNTADSKALSKKI-------IDITEEIAKAT 216
              ++ A+        E + +    TL+       + L K++        +ITE IA+  
Sbjct: 316 PDFLEHAKEQNKELKEEIDRVKQSYTLNESELESVRQLEKQLESLEKQYDEITERIAEQE 375

Query: 217 LPYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKSLV-EERKA 264
           + Y    +L+  L+ + K L+E E+ Q+     K+ + ++ L  +E +A
Sbjct: 376 IAY---SELQEELEEILKQLEEIEKEQE-----KLSEMLQGLRKDELEA 416



 Score = 30.6 bits (70), Expect = 2.9
 Identities = 23/109 (21%), Positives = 48/109 (44%), Gaps = 22/109 (20%)

Query: 494 TKAIKKAQ------VIETEVLNVKNTLSNPNTADSKALSKKI-------IDITEEIAKAT 540
              ++ A+        E + +    TL+       + L K++        +ITE IA+  
Sbjct: 316 PDFLEHAKEQNKELKEEIDRVKQSYTLNESELESVRQLEKQLESLEKQYDEITERIAEQE 375

Query: 541 LPYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKSLV-EERKA 588
           + Y    +L+  L+ + K L+E E+ Q+     K+ + ++ L  +E +A
Sbjct: 376 IAY---SELQEELEEILKQLEEIEKEQE-----KLSEMLQGLRKDELEA 416


>gnl|CDD|236671 PRK10308, PRK10308, 3-methyl-adenine DNA glycosylase II;
           Provisional.
          Length = 283

 Score = 30.5 bits (69), Expect = 3.0
 Identities = 19/99 (19%), Positives = 41/99 (41%), Gaps = 2/99 (2%)

Query: 394 QTKKPVYAKLCSLADAKKIKG-LRAMFDETYPDPVRVVCVGTPVEQLESEPESQAGMETS 452
           Q  + +  +L S+A A K+   +  ++ E   D    VC  TP     ++P++   +   
Sbjct: 116 QGVRAILGQLVSVAMAAKLTAKVAQLYGERLDDFPEYVCFPTPERLAAADPQALKALGMP 175

Query: 453 VEFCGG-THLAMAHHVGDFCISSEEAIAKGIRRIVALSG 490
           ++      HLA A   G   ++    + + ++ +    G
Sbjct: 176 LKRAEALIHLANAALEGTLPLTIPGDVEQAMKTLQTFPG 214


>gnl|CDD|223814 COG0743, Dxr, 1-deoxy-D-xylulose 5-phosphate reductoisomerase
           [Lipid metabolism].
          Length = 385

 Score = 30.3 bits (69), Expect = 3.3
 Identities = 19/70 (27%), Positives = 28/70 (40%), Gaps = 7/70 (10%)

Query: 583 VEERKADKFIVEILDAGSNTKALDAALKQVKTSAPHMAAMFISVDEAAKKIFCLSSVPKE 642
           V  R  DKF V  L AG N   ++   +Q++   P    +    DE+A K      +P  
Sbjct: 20  VIRRNPDKFEVVALAAGKN---VELLAEQIREFKPKYVVI---ADESAAKELE-DLLPGT 72

Query: 643 AVSAGLTAID 652
            V  G   + 
Sbjct: 73  EVLVGEEGLC 82


>gnl|CDD|227612 COG5293, COG5293, Predicted ATPase [General function prediction
           only].
          Length = 591

 Score = 30.7 bits (69), Expect = 3.4
 Identities = 16/85 (18%), Positives = 33/85 (38%), Gaps = 7/85 (8%)

Query: 158 IRRIVALSGNEATKAIKKAQVIE-TEVLNVKNTLSNPNTADSKALSKKIID-ITEEIAKA 215
           I  + +L   E T+  K+ +V +    LN  +       A   A ++++++ + E IA+ 
Sbjct: 220 IGSLDSLEEIETTELRKQDEVNKKQATLNTFD-----FHAQDYAETEELVNTVDERIAEL 274

Query: 216 TLPYWKKEDLRNLLKTMKKSLDEAE 240
                  +     +KT  K      
Sbjct: 275 NNRRISMQSHWKRVKTSLKEQILFC 299



 Score = 30.7 bits (69), Expect = 3.4
 Identities = 16/85 (18%), Positives = 33/85 (38%), Gaps = 7/85 (8%)

Query: 482 IRRIVALSGNEATKAIKKAQVIE-TEVLNVKNTLSNPNTADSKALSKKIID-ITEEIAKA 539
           I  + +L   E T+  K+ +V +    LN  +       A   A ++++++ + E IA+ 
Sbjct: 220 IGSLDSLEEIETTELRKQDEVNKKQATLNTFD-----FHAQDYAETEELVNTVDERIAEL 274

Query: 540 TLPYWKKEDLRNLLKTMKKSLDEAE 564
                  +     +KT  K      
Sbjct: 275 NNRRISMQSHWKRVKTSLKEQILFC 299


>gnl|CDD|217556 pfam03435, Saccharop_dh, Saccharopine dehydrogenase.  This family
           comprised of three structural domains that can not be
           separated in the linear sequence. In some organisms this
           enzyme is found as a bifunctional polypeptide with
           lysine ketoglutarate reductase. The saccharopine
           dehydrogenase can also function as a saccharopine
           reductase.
          Length = 380

 Score = 30.0 bits (68), Expect = 4.1
 Identities = 13/41 (31%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 572 ASKVLDRVKSLVEERKADKFIVEILDAGSNTKALDAALKQV 612
           A + L++ ++L   +   +FI   +DA  N +AL A LK+ 
Sbjct: 28  ADRSLEKAQALAAPKLGLRFIAIAVDA-DNYEALVALLKEG 67


>gnl|CDD|239511 cd03419, GRX_GRXh_1_2_like, Glutaredoxin (GRX) family, GRX human
           class 1 and 2 (h_1_2)-like subfamily; composed of
           proteins similar to human GRXs, approximately 10 kDa in
           size, and proteins containing a GRX or GRX-like domain.
           GRX is a glutathione (GSH) dependent reductase,
           catalyzing the disulfide reduction of target proteins
           such as ribonucleotide reductase. It contains a redox
           active CXXC motif in a TRX fold and uses a similar
           dithiol mechanism employed by TRXs for intramolecular
           disulfide bond reduction of protein substrates. Unlike
           TRX, GRX has preference for mixed GSH disulfide
           substrates, in which it uses a monothiol mechanism where
           only the N-terminal cysteine is required. The flow of
           reducing equivalents in the GRX system goes from NADPH
           -> GSH reductase -> GSH -> GRX -> protein substrates. By
           altering the redox state of target proteins, GRX is
           involved in many cellular functions including DNA
           synthesis, signal transduction and the defense against
           oxidative stress. Different classes are known including
           human GRX1 and GRX2, which are members of this
           subfamily. Also included in this subfamily are the
           N-terminal GRX domains of proteins similar to human
           thioredoxin reductase 1 and 3.
          Length = 82

 Score = 27.9 bits (63), Expect = 4.2
 Identities = 13/48 (27%), Positives = 22/48 (45%), Gaps = 2/48 (4%)

Query: 577 DRVKSLVEERKADKFIVEILDAGSNTKALDAALKQVKTSAPHMAAMFI 624
            R KSL++E      +VE LD   +   +   L++  T    +  +FI
Sbjct: 14  KRAKSLLKELGVKPAVVE-LDQHEDGSEIQDYLQE-LTGQRTVPNVFI 59


>gnl|CDD|222658 pfam14296, O-ag_pol_Wzy, O-antigen polysaccharide polymerase Wzy.
           This family includes O-antigen polysaccharide
           polymerases. These enzymes link O-units via a glycosidic
           linkage to form a long O-antigen. These enzymes vary in
           specificity and sequence.
          Length = 440

 Score = 30.0 bits (68), Expect = 4.6
 Identities = 13/43 (30%), Positives = 23/43 (53%), Gaps = 6/43 (13%)

Query: 22  PDQVYLIEFFCIVFFSIYLLSVPS------MYLLLILYSLINL 58
           P  + LI  F  + F +YL + PS      + ++ +LY L++L
Sbjct: 168 PYFIILIGSFFPIAFCLYLATKPSKKKILFILIIYLLYLLLSL 210


>gnl|CDD|232888 TIGR00233, trpS, tryptophanyl-tRNA synthetase.  This model
           represents tryptophanyl-tRNA synthetase. Some members of
           the family have a pfam00458 domain amino-terminal to the
           region described by this model [Protein synthesis, tRNA
           aminoacylation].
          Length = 327

 Score = 29.6 bits (67), Expect = 5.1
 Identities = 33/142 (23%), Positives = 55/142 (38%), Gaps = 30/142 (21%)

Query: 474 SEEAIAKGIRRIVALSGNEATKAIKKAQVIETEVLNVKNTLSNPNTADSKALSKKIIDIT 533
           + + I K IR+  A  G   T    + +     +L +   LS     D      K+ +I 
Sbjct: 206 TPKQIKKKIRKA-ATDGGRVTLFEHREKPGVPNLLVIYQYLSFFLIDDD-----KLKEIY 259

Query: 534 EEIAKATLPYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKSLVEER--KADKF 591
           E      L Y +           KK+L   E LQ+          +K + E R   A++ 
Sbjct: 260 EAYKSGKLGYGE----------CKKAL--IEVLQEF---------LKEIQERRAEIAEEI 298

Query: 592 IVEILDAGSNTKALDAALKQVK 613
           + +IL+ G+  KA + A K + 
Sbjct: 299 LDKILEPGAK-KARETANKTLA 319



 Score = 28.8 bits (65), Expect = 8.7
 Identities = 30/131 (22%), Positives = 49/131 (37%), Gaps = 18/131 (13%)

Query: 150 SEEAIAKGIRRIVALSGNEATKAIKKAQVIETEVLNVKNTLSNPNTADSKALSKKIIDIT 209
           + + I K IR+  A  G   T    + +     +L +   LS     D      K+ +I 
Sbjct: 206 TPKQIKKKIRKA-ATDGGRVTLFEHREKPGVPNLLVIYQYLSFFLIDDD-----KLKEIY 259

Query: 210 EEIAKATLPY--WKK---EDLRNLLKTMKKSLDEAERLQKAAH-ASKVLDRVKSLVEERK 263
           E      L Y   KK   E L+  LK ++       R + A     K+L+       E  
Sbjct: 260 EAYKSGKLGYGECKKALIEVLQEFLKEIQ-----ERRAEIAEEILDKILEPGAKKARE-T 313

Query: 264 ADKFIVEILDA 274
           A+K + ++  A
Sbjct: 314 ANKTLADVYKA 324


>gnl|CDD|183530 PRK12444, PRK12444, threonyl-tRNA synthetase; Reviewed.
          Length = 639

 Score = 29.7 bits (67), Expect = 5.6
 Identities = 30/129 (23%), Positives = 54/129 (41%), Gaps = 19/129 (14%)

Query: 337 HTATHILNHALRFVLGPEADQKGSLVAPDRLRFDFTNKGAMTADQVKKAEAEVIRIVQTK 396
           H+A HIL  A++ + G      G  V  +   +D     ++  + ++K E E+ +I+   
Sbjct: 77  HSAAHILAQAVKRLYGDVNLGVGP-VIENGFYYDMDLPSSVNVEDLRKIEKEMKKIINEN 135

Query: 397 KPVYAKLCSLADAKKIKGLRAMFDETYPDPVRVVCVGTPVEQLESEPESQA-GMETSVEF 455
             +     S  +A K      +F E   D ++       +E LE+ P  ++  +    EF
Sbjct: 136 IKIERVEVSREEAAK------LFQEMN-DRLK-------LELLEAIPSGESITLYKQGEF 181

Query: 456 ---CGGTHL 461
              C G HL
Sbjct: 182 VDLCRGPHL 190


>gnl|CDD|180601 PRK06520, PRK06520,
           5-methyltetrahydropteroyltriglutamate--homocysteine
           methyltransferase; Provisional.
          Length = 368

 Score = 29.7 bits (67), Expect = 6.0
 Identities = 15/48 (31%), Positives = 21/48 (43%), Gaps = 8/48 (16%)

Query: 355 ADQKGSLVAPD-----RLRFDFTNKGAMTADQVKKAEAEVIRIVQTKK 397
           AD  GS + P      R +F     G + A  ++K E   IR V  K+
Sbjct: 10  ADVVGSFLRPAAIKQARQQFA---AGEIDAAALRKIEDMEIRKVVEKQ 54


>gnl|CDD|221955 pfam13173, AAA_14, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily.
          Length = 127

 Score = 28.3 bits (64), Expect = 6.5
 Identities = 15/56 (26%), Positives = 24/56 (42%), Gaps = 13/56 (23%)

Query: 560 LDEAERLQKAAHASKVLDRVKSLVEERKADKFIVEILDAGSNTKALDAALKQVKTS 615
           LDE    QK       L   K L ++ +  + I+     GS++  L    K++ TS
Sbjct: 66  LDEI---QKVPDWEDAL---KRLYDDGRNLRIIL----TGSSSLLL---SKEIATS 108


>gnl|CDD|220499 pfam09979, DUF2213, Uncharacterized protein conserved in bacteria
           (DUF2213).  Members of this family of bacterial proteins
           comprise various hypothetical and phage-related
           proteins. The exact function of these proteins has not,
           as yet, been determined.
          Length = 170

 Score = 28.8 bits (65), Expect = 6.7
 Identities = 13/39 (33%), Positives = 16/39 (41%), Gaps = 6/39 (15%)

Query: 301 SHGYSYIEPCVEICAWTGGRSEGIPGSARRRPIMSNHTA 339
           S GY            T G ++G P  A +R I  NH A
Sbjct: 121 SCGYD----ADYD--QTPGVTDGQPYDAVQRDIRGNHVA 153


>gnl|CDD|177157 MTH00094, ND4, NADH dehydrogenase subunit 4; Provisional.
          Length = 403

 Score = 29.5 bits (67), Expect = 6.9
 Identities = 10/35 (28%), Positives = 16/35 (45%), Gaps = 5/35 (14%)

Query: 30 FFCIVFFSIYLLSVPSMYLLLILYSLINLNVVSWG 64
             + FFS +L      +L L+ +S +  N  SW 
Sbjct: 1  LSLLFFFSPFL-----FFLFLMFFSFLMFNNYSWS 30


>gnl|CDD|172536 PRK14042, PRK14042, pyruvate carboxylase subunit B; Provisional.
          Length = 596

 Score = 29.3 bits (65), Expect = 7.4
 Identities = 42/155 (27%), Positives = 66/155 (42%), Gaps = 11/155 (7%)

Query: 174 KKAQVIETEV-LNVKNTLSNPNTADSKALSKKIIDITE--EIAKATLPYWKKEDLRNLLK 230
           ++ + I  EV L  +     P    S AL KK I  TE  E+    L   + + L+N + 
Sbjct: 356 ERYKTITNEVKLYCQGKYGTPPGKISSALRKKAIGRTEVIEVRPGDLLPNELDQLQNEIS 415

Query: 231 TMKKSLDEAERLQKAAHASKVLDRVKSLVEERKADKFIVEILDAGSNTKIPRLRSFEFGL 290
            +  +L + + L  A       +  +  +E+RK ++ I E L   S+     + S EF +
Sbjct: 416 DL--ALSDEDVLLYAMFP----EIGRQFLEQRKNNQLIPEPLLTQSSAPDNSVMS-EFDI 468

Query: 291 KRQTSPYHVKSHGYSYIEPCVEIC-AWTGGRSEGI 324
                 YHVK  GY  IE   + C  W  G  E +
Sbjct: 469 ILHGESYHVKVAGYGMIEHGQQSCFLWVDGVPEEV 503


>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 782

 Score = 29.4 bits (67), Expect = 8.0
 Identities = 28/154 (18%), Positives = 61/154 (39%), Gaps = 32/154 (20%)

Query: 480 KGIRRIVALSGNEATKAIKKAQVIETEVLNVKNTLSNPNTADSKALSKKIIDITEEIAKA 539
           +G++R+        +K   + +  E E+  + +        + +AL + I+   EE AK 
Sbjct: 93  EGLQRV------RISKLSDEEEFFEAEIELLPDEPI-DEEREIEALVRSILSEFEEYAK- 144

Query: 540 TLPYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDRVKSLVEERKADKFIVEILDAG 599
                  E+L++L      S+D+  +L          D + + +  +  +K   EIL+  
Sbjct: 145 LNKKIPPEELQSL-----NSIDDPGKL---------ADTIAAHLPLKLEEKQ--EILETL 188

Query: 600 SNTKALDAALKQVKTSAPHMAAMFISVDEAAKKI 633
              + L+  L  ++          I + +  K+I
Sbjct: 189 DVKERLEKLLDLLEKE--------IDLLQLEKRI 214


>gnl|CDD|225315 COG2517, COG2517, Predicted RNA-binding protein containing a
           C-terminal EMAP domain [General function prediction
           only].
          Length = 219

 Score = 28.7 bits (64), Expect = 8.6
 Identities = 30/144 (20%), Positives = 58/144 (40%), Gaps = 27/144 (18%)

Query: 150 SEEAIAKGIRRIVAL--SGNEATKAIKKAQVIETEVLNVKNTLSNPNTADSKALSKKIID 207
           +E+A+   IR I +    G +  +AI+  + IE+E+  ++ +   P         K + +
Sbjct: 13  AEKAVELFIRTIESGGFRGWDRKRAIEALKSIESELQALRYSYLEPEELVDTEQMKDLKE 72

Query: 208 ITEEIAKATLPYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDR------------- 254
             E I +A         L        K L++A+ ++   +    LDR             
Sbjct: 73  KAEGIVEA---------LGGEKDWSTKFLEQADEVRFFLNTILNLDRGEINDPVIAVDIV 123

Query: 255 ---VKSLVEERKADKFIVEILDAG 275
              V S+ +   ADK +V +++ G
Sbjct: 124 VGEVMSVGKHPNADKLLVTVVNIG 147



 Score = 28.7 bits (64), Expect = 8.6
 Identities = 30/144 (20%), Positives = 58/144 (40%), Gaps = 27/144 (18%)

Query: 474 SEEAIAKGIRRIVAL--SGNEATKAIKKAQVIETEVLNVKNTLSNPNTADSKALSKKIID 531
           +E+A+   IR I +    G +  +AI+  + IE+E+  ++ +   P         K + +
Sbjct: 13  AEKAVELFIRTIESGGFRGWDRKRAIEALKSIESELQALRYSYLEPEELVDTEQMKDLKE 72

Query: 532 ITEEIAKATLPYWKKEDLRNLLKTMKKSLDEAERLQKAAHASKVLDR------------- 578
             E I +A         L        K L++A+ ++   +    LDR             
Sbjct: 73  KAEGIVEA---------LGGEKDWSTKFLEQADEVRFFLNTILNLDRGEINDPVIAVDIV 123

Query: 579 ---VKSLVEERKADKFIVEILDAG 599
              V S+ +   ADK +V +++ G
Sbjct: 124 VGEVMSVGKHPNADKLLVTVVNIG 147


>gnl|CDD|214437 MTH00152, ND6, NADH dehydrogenase subunit 6; Provisional.
          Length = 163

 Score = 28.3 bits (64), Expect = 9.5
 Identities = 10/43 (23%), Positives = 19/43 (44%)

Query: 20  LTPDQVYLIEFFCIVFFSIYLLSVPSMYLLLILYSLINLNVVS 62
           L+P+ ++   F  + F       +   +  L LYS   L++ S
Sbjct: 71  LSPNPLFGSNFKFVFFSLTLFGFLGLFFFFLYLYSSGGLSLSS 113


>gnl|CDD|234870 PRK00943, PRK00943, selenophosphate synthetase; Provisional.
          Length = 347

 Score = 29.0 bits (66), Expect = 9.6
 Identities = 23/96 (23%), Positives = 33/96 (34%), Gaps = 18/96 (18%)

Query: 605 LDAALKQVKTSAPHMAAMFISVDEAAKKIFCLSSVPKEAVSAGLTAIDWVRYLTDIMG-G 663
           L  A K+ K    H             +  C  + P     A    +  V  +TD+ G G
Sbjct: 185 LTTAEKKSKLKPEHYGLA--------IEAMCQLNRP----GADFAKLPGVHAMTDVTGFG 232

Query: 664 KGGGKAESAQASG----PNYENAPLVLSKARDYASQ 695
             G   E  Q +G     +Y   PL L    +Y +Q
Sbjct: 233 LLGHLLEMCQGAGLTARVDYAAVPL-LPGVEEYIAQ 267


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.131    0.368 

Gapped
Lambda     K      H
   0.267   0.0906    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 35,026,397
Number of extensions: 3465827
Number of successful extensions: 4760
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4707
Number of HSP's successfully gapped: 137
Length of query: 699
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 595
Effective length of database: 6,324,786
Effective search space: 3763247670
Effective search space used: 3763247670
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (27.3 bits)