RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9760
         (133 letters)



>gnl|CDD|109874 pfam00836, Stathmin, Stathmin family.  The Stathmin family of
           proteins play an important role in the regulation of the
           microtubule cytoskeleton. They regulate microtubule
           dynamics by promoting depolymerization of microtubules
           and/or preventing polymerisation of tubulin
           heterodimers.
          Length = 140

 Score = 39.2 bits (91), Expect = 9e-05
 Identities = 37/119 (31%), Positives = 65/119 (54%), Gaps = 1/119 (0%)

Query: 9   EIRCEEKSKGGIKYDVILAEPA-GTPPPLKTAADTPVILRSESIEEKLKQAEERRLSLEA 67
           E++   K   G  +++IL  P+  + P    ++     L  E I++KL+ AEERR S EA
Sbjct: 4   EVKQLNKRASGQAFELILKPPSFDSVPEFNLSSPKKKDLSLEEIQKKLEAAEERRKSQEA 63

Query: 68  EKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRLEISSEKREALINDKMEKLKD 126
           E + + A +R    +  +K  E+ N F +  +E L +++E   E REA +  K+E+L++
Sbjct: 64  EVLKQLAEKREHEREVLQKAIEENNNFSKMAEEKLTQKMESIKENREAQLAAKLERLQE 122


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 32.1 bits (74), Expect = 0.064
 Identities = 31/89 (34%), Positives = 46/89 (51%), Gaps = 13/89 (14%)

Query: 46  LRSESIEEKLKQAEERRLSL------EAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTK 99
           +R +  E K+K+AEE    +      EAE I K A   +K E+  K R+E E E  E+  
Sbjct: 24  VRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAK-EEIHKLRNEFEKELRERRN 82

Query: 100 E--TLEKRLEISSEKREALINDKMEKLKD 126
           E   LEKRL     ++E  ++ K+E L+ 
Sbjct: 83  ELQKLEKRLL----QKEENLDRKLELLEK 107



 Score = 29.0 bits (66), Expect = 0.76
 Identities = 23/74 (31%), Positives = 41/74 (55%), Gaps = 2/74 (2%)

Query: 52  EEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRLEISSE 111
           +E L +A+E    L  E   +   RR++++   K+  +KE E +++  E LEKR E   E
Sbjct: 56  KEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKE-ENLDRKLELLEKR-EEELE 113

Query: 112 KREALINDKMEKLK 125
           K+E  +  K ++L+
Sbjct: 114 KKEKELEQKQQELE 127


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 31.4 bits (72), Expect = 0.10
 Identities = 16/79 (20%), Positives = 29/79 (36%), Gaps = 12/79 (15%)

Query: 45  ILRSESIEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEK 104
           +    +  E +++ +E    +E E        R  + +  K+  +K  E +    E    
Sbjct: 207 LEEEGTPSELIREIKEELEEIEKE--------RESLLEELKELAKKYLEELLALYEY--- 255

Query: 105 RLEISSEKREALINDKMEK 123
            LEI  E+ EAL       
Sbjct: 256 -LEIELERAEALSKFLKTD 273



 Score = 26.8 bits (60), Expect = 4.0
 Identities = 15/73 (20%), Positives = 34/73 (46%), Gaps = 5/73 (6%)

Query: 53  EKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRLEISSEK 112
           +KL+        L   +  K       +E+  K  +E+  +  ++ KE  E+  E+ +E 
Sbjct: 60  DKLRSYLP---KLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEI 116

Query: 113 REALINDKMEKLK 125
           +E  +  ++E+L+
Sbjct: 117 KE--LEQEIERLE 127


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 31.2 bits (71), Expect = 0.11
 Identities = 16/79 (20%), Positives = 42/79 (53%), Gaps = 4/79 (5%)

Query: 49  ESIEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRLEI 108
           E ++EK+K  + +   L+  K+       S ++   K + E++NE ++   +  +K  + 
Sbjct: 280 EKLQEKIKALKYQLKRLK--KMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKK 337

Query: 109 SSEKREAL--INDKMEKLK 125
             +K++ +  + +++EKL+
Sbjct: 338 EEKKKKQIERLEERIEKLE 356


>gnl|CDD|220555 pfam10079, DUF2317, Uncharacterized protein conserved in bacteria
           (DUF2317).  Members of this family of hypothetical
           bacterial proteins have no known function.
          Length = 541

 Score = 30.6 bits (70), Expect = 0.22
 Identities = 21/92 (22%), Positives = 41/92 (44%), Gaps = 11/92 (11%)

Query: 36  LKTAADTPVILRSESIEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFI 95
           L+  A++ + +  E  + +L+   +R L  EA KI  +      +E A     E     I
Sbjct: 415 LEEKANSEIDIDFEEAKAELEAQFKR-LREEAAKIDPS------LEGAL----EANEAKI 463

Query: 96  EQTKETLEKRLEISSEKREALINDKMEKLKDH 127
            +  E LEKRL  +  ++   +  + ++L+  
Sbjct: 464 LKQLEFLEKRLLKAQRRKHEELLRQFDRLQTA 495


>gnl|CDD|233499 TIGR01631, Trypano_RHS, trypanosome RHS (retrotransposon hot spot)
           family.  This model describes full-length and
           part-length members of the RHS (retrotransposon hot
           spot) family in Trypanosoma brucei and Trypanosoma
           cruzi. Members of this family are frequently interrupted
           by non-LTR retrotransposons inserted at exactly the same
           relative position.
          Length = 760

 Score = 30.2 bits (68), Expect = 0.30
 Identities = 18/49 (36%), Positives = 23/49 (46%), Gaps = 2/49 (4%)

Query: 14  EKSKGGIKYDVILAEPAGTPPPLKTAADTPVILRSESIEEKLKQAEERR 62
           EK KG I YD   A+    PPP    +   VIL S   E   K+  ++R
Sbjct: 356 EKVKGYIIYDF--AKDDDEPPPADPPSGWGVILLSSPNESNFKEWSKQR 402


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 29.5 bits (67), Expect = 0.38
 Identities = 15/54 (27%), Positives = 26/54 (48%), Gaps = 6/54 (11%)

Query: 46  LRSESIEEKLKQ-----AEERRLSLEAEKIAKNAARRSKI-EDAAKKRDEKENE 93
            R E  E+ LK       EE +   E +K  +  A+ +K+  +  +K +EKE +
Sbjct: 264 TREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERK 317


>gnl|CDD|236529 PRK09465, tolC, outer membrane channel protein; Reviewed.
          Length = 446

 Score = 29.5 bits (67), Expect = 0.51
 Identities = 11/30 (36%), Positives = 22/30 (73%)

Query: 47  RSESIEEKLKQAEERRLSLEAEKIAKNAAR 76
           + +S+   LK+AE+R LSL + +++++ AR
Sbjct: 227 KPQSVNALLKEAEKRNLSLLSARLSQDLAR 256


>gnl|CDD|234306 TIGR03674, fen_arch, flap structure-specific endonuclease.
           Endonuclease that cleaves the 5'-overhanging flap
           structure that is generated by displacement synthesis
           when DNA polymerase encounters the 5'-end of a
           downstream Okazaki fragment. Has 5'-endo-/exonuclease
           and 5'-pseudo-Y-endonuclease activities. Cleaves the
           junction between single and double-stranded regions of
           flap DNA.
          Length = 338

 Score = 29.1 bits (66), Expect = 0.66
 Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 1/57 (1%)

Query: 31  GTPPPLKTAADTPVILRSESIEEKLKQAEERRLSLEAEKIAKNAAR-RSKIEDAAKK 86
           G PP LK           E  EEK ++A E+    EA K A+ ++R  S+I +++KK
Sbjct: 81  GKPPELKAETLEERREIREEAEEKWEEALEKGDLEEARKYAQRSSRLTSEIVESSKK 137


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 28.4 bits (64), Expect = 0.84
 Identities = 22/93 (23%), Positives = 43/93 (46%), Gaps = 7/93 (7%)

Query: 47  RSESIEEKLKQAEERRLSLEA------EKIAKNAARRSKIEDAAKKRDEKE-NEFIEQTK 99
           R   I + L +AE  +   +A      +++ +   + S+I + AKK  E+   E   + +
Sbjct: 38  RQAKIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAE 97

Query: 100 ETLEKRLEISSEKREALINDKMEKLKDHVSVTT 132
           E LE+  E +  + EA     +E+L+  V+   
Sbjct: 98  EELERIKEAAEAEIEAEKERALEELRAEVAELA 130


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 28.8 bits (65), Expect = 0.87
 Identities = 15/55 (27%), Positives = 31/55 (56%)

Query: 59  EERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRLEISSEKR 113
              R++   +K+ +      K++ A  + +E  +EF EQ+K+ LE + +IS+ K+
Sbjct: 297 GPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQ 351


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 29.0 bits (64), Expect = 0.87
 Identities = 24/76 (31%), Positives = 44/76 (57%), Gaps = 4/76 (5%)

Query: 52   EEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAK----KRDEKENEFIEQTKETLEKRLE 107
             E+ K+AEE + + E  KI    A++   ED  K    K+DE+E + I   K+  EK+ E
Sbjct: 1712 AEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAE 1771

Query: 108  ISSEKREALINDKMEK 123
               +++EA+I +++++
Sbjct: 1772 EIRKEKEAVIEEELDE 1787



 Score = 26.6 bits (58), Expect = 4.8
 Identities = 21/75 (28%), Positives = 34/75 (45%)

Query: 51   IEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRLEISS 110
             +E  K+AEE + + EA+K A+ A +++       +  +K  E  +   E      E + 
Sbjct: 1304 ADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAE 1363

Query: 111  EKREALINDKMEKLK 125
            EK EA    K E  K
Sbjct: 1364 EKAEAAEKKKEEAKK 1378



 Score = 26.3 bits (57), Expect = 7.3
 Identities = 20/74 (27%), Positives = 33/74 (44%)

Query: 52   EEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRLEISSE 111
             E+ K+AEE + + E  KI      +   ED  K  + K+ E  E+      K+    ++
Sbjct: 1643 AEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAK 1702

Query: 112  KREALINDKMEKLK 125
            K E L   + E+ K
Sbjct: 1703 KAEELKKKEAEEKK 1716


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 28.7 bits (65), Expect = 0.92
 Identities = 19/94 (20%), Positives = 44/94 (46%), Gaps = 7/94 (7%)

Query: 38  TAADT-PVILRSESIE--EKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEF 94
           +A D      +  +I     L   E  R+  +AE+    AA   K ++  + ++E E E+
Sbjct: 481 SAKDKGTGKEQKITITASSGLSDDEIERMVKDAEE---YAAEDKKRKERIEAKNEAE-EY 536

Query: 95  IEQTKETLEKRLEISSEKREALINDKMEKLKDHV 128
           +   +++L++  +   E  +  + + +E LK+ +
Sbjct: 537 VYSLEKSLKEEGDKLPEADKKKVEEAIEWLKEEL 570



 Score = 27.6 bits (62), Expect = 2.3
 Identities = 25/92 (27%), Positives = 41/92 (44%), Gaps = 12/92 (13%)

Query: 46  LRSESIEEKLKQAEERRLSLEAEKI---AKNAAR------RSKIEDAAKKRDEKENEFIE 96
           L  + IE  +K AEE     +  K    AKN A          +++   K  E + + +E
Sbjct: 501 LSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKKVE 560

Query: 97  QTKETLEKRLEISSEKREALINDKMEKLKDHV 128
           +  E L++  E+  E +E  I  K E+L+  V
Sbjct: 561 EAIEWLKE--ELEGEDKEE-IEAKTEELQKVV 589


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 28.5 bits (64), Expect = 1.0
 Identities = 15/68 (22%), Positives = 29/68 (42%), Gaps = 7/68 (10%)

Query: 49  ESIEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRLEI 108
            +I E+ +QAE +   L+A+       +++ +E         +++F E   E L K   +
Sbjct: 674 LAIAERKQQAETQLRQLDAQLKQLLEQQQAFLE-------ALKDDFRELRTERLAKWQVV 726

Query: 109 SSEKREAL 116
             E    L
Sbjct: 727 EGELDNQL 734



 Score = 26.6 bits (59), Expect = 4.6
 Identities = 14/80 (17%), Positives = 34/80 (42%), Gaps = 4/80 (5%)

Query: 52  EEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRLEISSE 111
              L++ +  + SL+ +     A R+ + E   ++ D +  + +EQ +  LE    +  +
Sbjct: 655 RLDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLE---ALKDD 711

Query: 112 KREALINDKMEKLKDHVSVT 131
            RE    +++ K +      
Sbjct: 712 FRELR-TERLAKWQVVEGEL 730


>gnl|CDD|189037 cd09867, PIN_FEN1, PIN domain of Flap Endonuclease-1, a
           structure-specific, divalent-metal-ion dependent, 5'
           nuclease and homologs.  Flap endonuclease-1 (FEN1) is
           involved in multiple DNA metabolic pathways, including
           DNA replication processes (5' flap DNA endonuclease
           activity and double stranded DNA 5'-exonuclease
           activity) and DNA repair processes (long-patch base
           excision repair) in eukaryotes and archaea. Interaction
           between FEN1 and PCNA (Proliferating cell nuclear
           antigen) is an essential prerequisite to FEN1's DNA
           replication functionality and stimulates FEN1 nuclease
           activity by 10-50 fold. FEN1 belongs to the
           FEN1-EXO1-like family of structure-specific, 5'
           nucleases. These nucleases contain a PIN (PilT N
           terminus) domain with a helical arch/clamp region (I
           domain) of variable length (approximately 45 residues in
           FEN1 PIN domains) and a H3TH (helix-3-turn-helix)
           domain, an atypical helix-hairpin-helix-2-like region.
           Both the H3TH domain (not included here) and the helical
           arch/clamp region are involved in DNA binding. Nucleases
           within this group also have a carboxylate-rich active
           site that is involved in binding essential divalent
           metal ion cofactors (Mg2+/Mn2+).  FEN1 has a C-terminal
           extension containing residues forming the consensus
           PIP-box - Qxx(M/L/I)xxF(Y/F) which serves to anchor FEN1
           to PCNA.
          Length = 261

 Score = 28.2 bits (64), Expect = 1.3
 Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 2/57 (3%)

Query: 31  GTPPPLKTAADTPVILRSESIEEKLKQAEERRLSLEAEKIAKNAARRSK--IEDAAK 85
           G PP LK+        R E  EEKL++A+E   + EA K AK   R +K  +E+  K
Sbjct: 82  GKPPELKSGELEKRRERREEAEEKLEEAKEEGDAEEARKYAKRTVRVTKEIVEECKK 138


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 28.0 bits (63), Expect = 1.5
 Identities = 21/91 (23%), Positives = 42/91 (46%), Gaps = 15/91 (16%)

Query: 49  ESIEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEF-------------I 95
           + ++ ++ Q E  R +L+ EK   +  R   +E    + +E+  +              I
Sbjct: 408 DELDRRIIQLEIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGI 467

Query: 96  EQTKETLEK-RLEISSEKREALINDKMEKLK 125
           +Q KE +E+ RLE+   +RE  +  K  +L+
Sbjct: 468 QQIKEEIEQVRLELEQAEREGDLA-KAAELQ 497


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 28.1 bits (63), Expect = 1.5
 Identities = 15/71 (21%), Positives = 26/71 (36%)

Query: 53  EKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRLEISSEK 112
           EKL   E +    E    A+        +    +         EQ ++ L K L+   E+
Sbjct: 98  EKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLKLLDAELEE 157

Query: 113 REALINDKMEK 123
            +A    K+E+
Sbjct: 158 EKAQRVKKIEE 168


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 28.1 bits (63), Expect = 1.5
 Identities = 17/81 (20%), Positives = 26/81 (32%)

Query: 45  ILRSESIEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEK 104
            L       +    E     L  E+         KI +  K+ +E E   IE+ KE   K
Sbjct: 291 KLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELNSK 350

Query: 105 RLEISSEKREALINDKMEKLK 125
             EI  +  +     +  K  
Sbjct: 351 LEEIQKKLEDLEKRLEKLKSN 371


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 28.3 bits (64), Expect = 1.6
 Identities = 16/79 (20%), Positives = 38/79 (48%), Gaps = 5/79 (6%)

Query: 49  ESIEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRL-E 107
           E  E  LK+AE+ +  LE +K          +E+A    +++  + I++ K+  ++ + E
Sbjct: 537 EEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEA----EKEAQQAIKEAKKEADEIIKE 592

Query: 108 ISSEKREALINDKMEKLKD 126
           +   ++    + K  +L +
Sbjct: 593 LRQLQKGGYASVKAHELIE 611


>gnl|CDD|218425 pfam05091, eIF-3_zeta, Eukaryotic translation initiation factor 3
           subunit 7 (eIF-3).  This family is made up of eukaryotic
           translation initiation factor 3 subunit 7 (eIF-3
           zeta/eIF3 p66/eIF3d). Eukaryotic initiation factor 3 is
           a multi-subunit complex that is required for binding of
           mRNA to 40 S ribosomal subunits, stabilisation of
           ternary complex binding to 40 S subunits, and
           dissociation of 40 and 60 S subunits. These functions
           and the complex nature of eIF3 suggest multiple
           interactions with many components of the translational
           machinery. The gene coding for the protein has been
           implicated in cancer in mammals.
          Length = 517

 Score = 27.7 bits (62), Expect = 1.8
 Identities = 20/60 (33%), Positives = 32/60 (53%), Gaps = 6/60 (10%)

Query: 38  TAADTPVILRSESIEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQ 97
           TAAD P+     SI           L+LEA KI +N A++  +E+  K + ++EN F ++
Sbjct: 288 TAADPPLDDDENSINS------PSSLALEATKINQNFAQQVLLENDEKYQFDEENPFYDE 341


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 27.6 bits (62), Expect = 1.8
 Identities = 18/64 (28%), Positives = 29/64 (45%)

Query: 62  RLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRLEISSEKREALINDKM 121
           R +L+AE+      +  K  +  K+  EK    +E   E +EKR E   +  E    D++
Sbjct: 113 RKALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEI 172

Query: 122 EKLK 125
             LK
Sbjct: 173 AFLK 176


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 27.8 bits (62), Expect = 1.9
 Identities = 21/73 (28%), Positives = 37/73 (50%), Gaps = 2/73 (2%)

Query: 49  ESIEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENE--FIEQTKETLEKRL 106
             +EE  ++ EE+  +L  E      A R ++ +   +  E  NE   +++  E+LE+RL
Sbjct: 354 AELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERL 413

Query: 107 EISSEKREALIND 119
           E  SE+ E L  +
Sbjct: 414 ERLSERLEDLKEE 426



 Score = 27.4 bits (61), Expect = 2.8
 Identities = 19/69 (27%), Positives = 33/69 (47%), Gaps = 1/69 (1%)

Query: 47  RSESIEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRL 106
           R E +EE+L++ EE    L+   + +       +E+A  K  E+  E  E+ +   E+  
Sbjct: 738 RLEELEEELEELEEELEELQER-LEELEEELESLEEALAKLKEEIEELEEKRQALQEELE 796

Query: 107 EISSEKREA 115
           E+  E  EA
Sbjct: 797 ELEEELEEA 805


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 27.7 bits (62), Expect = 2.0
 Identities = 23/65 (35%), Positives = 31/65 (47%), Gaps = 1/65 (1%)

Query: 53  EKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRD-EKENEFIEQTKETLEKRLEISSE 111
           E+LKQ  E  L     +  K AA   K+E A KK    +    I Q KE +E++ E   E
Sbjct: 37  EELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKEEIEQKRERIEE 96

Query: 112 KREAL 116
            + AL
Sbjct: 97  LKRAL 101


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 27.7 bits (63), Expect = 2.0
 Identities = 20/72 (27%), Positives = 30/72 (41%), Gaps = 8/72 (11%)

Query: 53  EKLKQAEERRLSLEAEKIAKNAAR--RSKIEDAAKKRDEKENEFIEQTKE------TLEK 104
           ++L + +E R  L+ E     A R   SK    AK++ E     I + KE       LE 
Sbjct: 28  DELLELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKALEA 87

Query: 105 RLEISSEKREAL 116
            L+    + E L
Sbjct: 88  ELDELEAELEEL 99


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 27.6 bits (62), Expect = 2.0
 Identities = 19/69 (27%), Positives = 37/69 (53%), Gaps = 3/69 (4%)

Query: 51  IEEKLKQAEERRLSLEAEKIAKNAARRSKIE-DAAKKRDE--KENEFIEQTKETLEKRLE 107
           IEE  K+AE  +     E   +    R+++E +  ++R+E  +    + Q +ETL++++E
Sbjct: 38  IEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKME 97

Query: 108 ISSEKREAL 116
              +K E L
Sbjct: 98  SLDKKEENL 106



 Score = 26.8 bits (60), Expect = 4.5
 Identities = 18/74 (24%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 49  ESIEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRLEI 108
              +E L +A+E    L AE   +   RR++++   ++  ++E E +++  E+L+K+ E 
Sbjct: 47  TLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQRE-ETLDRKMESLDKKEEN 105

Query: 109 SSEKREALINDKME 122
             +K + L N +  
Sbjct: 106 LEKKEKELSNKEKN 119


>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
           Activating Proteins.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. Slit-Robo GTPase Activating Proteins
           (srGAPs) are Rho GAPs that interact with Robo1, the
           transmembrane receptor of Slit proteins. Slit proteins
           are secreted proteins that control axon guidance and the
           migration of neurons and leukocytes. Vertebrates contain
           three isoforms of srGAPs, all of which are expressed
           during embryonic and early development in the nervous
           system but with different localization and timing.
           srGAPs contain an N-terminal F-BAR domain, a Rho GAP
           domain, and a C-terminal SH3 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 241

 Score = 27.3 bits (61), Expect = 2.0
 Identities = 19/48 (39%), Positives = 25/48 (52%), Gaps = 2/48 (4%)

Query: 48  SESIEEKLKQAEERRLSLEAEKIAKNAARRS--KIEDAAKKRDEKENE 93
           S+S E KLK+AE++    E     K    RS  KIE   +KR  K +E
Sbjct: 147 SKSAERKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSE 194


>gnl|CDD|206316 pfam14147, Spore_YhaL, Sporulation protein YhaL.  This family of
           proteins is involved in sporulation. In B. subtilis its
           expression is regulated by the early
           mother-cell-specific transcription factor sigma-E.
          Length = 52

 Score = 25.8 bits (57), Expect = 2.1
 Identities = 15/31 (48%), Positives = 19/31 (61%)

Query: 83  AAKKRDEKENEFIEQTKETLEKRLEISSEKR 113
            AK+  EKE EFIE+  E   +RLE   E+R
Sbjct: 22  TAKEEREKEQEFIEKEGEVYMERLEEERERR 52


>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences.
          Length = 379

 Score = 27.6 bits (62), Expect = 2.3
 Identities = 24/95 (25%), Positives = 40/95 (42%), Gaps = 5/95 (5%)

Query: 24  VILAEPAGTPPPLKTAADTPVILRSESIEEKL-KQAEERRLSLEAEKIAKNAARRSKIED 82
           +++AE        K+A D   +   E IEE+L ++ EE     EAEK    A ++   + 
Sbjct: 43  LVVAESGE----RKSAVDKLAMKPLEEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKK 98

Query: 83  AAKKRDEKENEFIEQTKETLEKRLEISSEKREALI 117
           A K   + ++E     +    +  E     R  LI
Sbjct: 99  AKKAIKKGKDEEALAEELLELEAEEPEPPLRPRLI 133


>gnl|CDD|226641 COG4172, COG4172, ABC-type uncharacterized transport system,
           duplicated ATPase component [General function prediction
           only].
          Length = 534

 Score = 27.6 bits (62), Expect = 2.3
 Identities = 13/26 (50%), Positives = 16/26 (61%), Gaps = 2/26 (7%)

Query: 27  AEPAGTPPPLKTAADTPVILRSESIE 52
           AEP+G PPPL    D PV+L  E + 
Sbjct: 260 AEPSGDPPPL--PEDAPVLLEVEDLR 283


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 27.3 bits (61), Expect = 2.5
 Identities = 17/75 (22%), Positives = 32/75 (42%), Gaps = 2/75 (2%)

Query: 49  ESIEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRLEI 108
           E +E  LK+  E  LS   E++        +     + R E E E  E+ ++  E++L  
Sbjct: 188 EELERALKEKREELLSKLEEELLARLES-KEAALEKQLRLEFERE-KEELRKKYEEKLRQ 245

Query: 109 SSEKREALINDKMEK 123
             E++      K++ 
Sbjct: 246 ELERQAEAHEQKLKN 260


>gnl|CDD|220506 pfam09989, DUF2229, CoA enzyme activase uncharacterized domain
           (DUF2229).  Members of this family include various
           bacterial hypothetical proteins, as well as CoA enzyme
           activases. The exact function of this domain has not, as
           yet, been defined.
          Length = 218

 Score = 27.1 bits (61), Expect = 2.5
 Identities = 16/78 (20%), Positives = 31/78 (39%)

Query: 40  ADTPVILRSESIEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTK 99
            +  + L S  +  K K+A  +RL    +K+  +     K  + A +  E   + + +  
Sbjct: 109 LEKGIRLLSPFLNLKDKEALAKRLYELGKKLGISKEEIKKAVEKALEEQEAFKKDLRKKG 168

Query: 100 ETLEKRLEISSEKREALI 117
           E     LE   +K   L+
Sbjct: 169 EEALAYLEEEGKKGIVLL 186


>gnl|CDD|180941 PRK07352, PRK07352, F0F1 ATP synthase subunit B; Validated.
          Length = 174

 Score = 26.8 bits (60), Expect = 2.7
 Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 6/57 (10%)

Query: 47  RSESIEEKLKQAEER------RLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQ 97
           R E+I + LK+AEER       L+   +K+A+      +I   AK R E     IE+
Sbjct: 51  RREAILQALKEAEERLRQAAQALAEAQQKLAQAQQEAERIRADAKARAEAIRAEIEK 107


>gnl|CDD|221059 pfam11285, DUF3086, Protein of unknown function (DUF3086).  This
          family of proteins with unknown function appears to be
          restricted to Cyanobacteria.
          Length = 283

 Score = 26.9 bits (60), Expect = 3.2
 Identities = 15/31 (48%), Positives = 20/31 (64%), Gaps = 1/31 (3%)

Query: 51 IEEKLKQAEERRLSLEAEKIAKNAARRSKIE 81
          +EE LK  E+RR +LE E I K   R+ +IE
Sbjct: 2  VEEALKDLEQRRQALEIE-IEKLERRKEQIE 31


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 26.9 bits (60), Expect = 3.2
 Identities = 13/52 (25%), Positives = 26/52 (50%)

Query: 51  IEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETL 102
           +E+  ++ EE+R   E E++ + A  RS+     +K  E + E  E+ +   
Sbjct: 119 LEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHRA 170


>gnl|CDD|193527 cd05650, M20_ArgE_DapE_like1, M20 Peptidases with similarity to
           acetylornithine deacetylases and
           succinyl-diaminopimelate desuccinylases.  Peptidase M20
           family, Uncharacterized protein subfamily with
           similarity to acetylornithine
           deacetylase/succinyl-diaminopimelate desuccinylase
           (ArgE/DapE) subfamily. ArgE/DapE enzymes catalyze
           analogous reactions and share a common activator, the
           metal ion (usually Co2+ or Zn2+). ArgE catalyzes a broad
           range of substrates, including N-acetylornithine,
           alpha-N-acetylmethionine and alpha-N-formylmethionine,
           while DapE catalyzes the hydrolysis of
           N-succinyl-L,L-diaminopimelate (L,L-SDAP) to
           L,L-diaminopimelate and succinate. Proteins in this
           subfamily are mostly bacterial and archaeal, and have
           been inferred by homology as being related to both ArgE
           and DapE.
          Length = 393

 Score = 26.8 bits (60), Expect = 3.4
 Identities = 15/42 (35%), Positives = 23/42 (54%), Gaps = 5/42 (11%)

Query: 13  EEKSKGGIKYDVILAEPAGTPPPLKTAADTPVILR-SESIEE 53
           EEK    I+ +V+  E A  PPP  T  D+ ++ R   +I+E
Sbjct: 296 EEKYGVKIEVEVVQREDA--PPP--TPPDSEIVRRLKRAIKE 333


>gnl|CDD|237256 PRK12901, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 1112

 Score = 26.9 bits (60), Expect = 3.6
 Identities = 20/83 (24%), Positives = 40/83 (48%), Gaps = 5/83 (6%)

Query: 51  IEEKLKQAEERRLSLEAEKI-AKNAARRSKIEDAAKKRDEKENEFIEQTKET--LEKRLE 107
           I EK+K       +L  +++ AK    +  I++A    D K  E   +  E+  +++R +
Sbjct: 23  IVEKIKAEYPELEALSNDELRAKTDEFKQYIKEAVADIDAKIEELKAEAIESLDIDERED 82

Query: 108 ISSE--KREALINDKMEKLKDHV 128
           I ++  K E    + +EK+ D +
Sbjct: 83  IYAQIDKLEKEAYEILEKVLDEI 105


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 26.8 bits (60), Expect = 3.9
 Identities = 19/79 (24%), Positives = 33/79 (41%), Gaps = 3/79 (3%)

Query: 48  SESIEEKLKQAEERRLSLEAEKIAKNAAR--RSKIEDAAKKRDEKENEFIEQTKETLEKR 105
                +KL + +E R  L  E     A R   SK    A KR E + E +    + L+++
Sbjct: 24  DALDVDKLLELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEK 83

Query: 106 LEISSEKREALINDKMEKL 124
           L+   E     +  +++ L
Sbjct: 84  LK-ELEAALDELEAELDTL 101


>gnl|CDD|221117 pfam11460, DUF3007, Protein of unknown function (DUF3007).  This is
           a family of uncharacterized proteins found in bacteria
           and eukaryotes.
          Length = 104

 Score = 26.1 bits (58), Expect = 3.9
 Identities = 11/37 (29%), Positives = 20/37 (54%), Gaps = 1/37 (2%)

Query: 84  AKKRDEKENEFIEQTKETLEKRLE-ISSEKREALIND 119
            ++R +    +   T E L+KR + +S E+ EAL  +
Sbjct: 65  MQQRRDYREAYDALTNEELQKRFDSLSPEELEALQAE 101


>gnl|CDD|220700 pfam10337, DUF2422, Protein of unknown function (DUF2422).  This is
           a family of proteins conserved in fungi. The function is
           not known. This family is the C-terminal half of some
           member proteins which contain the DUF2421 pfam10334
           domain at their N-terminus.
          Length = 438

 Score = 26.9 bits (60), Expect = 4.0
 Identities = 10/34 (29%), Positives = 18/34 (52%), Gaps = 1/34 (2%)

Query: 85  KKRDEKENEFIEQTKETLEKRLEI-SSEKREALI 117
            ++ ++    +++T E LEK L      KR AL+
Sbjct: 401 VEKQKEAAAELKETAEALEKALADFKDAKRLALL 434


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 26.6 bits (58), Expect = 4.2
 Identities = 18/72 (25%), Positives = 38/72 (52%), Gaps = 2/72 (2%)

Query: 44  VILRSESIEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLE 103
           +I+R E IE+  ++ EE    +E E+  +    R   E+  ++R+E+  E  EQ+ ++  
Sbjct: 121 LIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIE--EQSDDSEH 178

Query: 104 KRLEISSEKREA 115
           + +E    + E+
Sbjct: 179 EIIEQDESETES 190


>gnl|CDD|227702 COG5415, COG5415, Predicted integral membrane metal-binding protein
           [General function prediction only].
          Length = 251

 Score = 26.5 bits (58), Expect = 4.3
 Identities = 18/66 (27%), Positives = 26/66 (39%), Gaps = 5/66 (7%)

Query: 60  ERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRLEISSEKREALIND 119
           E R +    K+AK  A   K  +  K+           T   ++K  E  + K + L N 
Sbjct: 95  EFRNNRRLRKLAKLRAIHRKKLEKLKEETH-----YNATSSIIQKYSEELNAKYQELNNL 149

Query: 120 KMEKLK 125
           K EK K
Sbjct: 150 KTEKEK 155


>gnl|CDD|130046 TIGR00973, leuA_bact, 2-isopropylmalate synthase, bacterial type.
           This is the first enzyme of leucine biosynthesis. A
           larger family of homologous proteins includes
           homocitrate synthase, distinct lineages of
           2-isopropylmalate synthase, several distinct,
           uncharacterized, orthologous sets in the Archaea, and
           other related enzymes. This model describes a family of
           2-isopropylmalate synthases found primarily in Bacteria.
           The homologous families in the Archaea may represent
           isozymes and/or related enzymes [Amino acid
           biosynthesis, Pyruvate family].
          Length = 494

 Score = 26.6 bits (59), Expect = 4.4
 Identities = 14/51 (27%), Positives = 31/51 (60%), Gaps = 1/51 (1%)

Query: 74  AARRSKIEDAAKKRDEKE-NEFIEQTKETLEKRLEISSEKREALINDKMEK 123
            A + ++E+   K D++E ++  E+ KE  +K+ E++ E  EAL+ ++  +
Sbjct: 332 HAFKDRLEELGFKLDDEELDKLFEKFKELADKKKEVTDEDLEALVFEEKRQ 382


>gnl|CDD|237917 PRK15134, PRK15134, microcin C ABC transporter ATP-binding protein
           YejF; Provisional.
          Length = 529

 Score = 26.6 bits (59), Expect = 4.4
 Identities = 8/26 (30%), Positives = 13/26 (50%), Gaps = 2/26 (7%)

Query: 27  AEPAGTPPPLKTAADTPVILRSESIE 52
           +EP+G P PL        +L  E ++
Sbjct: 259 SEPSGDPVPL--PEPASPLLDVEQLQ 282


>gnl|CDD|214369 CHL00118, atpG, ATP synthase CF0 B' subunit; Validated.
          Length = 156

 Score = 26.1 bits (58), Expect = 5.0
 Identities = 17/78 (21%), Positives = 40/78 (51%), Gaps = 4/78 (5%)

Query: 47  RSESIEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRL 106
           R E I + L +A E  +  +A ++ K      ++  A K+   +  +  ++ KE +E  L
Sbjct: 54  RKEYIRKNLTKASE--ILAKANELTKQ--YEQELSKARKEAQLEITQSQKEAKEIVENEL 109

Query: 107 EISSEKREALINDKMEKL 124
           + + +  ++L+N+  ++L
Sbjct: 110 KQAQKYIDSLLNEATKQL 127


>gnl|CDD|232986 TIGR00464, gltX_bact, glutamyl-tRNA synthetase, bacterial family.
           The glutamyl-tRNA synthetases of the eukaryotic cytosol
           and of the Archaea are more similar to glutaminyl-tRNA
           synthetases than to bacterial glutamyl-tRNA synthetases.
           This model models just the bacterial and mitochondrial
           forms of the enzyme. In many species, the charging of
           tRNA(gln) proceeds first through misacylation with Glu
           and then transamidation. For this reason, glutamyl-tRNA
           synthetases may act on both tRNA(gln) and tRNA(glu).
           This model is highly specific. Proteins with positive
           scores below the trusted cutoff may be fragments rather
           than full-length sequences [Protein synthesis, tRNA
           aminoacylation].
          Length = 470

 Score = 26.2 bits (58), Expect = 7.2
 Identities = 21/69 (30%), Positives = 32/69 (46%), Gaps = 9/69 (13%)

Query: 58  AEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKE-NEFIEQTKETLEKRLEISSEKREAL 116
           AE  RL  E +K           EDA KK  +K   E +E  K+ L+   E ++E+ ++ 
Sbjct: 364 AELIRLFFEDKKEVD--------EDAFKKHLKKNVKEVLEALKKKLQALEEWTAEEVKSA 415

Query: 117 INDKMEKLK 125
           I    E+L 
Sbjct: 416 IKQIAEELG 424


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 25.8 bits (56), Expect = 7.3
 Identities = 17/100 (17%), Positives = 36/100 (36%), Gaps = 3/100 (3%)

Query: 4   LFPATEIRCEEKSKGGIKYDVILAEPAGTPPPLKTAADTPVILRSESIEEKLKQAEERRL 63
           ++ A  + C +         +  AEP+    P  T    P+       E +  Q    RL
Sbjct: 115 VYYAKRVPCSQPVSILSPNTLKEAEPSAEVQP--TTMTLPIAEHPTITENQSFQPWPERL 172

Query: 64  SLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLE 103
               E++ + ++         K    K+ + + + +E L+
Sbjct: 173 HNNVEELLQ-SSLSLGGSVQVKAPKPKQEQLLSKLQEYLQ 211


>gnl|CDD|235686 PRK06061, PRK06061, amidase; Provisional.
          Length = 483

 Score = 25.8 bits (57), Expect = 7.8
 Identities = 9/20 (45%), Positives = 10/20 (50%)

Query: 22  YDVILAEPAGTPPPLKTAAD 41
            DV+LA     PPP   A D
Sbjct: 382 VDVVLAPTTAQPPPRVGAFD 401


>gnl|CDD|223953 COG1022, FAA1, Long-chain acyl-CoA synthetases (AMP-forming) [Lipid
           metabolism].
          Length = 613

 Score = 25.7 bits (57), Expect = 8.4
 Identities = 19/87 (21%), Positives = 33/87 (37%), Gaps = 4/87 (4%)

Query: 42  TPVILRSESIEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKET 101
             +I+      EK  ++  + +S   E++A++      I     K +++   F EQ K  
Sbjct: 512 VALIVPDFDALEKWAESLNKVISASREELARDPKLLKLILPRVNKGNKRLFGF-EQIK-- 568

Query: 102 LEKRLEISSEKREALINDKMEKLKDHV 128
            +  L       E        KLK HV
Sbjct: 569 -KFVLLPKEFTPENGELTPTLKLKRHV 594


>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
           [DNA replication, recombination, and repair].
          Length = 417

 Score = 25.8 bits (57), Expect = 8.6
 Identities = 16/72 (22%), Positives = 31/72 (43%), Gaps = 5/72 (6%)

Query: 47  RSESIEEKLKQAEERRLSLEAEKIAKNAARRSK-----IEDAAKKRDEKENEFIEQTKET 101
           + ++  E +K+  +R+    A+K +    R+         +  K   ++E   +E  K  
Sbjct: 346 KVKTAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLE 405

Query: 102 LEKRLEISSEKR 113
             KRLE   +KR
Sbjct: 406 ELKRLENGKQKR 417


>gnl|CDD|181465 PRK08557, PRK08557, hypothetical protein; Provisional.
          Length = 417

 Score = 25.9 bits (57), Expect = 8.6
 Identities = 17/53 (32%), Positives = 31/53 (58%), Gaps = 7/53 (13%)

Query: 77  RSKIEDAAKKRD---EKENEFIEQTKETLEKRLEISSEKREALINDKMEKLKD 126
           R KI+D + K++   EK  +++E+ KE    R+E   E   +++ D +EK K+
Sbjct: 131 RIKIKDLSLKKELDFEKIEDYLEKNKE----RIEKLEENSLSILKDYIEKYKN 179


>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
          Length = 300

 Score = 25.8 bits (57), Expect = 9.3
 Identities = 14/74 (18%), Positives = 32/74 (43%), Gaps = 8/74 (10%)

Query: 49  ESIEEKLKQAEERRLSLEAEKIAKNAARRSKIEDAAKKRDEKENEFIEQTKETLEKRLEI 108
           +S+E+  K+A           I  N ++  K +   KK   K+ +  + T +  + R++ 
Sbjct: 235 DSLEDMYKKAHA--------AIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKA 286

Query: 109 SSEKREALINDKME 122
              +R   +  K++
Sbjct: 287 KKAQRRERLQKKIK 300


>gnl|CDD|99875 cd06064, H2MP_F420-Reduc, Endopeptidases belonging to F420-reducing
           hydrogenases group. These hydrogenases from methanogens
           are encoded by the fru, frc, or frh genes. Sequence
           comparison indicates that fruD and frcD gene products
           from Methanococcus voltae are similar to HycI protease
           of Escherichia coli and are putatively involved in the
           C-terminal processing of large subunits (FruA and FrcA
           respectively). FrhD (F420 reducing hydrogenase delta
           subunit) enzyme belongs to the gene cluster of
           8-hydroxy-5-deazaflavin (F420) reducing hydrogenase
           (FRH) from the thermophilic methanogen Methanobacterium
           thermoautotrophicum delta H. FrhD subunit is putatively
           involved in the processing of the coenzyme F420
           hydrogenase-processing. It is similar to those frhD
           genes found in Methanomicrobia and Methanobacteria. It
           is different from the FrhD conserved domain found in
           methyl viologen-reducing hydrogenase and
           F420-non-reducing hydrogenase iron-sulfur subunit D.
          Length = 150

 Score = 25.3 bits (56), Expect = 9.5
 Identities = 11/31 (35%), Positives = 15/31 (48%)

Query: 5   FPATEIRCEEKSKGGIKYDVILAEPAGTPPP 35
           +P  +   E K K GI+  VI  +P   P P
Sbjct: 95  WPLADPLHELKDKYGIEIVVIGCQPKRVPEP 125


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 25.7 bits (56), Expect = 9.9
 Identities = 19/93 (20%), Positives = 39/93 (41%), Gaps = 14/93 (15%)

Query: 52  EEKLKQAEERRLSLEAEKIAKNAARR------------SKIEDAAKKRDEKENEFIEQT- 98
           E   K+ E +R  +E E+ A   A               +++   +   + ENE  E   
Sbjct: 76  ELMQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALEELEKKAENEAAEAEE 135

Query: 99  -KETLEKRLEISSEKREALINDKMEKLKDHVSV 130
             + L+ +L+  S K +    D++++ K+ +S 
Sbjct: 136 EAKLLKDKLDAESLKLQNEKEDQLKEAKESISR 168


>gnl|CDD|223274 COG0196, RibF, FAD synthase [Coenzyme metabolism].
          Length = 304

 Score = 25.7 bits (57), Expect = 10.0
 Identities = 14/48 (29%), Positives = 22/48 (45%), Gaps = 3/48 (6%)

Query: 60  ERRLSLEAEKIAKNAA---RRSKIEDAAKKRDEKENEFIEQTKETLEK 104
               SLE   +  N      R K+      RDEK+ + +++ KE +EK
Sbjct: 247 GSERSLEVHILDFNGDLYGERVKVRFLKFIRDEKKFDSLDELKEQIEK 294


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.308    0.127    0.327 

Gapped
Lambda     K      H
   0.267   0.0788    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,687,479
Number of extensions: 619569
Number of successful extensions: 2176
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1876
Number of HSP's successfully gapped: 615
Length of query: 133
Length of database: 10,937,602
Length adjustment: 86
Effective length of query: 47
Effective length of database: 7,123,158
Effective search space: 334788426
Effective search space used: 334788426
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 53 (24.1 bits)