RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16002
         (229 letters)



>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 35.4 bits (82), Expect = 0.020
 Identities = 31/151 (20%), Positives = 50/151 (33%), Gaps = 11/151 (7%)

Query: 46  LESYHVAISEAPKVPKSSSEARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLE 105
           L      +     V +   EA    +E  V    L      +A +   + +   + D + 
Sbjct: 152 LRGLKFLLVRLGLVRREKLEALVGVIEDEV---ALYG-ENVEASVVIVVAHGAEDLDKVS 207

Query: 106 MKLNSLACEELTQHAGDTDDKSVENKLRERLKEMSQTLERV-SRTYELRQMQSTQLTNEL 164
             LN L  E       D     + ++L E + E+   LE + S    L +     +  EL
Sbjct: 208 KILNELGFELYEVPEFDGGPSELISELEEVIAEIQDELESLRSELEALAEK----IAEEL 263

Query: 165 KMANNALMNSFEKGKKKSQSRLKRLENEIVI 195
                 L    EK      S+L R E  + I
Sbjct: 264 LAVREIL--EIEKALGDVLSKLARTEYTLAI 292


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 32.6 bits (75), Expect = 0.18
 Identities = 35/210 (16%), Positives = 75/210 (35%), Gaps = 31/210 (14%)

Query: 29  IDKLDKERASIGNTTVKLESY----HVAISEAPKVPKSSSE--ARKMDLEMAVLMQELLS 82
           + KL      +     KL SY    +    E  KV   S E   + ++ E+  + +E+  
Sbjct: 45  LRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKE 104

Query: 83  MREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLKEMSQT 142
           + E+ +EL   I  LE E + LE    +                 ++  L    K +S  
Sbjct: 105 LEEEISELENEIKELEQEIERLE-PWGNF---------------DLDLSLLLGFKYVSVF 148

Query: 143 LERVSRTYELRQMQSTQLTNELKMANN------ALMNSFEKGKKKSQSRLKRLENE-IVI 195
           +  V           + + N   ++ +       ++   E   +     LK+L  E + +
Sbjct: 149 VGTVPEDKLEELKLESDVENVEYISTDKGYVYVVVVVLKELSDEVE-EELKKLGFERLEL 207

Query: 196 MSERHATQV-QHLKQRIVSLEEHIARYNSK 224
             E   +++ + +K+ +  +E+       +
Sbjct: 208 EEEGTPSELIREIKEELEEIEKERESLLEE 237


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 32.5 bits (74), Expect = 0.20
 Identities = 20/69 (28%), Positives = 36/69 (52%), Gaps = 8/69 (11%)

Query: 29  IDKLDKERASIGNTTVKLESYHVAISE-----APKVPKSSSEARKMD--LEMAVLMQELL 81
           I+K+DK  A+      +L  Y++ I E      P +P S+S+   +D  LE  +L+ E+ 
Sbjct: 354 INKIDKANANTERIKQQLAKYNL-IPEKWGGDTPMIPISASQGTNIDKLLETILLLAEIE 412

Query: 82  SMREDKAEL 90
            ++ D  +L
Sbjct: 413 DLKADPTQL 421


>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 782

 Score = 31.4 bits (72), Expect = 0.40
 Identities = 31/125 (24%), Positives = 53/125 (42%), Gaps = 36/125 (28%)

Query: 78  QELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLK 137
           QE+L   + K  L   +  LE E D+L+                      +E ++R+++K
Sbjct: 182 QEILETLDVKERLEKLLDLLEKEIDLLQ----------------------LEKRIRKKVK 219

Query: 138 EMSQTLERVSRTYELR-QMQSTQ----LTNELKMANNALMNSFEKG------KKKSQSRL 186
           E    +E+  R Y LR Q+++ Q      ++ K     L    EK       K+K++  L
Sbjct: 220 E---QMEKSQREYYLREQLKAIQKELGEDDDDKDEVEELREKIEKLKLPKEAKEKAEKEL 276

Query: 187 KRLEN 191
           K+LE 
Sbjct: 277 KKLET 281


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 31.5 bits (72), Expect = 0.42
 Identities = 34/147 (23%), Positives = 65/147 (44%), Gaps = 9/147 (6%)

Query: 76  LMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDD--KSVENKLR 133
           L+ +  S  E   E    I + E E    ++ +     EEL       ++  +    KL 
Sbjct: 290 LIDKYESHIEKALEELESILDTEKENSEFKLDV-----EELKALLEALEEILEKNLQKLE 344

Query: 134 ERLKEMSQTLERVSRTYELRQMQSTQLT-NELKMANNALMNSFEKGKKKSQSRLKRLE-N 191
           E+LK+ S ++E  S T  +  +       NEL   +N  +++ +K K K++ +L      
Sbjct: 345 EKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNKAKKKLWLHLVA 404

Query: 192 EIVIMSERHATQVQHLKQRIVSLEEHI 218
           E+    + +  + + L++ I SLE+ I
Sbjct: 405 ELKEDIDAYQKEKKGLEKAINSLEKEI 431


>gnl|CDD|220664 pfam10267, Tmemb_cc2, Predicted transmembrane and coiled-coil 2
           protein.  This family of transmembrane coiled-coil
           containing proteins is conserved from worms to humans.
           Its function is unknown.
          Length = 387

 Score = 30.9 bits (70), Expect = 0.50
 Identities = 29/113 (25%), Positives = 53/113 (46%), Gaps = 12/113 (10%)

Query: 114 EELTQHAGDTDDKSVENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMN 173
                 A         +KL E L+E+ +   R+  +YE  +    QL  + +    +L  
Sbjct: 192 GSGPGSAVAQQQSLNLDKLLEELREIKEGQSRLEESYERLK---EQLQRDYQYITQSL-- 246

Query: 174 SFEKGKKKSQSRLKRLENEIVIMSERHATQVQHLKQRIVSLEEHIARYNSKER 226
                 ++ + R +RLE ++  ++E H  ++ +LKQ + S+EE +A Y S ER
Sbjct: 247 ------QEERYRYERLEEQLNDLTELHQNEIANLKQELASMEEKVA-YQSYER 292


>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
           exopolysaccharide biosynthesis [Cell envelope
           biogenesis, outer membrane].
          Length = 458

 Score = 30.9 bits (70), Expect = 0.50
 Identities = 21/160 (13%), Positives = 54/160 (33%), Gaps = 28/160 (17%)

Query: 70  DLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVE 129
           + +++ L  +L S R   A+  AR+ +L     +                  +  +    
Sbjct: 238 EQQLSALNTQLQSARARLAQAEARLASLLQLLPLGREAAALR----------EVLESPTI 287

Query: 130 NKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRL 189
             LR++  ++ Q +  +S     +  Q   L  +L                      +++
Sbjct: 288 QDLRQQYAQVRQQIADLSTELGAKHPQLVALEAQLAELR------------------QQI 329

Query: 190 ENEIVIMSERHATQVQHLKQRIVSLEEHIARYNSKERNTS 229
             E+  +      ++  L+Q+  +LE+ +A+   +     
Sbjct: 330 AAELRQILASLPNELALLEQQEAALEKELAQLKGRLSKLP 369


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 31.2 bits (71), Expect = 0.50
 Identities = 33/165 (20%), Positives = 75/165 (45%), Gaps = 11/165 (6%)

Query: 70  DLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQH-AGDTDDKSV 128
           +LE+A+L+  L  +RE+  EL+  +   E E + L  +L  L  E+L +     ++ +  
Sbjct: 224 ELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELE-EKLEELRLEVSELEEE 282

Query: 129 ENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKG------KKKS 182
             +L++ L  ++  + R+ +  ++ + +   L  +L+     L     K         + 
Sbjct: 283 IEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAEL 342

Query: 183 QSRLKRLENEIVIMSERHAT---QVQHLKQRIVSLEEHIARYNSK 224
           + +L+ L+ E+  +         +++ L+ R+  LEE +    SK
Sbjct: 343 EEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSK 387



 Score = 30.8 bits (70), Expect = 0.74
 Identities = 42/208 (20%), Positives = 93/208 (44%), Gaps = 21/208 (10%)

Query: 20  AEMQLLLNNIDKLDKERASIGNTTVKLESYHVAISEAPKVPKSSSEARKMDLEMAVLMQE 79
            E+ LL+  +++L +E   +     + E     ++         +E ++++ ++  L  E
Sbjct: 225 LELALLVLRLEELREELEELQEELKEAEEELEELT---------AELQELEEKLEELRLE 275

Query: 80  LLSMREDKAELRARIYN-------LEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKL 132
           +  + E+  EL+  +Y        LE +K +L  +L +L  +     A   + +S  ++L
Sbjct: 276 VSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDEL 335

Query: 133 RERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENE 192
            E L E+ + LE +    E  + +  +L  EL+   + L    E+ +    S++ +LE +
Sbjct: 336 AEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLR-SKVAQLELQ 394

Query: 193 IVIMSERHATQVQHLKQRIVSLEEHIAR 220
           I         +++ L+ R+  LE+   R
Sbjct: 395 I----ASLNNEIERLEARLERLEDRRER 418



 Score = 29.3 bits (66), Expect = 1.8
 Identities = 39/196 (19%), Positives = 68/196 (34%), Gaps = 37/196 (18%)

Query: 64  SEARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDT 123
            E  ++  E+  L +++ ++R+D A L A +  LE     L  +L  L  E         
Sbjct: 712 EELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEER-- 769

Query: 124 DDKSVENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNAL------------ 171
                  +  E L E    +E +    E  + +   L   L      L            
Sbjct: 770 -----LEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRE 824

Query: 172 -MNSFEKGKKKSQSRLKRLENEIVIMSER-----------------HATQVQHLKQRIVS 213
            + S E+    ++ RL+ LE +I  +SE                    ++++ L     S
Sbjct: 825 RLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERAS 884

Query: 214 LEEHIARYNSKERNTS 229
           LEE +A   S+    S
Sbjct: 885 LEEALALLRSELEELS 900


>gnl|CDD|176470 cd01598, PurB, PurB_like adenylosuccinases (adenylsuccinate lyase,
           ASL).  This subgroup contains EcASL, the product of the
           purB gene in Escherichia coli, and related proteins. It
           is a member of the Lyase class I family of the Lyase_I
           superfamily. Members of the Lyase class I family
           function as homotetramers to catalyze similar
           beta-elimination reactions in which a Calpha-N or
           Calpha-O bond is cleaved with the subsequent release of
           fumarate as one of the products. The four active sites
           of the homotetrameric enzyme are each formed by residues
           from three different subunits. ASL catalyzes two
           non-sequential steps in the de novo purine biosynthesis
           pathway: the conversion of
           5-aminoimidazole-(N-succinylocarboxamide) ribotide
           (SAICAR) into 5-aminoimidazole-4-carboxamide ribotide
           (AICAR) and; the conversion of adenylsuccinate (SAMP)
           into adenosine monophosphate (AMP).
          Length = 425

 Score = 30.7 bits (70), Expect = 0.66
 Identities = 18/55 (32%), Positives = 26/55 (47%), Gaps = 9/55 (16%)

Query: 85  EDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLKEM 139
           E+   LRA I     E+D L +K      E  T H    D K+VE  L+E+ + +
Sbjct: 40  EELKFLRA-IIENFSEEDALRIK----EIEATTNH----DVKAVEYFLKEKFETL 85


>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
           Activating Proteins.  F-BAR domains are dimerization
           modules that bind and bend membranes and are found in
           proteins involved in membrane dynamics and actin
           reorganization. Slit-Robo GTPase Activating Proteins
           (srGAPs) are Rho GAPs that interact with Robo1, the
           transmembrane receptor of Slit proteins. Slit proteins
           are secreted proteins that control axon guidance and the
           migration of neurons and leukocytes. Vertebrates contain
           three isoforms of srGAPs, all of which are expressed
           during embryonic and early development in the nervous
           system but with different localization and timing.
           srGAPs contain an N-terminal F-BAR domain, a Rho GAP
           domain, and a C-terminal SH3 domain. F-BAR domains form
           banana-shaped dimers with a positively-charged concave
           surface that binds to negatively-charged lipid
           membranes. They can induce membrane deformation in the
           form of long tubules.
          Length = 241

 Score = 30.0 bits (68), Expect = 0.80
 Identities = 13/70 (18%), Positives = 28/70 (40%), Gaps = 2/70 (2%)

Query: 126 KSVENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALM--NSFEKGKKKSQ 183
           + + ++L + L  +   L+   +TY     +S     +LK A           + K +  
Sbjct: 116 REIGSQLHDELLRVLNELQTAMKTYHTYHAESKSAERKLKEAEKQEEKQEQSPEKKLERS 175

Query: 184 SRLKRLENEI 193
              K++E E+
Sbjct: 176 RSSKKIEKEV 185


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 30.1 bits (68), Expect = 1.0
 Identities = 23/96 (23%), Positives = 44/96 (45%), Gaps = 3/96 (3%)

Query: 135 RLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENEI- 193
           +LK++ + +  + +    +Q Q  +L  +LK      + S E    ++   LK+L  +I 
Sbjct: 39  QLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETE-IASLEAQLIETADDLKKLRKQIA 97

Query: 194 VIMSERHATQVQHLKQRIVSLEEHIARYNSKERNTS 229
            + +  +A +VQ  +QR   L E +A      RN  
Sbjct: 98  DLNARLNALEVQEREQRRR-LAEQLAALQRSGRNPP 132



 Score = 27.0 bits (60), Expect = 9.5
 Identities = 18/91 (19%), Positives = 31/91 (34%), Gaps = 8/91 (8%)

Query: 130 NKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRL 189
               ++L  +   +       EL  + S Q   + K+A   L+   +K   +  S L   
Sbjct: 167 KATLKQLAAVRAEIAAEQA--ELTTLLSEQRAQQAKLA--QLLEERKKTLAQLNSELSAD 222

Query: 190 ENEIVIMSERHATQVQHLKQRIVSLEEHIAR 220
           + +     E        LK  I S E   A+
Sbjct: 223 QKK----LEELRANESRLKNEIASAEAAAAK 249


>gnl|CDD|146095 pfam03288, Pox_D5, Poxvirus D5 protein-like.  This family includes
           D5 from Poxviruses which is necessary for viral DNA
           replication, and is a nucleic acid independent
           nucleoside triphosphatase. Members of this family are
           also found outside of poxviruses. This domain is a
           DNA-binding winged HTH domain.
          Length = 88

 Score = 28.1 bits (63), Expect = 1.1
 Identities = 11/41 (26%), Positives = 14/41 (34%)

Query: 155 MQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENEIVI 195
           +        L    N  + SFEK KKK  +        I I
Sbjct: 43  LSKRTFQQRLAKHLNERIESFEKKKKKPGNVPNEYLEYIFI 83


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 30.0 bits (68), Expect = 1.2
 Identities = 27/190 (14%), Positives = 63/190 (33%), Gaps = 28/190 (14%)

Query: 66  ARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACE------ELTQH 119
             K + E   L++E  ++   K  +  ++ +LE E + L  +++ L          L + 
Sbjct: 218 KEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEEL 277

Query: 120 AGDTDDKSVE--NKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMAN--------- 168
                D   E   +++E++ E+   +  + R+   ++ +       L             
Sbjct: 278 NKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAE 337

Query: 169 -----------NALMNSFEKGKKKSQSRLKRLENEIVIMSERHATQVQHLKQRIVSLEEH 217
                          +   +   + +  L+ L  E+  + +  A     LK     LE+ 
Sbjct: 338 IEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKL 397

Query: 218 IARYNSKERN 227
               N  +R 
Sbjct: 398 KREINELKRE 407



 Score = 28.5 bits (64), Expect = 3.9
 Identities = 42/235 (17%), Positives = 89/235 (37%), Gaps = 27/235 (11%)

Query: 16  PDDKAEMQLLLNNIDKLDKERASIGNTTVKLESYHVAISEAPKVPKSSSEARKMDLEMA- 74
             + AE+Q L   ++ L +E +S+ +   ++E+    +S+            + ++E   
Sbjct: 670 RSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLE 729

Query: 75  ------------------VLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEEL 116
                              L QE+ +++ +  EL ARI  LE +   LE  LN L     
Sbjct: 730 QEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLS 789

Query: 117 TQHAGDTDDKSVE-NKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNAL---M 172
                +   +  +  +   R++   + +E+      L +    +   EL+     L   +
Sbjct: 790 HSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQI 849

Query: 173 NSFEKGKKKSQSRLKRLENEIVIMSERHATQVQHLKQRIVSLEEHIARYNSKERN 227
            S EK  +    + + LE E+    E     ++ L+ R+  L++      ++ R 
Sbjct: 850 KSIEKEIENLNGKKEELEEEL----EELEAALRDLESRLGDLKKERDELEAQLRE 900



 Score = 28.1 bits (63), Expect = 4.9
 Identities = 33/167 (19%), Positives = 74/167 (44%), Gaps = 7/167 (4%)

Query: 12  LQDRPDDKAEMQLLLNNI-DKLDKER-ASIGNTTVKLESYHVAISEA-----PKVPKSSS 64
           +++  +D  +++  LN++  +L   R   I     KLE     I         K+ + + 
Sbjct: 767 IEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTL 826

Query: 65  EARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTD 124
           E   ++ E+  L ++ + ++E    +   I NL  +K+ LE +L  L        +   D
Sbjct: 827 EKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGD 886

Query: 125 DKSVENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNAL 171
            K   ++L  +L+E+ + +E +    E ++ + ++L  +L+     L
Sbjct: 887 LKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEEL 933


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 29.7 bits (67), Expect = 1.4
 Identities = 41/170 (24%), Positives = 72/170 (42%), Gaps = 3/170 (1%)

Query: 25  LLNNIDKLDKERASIGNTTVKLESYHVAISEAPKVPKSSSEARKMDLE--MAVLMQELLS 82
           L   I+ L +E         +LE     + EA +  +    A   +LE     L +EL  
Sbjct: 328 LKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAE 387

Query: 83  MREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLKEMSQT 142
           +  + AE+R  +  L+ E + LE +L  L+           + ++   +L+  L+E+++ 
Sbjct: 388 LEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEE 447

Query: 143 LERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENE 192
           LE +    E  + +  +L  EL      L    EK     ++RL RLE E
Sbjct: 448 LEELEEQLEELRDRLKELERELAELQEELQR-LEKELSSLEARLDRLEAE 496



 Score = 29.7 bits (67), Expect = 1.4
 Identities = 39/229 (17%), Positives = 89/229 (38%), Gaps = 15/229 (6%)

Query: 3   ETYKVLLHTLQDRPDDKAEMQLLLNNIDKLDKERASIGNTTVKLESYHVAISEAPKVPKS 62
                L        + + +++ L   +++L +E A++     +L+S    + E       
Sbjct: 692 SLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEE------- 744

Query: 63  SSEARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGD 122
             E  +++ E+  L + L  + E+   L   +  L+ E + LE K  +L  E        
Sbjct: 745 --ELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEEL 802

Query: 123 TDDKSVENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNAL-MNSFEKGKKK 181
            + +   + L   L+ + Q  ER+ +  E+ +++      E K+      +   EK  ++
Sbjct: 803 EEAERRLDALERELESLEQRRERLEQ--EIEELEEEIEELEEKLDELEEELEELEKELEE 860

Query: 182 SQSRLKRLENEIVIMSERHATQVQH---LKQRIVSLEEHIARYNSKERN 227
            +  L+ LE E   + +      +    L++ +  LE  +A    +   
Sbjct: 861 LKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEK 909


>gnl|CDD|233162 TIGR00871, zwf, glucose-6-phosphate 1-dehydrogenase.  This enzyme
           (EC 1.1.1.49) acts on glucose 6-phospate and reduces
           NADP(+). An alternate name appearing in the literature
           for the human enzyme, based on a slower activity with
           beta-D-glucose, is glucose 1-dehydrogenase (EC
           1.1.1.47), but that name more properly describes a
           subfamily of the short chain dehydrogenases/reductases
           family. This is a well-studied enzyme family, with
           sequences available from well over 50 species. The
           trusted cutoff is set above the score for the Drosophila
           melanogaster CG7140 gene product, a homolog of unknown
           function. G6PD homologs from the bacteria Aquifex
           aeolicus and Helicobacter pylori lack several motifs
           well conserved most other members, were omitted from the
           seed alignment, and score well below the trusted cutoff
           [Energy metabolism, Pentose phosphate pathway].
          Length = 482

 Score = 29.6 bits (67), Expect = 1.5
 Identities = 12/60 (20%), Positives = 30/60 (50%), Gaps = 3/60 (5%)

Query: 86  DKAELRARIYN--LEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLKEMSQTL 143
              + R ++    ++ E + ++ +L+    + L+  +GD DD    + L E L+++ +T 
Sbjct: 47  SVEDFRKQVREAIIKFETEEIDEQLDDF-AQRLSYVSGDYDDDESYDSLNEHLEQLDKTR 105


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 28.8 bits (65), Expect = 2.0
 Identities = 21/89 (23%), Positives = 38/89 (42%), Gaps = 5/89 (5%)

Query: 132 LRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQ--SRLKRL 189
            RERL E+ Q L  +       Q Q  +L  E +     L     + ++  +  +R+K+L
Sbjct: 64  ARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQL 123

Query: 190 ENEIVIMSERHA---TQVQHLKQRIVSLE 215
               + + E +     ++  LKQ   +LE
Sbjct: 124 SANAIELDEENRELREELAELKQENEALE 152



 Score = 26.5 bits (59), Expect = 9.0
 Identities = 24/107 (22%), Positives = 46/107 (42%), Gaps = 4/107 (3%)

Query: 60  PKSSSEARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQH 119
           P +     ++  E+A L +EL  ++E  AEL+     L+ E   LE +L  L  E     
Sbjct: 62  PSARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKE--LAR 119

Query: 120 AGDTDDKSVENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKM 166
                  ++E  L E  +E+ + L  + +  E  + ++ +L    + 
Sbjct: 120 IKQLSANAIE--LDEENRELREELAELKQENEALEAENERLQENEQR 164


>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
          Length = 874

 Score = 29.3 bits (67), Expect = 2.1
 Identities = 33/140 (23%), Positives = 61/140 (43%), Gaps = 20/140 (14%)

Query: 19  KAEMQ--------LLLNNIDKLDKERASIGNTTV----KLESYHVAISEAPKVPKSSSEA 66
           +AEM         LLL   D  D+ R       +    +LES  +  ++  + P+ ++ A
Sbjct: 734 RAEMNIPPSKKLPLLLKGADAEDRARLEANEAYIKRLARLESLEIL-ADDEEAPEGAASA 792

Query: 67  RKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDK 126
                E+ + ++ L+ +  + A L   +  LE E + +E KL++   E     A     +
Sbjct: 793 VVGGAELFLPLEGLIDVEAELARLEKELAKLEKEIERVEKKLSN---EGFVAKA----PE 845

Query: 127 SVENKLRERLKEMSQTLERV 146
            V  K RE+L E  + L ++
Sbjct: 846 EVVEKEREKLAEYEEKLAKL 865


>gnl|CDD|235599 PRK05769, PRK05769, 4-aminobutyrate aminotransferase; Provisional.
          Length = 441

 Score = 28.7 bits (65), Expect = 2.6
 Identities = 18/86 (20%), Positives = 34/86 (39%), Gaps = 21/86 (24%)

Query: 109 NSLAC-----------EELTQHAGDTDDKSVENKLRERLKEMSQTLERVSRTYELRQMQS 157
           N +A            E L ++A     + +   LR+ LKE+ +  E +     L  M  
Sbjct: 322 NPVAAAAALATLEELEEGLLENA-----QKLGEYLRKELKELKEKYEFIGDVRGLGLMIG 376

Query: 158 TQLTNELK-----MANNALMNSFEKG 178
            +L  + K     + +  L  +F++G
Sbjct: 377 VELVKDRKEPDPKLRDKVLYEAFKRG 402


>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
           envelope biogenesis, outer membrane].
          Length = 835

 Score = 28.9 bits (65), Expect = 2.9
 Identities = 23/106 (21%), Positives = 37/106 (34%), Gaps = 3/106 (2%)

Query: 69  MDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAG-DTDDKS 127
           + L    ++QELL  R +   L A    LE    +L+  +NS   EE    A        
Sbjct: 1   LALAGNNVLQELLQSRRE--LLTAESAQLEAALQLLQEAVNSKRQEEAEPAAEEAELQAE 58

Query: 128 VENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMN 173
           +  +      ++SQ L + +           QL N L     +   
Sbjct: 59  LIQQELAINDQLSQALNQQTERLNALASDDRQLANLLLQLLQSSRT 104


>gnl|CDD|218417 pfam05075, DUF684, Protein of unknown function (DUF684).  This
           family contains several uncharacterized proteins from
           Caenorhabditis elegans. The GO annotation suggests that
           the protein is involved in nematode larval development
           and has a positive regulation on growth rate.
          Length = 346

 Score = 28.6 bits (64), Expect = 2.9
 Identities = 20/69 (28%), Positives = 33/69 (47%), Gaps = 5/69 (7%)

Query: 116 LTQHAGDTDDKSVENKLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNA--LMN 173
           L ++  DT  K  + K +E  +E+ Q    +    ++  +   ++TN LKMA  A  L  
Sbjct: 45  LMKYMQDTV-KHPDKKTKEIFREVYQKHSPLELAEKMIILLEQEVTNPLKMAMKADPLRT 103

Query: 174 S--FEKGKK 180
              FEK +K
Sbjct: 104 KETFEKWRK 112


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 28.8 bits (64), Expect = 3.1
 Identities = 20/91 (21%), Positives = 40/91 (43%)

Query: 87  KAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLKEMSQTLERV 146
           +AE + RI  LE +   L  K              + + + ++ +L++RLK+M+      
Sbjct: 215 RAENQQRILELEQDIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKMTIQRRDE 274

Query: 147 SRTYELRQMQSTQLTNELKMANNALMNSFEK 177
                  Q+++ QL  +L+     L +S +K
Sbjct: 275 ETERIDLQLENEQLHEDLRTLQERLESSQQK 305


>gnl|CDD|178106 PLN02488, PLN02488, probable pectinesterase/pectinesterase
           inhibitor.
          Length = 509

 Score = 28.4 bits (63), Expect = 3.5
 Identities = 26/86 (30%), Positives = 37/86 (43%), Gaps = 13/86 (15%)

Query: 55  EAPKVPKSSSEARKMDLEMAVLMQE-------LLSMREDKAELRARIYNLEHEKDVLEMK 107
           E   V  S SE R  DL + VL          ++ + ED   L       E E D+L +K
Sbjct: 20  ELKTVSSSLSEMRCRDLLIIVLKNSVWRIDMAMIGVMEDTKLLE------EMENDMLGVK 73

Query: 108 LNSLACEELTQHAGDTDDKSVENKLR 133
            ++   EE+ + A D   +SVE  L 
Sbjct: 74  EDTNLFEEMMESAKDRMIRSVEELLG 99


>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
          Length = 343

 Score = 28.4 bits (64), Expect = 3.5
 Identities = 27/109 (24%), Positives = 43/109 (39%), Gaps = 8/109 (7%)

Query: 81  LSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLKEMS 140
             +++  A LRA +   E E+  L+  L  LA        G   + + E      L    
Sbjct: 77  QDLQDSVANLRASLSAAEAERSRLQALLAELA-GAGAAAEGRAGELAQE------LDSEK 129

Query: 141 QTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRL 189
           Q   R     EL   Q   L  +L     AL ++ EK  ++SQ+++  L
Sbjct: 130 QVSARALAQVELLNQQIAALRRQLAALEAAL-DASEKRDRESQAKIADL 177


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 28.1 bits (63), Expect = 3.7
 Identities = 24/122 (19%), Positives = 41/122 (33%), Gaps = 15/122 (12%)

Query: 75  VLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACE--ELTQHAGDTDDKSVE--N 130
            L + L  M+ D+  L   +  +   K  L  KL +L  E   L Q A + +        
Sbjct: 151 GLEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELE 210

Query: 131 KLRERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLE 190
           K R+ L+ +S  +               +   EL+     L  + E    K    L+ + 
Sbjct: 211 KARQELRSLSVKISE-----------KRKQLEELQQELQELTIAIEALTNKKSELLEEIA 259

Query: 191 NE 192
             
Sbjct: 260 EA 261


>gnl|CDD|223294 COG0216, PrfA, Protein chain release factor A [Translation,
           ribosomal structure and biogenesis].
          Length = 363

 Score = 27.9 bits (63), Expect = 4.4
 Identities = 23/103 (22%), Positives = 46/103 (44%), Gaps = 21/103 (20%)

Query: 107 KLNSLAC--EELTQHAGDTDDKSVENKLRERLKEMSQTLERVSRTY-----------ELR 153
           KL SL    EEL     D +  S  ++ R+  KE ++ LE +   Y           + +
Sbjct: 8   KLESLLERYEELEALLSDPEVISDPDEYRKLSKEYAE-LEPIVEKYREYKKAQEDLEDAK 66

Query: 154 QMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENEIVIM 196
           +M + +   E++          E+  K+ +++++ LE E+ I+
Sbjct: 67  EMLAEEKDPEMRE-------MAEEEIKELEAKIEELEEELKIL 102


>gnl|CDD|151799 pfam11358, DUF3158, Protein of unknown function (DUF3158).  Some
           members in this family of proteins are annotated as
           integrase regulator R however this cannot be confirmed.
           This family of proteins with unknown function appear to
           be restricted to Proteobacteria.
          Length = 160

 Score = 27.3 bits (61), Expect = 4.6
 Identities = 23/80 (28%), Positives = 37/80 (46%), Gaps = 6/80 (7%)

Query: 67  RKMD-LEMAVLM-QELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTD 124
           R  D   M V + QEL++      ELR  +Y +E ++ VL M+++ L    L + A +  
Sbjct: 83  RSHDRSRMGVAVWQELVADPATPVELRDDLYAMEKQRIVLNMQISLL--HSLGRQAAECA 140

Query: 125 DK--SVENKLRERLKEMSQT 142
            K    E     RL+  S +
Sbjct: 141 SKLAQAEAVYLRRLQSPSNS 160


>gnl|CDD|189038 cd09868, PIN_XPG, PIN domain of Xeroderma pigmentosum
           complementation group G (XPG) nuclease, a
           structure-specific, divalent-metal-ion dependent, 5'
           nuclease and homologs.  The Xeroderma pigmentosum
           complementation group G (XPG) nuclease plays a central
           role in nucleotide excision repair (NER) in cleaving DNA
           bubble structures or loops. XPG is a member of the
           structure-specific, 5' nuclease family that catalyzes
           hydrolysis of DNA duplex-containing nucleic acid
           structures during DNA replication, repair, and
           recombination. These nucleases contain a PIN (PilT N
           terminus) domain with a helical arch/clamp region (I
           domain). In XPG PIN domains, this arch region can be
           quite variable and extensive (400 - 800 residues) in
           length and is required for NER activity and for
           efficient processing of bubble substrates. Inserted
           within the PIN domain of these 5' nucleases is a H3TH
           (helix-3-turn-helix) domain, an atypical
           helix-hairpin-helix-2-like region. Both the H3TH domain
           (not included here) and the helical arch/clamp region
           are involved in DNA binding. Nucleases within this group
           also have a carboxylate-rich active site that is
           involved in binding essential divalent metal ion
           cofactors (Mg2+/Mn2+).
          Length = 249

 Score = 27.6 bits (62), Expect = 4.8
 Identities = 14/48 (29%), Positives = 21/48 (43%), Gaps = 2/48 (4%)

Query: 148 RTYELRQMQSTQLTNEL-KMANNALMNSFEKGKKKSQSR-LKRLENEI 193
           RT   R+ Q  +      K A   L +  +K   KSQ +  +R  +EI
Sbjct: 86  RTLAKRRQQREKAEINAKKTAEKLLASQLKKSAVKSQQQQQERDADEI 133


>gnl|CDD|220879 pfam10805, DUF2730, Protein of unknown function (DUF2730).  This
           family of proteins with unknown function appears to be
           restricted to Gammaproteobacteria.
          Length = 106

 Score = 26.6 bits (59), Expect = 4.8
 Identities = 22/70 (31%), Positives = 32/70 (45%), Gaps = 7/70 (10%)

Query: 84  REDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVE-NKLRERLKEMSQT 142
           RED  +L   I + +     +E KL SL        A D     +E  +LR  LKE+S  
Sbjct: 34  REDVEKLIDIIASNDKRLTEMETKLESLPT------AKDVHALKIEITELRGELKELSAQ 87

Query: 143 LERVSRTYEL 152
           ++ +S   EL
Sbjct: 88  IQSISHQLEL 97


>gnl|CDD|204102 pfam08942, DUF1919, Domain of unknown function (DUF1919).  This
           domain has no known function. It is found in various
           hypothetical and putative bacterial proteins.
          Length = 198

 Score = 27.3 bits (61), Expect = 5.1
 Identities = 11/29 (37%), Positives = 16/29 (55%), Gaps = 2/29 (6%)

Query: 173 NSFEKGKKKSQSRLKRL--ENEIVIMSER 199
            SFE+ K K   R KR+  +N  V  ++R
Sbjct: 100 ASFEEAKNKWDERKKRINWDNLFVKFTDR 128


>gnl|CDD|234714 PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed.
          Length = 438

 Score = 27.9 bits (63), Expect = 5.2
 Identities = 14/82 (17%), Positives = 35/82 (42%), Gaps = 1/82 (1%)

Query: 133 RERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENE 192
           ++RL  + Q L+R      LR  +        ++         E+ +++ +   +RL   
Sbjct: 311 QQRLDRLQQRLQRALERR-LRLAKQRLERLSQRLQQQNPQRRIERAQQRLEQLEQRLRRA 369

Query: 193 IVIMSERHATQVQHLKQRIVSL 214
           +    +R   +++ L Q++ +L
Sbjct: 370 MRRQLKRKRQRLEALAQQLEAL 391


>gnl|CDD|221296 pfam11894, DUF3414, Protein of unknown function (DUF3414).  This
           family of proteins are functionally uncharacterized.
           This protein is found in eukaryotes. Proteins in this
           family are typically between 764 to 2011 amino acids in
           length. This protein has a conserved LLG sequence motif.
          Length = 1612

 Score = 27.7 bits (62), Expect = 6.0
 Identities = 15/82 (18%), Positives = 33/82 (40%), Gaps = 5/82 (6%)

Query: 134 ERLKEMSQTLERVSRTYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENEI 193
           E  K++  +++ +S            L  EL+      +N  +   K  +SR ++LE+  
Sbjct: 5   EPFKDLYNSIQALSD---SDPDLLNDLEVELEKHKPDFLNLLDNPPKNDKSR-QQLESGK 60

Query: 194 VIMSERHATQV-QHLKQRIVSL 214
           + + +     + Q   Q  + L
Sbjct: 61  ITLGDGDEYSLNQEFIQEALIL 82


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 27.3 bits (61), Expect = 7.2
 Identities = 24/123 (19%), Positives = 47/123 (38%), Gaps = 16/123 (13%)

Query: 89  ELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLKEMSQTLERVSR 148
           ELR  +  L  E + L+ K+                  + + KL   L    +   +  R
Sbjct: 24  ELRLDLARLLLENEELKQKVEEA----------LEGATNEDGKLAADLL-KLEVARKKER 72

Query: 149 TYELRQMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENEIVIMSERHATQVQHLK 208
             ++R  + +QL  E++          E+ K+    R   L +    + +R A+Q++ L+
Sbjct: 73  LNQIRA-RISQLKEEIE----QKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQ 127

Query: 209 QRI 211
             I
Sbjct: 128 DEI 130



 Score = 26.9 bits (60), Expect = 9.8
 Identities = 15/89 (16%), Positives = 34/89 (38%), Gaps = 6/89 (6%)

Query: 79  ELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSVENKLRERLKE 138
           E+   +E   ++RARI  L+ E +    ++  L      + A       + +   +  K 
Sbjct: 65  EVARKKERLNQIRARISQLKEEIEQKRERIEEL------KRALAQRRSDLSSASYQLEKR 118

Query: 139 MSQTLERVSRTYELRQMQSTQLTNELKMA 167
            +  LE++    +  + +   L + L   
Sbjct: 119 RASQLEKLQDEIKRTRSKLNALHSLLAEK 147


>gnl|CDD|226228 COG3705, HisZ, ATP phosphoribosyltransferase involved in histidine
           biosynthesis [Amino acid transport and metabolism].
          Length = 390

 Score = 27.3 bits (61), Expect = 7.9
 Identities = 11/55 (20%), Positives = 16/55 (29%), Gaps = 4/55 (7%)

Query: 51  VAISEAPKVPKSSSEARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLE 105
           + +         S E R    E+      LL  RE     R  +  L  +   L 
Sbjct: 190 LGLELLVLAAPLSPELRGRLSELL----ALLGGREVLERARGLLDELMAQGIALN 240


>gnl|CDD|215966 pfam00517, GP41, Retroviral envelope protein.  This family includes
           envelope protein from a variety of retroviruses. It
           includes the GP41 subunit of the envelope protein
           complex from human and simian immunodeficiency viruses
           (HIV and SIV) which mediate membrane fusion during viral
           entry. The family also includes bovine immunodeficiency
           virus, feline immunodeficiency virus and Equine
           infectious anaemia (EIAV). The family also includes the
           Gp36 protein from mouse mammary tumour virus (MMTV) and
           human endogenous retroviruses (HERVs).
          Length = 204

 Score = 27.0 bits (60), Expect = 7.9
 Identities = 15/57 (26%), Positives = 29/57 (50%), Gaps = 4/57 (7%)

Query: 154 QMQSTQLTNELKMANNALMNSFEKGKKKSQSRLKRLENEIVIMSERHATQVQHLKQR 210
            +QS Q+   +      L+N+ E+  K  Q R+  L+N ++++  R    VQ L++ 
Sbjct: 9   SVQSAQVLAGIVQNQTNLLNAQEQIDKLLQLRIWGLKNVVLVLGAR----VQALEKY 61


>gnl|CDD|184150 PRK13569, PRK13569, anthranilate synthase component I; Provisional.
          Length = 506

 Score = 27.3 bits (61), Expect = 8.3
 Identities = 24/103 (23%), Positives = 39/103 (37%), Gaps = 3/103 (2%)

Query: 52  AISEAPKVPKSSSEARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSL 111
           AIS   KVPK  S   +M        + L++   +  EL    Y   + ++  E K+   
Sbjct: 127 AISLIEKVPKHHSRDTEMPTCHFFFCETLIAYDHETKELHFIHYVRLNGQETEEEKIEKY 186

Query: 112 --ACEELTQHAGDTDDKSVENKL-RERLKEMSQTLERVSRTYE 151
             A  E+         +  E +L      E + + E V+  YE
Sbjct: 187 KEAQAEIETLIEKLARRKAEKELLLPADSERTVSFEGVTSNYE 229


>gnl|CDD|239947 cd04506, SGNH_hydrolase_YpmR_like, Members of the SGNH-hydrolase
           superfamily, a diverse family of lipases and esterases.
           The tertiary fold of the enzyme is substantially
           different from that of the alpha/beta hydrolase family
           and unique among all known hydrolases; its active site
           closely resembles the Ser-His-Asp(Glu) triad from other
           serine hydrolases, but may lack the carboxlic acid. This
           subfamily contains sequences similar to Bacillus YpmR.
          Length = 204

 Score = 26.8 bits (60), Expect = 8.5
 Identities = 25/107 (23%), Positives = 45/107 (42%), Gaps = 29/107 (27%)

Query: 116 LTQHAGDTDDKS----VENKLRERLKEMSQTLE----RVSRTYE-LRQMQSTQLTNELKM 166
           LT+  GD   K       +KL E       T++       R+ + L+++++ ++  ELK 
Sbjct: 9   LTEGVGDETGKGGYVGRLDKLIETKTVKKVTVQNFGVSGDRSDQLLKRLKTKKVQKELKK 68

Query: 167 AN--------NALMN------------SFEKGKKKSQSRLKRLENEI 193
           A+        N LM              F+K ++  Q+ LK++  EI
Sbjct: 69  ADVITITIGGNDLMQVLEKNFLSLDVEDFKKAEETYQNNLKKIFKEI 115


>gnl|CDD|204632 pfam11365, DUF3166, Protein of unknown function (DUF3166).  This
           eukaryotic family of proteins has no known function.
          Length = 97

 Score = 26.0 bits (57), Expect = 8.6
 Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 5/67 (7%)

Query: 76  LMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSL-----ACEELTQHAGDTDDKSVEN 130
           L  +L  ++E+ A +R ++  LE EKD +E +L          +   +HA    D + E 
Sbjct: 6   LRCQLQFVKEEAALMRKKLAKLEEEKDKVEGELQKYKSKYGDLDSGAEHAAGGPDTTREA 65

Query: 131 KLRERLK 137
           +L+ RLK
Sbjct: 66  ELQLRLK 72


>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
           Members of this protein family are homologs of ClpB, an
           ATPase associated with chaperone-related functions.
           These ClpB homologs, designated ClpV1, are a key
           component of the bacterial pathogenicity-associated type
           VI secretion system [Protein fate, Protein and peptide
           secretion and trafficking, Cellular processes,
           Pathogenesis].
          Length = 852

 Score = 27.2 bits (61), Expect = 9.0
 Identities = 20/73 (27%), Positives = 33/73 (45%), Gaps = 3/73 (4%)

Query: 69  MDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGDTDDKSV 128
            D  +A L  EL ++  + A L AR    + EK+++E  L   A  E    A   DD ++
Sbjct: 448 HDERLAELRAELAALEAELAALEAR---WQQEKELVEAILALRAELEADADAPADDDAAL 504

Query: 129 ENKLRERLKEMSQ 141
             +L E    ++ 
Sbjct: 505 RAQLAELEAALAS 517


>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein.  This family of proteins
            is found in bacteria. Proteins in this family are
            typically between 397 and 1543 amino acids in length.
            This family is the SidE protein in the Dot/Icm pathway of
            Legionella pneumophila bacteria. There is little
            literature describing the family.
          Length = 1443

 Score = 27.2 bits (60), Expect = 9.2
 Identities = 25/86 (29%), Positives = 43/86 (50%), Gaps = 8/86 (9%)

Query: 63   SSEARKMDLEMAVLMQELLSMREDKAELRARIYNLEHEKDVLEMKLNSLACEELTQHAGD 122
            S EAR+   +M  + Q    M E+K  L A    L   +D LE +  + A E L +   +
Sbjct: 1011 SGEARQAKRQMDAIKQ----MVENKVVLDA----LTQCQDALEKQNIAGALEALKKIPSE 1062

Query: 123  TDDKSVENKLRERLKEMSQTLERVSR 148
             +  ++  +LRE+++ + Q LE + R
Sbjct: 1063 KEMGTISRELREQIQSVRQELESLQR 1088


>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP).  This family consists of
           several PV-1 (PLVAP) proteins which seem to be specific
           to mammals. PV-1 is a novel protein component of the
           endothelial fenestral and stomatal diaphragms. The
           function of this family is unknown.
          Length = 442

 Score = 26.9 bits (59), Expect = 9.2
 Identities = 24/90 (26%), Positives = 43/90 (47%), Gaps = 1/90 (1%)

Query: 25  LLNNIDKLDKERASIGNTTVKLESYHVAISEAPKVPKSSSEARKMDLEMAVLMQELLSMR 84
           L  +I+++ +E A +    ++ E    A  EA    +  ++AR+  L+     Q  L++ 
Sbjct: 290 LRADIERVARENADLQRQKLEAEQAAQAAQEAKAKAEKEAQAREAKLQAECARQTQLAL- 348

Query: 85  EDKAELRARIYNLEHEKDVLEMKLNSLACE 114
           E+KA LRA   NL  E +  + +   L  E
Sbjct: 349 EEKAALRAERDNLAKELEAKKREAEQLRME 378


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.124    0.317 

Gapped
Lambda     K      H
   0.267   0.0704    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,968,738
Number of extensions: 1011854
Number of successful extensions: 1476
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1382
Number of HSP's successfully gapped: 292
Length of query: 229
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 135
Effective length of database: 6,768,326
Effective search space: 913724010
Effective search space used: 913724010
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 57 (25.8 bits)