RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7197
         (488 letters)



>gnl|CDD|217293 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand
           binding domain.  This family is the extracellular ligand
           binding domain of these ion channels. This domain forms
           a pentameric arrangement in the known structure.
          Length = 215

 Score =  232 bits (594), Expect = 5e-74
 Identities = 85/163 (52%), Positives = 110/163 (67%), Gaps = 3/163 (1%)

Query: 10  QSWYDYKLTWDPKEYGGVEMLHVPSDHIWRPDIVLYNNADGNFEVTLA-TKATLNYTGRV 68
           Q W D +L WDP++YGG+  L +PSD IW+PDIVLYN ADG  ++T   T   +   G V
Sbjct: 55  QQWTDERLAWDPEDYGGITSLRLPSDKIWKPDIVLYNKADGIHDITTPNTNVRVYPDGTV 114

Query: 69  EWKPPAIYKSSCEIDVEYFPFDEQTCVMKFGSWTYDGFQVDLRHLDEEKGTNVVDIGVDL 128
            W PPAIYKSSC +D+ YFPFD+Q C +KFGSWTY+G +VDLR  D+     ++D  +DL
Sbjct: 115 LWSPPAIYKSSCPMDLTYFPFDQQNCSLKFGSWTYNGEEVDLRWKDDTPP--ILDEEIDL 172

Query: 129 TAFYTSVEWDILEVPAVRNEKFYTCCDEPYLDITFNITMRRKT 171
           + F  + EWDI++VPA RNE  Y C  E Y D+TF   +RRK 
Sbjct: 173 SDFTENGEWDIVDVPAKRNEYPYGCYSELYPDVTFYFVLRRKP 215


>gnl|CDD|233155 TIGR00860, LIC, Cation transporter family protein.  The
           Ligand-gated Ion Channel (LIC) Family of
           Neurotransmitter Receptors TC 1.A.9)Members of the LIC
           family of ionotropic neurotransmitter receptors are
           found only in vertebrate and invertebrate animals. They
           exhibit receptor specificity for (1)acetylcholine, (2)
           serotonin, (3) glycine, (4) glutamate and (5)
           g-aminobutyric acid (GABA). All of these receptor
           channels are probably hetero- orhomopentameric. The best
           characterized are the nicotinic acetyl-choline receptors
           which are pentameric channels of a2bgd subunit
           composition. All subunits arehomologous. The three
           dimensional structures of the protein complex in both
           the open and closed configurations have been solved at
           0.9 nm resolution.The channel protein complexes of the
           LIC family preferentially transport cations or anions
           depending on the channel (e.g., the acetylcholine
           receptors are cationselective while glycine receptors
           are anion selective) [Transport and binding proteins,
           Cations and iron carrying compounds].
          Length = 459

 Score =  233 bits (596), Expect = 2e-71
 Identities = 104/452 (23%), Positives = 161/452 (35%), Gaps = 87/452 (19%)

Query: 10  QSWYDYKLTWDPKEYGGVEMLHVPSDHIWRPDIVLYNNADGNFEVTLATKATLNYT--GR 67
           Q W D +L W+P+EY GV ++  P D IW PDI  YN  D  F     T   +     G 
Sbjct: 84  QEWTDERLQWNPEEYPGVTLVRTPDDSIWVPDIFFYNEKDARFHGITMTNVLVRIHPNGS 143

Query: 68  VEWKPPAIYKSSCEIDVEYFPFDEQTCVMKFGSWTYDGFQVDLRHLDEEKGTNVVDIGVD 127
           V + P      +C +D+  FPFD Q C +KF SW Y     D++   +E+G   VD  + 
Sbjct: 144 VLYSPRITLTLACPMDLRNFPFDVQNCSLKFESWGYTT--NDIKLEWKEQGAVQVDDSL- 200

Query: 128 LTAFYTSVEWDILEVPAVRNEKFYTCCDEPYLDITFNITMRRKTLFYTVNLIIPCMGISF 187
              F +  E+++L V   R           Y  +TF+  +RR+ L+Y + L IP + I  
Sbjct: 201 ---FISLPEFELLGVYGTRYC-TSETNTGEYPCLTFSFVLRRRPLYYLLQLYIPSILIVI 256

Query: 188 LTILVFYLPSD-SGEKVSLSISILLSLTVFFLLLAEIIPPTSLVVSLSISILLSLTVFFL 246
           L+ + F+LP+D SG +VSL I+ LL++T F   + E +P  S                  
Sbjct: 257 LSWVSFWLPADASGARVSLGITTLLTMTTFSSGVRESLPAVS------------------ 298

Query: 247 LLAEIIPPTSLVVPLLGKFVLFTMILDTFSICVTVVVLNIHFRSPQTHVMSPWVKKVFIH 306
                       V  +  +    M     ++  T  V  +H + P           + + 
Sbjct: 299 -----------YVKAIDVYFAVCMAFVFLALLETAFVNYVHHKDPAQGKR-----HLLLE 342

Query: 307 LLPKLLVMRRPQYDRNKRDMDPSQHQSTCKVHGTPTRPPGDLPLDDIESEPLDPLPPPIP 366
                L  + P  D  +  +D +   S         R                  PPP  
Sbjct: 343 RCAWRLCKQEPGEDYRRPALDHASLSSVEMRAKDGGRGLESPTERQRLL----HSPPPAE 398

Query: 367 PLPPMKKVSHHWHLCPELHKAIAGVRFIADHTKTAVNDTKVKPLEVERSLCHGWDYNSPL 426
               +  +                                   LE  R        +   
Sbjct: 399 GDLDLAGI-----------------------------------LEEVRIAHRFRKRDESE 423

Query: 427 IIPSP----AMVLDRLFLWIFTLAVLIGSAGI 454
            +       A V+D+L    F LA L+ + G 
Sbjct: 424 EVVRDWKFRAKVIDKLSRMAFPLAFLLFNIGY 455


>gnl|CDD|202474 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel
           transmembrane region.  This family includes the four
           transmembrane helices that form the ion channel.
          Length = 228

 Score =  207 bits (528), Expect = 5e-64
 Identities = 98/278 (35%), Positives = 137/278 (49%), Gaps = 52/278 (18%)

Query: 178 LIIPCMGISFLTILVFYLPSDSG-EKVSLSISILLSLTVFFLLLAEIIPPTSLVVSLSIS 236
           LIIPC+ ISFL+ LVF+LP+D+G EKV+L I+ LL++TVF LL+ E +P TS VV     
Sbjct: 1   LIIPCVLISFLSWLVFWLPADAGPEKVTLGITTLLTMTVFLLLIRESLPKTSYVV----- 55

Query: 237 ILLSLTVFFLLLAEIIPPTSLVVPLLGKFVLFTMILDTFSICVTVVVLNIHFRSPQTHVM 296
                                  PL+GK+++FTM + T S+   VVVLN+H RSP TH M
Sbjct: 56  -----------------------PLIGKYLVFTMFVFTASVEYAVVVLNVHHRSPSTHKM 92

Query: 297 SPWVKKVFIHLLPKLLVMRRPQYDRNKRDMDPSQHQSTCKVHGTPTRPPGDLPLDDIESE 356
             WV+K+F+  LP+LL M+RP    +         + +    G+   P            
Sbjct: 93  PEWVRKLFLRKLPRLLFMKRP--PESLSPPAAPNLRRSSSSSGSVLEPASGKES----KG 146

Query: 357 PLDPLPPPIPPLPPMKKVSHHWHLCPELHKAIAGVRFIADHTKTAVNDTKVKPLEVERSL 416
              P      P      +   + L PEL KA+ GVRFIA+H ++     +VK        
Sbjct: 147 VGGPETSSGLPGGSGSGLGSPFRLSPELKKALEGVRFIAEHLRSRDEFDEVKED------ 200

Query: 417 CHGWDYNSPLIIPSPAMVLDRLFLWIFTLAVLIGSAGI 454
              W Y         AMV+DRL  WIF +A ++G+   
Sbjct: 201 ---WKYV--------AMVIDRLSRWIFPIAFVLGTLVY 227


>gnl|CDD|226965 COG4606, CeuB, ABC-type enterochelin transport system, permease
           component [Inorganic ion transport and metabolism].
          Length = 321

 Score = 35.7 bits (83), Expect = 0.048
 Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 11/74 (14%)

Query: 193 FYLPSDSGEKVSLSISILLSLTVFFLLLAEIIPPTSLVVSLSISILLSL--TVFFLLLAE 250
           F  PS +G   S  + ILLSL +F        P  SL++ + ++ + +L  T+ F+++  
Sbjct: 78  FVEPSTAGTTESAKLGILLSLLLF--------PSASLLIKMLVAFVFALAGTLLFMMILR 129

Query: 251 IIPPTS-LVVPLLG 263
            I     L VPL+G
Sbjct: 130 RIKLKDVLFVPLIG 143


>gnl|CDD|226999 COG4652, COG4652, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 657

 Score = 35.4 bits (82), Expect = 0.074
 Identities = 33/118 (27%), Positives = 49/118 (41%), Gaps = 15/118 (12%)

Query: 170 KTLFYTV--NLIIPCMGISFLTILVFYLPSDSGEKVSLSISILLSLTVFFLLLAEIIPPT 227
            TL + +  N I    G S LTI+   L    G  +   + +L    +FF     ++   
Sbjct: 193 FTLLWIIFRNAIYDIKGWSLLTIISRLL----GFSILWILGLLFLSLLFFRRFG-VLF-- 245

Query: 228 SLVVSLSISILLSLTVFFLLLAEIIPPTSLVVPLLGK-FVLFTMILDTFSICVTVVVL 284
           SL+  L ISILLSL    LL    +        + GK     +M L T    V V+++
Sbjct: 246 SLLFFLMISILLSLISLLLLSRLSLANL-----IKGKLPFKRSMFLLTVCKLVAVLIV 298


>gnl|CDD|214444 MTH00165, ND5, NADH dehydrogenase subunit 5; Provisional.
          Length = 573

 Score = 33.6 bits (78), Expect = 0.22
 Identities = 22/79 (27%), Positives = 39/79 (49%), Gaps = 3/79 (3%)

Query: 171 TLFYTVNLIIPCMGISFLTILVFYLPSDSGEKVSLSISILLSLTVFF-LLLAEIIPPTSL 229
           T+ Y+  LI   M   F  +      SD  + +  S+ +LL +++F   +L+ +I P   
Sbjct: 388 TVSYSFRLIYYSMSGDF-NLKSLNNLSDESKFMLKSMILLLFMSIFGGSMLSWLIFPEPY 446

Query: 230 VVSLSISILLSLTVFFLLL 248
           ++ L    L  LT+F +LL
Sbjct: 447 MICLPF-YLKLLTLFVILL 464


>gnl|CDD|133467 cd06912, GT_MraY_like, This subfamily is composed of
           uncharacterized bacterial glycosyltransferases in the
           MraY-like family. This family contains both eukaryotic
           and prokaryotic UDP-D-N-acetylhexosamine:polyprenol
           phosphate D-N-acetylhexosamine-1-phosphate transferases,
           which catalyze the transfer of a D-N-acetylhexosamine
           1-phosphate to a membrane-bound polyprenol phosphate.
           This is the initiation step of protein N-glycosylation
           in eukaryotes and peptidoglycan biosynthesis in
           bacteria. The three bacterial members MraY, WecA, and
           WbpL/WbcO, utilize undecaprenol phosphate as the
           acceptor substrate, but use different UDP-sugar donor
           substrates. MraY-type transferases are highly specific
           for UDP-N-acetylmuramate-pentapeptide, whereas WecA
           proteins are selective for UDP-N-acetylglucosamine
           (UDP-GlcNAc). The WbcO/WbpL substrate specificity has
           not yet been determined, but the structure of their
           biosynthetic end products implies that
           UDP-N-acetyl-D-fucosamine (UDP-FucNAc) and/or
           UDPN-acetyl-D-quinosamine (UDP-QuiNAc) are used. The
           prokaryotic enzyme-catalyzed reactions lead to the
           formation of polyprenol-linked oligosaccharides involved
           in bacterial cell wall and peptidoglycan assembly.
          Length = 193

 Score = 33.0 bits (76), Expect = 0.23
 Identities = 27/105 (25%), Positives = 41/105 (39%), Gaps = 2/105 (1%)

Query: 184 GISFLTILVFYLPSDSGEKVSLSI-SILLSLTVFFLLLAEIIPPTSLV-VSLSISILLSL 241
           G++    L+  L   S    SL +  +L +L  F   L E I       + L  + L +L
Sbjct: 16  GVAIFLGLLAGLLLLSLLSGSLLLLLLLAALPAFLAGLLEDITKRVSPRIRLLATFLSAL 75

Query: 242 TVFFLLLAEIIPPTSLVVPLLGKFVLFTMILDTFSICVTVVVLNI 286
              +LL A I       + LL  F  F +I   F++       NI
Sbjct: 76  LAVWLLGASITRLDLPGLDLLLSFPPFAIIFTIFAVAGVANAFNI 120


>gnl|CDD|225105 COG2194, COG2194, Predicted membrane-associated, metal-dependent
           hydrolase [General function prediction only].
          Length = 555

 Score = 32.7 bits (75), Expect = 0.44
 Identities = 22/80 (27%), Positives = 33/80 (41%), Gaps = 5/80 (6%)

Query: 193 FYLPSDSGEKVSLSISILLSLTVFFLLLAEIIPPTSLVVSLSISILLSLTVFFLLLAEII 252
              P  +   +SL ++    L + F    ++    SL +S S S LL+L   FLLL    
Sbjct: 5   LKRPKLTKLSLSLLLAWYFLLLLNFAFFLQVFLINSLDLSFSFSFLLALVFAFLLLL--- 61

Query: 253 PPTSLVVPLLGKFVLFTMIL 272
               L  P L K +   + L
Sbjct: 62  --LLLSFPRLLKPLAGVLSL 79


>gnl|CDD|217705 pfam03739, YjgP_YjgQ, Predicted permease YjgP/YjgQ family.  Members
           of this family are predicted integral membrane proteins
           of unknown function. They are about 350 amino acids long
           and contain about 6 transmembrane regions. They are
           predicted to be permeases although there is no
           verification of this.
          Length = 354

 Score = 32.6 bits (75), Expect = 0.47
 Identities = 24/109 (22%), Positives = 45/109 (41%), Gaps = 12/109 (11%)

Query: 174 YTVNLIIPCMGISFLTILVFYLPSDSGEKVSLSISILLSLTVFFLLLAEIIPPTSLVVSL 233
           Y +   +    +  L +L+ +L  D    +   +S  L L     LL  ++P   L + L
Sbjct: 2   YILREFLKPFLLVLLVLLLLFLLVDLLRLLDDILSKGLPLADILRLLLLLLPSI-LELIL 60

Query: 234 SISILLSLTVFFLLLA---EI-------IPPTSLVVPLLGKFVLFTMIL 272
            +++LL++ +    LA   E+       I    L+ P+L    L   +L
Sbjct: 61  PLALLLAVLLTLGRLARDSELTALRAAGISLKRLLRPVL-ILALLLSLL 108


>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter.  The Monovalent
           Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
           CPA1 family is a large family of proteins derived from
           Gram-positive and Gram-negative bacteria, blue green
           bacteria, yeast, plants and animals. Transporters from
           eukaryotes have been functionally characterized, and all
           of these catalyze Na+:H+ exchange. Their primary
           physiological functions may be in (1) cytoplasmic pH
           regulation, extruding the H+ generated during
           metabolism, and (2) salt tolerance (in plants), due to
           Na+ uptake into vacuoles. This model is specific for the
           fungal members of this family [Transport and binding
           proteins, Cations and iron carrying compounds].
          Length = 810

 Score = 32.2 bits (73), Expect = 0.75
 Identities = 26/116 (22%), Positives = 49/116 (42%), Gaps = 17/116 (14%)

Query: 235 ISILLSLTVFFLLLAEIIPPTSLVVPLLGKFVLFTMILDTFSICVTVVVLNIHFRSPQTH 294
           ++ +  +T F ++ + I+  +S+ V +LG++      L+T ++  T          P T 
Sbjct: 410 MACIWPITCFSIVTSVIVHGSSVAVIMLGRY------LNTVTLTAT----------PTTR 453

Query: 295 VMSPWVKKVFIHLLPKLLVMRRP-QYDRNKRDMDPSQHQSTCKVHGTPTRPPGDLP 349
             +   K  ++  LP L    RP    R  ++M  S  Q   +  G P RP   + 
Sbjct: 454 TTNTNTKNSWLQRLPSLDKSGRPFSLQRLDKEMSLSPGQIGVRTSGMPARPALGMR 509


>gnl|CDD|223769 COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT)
           superfamily [Carbohydrate transport and metabolism /
           Amino acid transport and metabolism / General function
           prediction only].
          Length = 292

 Score = 31.8 bits (72), Expect = 0.84
 Identities = 21/113 (18%), Positives = 39/113 (34%), Gaps = 7/113 (6%)

Query: 167 MRRKTLFYTVNLIIPCMGISFLTILVFYLPSDSGEKVSLSISILLSLTVFFLLLAEIIPP 226
           M+R  L   + L++  +      I +          +  +    L   +  L L  + P 
Sbjct: 1   MKRALLLGLLALLLWGLLWGLSFIALKLAVESLDPFLFAAALRFLIAALLLLPLLLLEPR 60

Query: 227 TSL-------VVSLSISILLSLTVFFLLLAEIIPPTSLVVPLLGKFVLFTMIL 272
                     ++ L   + L+L    L LA      S+   ++G   LFT +L
Sbjct: 61  GLRPALRPWLLLLLLALLGLALPFLLLFLALKYTSASVASLIIGLLPLFTALL 113



 Score = 29.4 bits (66), Expect = 3.8
 Identities = 18/107 (16%), Positives = 43/107 (40%), Gaps = 10/107 (9%)

Query: 162 TFNITMRRKTLFY--TVNLIIPCMGISFLTILVFYLPSDSGEKVSLSISILLSLTVFFLL 219
                + ++          ++  + ++ L +L+F+L       +S +  +LL L VF   
Sbjct: 167 ALYTALVKRLSRLGPVTLALLLQLLLALLLLLLFFLSGFGAPILSRAWLLLLYLGVFSTG 226

Query: 220 LA--------EIIPPTSLVVSLSISILLSLTVFFLLLAEIIPPTSLV 258
           LA         ++  + + +   +  + +  +  LLL E + P  L+
Sbjct: 227 LAYLLWYYALRLLGASLVALLSLLEPVFAALLGVLLLGEPLSPAQLL 273


>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
           motility and secretion / Intracellular trafficking and
           secretion].
          Length = 696

 Score = 31.8 bits (73), Expect = 1.0
 Identities = 18/84 (21%), Positives = 36/84 (42%), Gaps = 3/84 (3%)

Query: 183 MGISFLTILVFYLPSDSGEKVSLSISILLSLTVFFLLLAEIIPPTSLVVSLSISILLSLT 242
           M ++        L    G+ +++ + I+L L +  L L   +    L  ++++S+L+ L 
Sbjct: 1   MNLASKLRFPGNLDGLQGKDLAVPLGIVLILAMLILPLPAFLLDFLLTFNIALSVLILLV 60

Query: 243 VFFL---LLAEIIPPTSLVVPLLG 263
             F+   L     P   L+  LL 
Sbjct: 61  AMFIQRPLDFSSFPTLLLITTLLR 84



 Score = 29.9 bits (68), Expect = 3.6
 Identities = 12/24 (50%), Positives = 16/24 (66%)

Query: 426 LIIPSPAMVLDRLFLWIFTLAVLI 449
           LI+P PA +LD L  +   L+VLI
Sbjct: 34  LILPLPAFLLDFLLTFNIALSVLI 57


>gnl|CDD|235672 PRK06012, flhA, flagellar biosynthesis protein FlhA; Validated.
          Length = 697

 Score = 31.2 bits (72), Expect = 1.2
 Identities = 10/24 (41%), Positives = 15/24 (62%)

Query: 426 LIIPSPAMVLDRLFLWIFTLAVLI 449
           +I+P P  +LD L  +   L+VLI
Sbjct: 34  MILPLPPFLLDLLLTFNIALSVLI 57


>gnl|CDD|223857 COG0786, GltS, Na+/glutamate symporter [Amino acid transport and
           metabolism].
          Length = 404

 Score = 31.1 bits (71), Expect = 1.3
 Identities = 26/121 (21%), Positives = 54/121 (44%), Gaps = 12/121 (9%)

Query: 170 KTLFYTVNLIIPCMGISF-----LTILVFYLPSDSGEKVSLSISILLSLTVFFLLLAEII 224
           + L  T+ +I  C+ +       L  L   LP        L + ++L   +  L    + 
Sbjct: 218 EPLIETLAIIAICLAVGKIINQLLKSLGLALPLFVM---CLFVGVILRNILDLLKKYRVF 274

Query: 225 PPT-SLVVSLSISILLSLTVFFLLLAEIIPPTSLVVPLLGKFVLFTMILDTFSICVTVVV 283
                ++ ++S+S+ L++ +  L L E+     L +PLL    + T+++  F+I VT  +
Sbjct: 275 RRAVDVIGNVSLSLFLAMALMSLKLWELA---DLALPLLVILAVQTIVMALFAIFVTFRL 331

Query: 284 L 284
           +
Sbjct: 332 M 332


>gnl|CDD|226614 COG4129, COG4129, Predicted membrane protein [Function unknown].
          Length = 332

 Score = 30.8 bits (70), Expect = 1.8
 Identities = 18/86 (20%), Positives = 36/86 (41%), Gaps = 6/86 (6%)

Query: 187 FLTILVFYLPSDSGEKVSLSISILLSLTVFFLLLAEIIPPTSLVVSLSISILLSLTVFF- 245
            L +L F L   +     + + I++ L V   L   ++P T  V+ + ++ ++ L + F 
Sbjct: 69  ILAVLFFLLFGQNPIAFGVVLLIIIPLLVLLKLENGVVPITVGVLHILVAAMIPLFLIFN 128

Query: 246 -LLLAEIIPPTSLVVPLLGKFVLFTM 270
             LL  +     + V  L   V+   
Sbjct: 129 RFLLVFV----GVGVAFLVNLVMPPP 150


>gnl|CDD|221739 pfam12730, ABC2_membrane_4, ABC-2 family transporter protein.  This
           family is related to the ABC-2 membrane transporter
           family pfam01061.
          Length = 230

 Score = 30.4 bits (69), Expect = 1.8
 Identities = 17/79 (21%), Positives = 36/79 (45%)

Query: 170 KTLFYTVNLIIPCMGISFLTILVFYLPSDSGEKVSLSISILLSLTVFFLLLAEIIPPTSL 229
           K L   +  ++  + +  L+++   L   SG  +SL +  LL L +  LL   +I     
Sbjct: 95  KLLVLLILSLLASLLLLLLSLIAGLLLGGSGFSLSLLLGALLLLLLLSLLALLLILLILF 154

Query: 230 VVSLSISILLSLTVFFLLL 248
           +  L  +  +++ +  LL+
Sbjct: 155 LSLLLRNSAVAIGIGLLLI 173


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 30.9 bits (70), Expect = 1.8
 Identities = 11/82 (13%), Positives = 17/82 (20%), Gaps = 1/82 (1%)

Query: 308 LPKLLVMRRPQYDRNKRDMDPSQHQSTCKVHGTPTRPPGDLPLDDIESEPLDPLPPPIPP 367
           L ++L                +            +     LPL     E       P   
Sbjct: 134 LSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLS-SPEETARAPSSPPAE 192

Query: 368 LPPMKKVSHHWHLCPELHKAIA 389
            PP    +      P     I+
Sbjct: 193 PPPSTPPAAASPRPPRRSSPIS 214



 Score = 29.4 bits (66), Expect = 5.3
 Identities = 14/83 (16%), Positives = 21/83 (25%), Gaps = 3/83 (3%)

Query: 289 RSPQTHVMSPWVKKVFIHLLPKLLVMRRPQYDRNKRDMDPSQHQSTCKVHGTPTRPPGDL 348
           R P      P  +              R +    +R   P + ++     G    P    
Sbjct: 837 RPPGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRAR---PGAAAPPKAAA 893

Query: 349 PLDDIESEPLDPLPPPIPPLPPM 371
                 +    P P P   L PM
Sbjct: 894 AAPPAGAPAPRPRPAPRVKLGPM 916


>gnl|CDD|222436 pfam13886, DUF4203, Domain of unknown function (DUF4203).  This is
           the N-terminal region of 7tm proteins. The function is
           not known.
          Length = 210

 Score = 30.3 bits (69), Expect = 1.9
 Identities = 19/102 (18%), Positives = 36/102 (35%), Gaps = 14/102 (13%)

Query: 186 SFLTILVFYLPSDSGEKVSLSIS---------ILLSLTVFFLLLAEIIPPTSLVVSLSIS 236
           S +  L+    +       L  S         IL  L   FL+++  +      V L + 
Sbjct: 35  SLIVFLLITKITLLVTNARLGASLVAGVVGGAILGLLWWVFLIVSVGL----FCVGLLLG 90

Query: 237 ILLSLTVFFLLLAEIIPPTSLVVPLLGKFVLFTMILDTFSIC 278
            LL+  + FL L  +    S  V L+   +   ++    ++ 
Sbjct: 91  FLLASALLFLPLGNLYIYHSDWVLLVT-LLCGAILFALLTLK 131



 Score = 28.4 bits (64), Expect = 7.7
 Identities = 17/83 (20%), Positives = 28/83 (33%), Gaps = 15/83 (18%)

Query: 184 GISFLTILVFYLPSDSGEKVSLSISI-------LLSLTVFFLLLAEIIPPTSLVVSLSIS 236
           G     +   +L       VS+ +         LL+  + FL L  +    S  V   + 
Sbjct: 65  GAILGLLWWVFLI------VSVGLFCVGLLLGFLLASALLFLPLGNLYIYHSDWV--LLV 116

Query: 237 ILLSLTVFFLLLAEIIPPTSLVV 259
            LL   + F LL        L++
Sbjct: 117 TLLCGAILFALLTLKWQKYLLIL 139


>gnl|CDD|216595 pfam01595, DUF21, Domain of unknown function DUF21.  This
           transmembrane region has no known function. Many of the
           sequences in this family are annotated as hemolysins,
           however this is due to a similarity to Treponema
           hyodysenteriae hemolysin C that does not contain this
           domain. This domain is found in the N-terminus of the
           proteins adjacent to two intracellular CBS domains
           pfam00571.
          Length = 182

 Score = 29.8 bits (68), Expect = 1.9
 Identities = 17/92 (18%), Positives = 38/92 (41%), Gaps = 9/92 (9%)

Query: 176 VNLIIPCMGISFLTILVFYLPSDSGEKVSLSISILLSLTVFFLLLAEIIPPTSLVVSLSI 235
            N+++  +       L+          + + ++ L+S T+  L+  EI+ P +L      
Sbjct: 66  ANILLGALATLAAAELLL-------GSLGVLLATLVS-TLLILVFGEIL-PKTLARRNPE 116

Query: 236 SILLSLTVFFLLLAEIIPPTSLVVPLLGKFVL 267
            I L L     +L +++ P   ++  L   +L
Sbjct: 117 RIALRLAPPLRVLMKLLYPLVWLLSKLSNLLL 148


>gnl|CDD|185718 cd08772, GH43_62_32_68, Glycosyl hydrolase families: GH43, GH62,
           GH32, GH68.  Members of the glycosyl hydrolase families
           32, 43, 62 and 68 (GH32, GH43, GH62, GH68) all possess
           5-bladed beta-propeller domains and comprise clans F and
           J, as classified by the carbohydrate-active enzymes
           database (CAZY).  Clan F consists of families GH43 and
           GH62. GH43 includes beta-xylosidases, beta-xylanases,
           alpha-L-arabinases, and alpha-L-arabinofuranosidases,
           using aryl-glycosides as substrates, while family GH62
           contains alpha-L-arabinofuranosidases (EC 3.2.1.55) that
           specifically cleave either alpha-1,2 or
           alpha-1,3-L-arabinofuranose sidechains from xylans.
           These are inverting enzymes (i.e. they invert the
           stereochemistry of the anomeric carbon atom of the
           substrate) that have an aspartate as the catalytic
           general base, a glutamate as the catalytic general acid
           and another aspartate that is responsible for pKa
           modulation and orienting the catalytic acid. Clan J
           consists of families GH32 and GH68. GH32 comprises
           sucrose-6-phosphate hydrolases, invertases, inulinases,
           levanases, eukaryotic fructosyltransferases, and
           bacterial fructanotransferases while GH68 consists of
           frucosyltransferases (FTFs) that include levansucrase
           (EC 2.4.1.10); beta-fructofuranosidase (EC 3.2.1.26);
           inulosucrase (EC 2.4.1.9), while GH68 consists of
           frucosyltransferases (FTFs) that include levansucrase
           (EC 2.4.1.10); beta-fructofuranosidase (EC 3.2.1.26);
           inulosucrase (EC 2.4.1.9), all of which use sucrose as
           their preferential donor substrate. Members of this clan
           are retaining enzymes (i.e. they retain the
           configuration at anomeric carbon atom of the substrate)
           that catalyze hydrolysis in two steps involving a
           covalent glycosyl enzyme intermediate: an aspartate
           located close to the N-terminus acts as the catalytic
           nucleophile and a glutamate acts as the general
           acid/base; a conserved aspartate residue in the
           Arg-Asp-Pro (RDP) motif stabilizes the transition state.
           Structures of all families in the two clans manifest a
           funnel-shaped active site that comprises two subsites
           with a single route for access by ligands.
          Length = 286

 Score = 30.5 bits (69), Expect = 1.9
 Identities = 19/91 (20%), Positives = 28/91 (30%), Gaps = 18/91 (19%)

Query: 34  SDHIWRPDIVLYNN----------ADGNFE--VTLATKATLNYTGRVEWKPPAIYKSSCE 81
           S  IW P IV   N          +    +  + +AT    N         P +  +   
Sbjct: 57  SGGIWAPSIVYIENGKFYLYYTDVSFTKNQQTIGVATAEDGNGPWTDYIGGPVLPDNPPA 116

Query: 82  IDVE------YFPFDEQTCVMKFGSWTYDGF 106
            DV        F  D+    + FGS  +  F
Sbjct: 117 ADVSNFRDPFVFEDDDGKWYLVFGSGDHHNF 147


>gnl|CDD|237054 PRK12316, PRK12316, peptide synthase; Provisional.
          Length = 5163

 Score = 30.7 bits (69), Expect = 2.1
 Identities = 15/45 (33%), Positives = 19/45 (42%), Gaps = 6/45 (13%)

Query: 335  CKVHGTPTRPPGDLPLDDIESEPLDPLPPP------IPPLPPMKK 373
            C         P D PL  ++   LD LP P      I PL PM++
Sbjct: 4066 CCDAERHGVTPSDFPLAGLDQARLDALPLPLGEIEDIYPLSPMQQ 4110



 Score = 29.2 bits (65), Expect = 7.5
 Identities = 13/35 (37%), Positives = 16/35 (45%), Gaps = 6/35 (17%)

Query: 345  PGDLPLDDIESEPLDPLPPPIP------PLPPMKK 373
            P D PL  +    LD LP P        PL PM++
Sbjct: 1530 PSDFPLAGLSQAQLDALPLPAGEIADIYPLSPMQQ 1564


>gnl|CDD|225126 COG2216, KdpB, High-affinity K+ transport system, ATPase chain B
           [Inorganic ion transport and metabolism].
          Length = 681

 Score = 30.3 bits (69), Expect = 2.4
 Identities = 14/42 (33%), Positives = 19/42 (45%), Gaps = 5/42 (11%)

Query: 186 SFLTILVFYLPSDSGEK-----VSLSISILLSLTVFFLLLAE 222
           S LT  +   P   G        +L+I+I+L  TV F   AE
Sbjct: 42  SILTTFLTIFPDLFGGTGGSRLFNLAITIILWFTVLFANFAE 83



 Score = 29.9 bits (68), Expect = 3.2
 Identities = 11/44 (25%), Positives = 20/44 (45%)

Query: 207 ISILLSLTVFFLLLAEIIPPTSLVVSLSISILLSLTVFFLLLAE 250
           +  +L+  +            S + +L+I+I+L  TV F   AE
Sbjct: 40  VGSILTTFLTIFPDLFGGTGGSRLFNLAITIILWFTVLFANFAE 83


>gnl|CDD|233565 TIGR01770, NDH_I_N, proton-translocating NADH-quinone
           oxidoreductase, chain N.  This model describes the 14th
           (based on E. coli) structural gene, N, of bacterial and
           chloroplast energy-transducing NADH (or NADPH)
           dehydrogenases. This model does not describe any subunit
           of the mitochondrial complex I (for which the subunit
           composition is very different), nor NADH dehydrogenases
           that are not coupled to ion transport. The Enzyme
           Commission designation 1.6.5.3, for NADH dehydrogenase
           (ubiquinone), is applied broadly, perhaps unfortunately,
           even if the quinone is menaquinone (Thermus,
           Mycobacterium) or plastoquinone (chloroplast). For
           chloroplast members, the name NADH-plastoquinone
           oxidoreductase is used for the complex and this protein
           is designated as subunit 2 or B. This model also
           includes a subunit of a related complex in the archaeal
           methanogen, Methanosarcina mazei, in which F420H2
           replaces NADH and 2-hydroxyphenazine replaces the
           quinone [Energy metabolism, Electron transport].
          Length = 468

 Score = 30.3 bits (69), Expect = 2.6
 Identities = 21/81 (25%), Positives = 33/81 (40%), Gaps = 13/81 (16%)

Query: 209 ILLSLTVFFLLLAEIIPPTS-----LVVSLSISILLSLTVFFLLLAEIIPPTSLVVPLLG 263
           I+L L  F LLL  +          L+ +L   +LLSL V   L  +             
Sbjct: 8   IILLLGAFILLLYGLFYSRKKRNPLLLKNLFYIVLLSLLVSLFLTLQYASYLF------- 60

Query: 264 KFVLFTMILDTFSICVTVVVL 284
            F     I+D FS+ + +++L
Sbjct: 61  -FGFSFFIVDNFSLIIKLILL 80


>gnl|CDD|225245 COG2370, HupE, Hydrogenase/urease accessory protein
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 201

 Score = 29.7 bits (67), Expect = 2.6
 Identities = 13/35 (37%), Positives = 15/35 (42%), Gaps = 1/35 (2%)

Query: 235 ISILLSLTVFFLLLA-EIIPPTSLVVPLLGKFVLF 268
             I  S+    LLLA     P   V  L+G F LF
Sbjct: 100 PGIAASVLGLGLLLALAARLPLPAVAALIGIFGLF 134


>gnl|CDD|225892 COG3356, COG3356, Predicted membrane protein [Function unknown].
          Length = 578

 Score = 30.1 bits (68), Expect = 2.7
 Identities = 28/108 (25%), Positives = 43/108 (39%), Gaps = 3/108 (2%)

Query: 169 RKTLFYTVNLIIPCMGISFLTILVFYLPSDSGEKVSLSISILLSLTVFFLLLAEIIPPTS 228
           R      +  +   +   FL +LV Y  +     V  S++ + S  + +L         S
Sbjct: 103 RPVDVSLILAVALVLSFRFLIVLVSY-KNSVPSSVLPSMNYVYSPLIVYLKGFVAQY--S 159

Query: 229 LVVSLSISILLSLTVFFLLLAEIIPPTSLVVPLLGKFVLFTMILDTFS 276
           L+V   IS LL L +  LL   ++     VV   G   L   IL+ FS
Sbjct: 160 LIVPFIISSLLILLLASLLYGRVLYSFVKVVNRPGVRSLGLKILEFFS 207


>gnl|CDD|226365 COG3846, TrbL, Type IV secretory pathway, TrbL components
           [Intracellular trafficking and secretion].
          Length = 452

 Score = 30.1 bits (68), Expect = 2.8
 Identities = 18/91 (19%), Positives = 33/91 (36%), Gaps = 4/91 (4%)

Query: 184 GISFLTILVFYLPSDSGEKVSLSISILLSLTVFFLLLAEIIPPTSLVVSLSISILLSLTV 243
              FL           GEK+     I+   ++F +        +S+   +S++I L+  V
Sbjct: 206 ETLFLAERFLKTVISVGEKLMALALIIGIGSLFLIQQTAGFLNSSVTKIISLAIYLASLV 265

Query: 244 FFLLLAEIIPPTSLVV---PLLGKF-VLFTM 270
              L     P  + +V   P +G   V   +
Sbjct: 266 LLALFNYGPPIAAGLVIGGPQVGAGAVGAGV 296


>gnl|CDD|184448 PRK14010, PRK14010, potassium-transporting ATPase subunit B;
           Provisional.
          Length = 673

 Score = 30.1 bits (67), Expect = 2.9
 Identities = 19/65 (29%), Positives = 37/65 (56%), Gaps = 5/65 (7%)

Query: 184 GISFLTILVFYLPSDSGEKVSLSI---SILLSLTVFFLLLAEIIPP--TSLVVSLSISIL 238
           G SFL  ++  +   + +K    I   ++L++LT+ FL++   + P    L  +LSI++L
Sbjct: 193 GHSFLDKMIGLVEGATRKKTPNEIALFTLLMTLTIIFLVVILTMYPLAKFLNFNLSIAML 252

Query: 239 LSLTV 243
           ++L V
Sbjct: 253 IALAV 257


>gnl|CDD|235564 PRK05691, PRK05691, peptide synthase; Validated.
          Length = 4334

 Score = 30.1 bits (68), Expect = 3.1
 Identities = 15/46 (32%), Positives = 19/46 (41%), Gaps = 6/46 (13%)

Query: 334  TCKVHGTPTRPPGDLPLDDIESEPLDPLPPP------IPPLPPMKK 373
             C   G     P D PL  +    LD LP P      + PL PM++
Sbjct: 3220 HCLADGAGGLTPSDFPLAQLTQAQLDALPVPAAEIEDVYPLTPMQE 3265


>gnl|CDD|238538 cd01118, ArsB_permease, Anion permease ArsB.  These permeases have
           been shown to export arsenate and antimonite in
           eubacteria and archaea.  A typical ArsB permease
           contains 8-13 transmembrane helices and can function
           either independently as a chemiosmotic transporter or as
           a channel-forming subunit of an ATP-driven anion pump
           (ArsAB).  The ArsAB complex is similar in many ways to
           ATP-binding cassette transporters, which have two groups
           of six transmembrane-spanning helical segments and two
           nucleotide-binding domains. The ArsB proteins belong to
           the ArsB/NhaD superfamily of permeases that translocate
           sodium, arsenate, sulfate, and organic anions across
           biological membranes in all three kingdoms of life.
          Length = 416

 Score = 29.9 bits (68), Expect = 3.2
 Identities = 26/101 (25%), Positives = 41/101 (40%), Gaps = 25/101 (24%)

Query: 184 GISFLTIL-VFYLPSDSGEKVSLSISILLSLTVFFLLLAEIIPP---TSLVVSLSISI-- 237
           GI F     + +LPS         +SIL +  V +L     +P    T+ V   S++I  
Sbjct: 167 GIGFTEYAALMFLPS--------IVSILATYVVLYLFFRRALPKRLETASVKKPSLAIKD 218

Query: 238 ----LLSLTVFFLLLAEI-------IPPTSLVVPLLGKFVL 267
                L+ T+  LL+          IPP+ +   L   F+L
Sbjct: 219 PLLFGLAWTLLVLLVVSAFGVDILLIPPSFIAGVLALIFLL 259


>gnl|CDD|223939 COG1007, NuoN, NADH:ubiquinone oxidoreductase subunit 2 (chain N)
           [Energy production and conversion].
          Length = 475

 Score = 29.9 bits (68), Expect = 3.5
 Identities = 20/82 (24%), Positives = 33/82 (40%), Gaps = 13/82 (15%)

Query: 205 LSISILLSLTVFFLLLAE--IIPPTSLVVSLSISILLSLTVFFLLLAEIIPPTSLVVPLL 262
           L   I+L++T   LLL    I    SL+  L++ + L   +  LL        S      
Sbjct: 8   LLPEIILAITALVLLLVGLFIRRSRSLLAYLAV-LGLLAALASLLFIAYRG--SAFAGA- 63

Query: 263 GKFVLFTMILDTFSICVTVVVL 284
                   I+D  ++ + +VVL
Sbjct: 64  -------FIVDALALLMKIVVL 78


>gnl|CDD|234052 TIGR02900, spore_V_B, stage V sporulation protein B.  SpoVB is the
           stage V sporulation protein B of the bacterial endopore
           formation program in Bacillus subtilis and various other
           Firmcutes. It is nearly universal among
           endospore-formers. Paralogs with rather high sequence
           similarity to SpoVB exist, including YkvU in B. subtilis
           and a number of proteins in the genus Clostridium.
           Member sequences for the seed alignment were chosen to
           select those proteins, no more than one to a genome,
           closest to B. subtilis SpoVB in a neighbor joining tree
           [Cellular processes, Sporulation and germination].
          Length = 488

 Score = 29.6 bits (67), Expect = 3.9
 Identities = 12/67 (17%), Positives = 30/67 (44%), Gaps = 6/67 (8%)

Query: 159 LDITFNITMRRKTLFYTVNL------IIPCMGISFLTILVFYLPSDSGEKVSLSISILLS 212
           L    N+   +K +  T++L       +  + +     L+ Y+  +    ++L + I+L 
Sbjct: 422 LVTILNLAEIKKNIRITIDLSDFLIFALLGILVYLGLSLLKYIIFNPNILINLLLIIILG 481

Query: 213 LTVFFLL 219
            +++ LL
Sbjct: 482 FSLYILL 488


>gnl|CDD|218624 pfam05529, Bap31, B-cell receptor-associated protein 31-like.
           Bap31 is a polytopic integral protein of the endoplasmic
           reticulum membrane and a substrate of caspase-8. Bap31
           is cleaved within its cytosolic domain, generating
           pro-apoptotic p20 Bap31.
          Length = 149

 Score = 28.7 bits (65), Expect = 4.2
 Identities = 17/62 (27%), Positives = 27/62 (43%), Gaps = 10/62 (16%)

Query: 236 SILLSLTVFFLLLAEIIPPTSLVVPL-------LGKFVLFTMILDTFSIC--VTVVVLNI 286
           S+  +L VF LL AE+     LV+PL       + K V    +   F     + ++ L +
Sbjct: 2   SLYWTL-VFGLLYAEMAVFLLLVLPLPSPIRRKIFKSVSKLQLSQQFQTGFKILLIFLLV 60

Query: 287 HF 288
            F
Sbjct: 61  LF 62


>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
          Length = 1463

 Score = 29.8 bits (66), Expect = 4.5
 Identities = 15/44 (34%), Positives = 22/44 (50%), Gaps = 2/44 (4%)

Query: 332 QSTCKVHGTPTRPPGDLPLDDIESEPLDPLP--PPIPPLPPMKK 373
           + TC + G P   P  +P  D ++  L P P  P  PP PP ++
Sbjct: 816 KKTCGLQGLPDSLPPAVPETDRDNPLLPPCPITPEGPPCPPREE 859


>gnl|CDD|182589 PRK10614, PRK10614, multidrug efflux system subunit MdtC;
           Provisional.
          Length = 1025

 Score = 29.7 bits (67), Expect = 4.7
 Identities = 19/50 (38%), Positives = 28/50 (56%), Gaps = 6/50 (12%)

Query: 232 SLSISILLSLTVFFLLL----AEIIPPTSLVVPLLGKFVLFTMILDTFSI 277
           +L+IS+ L + V FL L    A +IP  ++ V L+G F    M L  FS+
Sbjct: 337 TLAISVALVILVVFLFLRSGRATLIPAVAVPVSLIGTFA--AMYLCGFSL 384


>gnl|CDD|224287 COG1368, MdoB, Phosphoglycerol transferase and related proteins,
           alkaline phosphatase superfamily [Cell envelope
           biogenesis, outer membrane].
          Length = 650

 Score = 29.3 bits (66), Expect = 4.9
 Identities = 24/97 (24%), Positives = 38/97 (39%), Gaps = 3/97 (3%)

Query: 163 FNITMRRKTLFYTVNLIIPCMGISFLTILVFYLPSDSGEKVSLSISILLSLTVFFLLLAE 222
           F   +         +  +  +G S L++L    P D    V L + ILL +  + L    
Sbjct: 104 FIDFLTIPNALLIEDFNLGKLGFSALSLLY---PEDILFVVDLILLILLLVFYWRLAGLT 160

Query: 223 IIPPTSLVVSLSISILLSLTVFFLLLAEIIPPTSLVV 259
                  V  L +++LL L +  L L EI+ P  L  
Sbjct: 161 SKLIPLFVRLLVLALLLYLFLQLLFLGEILDPALLAA 197


>gnl|CDD|236315 PRK08633, PRK08633, 2-acyl-glycerophospho-ethanolamine
           acyltransferase; Validated.
          Length = 1146

 Score = 29.5 bits (67), Expect = 5.0
 Identities = 24/139 (17%), Positives = 56/139 (40%), Gaps = 39/139 (28%)

Query: 179 IIPCMGISFLTILVFYLPSDSGEKVSLSISILLSLTVFFLL------------------- 219
           ++P +G   L + +F LP+        + S+   L +FFL                    
Sbjct: 301 LVP-LGALGLALSLFLLPT--------APSLASVLVLFFLFGFSAGLFIVPLNALIQFRA 351

Query: 220 ----LAEIIPPTSLVVSLSISILLSLTVFFLLLAEIIPPTSL-----VVPLLGKFVLFTM 270
               L +++   + + ++ + + L+LT  F  L   + P  L     +V L+G      +
Sbjct: 352 PEKELGKVLAANNFLQNVGMLLFLALTTLFSGLG--LSPAGLFYLIALVTLIGTLYTLLL 409

Query: 271 ILDTFSICVTVVVLNIHFR 289
           + D+    + +++++  +R
Sbjct: 410 LPDSLLRFLLLLLMHTRYR 428


>gnl|CDD|237702 PRK14399, PRK14399, membrane protein; Provisional.
          Length = 258

 Score = 29.1 bits (65), Expect = 5.2
 Identities = 21/94 (22%), Positives = 39/94 (41%)

Query: 165 ITMRRKTLFYTVNLIIPCMGISFLTILVFYLPSDSGEKVSLSISILLSLTVFFLLLAEII 224
           ++M    LF   +  IPC+          Y     G+ VS  + +L  +   +L++  I+
Sbjct: 82  LSMIPNELFSQTSYFIPCIFALIGHCWPIYYKFKGGKAVSCFLGLLFVVNYLYLIIFLIV 141

Query: 225 PPTSLVVSLSISILLSLTVFFLLLAEIIPPTSLV 258
              S+ +S  +S+    +   +LL   IP    V
Sbjct: 142 WFISVAISRKVSVASIFSAATILLIMWIPYLYGV 175


>gnl|CDD|223843 COG0772, FtsW, Bacterial cell division membrane protein [Cell
           division and chromosome partitioning].
          Length = 381

 Score = 29.1 bits (66), Expect = 5.3
 Identities = 25/132 (18%), Positives = 47/132 (35%), Gaps = 33/132 (25%)

Query: 173 FYTVNLIIPCMGISFLTILVFYLPSDSGEKVSLSISILLSLTVFFLLLAEIIPPT----- 227
           F    L+   +G+    +++  LP     K S +I +L  + +  L+L      +     
Sbjct: 49  FLIRQLVWLLLGLILALLVILRLPLRFLRKRSFAI-LLYVIILLLLVLVLFFGSSVNGAK 107

Query: 228 ------------------SLVVSLSIS-------ILLSLTVFFLLLAEIIPPTSLVV--P 260
                             +L++ L+         I   L +F  L+  +  P  L++  P
Sbjct: 108 RWIALGGLSIQPSEFAKIALILYLAAYLSRKGDEIKSFLRLFLKLILLVALPALLILLQP 167

Query: 261 LLGKFVLFTMIL 272
            LG  +L   IL
Sbjct: 168 DLGTALLLFAIL 179


>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator.  In
           eukaryotes, this family of proteins induces
           mitochondrial fission.
          Length = 248

 Score = 28.9 bits (65), Expect = 5.6
 Identities = 12/37 (32%), Positives = 16/37 (43%)

Query: 339 GTPTRPPGDLPLDDIESEPLDPLPPPIPPLPPMKKVS 375
             P+      P+     EP+  +PPP PP PP    S
Sbjct: 161 SVPSSSTTSFPISPPTEEPVLEVPPPPPPPPPPPPPS 197



 Score = 28.6 bits (64), Expect = 7.6
 Identities = 10/34 (29%), Positives = 14/34 (41%)

Query: 338 HGTPTRPPGDLPLDDIESEPLDPLPPPIPPLPPM 371
             T + P      + +   P  P PPP PP P +
Sbjct: 165 SSTTSFPISPPTEEPVLEVPPPPPPPPPPPPPSL 198



 Score = 28.2 bits (63), Expect = 8.4
 Identities = 13/46 (28%), Positives = 18/46 (39%), Gaps = 4/46 (8%)

Query: 325 DMDPSQHQSTCKVHGTPTRPPGDLPLDDIESEPLDPLPPPIPPLPP 370
           D       +T      P  PP + P+ ++   P  P PPP P L  
Sbjct: 159 DESVPSSSTT----SFPISPPTEEPVLEVPPPPPPPPPPPPPSLQQ 200


>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
          Length = 540

 Score = 29.1 bits (65), Expect = 5.8
 Identities = 20/62 (32%), Positives = 25/62 (40%), Gaps = 6/62 (9%)

Query: 339 GTPTRPPGDLPLDDIESEPLDPLPPPIPPLPPMKKVSHHWHLCPELHKAIAGVRFIADHT 398
             PT P G  P DD E +       P PPL P    + H H   E    +A VR +    
Sbjct: 164 PVPTPPGGTPPPDDDEGDEAGAPATPAPPLHP--PPAPHPHPIAE----VAHVRGVTVSL 217

Query: 399 KT 400
           +T
Sbjct: 218 RT 219


>gnl|CDD|226512 COG4025, COG4025, Predicted membrane protein [Function unknown].
          Length = 284

 Score = 29.0 bits (65), Expect = 5.9
 Identities = 12/59 (20%), Positives = 25/59 (42%)

Query: 201 EKVSLSISILLSLTVFFLLLAEIIPPTSLVVSLSISILLSLTVFFLLLAEIIPPTSLVV 259
           E   L + +   L + F++L EI+  + L+      IL  +  F L+    +  +  + 
Sbjct: 94  ELPVLRLQLSFLLFLVFVILEEIVLISMLLWEPLALILAFVLEFLLIRYTRVSNSEDIH 152


>gnl|CDD|216110 pfam00771, FHIPEP, FHIPEP family. 
          Length = 658

 Score = 28.9 bits (66), Expect = 6.3
 Identities = 9/24 (37%), Positives = 15/24 (62%)

Query: 426 LIIPSPAMVLDRLFLWIFTLAVLI 449
           +I+P P  +LD L  +   L++LI
Sbjct: 6   MILPLPPFLLDLLLAFNIALSLLI 29


>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA.  This model
           describes flagellar biosynthesis protein FlhA, one of a
           large number of genes associated with the biosynthesis
           of functional bacterial flagella. Homologs of many such
           proteins, including FlhA, function in type III protein
           secretion systems. A separate model describes InvA
           (Salmonella enterica), LcrD (Yersinia enterocolitica),
           HrcV (Xanthomonas), etc., all of which score below the
           noise cutoff for this model [Cellular processes,
           Chemotaxis and motility].
          Length = 678

 Score = 29.2 bits (66), Expect = 6.3
 Identities = 12/35 (34%), Positives = 19/35 (54%)

Query: 426 LIIPSPAMVLDRLFLWIFTLAVLIGSAGIILQAPT 460
           +I+P PA +LD L      L++LI    + +Q P 
Sbjct: 17  MILPLPAFLLDILLALNIALSLLILLVTLFIQKPL 51


>gnl|CDD|222703 pfam14351, DUF4401, Domain of unknown function (DUF4401).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 357 and 735 amino
           acids in length. The family is found in association with
           pfam09925. There is a single completely conserved
           residue K that may be functionally important.
          Length = 321

 Score = 28.7 bits (65), Expect = 6.4
 Identities = 18/91 (19%), Positives = 33/91 (36%), Gaps = 3/91 (3%)

Query: 182 CMGISFLTILVFYLPSDSGEKVSLSISILLSLTVFFLLLAEIIPPTSLVVSLSISILLSL 241
            + ++   +L F L     E V   ++ LL L +   LL  ++P   L    + + +L+L
Sbjct: 65  ALSLAGQLLLAFGLFDLLDESV---LAALLLLALILALLYFLMPDRLLRFLSAAAGVLAL 121

Query: 242 TVFFLLLAEIIPPTSLVVPLLGKFVLFTMIL 272
                 L        L+       V   + L
Sbjct: 122 LGLLAELLLSGLAYLLLPLAPLLLVALFVFL 152


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 29.3 bits (65), Expect = 6.8
 Identities = 16/49 (32%), Positives = 21/49 (42%), Gaps = 1/49 (2%)

Query: 328 PSQHQSTCKVHGTPTRPPGDLPLDDIESEPLDPLPPPIPPLPPMKKVSH 376
           P  H    +    P +PP + PL    S P    PPP  P+P +   SH
Sbjct: 315 PHSHTPPSQSALQPQQPPREQPLPPAPSMP-HIKPPPTTPIPQLPNQSH 362


>gnl|CDD|218364 pfam04987, PigN, Phosphatidylinositolglycan class N (PIG-N).
           Phosphatidylinositolglycan class N (PIG-N) is a
           mammalian homologue of the yeast protein MCD4P and is
           expressed in the endoplasmic reticulum. PIG-N is
           essential for glycosylphosphatidylinositol anchor
           synthesis. Glycosylphosphatidylinositol (GPI)-anchored
           proteins are cell surface-localised proteins that serve
           many important cellular functions.
          Length = 441

 Score = 28.8 bits (65), Expect = 6.9
 Identities = 22/106 (20%), Positives = 37/106 (34%), Gaps = 7/106 (6%)

Query: 174 YTVNLIIPCMGISFLTILVFYLPSDSGEKVSLSISILLSLTVFFLLLAEIIPPTSLVVSL 233
            ++  +I            F     +G    L + + L L+VF LL A       L+  L
Sbjct: 136 LSLLFLIFAFWPLLNGRRAF---RRNGLLSLLWLLLCLLLSVFTLLPASKGESLPLI--L 190

Query: 234 SISILLSLTVFFLLLAE--IIPPTSLVVPLLGKFVLFTMILDTFSI 277
              ILL +     LL E      +  +  +    +L   I+   S+
Sbjct: 191 LGGILLVVVGPLYLLFERSGSRHSRRLWGIQILLLLLAAIVTRSSV 236


>gnl|CDD|234014 TIGR02786, addB_alphas, double-strand break repair protein AddB,
           alphaproteobacterial type.  AddAB is a system well
           described in the Firmicutes as a replacement for RecBCD
           in many prokaryotes for the repair of double stranded
           break DNA damage. More recently, a distantly related
           gene pair conserved in many alphaproteobacteria was
           shown also to function in double-stranded break repair
           in Rhizobium etli. This family consists of AddB proteins
           of the alphaproteobacteial type [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1021

 Score = 28.9 bits (65), Expect = 7.1
 Identities = 11/30 (36%), Positives = 13/30 (43%), Gaps = 4/30 (13%)

Query: 343 RPPGDLPLDDIESEPL----DPLPPPIPPL 368
           RP GD+  D+            LPP I PL
Sbjct: 73  RPLGDVDEDEALFAAEGPAILDLPPAISPL 102


>gnl|CDD|113585 pfam04819, DUF716, Family of unknown function (DUF716).  This
           family is equally distributed in both metazoa and
           plants. Annotation associated with a member from
           Nicotiana tabacum suggest that it may be involved in
           response to viral attack in plants. However, no clear
           function has been assigned to this family.
          Length = 137

 Score = 27.7 bits (62), Expect = 8.0
 Identities = 18/58 (31%), Positives = 24/58 (41%), Gaps = 3/58 (5%)

Query: 187 FLTILVFYLPSDSGEKVSLSISILLSLTVF---FLLLAEIIPPTSLVVSLSISILLSL 241
              +L+FY  S     +      LL L VF      L E++ P S  + L  S LL L
Sbjct: 15  AQELLLFYFHSHDHSGLEGHYHSLLLLVVFLCALSTLLEVLLPRSFPLELLRSALLLL 72


>gnl|CDD|224173 COG1253, TlyC, Hemolysins and related proteins containing CBS
           domains [General function prediction only].
          Length = 429

 Score = 28.4 bits (64), Expect = 8.5
 Identities = 9/47 (19%), Positives = 19/47 (40%)

Query: 207 ISILLSLTVFFLLLAEIIPPTSLVVSLSISILLSLTVFFLLLAEIIP 253
           +S+LL       L A + P    +   +      +T   ++  E++P
Sbjct: 70  VSLLLGAVGEPALAALLEPLLEALGLSAALSFAIITFLHVVFGELVP 116


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.324    0.140    0.444 

Gapped
Lambda     K      H
   0.267   0.0647    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 25,717,453
Number of extensions: 2585730
Number of successful extensions: 3562
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3446
Number of HSP's successfully gapped: 245
Length of query: 488
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 387
Effective length of database: 6,457,848
Effective search space: 2499187176
Effective search space used: 2499187176
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 61 (27.4 bits)