RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12306
         (1643 letters)



>gnl|CDD|215968 pfam00520, Ion_trans, Ion transport protein.  This family contains
            Sodium, Potassium, Calcium ion channels. This family is 6
            transmembrane helices in which the last two helices flank
            a loop which determines ion selectivity. In some
            sub-families (e.g. Na channels) the domain is repeated
            four times, whereas in others (e.g. K channels) the
            protein forms as a tetramer in the membrane. A bacterial
            structure of the protein is known for the last two
            helices but is not the Pfam family due to it lacking the
            first four helices.
          Length = 194

 Score =  160 bits (408), Expect = 7e-45
 Identities = 67/230 (29%), Positives = 112/230 (48%), Gaps = 42/230 (18%)

Query: 942  MNMFFTTVFALEFVFKLAAFRFK-NYFSDAWNVFDFVIVMGSFIDIFSDSPPHAAAKATI 1000
            ++  FT +F LE + K  A  FK  YF   WN+ DF++V+ S + +              
Sbjct: 2    LDYVFTVIFTLEMLLKFIALGFKRKYFRSPWNILDFLVVLPSLVSLIL------------ 49

Query: 1001 PGSHEQTSGRLSINFFRLFRVMRLIKLLSRGEGIRTLLWTFIKSFQALPYVALLIVMLFF 1060
                        +   RL R++RL++LL R  G+RTLL +  +S ++L  + LL+++L F
Sbjct: 50   ---FLLGEDSGLLRVLRLLRLLRLLRLLRRFPGLRTLLQSLGRSLKSLLNLLLLLLLLLF 106

Query: 1061 IYAVIGMQVFGKIALVFG----GGIHRNNNFQTFPQAVLVLFRSATGEAWQDIMLDCSNR 1116
            I+A+IG+Q+FG             I+ N+NF ++ +A+L LFR+ T E W D+M D    
Sbjct: 107  IFAIIGVQLFGGELDKCCDKNENPINGNSNFDSYGEALLWLFRTLTTEGWGDVMYDT--- 163

Query: 1117 PSEVKCDAASDGNPTNDDNVSCGSDIAYPYFISFYVLCSFLIINLFVAVI 1166
                               +  G+ +   +F+ F +L   L++NL + VI
Sbjct: 164  -------------------LVPGTVLGKIFFVIFIILGGVLLLNLLIGVI 194



 Score =  133 bits (337), Expect = 2e-35
 Identities = 66/269 (24%), Positives = 119/269 (44%), Gaps = 77/269 (28%)

Query: 587 YFDNFFTAVFTIEILLKIISYGFILHDGAFCRSSFNLLDLLVVIVSVVAIYWNSAAI--A 644
             D  FT +FT+E+LLK I+ GF      + RS +N+LD LVV+ S+V++          
Sbjct: 1   ILDYVFTVIFTLEMLLKFIALGFKRK---YFRSPWNILDFLVVLPSLVSLILFLLGEDSG 57

Query: 645 FVKILRVLRVLRPLRAINRAKGLKHVVQCVIVAVKTIGNIVLVTCLLQFMFAVIGVQLFK 704
            +++LR+LR+LR LR + R  GL+ ++Q +  ++K++ N+                    
Sbjct: 58  LLRVLRLLRLLRLLRLLRRFPGLRTLLQSLGRSLKSLLNL-------------------- 97

Query: 705 YVVKCVIVAIKTIGNIMLVTYLLSFMFAVMGVQLFKGKFFQCNDGSKMTKAECHGTYLVF 764
                           +L+  LL F+FA++GVQLF G+  +C D                
Sbjct: 98  ----------------LLLLLLLLFIFAIIGVQLFGGELDKCCD---------------- 125

Query: 765 SNGKIDEPSIEVREWSKHRFHFDDVAKAMLTLFTVSTFEGWPGLLYNSIDSNLENHGPIH 824
                     +         +FD   +A+L LF   T EGW  ++Y+++           
Sbjct: 126 ----------KNENPINGNSNFDSYGEALLWLFRTLTTEGWGDVMYDTLV---------- 165

Query: 825 NYRPIVATYYIIYIIIIAFFMVNIFVGFV 853
               +   +++I+II+    ++N+ +G +
Sbjct: 166 PGTVLGKIFFVIFIILGGVLLLNLLIGVI 194



 Score =  128 bits (324), Expect = 1e-33
 Identities = 57/150 (38%), Positives = 91/150 (60%), Gaps = 6/150 (4%)

Query: 319 VTNIFFVALFTMEMTLKMYSLGF-QGYFVSLFNRFDCFVVIGSIGEMILTRTNIMPPLGV 377
           + +  F  +FT+EM LK  +LGF + YF S +N  D  VV+ S+  +IL        L +
Sbjct: 1   ILDYVFTVIFTLEMLLKFIALGFKRKYFRSPWNILDFLVVLPSLVSLILFLLGEDSGL-L 59

Query: 378 SVLRCVRLLRVFKVTKYWRSLSNLVASLLNSIQSIASLLLLLFLFIVIFALLGMQVFGGK 437
            VLR +RLLR+ ++ + +  L  L+ SL  S++S+ +LLLLL L + IFA++G+Q+FGG+
Sbjct: 60  RVLRLLRLLRLLRLLRRFPGLRTLLQSLGRSLKSLLNLLLLLLLLLFIFAIIGVQLFGGE 119

Query: 438 FN----FNPTIAKPRSNFDSFWQSLLTVFQ 463
            +     N       SNFDS+ ++LL +F+
Sbjct: 120 LDKCCDKNENPINGNSNFDSYGEALLWLFR 149



 Score = 71.2 bits (175), Expect = 1e-13
 Identities = 28/69 (40%), Positives = 44/69 (63%), Gaps = 1/69 (1%)

Query: 85  WEGPNNGITNFDNFGLAMLTVFQCITLEGWTDVLYNIEDALGRTWQWTYFVSMVILGAFF 144
            E P NG +NFD++G A+L +F+ +T EGW DV+Y+     G      +FV  +ILG   
Sbjct: 127 NENPINGNSNFDSYGEALLWLFRTLTTEGWGDVMYDTLVP-GTVLGKIFFVIFIILGGVL 185

Query: 145 VMNLILGVL 153
           ++NL++GV+
Sbjct: 186 LLNLLIGVI 194



 Score = 51.5 bits (124), Expect = 5e-07
 Identities = 19/45 (42%), Positives = 30/45 (66%), Gaps = 3/45 (6%)

Query: 38 IEYIFLVIFTVECVMKILAYGFVAHPGAYLRNGWNLLDFTIVVIG 82
          ++Y+F VIFT+E ++K +A GF      Y R+ WN+LDF +V+  
Sbjct: 2  LDYVFTVIFTLEMLLKFIALGFKRK---YFRSPWNILDFLVVLPS 43


>gnl|CDD|149732 pfam08763, Ca_chan_IQ, Voltage gated calcium channel IQ domain.
            Voltage gated calcium channels control cellular calcium
            entry in response to changes in membrane potential. The
            isoleucine-glutamine (IQ) motif in the voltage gated
            calcium channel IQ domain interacts with hydrophobic
            pockets of Ca2+/calmodulin. The interaction regulates two
            self-regulatory calcium dependent feedback mechanism,
            calcium dependent inactivation (CDI), and
            calcium-dependent facilitation (CDF).
          Length = 32

 Score = 69.1 bits (170), Expect = 9e-15
 Identities = 26/31 (83%), Positives = 29/31 (93%)

Query: 1303 EEVTVGKFYATFLIQDYFRRFKKRKEQELKD 1333
            +EVTVGKFYATFLIQDYFR+FKKRKEQE + 
Sbjct: 1    DEVTVGKFYATFLIQDYFRQFKKRKEQEQQG 31



 Score = 69.1 bits (170), Expect = 9e-15
 Identities = 26/31 (83%), Positives = 29/31 (93%)

Query: 1483 EEVTVGKFYATFLIQDYFRRFKKRKEQELKD 1513
            +EVTVGKFYATFLIQDYFR+FKKRKEQE + 
Sbjct: 1    DEVTVGKFYATFLIQDYFRQFKKRKEQEQQG 31


>gnl|CDD|198130 smart01062, Ca_chan_IQ, Voltage gated calcium channel IQ domain.
            Voltage gated calcium channels control cellular calcium
            entry in response to changes in membrane potential. The
            isoleucine-glutamine (IQ) motif in the voltage gated
            calcium channel IQ domain interacts with hydrophobic
            pockets of Ca2+/calmodulin. The interaction regulates two
            self-regulatory calcium dependent feedback mechanism,
            calcium dependent inactivation (CDI), and
            calcium-dependent facilitation (CDF).
          Length = 31

 Score = 62.2 bits (152), Expect = 2e-12
 Identities = 22/30 (73%), Positives = 25/30 (83%)

Query: 1303 EEVTVGKFYATFLIQDYFRRFKKRKEQELK 1332
            +E+TVGK YA  LIQDYFR+FK RKEQEL 
Sbjct: 1    DELTVGKIYAAMLIQDYFRQFKSRKEQELL 30



 Score = 62.2 bits (152), Expect = 2e-12
 Identities = 22/30 (73%), Positives = 25/30 (83%)

Query: 1483 EEVTVGKFYATFLIQDYFRRFKKRKEQELK 1512
            +E+TVGK YA  LIQDYFR+FK RKEQEL 
Sbjct: 1    DELTVGKIYAAMLIQDYFRQFKSRKEQELL 30


>gnl|CDD|219699 pfam08016, PKD_channel, Polycystin cation channel.  This family
            contains the cation channel region of PKD1 and PKD2
            proteins.
          Length = 423

 Score = 45.8 bits (109), Expect = 2e-04
 Identities = 53/253 (20%), Positives = 88/253 (34%), Gaps = 70/253 (27%)

Query: 945  FFTTVFALEFVFKLA-AFRF--KNYFSDAWNVFDFVIVMGS----------------FID 985
                VF L FV       R     Y    WN+ D  IV  S                   
Sbjct: 213  IIFLVFVLYFVVAEITKIRREGPRYLRSVWNLLDLAIVALSVVVIVLHIYRDFLATQLWK 272

Query: 986  IFSDSPPHAAAKATIPGSHEQTSGRLSINFF----RLFRVMRLIK---LLSRGEGIRTLL 1038
                SP        +    +     L+   F    +LF+V+R  +   L ++        
Sbjct: 273  QVEASPVDFTDFDRVARLDQLYRIILAFLLFLTWLKLFKVLRFNRTMSLFTK-------- 324

Query: 1039 WTFIKSFQALPYVALLIVMLFFIYAVIGMQVFGKIALVFGGGIHRNNNFQTFPQAVLVLF 1098
             T  ++ + L   A++ V+LF  YA +G        L+FG  +   +NF  F +++L LF
Sbjct: 325  -TLSRALKELLGFAVMFVILFLAYAQLG-------YLLFGSQV---DNFSNFVKSILTLF 373

Query: 1099 RSATGEAWQDIMLDCSNRPSEVKCDAASDGNPTNDDNVSCGSDIAYPYFISFYVLCSFLI 1158
            R+                     C           +  S    +    F++F  L  F++
Sbjct: 374  RTL--------------------CGDFG-----YTEIFSGNRVLGPLLFLTFVFLVIFIL 408

Query: 1159 INLFVAVIMDNFD 1171
            +NLF+A+I D++ 
Sbjct: 409  LNLFLAIINDSYV 421



 Score = 34.2 bits (79), Expect = 0.63
 Identities = 40/201 (19%), Positives = 83/201 (41%), Gaps = 46/201 (22%)

Query: 290 SFQEVTNIFFVALFTMEMTLKMYSLGFQEVTNIFFVALFTMEMTLKMYSLGFQGYFVSLF 349
           S Q V  + FV   ++ + L        E+  + FV  F +    K+   G + Y  S++
Sbjct: 191 SVQSVKLLRFVTGLSLFLLL-------CEIIFLVFVLYFVVAEITKIRREGPR-YLRSVW 242

Query: 350 NRFDCFVVIGSIGEMIL----------------------------TRTNIMPPLGVSVLR 381
           N  D  +V  S+  ++L                             R + +  + ++ L 
Sbjct: 243 NLLDLAIVALSVVVIVLHIYRDFLATQLWKQVEASPVDFTDFDRVARLDQLYRIILAFLL 302

Query: 382 CVRLLRVFKVTKYWRSLSNLVASLLNSIQSIASLLLLLFLFIVIFALLGMQVFGGKFNFN 441
            +  L++FKV ++ R++S    +L  +++ +    ++  +  + +A LG  +FG +    
Sbjct: 303 FLTWLKLFKVLRFNRTMSLFTKTLSRALKELLGFAVMFVILFLAYAQLGYLLFGSQV--- 359

Query: 442 PTIAKPRSNFDSFWQSLLTVF 462
                   NF +F +S+LT+F
Sbjct: 360 -------DNFSNFVKSILTLF 373



 Score = 31.9 bits (73), Expect = 3.1
 Identities = 46/278 (16%), Positives = 97/278 (34%), Gaps = 72/278 (25%)

Query: 584 FLGYFDNF--FTAVFTIEILLKIISYGF-----ILHDGA-FCRSSFNLLDLLVVIVSVVA 635
            L +      F  +  I  L+ ++ +       I  +G  + RS +NLLDL +V +SVV 
Sbjct: 197 LLRFVTGLSLFLLLCEIIFLVFVLYFVVAEITKIRREGPRYLRSVWNLLDLAIVALSVVV 256

Query: 636 IYWNSAAIAFVKILRVLRVLRPLRAINRA-KGLKHVVQCVIVAVKTIGNIVLVTCL--LQ 692
           I      +   +     ++ + + A          V +   +    +  ++ +T L   +
Sbjct: 257 I-----VLHIYRDFLATQLWKQVEASPVDFTDFDRVARLDQLYRIILAFLLFLTWLKLFK 311

Query: 693 FMFAVIGVQLFKYVVKCVIVAIKTIGNIMLVTYLLSFMFAVMGVQLFKGKFFQCNDGSKM 752
            +     + LF   +     A+K +    ++  +L   +A +G                 
Sbjct: 312 VLRFNRTMSLFTKTLS---RALKELLGFAVMFVILFLAYAQLG----------------- 351

Query: 753 TKAECHGTYLVFSNGKIDEPSIEVREWSKHRFHFDDVAKAMLTLFTVSTFEGWPGLLYNS 812
                   YL+F +   +               F +  K++LTLF            Y  
Sbjct: 352 --------YLLFGSQVDN---------------FSNFVKSILTLFRTL----CGDFGYTE 384

Query: 813 IDSNLENHGPIHNYRPIVATYYIIYIIIIAFFMVNIFV 850
           I S     GP+          ++ ++ ++ F ++N+F+
Sbjct: 385 IFSGNRVLGPL---------LFLTFVFLVIFILLNLFL 413


>gnl|CDD|117542 pfam08976, DUF1880, Domain of unknown function (DUF1880).  This
            domain is found predominantly in DJ binding protein. It
            has no known function.
          Length = 122

 Score = 36.7 bits (84), Expect = 0.019
 Identities = 21/64 (32%), Positives = 34/64 (53%), Gaps = 6/64 (9%)

Query: 1186 DEFIRLWSEYDPDAKGRIKHLDVVTLL---RKISPPLGFGKLCPHRVACKRLVSMNMPLN 1242
            ++F RLW+E   +AKGR+K+ D ++     R  SPP+  G      +A +     + P  
Sbjct: 7    EQFDRLWNELPVNAKGRLKYPDFLSKFSIERAASPPMAAGDSGESAMAQR---GSSAPDF 63

Query: 1243 SDGT 1246
            S+GT
Sbjct: 64   SEGT 67



 Score = 36.3 bits (83), Expect = 0.028
 Identities = 17/48 (35%), Positives = 27/48 (56%), Gaps = 3/48 (6%)

Query: 1436 DEFIRLWSEYDPDAKGRIKHLDVVTLL---RKISPPLGFGKLCPHRVA 1480
            ++F RLW+E   +AKGR+K+ D ++     R  SPP+  G      +A
Sbjct: 7    EQFDRLWNELPVNAKGRLKYPDFLSKFSIERAASPPMAAGDSGESAMA 54


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 36.4 bits (84), Expect = 0.042
 Identities = 19/72 (26%), Positives = 35/72 (48%), Gaps = 5/72 (6%)

Query: 463 QQEQKGQEESPPRSIMENEEEEEEGSIIEGEEIEDESVREEEDGVKVQLENCDSEFY--- 519
           Q++ + +EE     I E E+ E+E  I+E +E E+E   E+   +K   +   ++ +   
Sbjct: 48  QEDDEEEEEEDEEEIEEPEDIEDEEEIVE-DEEEEEEDEEDNVDLKDIEKKNINDIFNST 106

Query: 520 -DYSYQADIMNN 530
            D + Q  I  N
Sbjct: 107 QDDNAQNLISKN 118


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 37.0 bits (86), Expect = 0.062
 Identities = 25/93 (26%), Positives = 38/93 (40%), Gaps = 11/93 (11%)

Query: 174 REKQQIEEDLRG------YLDWITQ-AEDIEPEGEERANHHDQKNKSSSETESAD--RLE 224
           REK++ E +LR       Y +W  Q A+    +   +      ++ SSS + SA   R  
Sbjct: 116 REKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPERNV 175

Query: 225 GDEGENQ--ESWWRKKNKDFDRNNRRMRRACRK 255
             E   +  + W  KK K   +     RR  RK
Sbjct: 176 SQEEAKKRLQEWELKKLKQQQQKREEERRKQRK 208


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The CAF-1
           or chromatin assembly factor-1 consists of three
           subunits, and this is the first, or A. The A domain is
           uniquely required for the progression of S phase in
           mouse cells, independent of its ability to promote
           histone deposition but dependent on its ability to
           interact with HP1 - heterochromatin protein 1-rich
           heterochromatin domains next to centromeres that are
           crucial for chromosome segregation during mitosis. This
           HP1-CAF-1 interaction module functions as a built-in
           replication control for heterochromatin, which, like a
           control barrier, has an impact on S-phase progression in
           addition to DNA-based checkpoints.
          Length = 76

 Score = 33.3 bits (77), Expect = 0.10
 Identities = 12/24 (50%), Positives = 17/24 (70%)

Query: 482 EEEEEGSIIEGEEIEDESVREEED 505
           EEEEEG  +E E+ EDE   +++D
Sbjct: 49  EEEEEGEDLESEDEEDEEEDDDDD 72


>gnl|CDD|184535 PRK14144, PRK14144, heat shock protein GrpE; Provisional.
          Length = 199

 Score = 35.5 bits (81), Expect = 0.11
 Identities = 19/97 (19%), Positives = 38/97 (39%), Gaps = 12/97 (12%)

Query: 165 KARGDFHKLREKQQIEEDLRGYLDWITQAEDIEPEG-EERANHHDQKNKSSSETESADRL 223
           + + D+ K +++ + E              +I  E  +E + H +      S T   ++L
Sbjct: 4   QNKKDWKKFKDEHKEEH---------KVENEILEEETDEESQHQEPALGHPSYTALEEQL 54

Query: 224 EGDEGENQESWWR--KKNKDFDRNNRRMRRACRKAVK 258
              E +  E+W +  +   + +   RRM R    A K
Sbjct: 55  TLAEQKAHENWEKSVRALAELENVRRRMEREVANAHK 91


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 36.3 bits (83), Expect = 0.20
 Identities = 19/95 (20%), Positives = 43/95 (45%), Gaps = 1/95 (1%)

Query: 158  SKEREKAKARGDFHKLREKQQIEEDLRGYLDWITQAEDIEPEGEERANHHDQ-KNKSSSE 216
            ++E EK  A     +  E ++ EE  +   +   +AE+++   EE     ++ K ++  +
Sbjct: 1683 AEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEED 1742

Query: 217  TESADRLEGDEGENQESWWRKKNKDFDRNNRRMRR 251
             + A+  + DE E ++    KK ++      R  +
Sbjct: 1743 KKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEK 1777



 Score = 33.2 bits (75), Expect = 1.8
 Identities = 22/73 (30%), Positives = 33/73 (45%), Gaps = 6/73 (8%)

Query: 159  KEREKAKARGDFHKLREKQQIEEDLRGYLDWITQAEDIEPEGEERANHHDQKNKSSSETE 218
               E  KA  D  K  E ++ EED +        AE ++ E EE     + K K + E +
Sbjct: 1663 AAEEAKKAEEDKKKAEEAKKAEEDEK------KAAEALKKEAEEAKKAEELKKKEAEEKK 1716

Query: 219  SADRLEGDEGENQ 231
             A+ L+  E EN+
Sbjct: 1717 KAEELKKAEEENK 1729



 Score = 32.4 bits (73), Expect = 2.8
 Identities = 17/101 (16%), Positives = 44/101 (43%)

Query: 160  EREKAKARGDFHKLREKQQIEEDLRGYLDWITQAEDIEPEGEERANHHDQKNKSSSETES 219
            E  + KA     K  E ++  +  +   +   +A++ + + EE     D+  K+++  + 
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK 1419

Query: 220  ADRLEGDEGENQESWWRKKNKDFDRNNRRMRRACRKAVKSQ 260
            AD  +    E +++   KK  +  +     ++   +A K++
Sbjct: 1420 ADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAE 1460


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 35.7 bits (82), Expect = 0.24
 Identities = 20/92 (21%), Positives = 40/92 (43%), Gaps = 7/92 (7%)

Query: 456 QSLLTVFQQEQKGQEESPPR----SIMENEEEEEEGSIIEGEEIEDESVREEEDGVKVQL 511
           + +L    Q+      +       + +++ E  E+GS     + ED +  EEE+    + 
Sbjct: 68  KPILDDLNQKYVEFLINKEHIRVLAKLQDSESHEDGSDGSDMDSEDSADDEEEE---EED 124

Query: 512 ENCDSEFYDYSYQADIMNNEDARPRRLSEFNT 543
           E+ + E  D   +AD+ N  ++    LS+  T
Sbjct: 125 ESLEDEMIDDEDEADLFNESESSLEDLSDDET 156


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts
           harbouring premature signals for translation termination
           are recognised and rapidly degraded by eukaryotic cells
           through a pathway known as nonsense-mediated mRNA decay.
           In Saccharomyces cerevisiae, three trans-acting factors
           (Upf1 to Upf3) are required for nonsense-mediated mRNA
           decay.
          Length = 171

 Score = 33.9 bits (78), Expect = 0.39
 Identities = 17/88 (19%), Positives = 36/88 (40%), Gaps = 8/88 (9%)

Query: 469 QEESPPRSIMENEEEEEEGSIIEGEEIEDESVREEEDGVKV-----QLENCDSEFYDYSY 523
           ++E  P    ++E  +EE  +   ++ +DE    EE+ + V     +++      +D  +
Sbjct: 12  EDEELPEEDEDDESSDEEE-VDLPDDEQDEESDSEEEQIFVTRQEEEVDPEAEAEFDREF 70

Query: 524 QADIMNNEDARPRRLSEFNTATKKQPIP 551
           +   M  E    R+       T   P+P
Sbjct: 71  EK--MMAESLESRKFESRKKPTFDVPLP 96


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 34.6 bits (80), Expect = 0.61
 Identities = 24/83 (28%), Positives = 36/83 (43%), Gaps = 6/83 (7%)

Query: 479 ENEEEEEEGSIIEGEEIEDESVREEEDGVKVQLENCDSEFYDYSYQADIMNNEDARPRRL 538
           E+EEEEE+G   E EE +D+ + EEE+ V +  E  D E  D   + D    E+ + +  
Sbjct: 321 EDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEK-- 378

Query: 539 SEFNTATKKQPIPPASSFFIFSA 561
                  K      +   F F  
Sbjct: 379 ----KKKKSAESTRSELPFTFPC 397


>gnl|CDD|240364 PTZ00332, PTZ00332, paraflagellar rod protein; Provisional.
          Length = 589

 Score = 33.8 bits (77), Expect = 0.84
 Identities = 24/78 (30%), Positives = 37/78 (47%), Gaps = 10/78 (12%)

Query: 172 KLREKQQIEEDLRGYLDWITQAEDIEPEGEERANHHDQKNKSSSETESADRLEGDEGENQ 231
           K R KQ+ E DL+   D I +A D+E + E    +   K KS       +R   +  + Q
Sbjct: 269 KRRLKQRCETDLKHIHDAIQKA-DLE-DAEAMKRYATNKEKS-------ERFIRENEDRQ 319

Query: 232 ESWWRKKNKDFDRNNRRM 249
           E  W K  +D +R  +R+
Sbjct: 320 EEAWNKI-QDLERQLQRL 336


>gnl|CDD|107035 PHA01511, PHA01511, coat protein.
          Length = 430

 Score = 33.2 bits (75), Expect = 1.1
 Identities = 21/93 (22%), Positives = 43/93 (46%), Gaps = 8/93 (8%)

Query: 452 DSFWQSLLTVFQQEQKGQEESPPRSIMENEEE------EEEGSIIEGEEIEDESVREEED 505
           D   +++  +    QK ++ +PP + M+          E+E    EG ++ D++    E 
Sbjct: 14  DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLEL 73

Query: 506 GVKVQLENCDSEFYDYSYQADIMNNEDARPRRL 538
            V V +   D++F  +  +AD + +E A   R+
Sbjct: 74  NVAVNMGEPDNDF--FQLRADDLRDETAYRHRI 104


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 33.5 bits (76), Expect = 1.1
 Identities = 20/75 (26%), Positives = 28/75 (37%), Gaps = 5/75 (6%)

Query: 469 QEESPPRSIMENEEEEEEGSIIEGEEIEDESV-REEEDGVKVQLENCDSEFYDYSYQADI 527
           QE+S   +    EEE+EE      E  E E V + E+       E C  E      + + 
Sbjct: 101 QEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAEECQKE----EKEPEP 156

Query: 528 MNNEDARPRRLSEFN 542
              E  +   L E N
Sbjct: 157 EEEEKPKRGSLEENN 171


>gnl|CDD|224459 COG1542, COG1542, Uncharacterized conserved protein [Function
            unknown].
          Length = 593

 Score = 33.3 bits (76), Expect = 1.2
 Identities = 21/71 (29%), Positives = 30/71 (42%), Gaps = 5/71 (7%)

Query: 1315 LIQDYFRRFKKRKEQELKDGDK-ESHNTV-TLQAGLRTLHDAGPELKRAISGNLEELVDD 1372
            LI+D        +E+ +K   K ES   V  L      L +AG  +K AIS    + +  
Sbjct: 444  LIEDIQGHVGGDEEEVIKSLGKLESKGFVEELPNKGVKLTEAGELVKTAISMAKIQELLA 503

Query: 1373 NPEPSHRVSPT 1383
               P   V+PT
Sbjct: 504  LKFP---VTPT 511



 Score = 32.5 bits (74), Expect = 2.4
 Identities = 17/51 (33%), Positives = 23/51 (45%), Gaps = 2/51 (3%)

Query: 1495 LIQDYFRRFKKRKEQELKDGDK-ESHNTV-TLQAGLRTLHDAGPELKRAIS 1543
            LI+D        +E+ +K   K ES   V  L      L +AG  +K AIS
Sbjct: 444  LIEDIQGHVGGDEEEVIKSLGKLESKGFVEELPNKGVKLTEAGELVKTAIS 494


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 33.4 bits (76), Expect = 1.2
 Identities = 25/93 (26%), Positives = 42/93 (45%), Gaps = 10/93 (10%)

Query: 463 QQEQKGQEESPPRSIMENEEEEEEGSIIEGEEIEDESVREEED----GVKVQLENCDSEF 518
           ++E K   E P +  +E +E+ EE    + EE    S + ++     G K  L+  DS+ 
Sbjct: 299 EREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDS 358

Query: 519 YDYSYQADIMNNEDARPRRLSEFNTATKKQPIP 551
            D S  +DI + ED+         TA K++   
Sbjct: 359 GDDSDDSDI-DGEDSVS-----LVTAKKQKEPK 385


>gnl|CDD|224749 COG1836, COG1836, Predicted membrane protein [Function unknown].
          Length = 247

 Score = 32.7 bits (75), Expect = 1.4
 Identities = 15/73 (20%), Positives = 32/73 (43%), Gaps = 9/73 (12%)

Query: 261 GFYWLIIILVFLNTGVLATEHYQQPK-------WLDSFQEVTNIFFVALFTMEMTLKMYS 313
           GF W +++L+F   G LAT++  + K         D  + + N+    L  +   + +Y+
Sbjct: 53  GFKWFLLLLIFFALGSLATKYKYEEKEALGVAEGRDGARSLENVLGNGLAALLFAV-LYA 111

Query: 314 LGFQEVTNIFFVA 326
           + + +    F   
Sbjct: 112 I-YPDPQLWFLGG 123


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 33.0 bits (75), Expect = 1.6
 Identities = 12/41 (29%), Positives = 20/41 (48%), Gaps = 1/41 (2%)

Query: 465 EQKGQEESPPRSIMENEEEEEEGSIIEGEEIEDESVREEED 505
           E +GQE S P+     + E E G   E  + + + + +E D
Sbjct: 232 EGEGQEGSAPQESEATDRESESGE-EEMVQSDQDDLPDESD 271


>gnl|CDD|238535 cd01115, SLC13_permease, Permease SLC13 (solute carrier 13).  The
            sodium/dicarboxylate cotransporter NaDC-1 has been shown
            to translocate Krebs cycle intermediates such as
            succinate, citrate, and alpha-ketoglutarate across plasma
            membranes rabbit, human, and rat kidney. It is related to
            renal and intestinal Na+/sulfate cotransporters and a few
            putative bacterial permeases. The SLC13-type proteins
            belong to the ArsB/NhaD superfamily of permeases that
            translocate sodium and various anions across biological
            membranes in all three kingdoms of life. A typical
            ArsB/NhaD permease is composed of 8-13 transmembrane
            helices.
          Length = 382

 Score = 32.6 bits (75), Expect = 1.7
 Identities = 25/126 (19%), Positives = 52/126 (41%), Gaps = 15/126 (11%)

Query: 976  FVIVMGSFIDIFSDSPPHAAAKATIPGSHEQTSGRLSINFFRLFRVMRLIKLLSRGEGIR 1035
            F   +G  + +   +PP+  A   +     ++ G    +FF    +   + ++    G+ 
Sbjct: 134  FAASIGGMLTLIG-TPPNLVASGYL-----ESLGGQGFSFFEFTPIGLPLLII----GLL 183

Query: 1036 TLLWTFIKSFQALPYVALLIVMLFFIYAVIGMQVF---GKIALVFGGGIHRNNNFQTFPQ 1092
             L + F  +   +  V L    L  I  ++ + V    G I LVFGG +    ++++   
Sbjct: 184  YLWFIFRLAVLIITIVLL--AALAAITGLLPVSVAIAIGAIVLVFGGVLLTKEDYKSIDW 241

Query: 1093 AVLVLF 1098
             ++ LF
Sbjct: 242  GIIFLF 247


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 32.8 bits (75), Expect = 1.8
 Identities = 19/77 (24%), Positives = 33/77 (42%), Gaps = 1/77 (1%)

Query: 158 SKEREKAKARGDFHKLREKQQIEEDLRGYLDWITQ-AEDIEPEGEERANHHDQKNKSSSE 216
           SK R+K +AR D     E  + E++         +  E  + E E   +  D  ++SSS+
Sbjct: 352 SKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSD 411

Query: 217 TESADRLEGDEGENQES 233
             S    + D+    +S
Sbjct: 412 VGSDSESKADKESASDS 428


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 32.5 bits (74), Expect = 1.8
 Identities = 12/44 (27%), Positives = 20/44 (45%), Gaps = 2/44 (4%)

Query: 465 EQKGQEESPPRSIMENEEEEEEGSIIEGEEIEDE--SVREEEDG 506
            +  ++E  P+   ++++ EEE S       ED   S  E E G
Sbjct: 221 SEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESG 264


>gnl|CDD|221534 pfam12331, DUF3636, Protein of unknown function (DUF3636).  This
            domain family is found in eukaryotes, and is
            approximately 160 amino acids in length.
          Length = 155

 Score = 31.5 bits (72), Expect = 1.9
 Identities = 16/49 (32%), Positives = 24/49 (48%), Gaps = 1/49 (2%)

Query: 1258 RTSLRIKTEGNIDDANAELRAVIKKIWKRTSNELLNQVVPPPGGEEEVT 1306
             TS+   + G I D+ ++ RA    I  R +N LL++   P  GE   T
Sbjct: 56   STSVLPNSFGPIQDSESKQRANENYIIDRVTN-LLSEPPQPDEGEAPYT 103


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 32.3 bits (74), Expect = 1.9
 Identities = 15/27 (55%), Positives = 18/27 (66%)

Query: 479 ENEEEEEEGSIIEGEEIEDESVREEED 505
           E EEEE++ S  E E+ EDE   EEED
Sbjct: 287 EEEEEEDDYSESEDEDEEDEDEEEEED 313


>gnl|CDD|110325 pfam01312, Bac_export_2, FlhB HrpN YscU SpaS Family.  This family
           includes the following members: FlhB, HrpN, YscU, SpaS,
           HrcU SsaU and YopU. All of these proteins export
           peptides using the type III secretion system. The
           peptides exported are quite diverse.
          Length = 343

 Score = 32.2 bits (74), Expect = 2.1
 Identities = 40/191 (20%), Positives = 74/191 (38%), Gaps = 17/191 (8%)

Query: 581 AILFLGYFDNFFTAVFTIEILLKIISYGFILHDGAFCRSSFNLLDLLVVIVSVVAIYWNS 640
           A L  G    F         L+ ++    IL    F  S  +L  L+V+++  V +  + 
Sbjct: 33  ASLLAGSAYLFLLGSVFARRLIALLLLTLILIPLPFSLSVTDLFSLVVLLLLTVLLLLSL 92

Query: 641 AAIAFVKI---------LRVLRVLRP-LRAINRAKGLKHV--VQCVIVAVKTIGNIVLVT 688
                V +         L     ++P    IN  KGLK +   + ++  +K+I  +V V 
Sbjct: 93  FVALLVGLASNLLQVGFLFSTEAIKPDFSKINPIKGLKRIFSARSLVELLKSILKVVAVG 152

Query: 689 CLLQFMF---AVIGVQLFKYVVKCVIVAIKTIGNIMLVTYLLSFMFAVMGVQLFKGKFFQ 745
            +  F         V+L       +IV + T+   +L+  LL     ++G+  +  + + 
Sbjct: 153 LIFYFFLKNDLGTIVRLPGCSPMGIIVIVMTLLKKLLLVVLLF--LLIVGLLDYLFQRYL 210

Query: 746 CNDGSKMTKAE 756
                KM+K E
Sbjct: 211 FIKDLKMSKQE 221


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 33.0 bits (75), Expect = 2.1
 Identities = 13/60 (21%), Positives = 28/60 (46%), Gaps = 8/60 (13%)

Query: 463  QQEQKGQEESPPRSIMENEEEEEEGSIIE-----GEEIEDESVREEEDGVKVQLENCDSE 517
            + ++K  E+    ++ ENEE  E+G   +     GE  ED+++       K+  ++  + 
Sbjct: 4057 EDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHP---KMDAKSTFAS 4113


>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
           [Translation, ribosomal structure and biogenesis].
          Length = 569

 Score = 32.7 bits (74), Expect = 2.1
 Identities = 11/61 (18%), Positives = 24/61 (39%), Gaps = 1/61 (1%)

Query: 474 PRSIMENEEEEEEGSIIEGEEIEDESVREEEDGVKV-QLENCDSEFYDYSYQADIMNNED 532
            + I E ++   +GS +     E+  + E E   K  Q+ + D    +      +  N++
Sbjct: 71  IQGIQELKDPSLDGSTLNSSSGEESELEEAESVFKEKQMLSADVSEIEEQSNDSLSENDE 130

Query: 533 A 533
            
Sbjct: 131 E 131


>gnl|CDD|221203 pfam11748, DUF3306, Protein of unknown function (DUF3306).  This
           family of proteobacterial species proteins has no known
           function.
          Length = 115

 Score = 30.7 bits (70), Expect = 2.2
 Identities = 12/56 (21%), Positives = 17/56 (30%)

Query: 450 NFDSFWQSLLTVFQQEQKGQEESPPRSIMENEEEEEEGSIIEGEEIEDESVREEED 505
            F S W       + E+  +                     E  E+EDE + EE D
Sbjct: 1   GFLSRWSRRKLAVRAEEPAEPAETAEEEAAAAAPAPAPEEEEEAELEDEELLEELD 56


>gnl|CDD|220687 pfam10316, 7TM_GPCR_Srbc, Serpentine type 7TM GPCR chemoreceptor
           Srbc.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srbc is a solo family
           amongst the superfamilies of chemoreceptors.
           Chemoperception is one of the central senses of soil
           nematodes like C. elegans which are otherwise 'blind'
           and 'deaf'.
          Length = 272

 Score = 31.7 bits (73), Expect = 2.5
 Identities = 13/39 (33%), Positives = 20/39 (51%), Gaps = 3/39 (7%)

Query: 822 PI--HNYRPIVATYYIIYIIIIAFFMVNIFVGFVIVTFQ 858
           PI  HNYRP +  + II I+ I++ +   +V F      
Sbjct: 112 PIFYHNYRPKIPNF-IILILAISYGLFEQYVLFGFCDNV 149


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 32.1 bits (73), Expect = 2.6
 Identities = 12/42 (28%), Positives = 21/42 (50%), Gaps = 1/42 (2%)

Query: 464 QEQKGQEESPPRSIMENEEEEEEGSIIEGEEIEDESVREEED 505
           +E +G +ES        + E EEG  ++  E  ++S  +E D
Sbjct: 252 EESEGSDESEEDEAEATDGEGEEGE-MDAAEASEDSESDESD 292


>gnl|CDD|235569 PRK05702, flhB, flagellar biosynthesis protein FlhB; Reviewed.
          Length = 359

 Score = 32.0 bits (74), Expect = 3.0
 Identities = 14/48 (29%), Positives = 24/48 (50%), Gaps = 1/48 (2%)

Query: 400 NLVASLLNSIQSIASLLLLLFLFIVIFALLGMQVFGGKFNFNPTIAKP 447
           ++++  L  +      LL + L +V+ ALL   + GG   F+P   KP
Sbjct: 77  DVLSLFLLLLLEALLALLPILLLLVLAALLAPVLQGG-LLFSPKALKP 123


>gnl|CDD|181435 PRK08457, motB, flagellar motor protein MotB; Reviewed.
          Length = 257

 Score = 31.6 bits (72), Expect = 3.3
 Identities = 23/125 (18%), Positives = 46/125 (36%), Gaps = 27/125 (21%)

Query: 415 LLLLLFLFIVIFALLGMQV---------FGGKFNF--NPTIAKPRSNFDSFWQSLLTVFQ 463
           L LLL LFI ++A+  +           F   F++   P + +P            +   
Sbjct: 24  LSLLLALFIALYAISAVNKAKVEALKTEFIKIFDYTAKPEVKQPVKV-----IPPDSGDA 78

Query: 464 QEQKGQEESPPRSIMENEEEEE----EGSIIEGEEIEDESVREEEDGV-----KVQLENC 514
           +E+  + +S   S    +E  E    EG  +  ++I++    +    +       ++ N 
Sbjct: 79  EEEMDELKSLKASQNTQQETIERRKGEGGAL--DQIDNGIALKLPSKLLFENGSAEIINA 136

Query: 515 DSEFY 519
           D   Y
Sbjct: 137 DMMDY 141


>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
          Length = 201

 Score = 30.8 bits (70), Expect = 4.1
 Identities = 17/65 (26%), Positives = 29/65 (44%), Gaps = 10/65 (15%)

Query: 131 WTYFVSMVILGAFFVM---NLI-------LGVLSGEFSKEREKAKARGDFHKLREKQQIE 180
               V+ VI G +  +    LI       L  +  EF KE  +A+  GD  KL++ Q+ +
Sbjct: 48  LVILVAAVITGLYITILQKLLIDQEKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQ 107

Query: 181 EDLRG 185
            ++  
Sbjct: 108 MEMMD 112


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 31.1 bits (70), Expect = 4.5
 Identities = 18/48 (37%), Positives = 24/48 (50%), Gaps = 5/48 (10%)

Query: 461 VFQQEQKGQEESPPRSIMENEEEEEEGSII--EGEEIEDESVREEEDG 506
           V  +E++  EES      E EE+EE GS    EG    +E   EE +G
Sbjct: 174 VDNEEEERLEESDG---REEEEDEEVGSDSYGEGNRELNEEEEEEAEG 218


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 31.5 bits (71), Expect = 5.2
 Identities = 17/34 (50%), Positives = 20/34 (58%)

Query: 479 ENEEEEEEGSIIEGEEIEDESVREEEDGVKVQLE 512
           E EEEEEE    E EE E+E   EEE+   + LE
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLE 897


>gnl|CDD|235086 PRK02919, PRK02919, oxaloacetate decarboxylase subunit gamma;
           Provisional.
          Length = 82

 Score = 28.5 bits (64), Expect = 5.7
 Identities = 12/30 (40%), Positives = 16/30 (53%)

Query: 418 LLFLFIVIFALLGMQVFGGKFNFNPTIAKP 447
           L FLF++IFA+ GM     +F   P  A  
Sbjct: 22  LAFLFLLIFAIRGMSALINRFFPEPVAAPA 51


>gnl|CDD|131739 TIGR02692, tRNA_CCA_actino, tRNA adenylyltransferase.  The enzyme
            tRNA adenylyltransferase, also called
            tRNA-nucleotidyltransferase and CCA-adding enzyme, can
            add or repair the required CCA triplet at the 3'-end of
            tRNA molecules. Genes encoding tRNA include the CCA tail
            in some but not all bacteria, and this enzyme may be
            required for viability. Members of this family represent
            a distinct clade within the larger family pfam01743 (tRNA
            nucleotidyltransferase/poly(A) polymerase family
            protein). The example from Streptomyces coelicolor was
            shown to act as a CCA-adding enzyme and not as a poly(A)
            polymerase [Protein synthesis, tRNA and rRNA base
            modification].
          Length = 466

 Score = 30.8 bits (70), Expect = 6.3
 Identities = 16/84 (19%), Positives = 33/84 (39%), Gaps = 18/84 (21%)

Query: 1531 LHDAGPELKRA----ISGNLEEL-VDDNPEPSHRVLAEYGLGKFAYPELVKYTSKEMQEA 1585
            + +   +++R     +   L++L + D+P     ++ E GL     PE+          A
Sbjct: 195  MTEMADQIERISAERVRVELDKLLLGDHPRAGIDLMVETGLADRVLPEI---------PA 245

Query: 1586 LNMTQEE----MDRAAHKLIIQER 1605
            L +  +E     D   H L +  +
Sbjct: 246  LRLEIDEHHQHKDVYEHSLTVLRQ 269


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 31.2 bits (71), Expect = 6.4
 Identities = 34/125 (27%), Positives = 50/125 (40%), Gaps = 12/125 (9%)

Query: 379 VLRCVRLLRVFKVTKYWRSLSNLVASLLNSIQSIASLLLLLFLFIVIFALLGM---QVFG 435
           VL  ++     +     RS   LV+ L NS+  I S+LL    FI     LG     +F 
Sbjct: 184 VLVWIQQNNSIRSNVLIRSNKYLVSELRNSMARIFSILL----FITCVYYLGRIPSPIFT 239

Query: 436 GKFNFNPTIAKPRSNFDSFWQSLLTVF-----QQEQKGQEESPPRSIMENEEEEEEGSII 490
            K        +   + +     + T       +QEQ+G  E  P   + +EE E+   I 
Sbjct: 240 KKLKETSETEERVESEEETDVEIETASETKGTKQEQEGSTEEDPSPSLFSEEREDPDKID 299

Query: 491 EGEEI 495
           E EEI
Sbjct: 300 ETEEI 304


>gnl|CDD|227434 COG5103, CDC39, Cell division control protein, negative regulator
           of transcription [Cell division and chromosome
           partitioning / Transcription].
          Length = 2005

 Score = 31.1 bits (70), Expect = 6.5
 Identities = 38/184 (20%), Positives = 62/184 (33%), Gaps = 30/184 (16%)

Query: 791 KAMLTLFTVSTFEGWPGL-LYNSIDSNLENHGPIH--------NYRPIVATYYIIYIIII 841
           + + + F    F  W    LY  ID      G I          ++   A    I + I 
Sbjct: 255 RDVYSCFLDVNFSIWTLEPLYVIIDCWFYISGIITVPYEFFFKRWKNERAQILFIRLHIE 314

Query: 842 A-----FFMVNIFVGFVIVTFQNEGEQEYKNCELDKNQRNCIEF----ALKAKPVRRYIP 892
           +     F   N+F    IV+ ++     +KN    ++  NC+E      L + P+  YI 
Sbjct: 315 SDEKRTFDYSNVF-FTRIVSIEDSRGHRFKNYMGYESNLNCVELFQCIMLGSLPLIEYIA 373

Query: 893 KQRLQYKIWWFVTSQYFEYTIFCLIMINTLTLAMKFYRQPLFYADILDKMNMFFTTVFAL 952
           +  L     W      F    F  +  N           P+  + I +++ M F      
Sbjct: 374 RLFLNKSPEWCAAGLGFVQPKFSPLAWN-----------PVQESRIFEELLMGFANGAPN 422

Query: 953 EFVF 956
            FVF
Sbjct: 423 SFVF 426


>gnl|CDD|188151 TIGR01517, ATPase-IIB_Ca, plasma-membrane calcium-translocating
           P-type ATPase.  This model describes the P-type ATPase
           responsible for translocating calcium ions across the
           plasma membrane of eukaryotes , out of the cell. In some
           organisms, this type of pump may also be found in
           vacuolar membranes. In humans and mice, at least, there
           are multiple isoforms of the PMCA pump with overlapping
           but not redundant functions. Accordingly, there are no
           human diseases linked to PMCA defects, although
           alterations of PMCA function do elicit physiological
           effects. The calcium P-type ATPases have been
           characterized as Type IIB based on a phylogenetic
           analysis which distinguishes this group from the Type
           IIA SERCA calcium pump. A separate analysis divides Type
           IIA into sub-types (SERCA and PMR1) which are modelled
           by TIGR01116 and TIGR01522. This model is well separated
           from those.
          Length = 944

 Score = 30.9 bits (70), Expect = 7.0
 Identities = 28/144 (19%), Positives = 46/144 (31%), Gaps = 30/144 (20%)

Query: 244 RNNRRMRRACRKAVKSQGFYWLIIILVFLNTGVLATEHYQQPKWLDSFQEVTNIFFVALF 303
           RN   + R+  K +  Q  Y L+         V     +      D              
Sbjct: 801 RNAPLISRSMWKNILGQAGYQLV---------VTFILLFAGGSIFDVSGPDEI------- 844

Query: 304 TMEMTLKMYSLGFQEVTNIF-FVALFTMEMTLKMYSLG-FQGYFVSLFNRFDCFVVIGSI 361
           T     ++ ++ F    N F  + LF      K+Y    F+G F    NR    ++  + 
Sbjct: 845 TSHQQGELNTIVF----NTFVLLQLFNEINARKLYERNVFEGLFK---NRIFVTIMGFTF 897

Query: 362 GEMILTRTNIMPPLGVSVLRCVRL 385
           G  ++    I+   G S    V L
Sbjct: 898 GFQVI----IVEFGG-SFFSTVSL 916


>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
           [Transcription].
          Length = 175

 Score = 29.7 bits (67), Expect = 7.7
 Identities = 11/43 (25%), Positives = 21/43 (48%)

Query: 465 EQKGQEESPPRSIMENEEEEEEGSIIEGEEIEDESVREEEDGV 507
           E+  ++E       E EE++E  S+ +  + EDE   E  + +
Sbjct: 111 EEGDEDELDYDEDKEEEEDDEVDSLDDENDDEDEDDDEIVEIL 153


>gnl|CDD|225743 COG3202, COG3202, ATP/ADP translocase [Energy production and
            conversion].
          Length = 509

 Score = 30.7 bits (70), Expect = 8.3
 Identities = 14/55 (25%), Positives = 23/55 (41%), Gaps = 2/55 (3%)

Query: 1141 DIAYPYFISFYVLCSFLIINLFVAVIMDNFDYLTRDWSILGPHHLDEFIRLWSEY 1195
             I   +F+ F+ L +F+I       I+      +RD     P  L  FI +  E+
Sbjct: 96   YIILGFFLGFFALFAFVI--YPYKDILHPDPEFSRDLFADLPMFLKWFILIVGEW 148


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 30.7 bits (69), Expect = 8.8
 Identities = 17/87 (19%), Positives = 34/87 (39%), Gaps = 10/87 (11%)

Query: 455 WQSLLTVFQQEQKGQEESPPRS-IMENEEEEEEGSIIEGEEIED---ESVREEEDGVKVQ 510
           W++L      E+ G EE+  +  I+++     E    +    E    E+   EE   + +
Sbjct: 333 WRNL----NNEKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEE 388

Query: 511 LENCDSEFYDYSYQADIMNNEDARPRR 537
           +E+ + E        D  ++E     R
Sbjct: 389 VEDYEDENDHSKRICD--DDELENHFR 413


>gnl|CDD|238507 cd01036, ClC_euk, Chloride channel, ClC.  These domains are found
           in the eukaryotic halogen ion (Cl-, Br- and I-) channel
           proteins that perform a variety of functions including
           cell volume regulation, membrane potential
           stabilization, charge compensation necessary for the
           acidification of intracellular organelles, signal
           transduction and transepithelial transport.  They are
           also involved in many pathophysiological processes and
           are responsible for a number of human diseases.  These
           proteins belong to the ClC superfamily of chloride ion
           channels, which share the unique double-barreled
           architecture and voltage-dependent gating mechanism.
           The gating is conferred by the permeating anion itself,
           acting as the gating charge.  Some proteins possess long
           C-terminal cytoplasmic regions containing two CBS
           (cystathionine beta synthase) domains of putative
           regulatory function.
          Length = 416

 Score = 30.4 bits (69), Expect = 9.9
 Identities = 35/150 (23%), Positives = 60/150 (40%), Gaps = 25/150 (16%)

Query: 297 IFFVALFTMEM--TLKMYSLGFQEVTNI--FFVALFTMEMTLKMYSLGFQGYFVSLFNRF 352
           +FF AL +  +      ++ GF+ +      F++L   E+ + +                
Sbjct: 188 VFFAALVSAFVIQIYNSFNSGFELLDRSSAMFLSLTVFELHVPLN-------LYEFIP-- 238

Query: 353 DCFVVIGSIGEM---ILTRTNIMPPLGVSVLRCVRLLRVFKVTKYWRSLSNLVASLLNSI 409
              VVIG I  +   +  R +I+       LR  R L   K  +Y      L   + ++I
Sbjct: 239 --TVVIGVICGLLAALFVRLSII------FLRWRRRLLFRKTARYRVLEPVLFTLIYSTI 290

Query: 410 QSIASLLLLLFLFIVIFAL-LGMQVFGGKF 438
               +LLL L ++  + AL  G+ V GG F
Sbjct: 291 HYAPTLLLFLLIYFWMSALAFGIAVPGGTF 320


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.325    0.140    0.427 

Gapped
Lambda     K      H
   0.267   0.0775    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 86,369,535
Number of extensions: 8977507
Number of successful extensions: 15164
Number of sequences better than 10.0: 1
Number of HSP's gapped: 15016
Number of HSP's successfully gapped: 135
Length of query: 1643
Length of database: 10,937,602
Length adjustment: 110
Effective length of query: 1533
Effective length of database: 6,058,662
Effective search space: 9287928846
Effective search space used: 9287928846
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 66 (29.1 bits)