RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16876
         (1124 letters)



>gnl|CDD|219911 pfam08572, PRP3, pre-mRNA processing factor 3 (PRP3).  Pre-mRNA
           processing factor 3 (PRP3) is a U4/U6-associated
           splicing factor. The human PRP3 has been implicated in
           autosomal retinitis pigmentosa.
          Length = 222

 Score =  263 bits (673), Expect = 8e-81
 Identities = 110/222 (49%), Positives = 159/222 (71%), Gaps = 11/222 (4%)

Query: 183 ESKFFDPRIS-AKSSVRTKRALKFHEPGKFQQLAEQLRVKAQLAKLQADISKNARKTGIS 241
           E+ + DPR++ AK   R +R L+F E GKF Q AE+LR +A+L + +A IS+ A+  G+ 
Sbjct: 1   ENPYLDPRMAEAKRKKRRRRGLQFIEKGKFSQQAEELRREAKLEEEEARISEEAQNAGLK 60

Query: 242 SATKLALIAPKMEDDQDEMPEVEWWDAVIM-------VEETYEKENNIKTSAITNLVEHP 294
           SAT LA   PK E   D++P++EWWD  I+       +E+  +++ + + S+ITN +EHP
Sbjct: 61  SATDLAEKIPKKEL-PDDVPDIEWWDMPILDDPDYNDLEDESDEDIDEEMSSITNYIEHP 119

Query: 295 IQMKPPSDMAKPVYM--PVFLTDKERKKLRRQNRREAWKEEQEKIRLGLEPPPEPKLRIS 352
           + ++PP +   P  +  P++LT KE+KKLRRQ R+E  KE+QE+IRLGLEPPP PK+++S
Sbjct: 120 VPIEPPWEAHYPPPIRPPLYLTKKEQKKLRRQRRKEKRKEKQERIRLGLEPPPPPKVKLS 179

Query: 353 NLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLT 394
           NLMRVLG+EAVQDPTK+EA VR Q+A+R++ HE+ N  RKLT
Sbjct: 180 NLMRVLGSEAVQDPTKVEAEVRRQVAERRQAHEDRNEERKLT 221


>gnl|CDD|162054 TIGR00815, sulP, high affinity sulphate transporter 1.  The SulP
           family is a large and ubiquitous family with over 30
           sequenced members derived from bacteria, fungi, plants
           and animals. Many organisms including Bacillus subtilis,
           Synechocystis sp, Saccharomyces cerevisiae, Arabidopsis
           thaliana and Caenorhabditis elegans possess multiple
           SulP family paralogues. Many of these proteins are
           functionally characterized, and all are sulfate uptake
           transporters. Some transport their substrate with high
           affinities, while others transport it with relatively
           low affinities. Most function by SO42- :H+symport, but
           SO42- :HCO3- antiport has been reported for the rat
           protein (spP45380). The bacterial proteins vary in size
           from 434 residues to 566 residues with one exception, a
           Mycobacterium tuberculosis protein with 784 residues.
           The eukaryotic proteins vary in size from 611 residues
           to 893 residues with one exception, a protein designated
           "early nodulin 70 protein" from Glycine max which is
           reported to be of 485 residues. Thus, the eukaryotic
           proteins are almost without exception larger than the
           prokaryotic proteins. These proteins exhibit 10-13
           putative transmembrane a-helical spanners (TMSs)
           depending on the protein. The phylogenetic tree for the
           SulP family reveals five principal branches. Three of
           these are bacterial specific as follows: one bears a
           single protein from M. tuberculosis; a second bears two
           proteins, one from M. tuberculosis, the other from
           Synechocystis sp, and the third bears all remaining
           prokaryotic proteins. The remaining two clusters bear
           only eukaryotic proteins with the animal proteins all
           localized to one branch and the plant and fungal
           proteins localized to the other. The generalized
           transport reactions catalyzed by SulP family proteins
           are: (1) SO42- (out) + nH+ (out) --> SO42- (in) + nH+
           (in). (2) SO42- (out) + nHCO3- (in) SO42- (in) + nHCO3-
           (out) [Transport and binding proteins, Anions].
          Length = 563

 Score =  257 bits (659), Expect = 1e-74
 Identities = 132/417 (31%), Positives = 220/417 (52%), Gaps = 17/417 (4%)

Query: 569 LGSLSVLMSDIMISGFTTGTAILVILSQIKHVFGI---KVKRHIGPFNVIYTLIDVAENI 625
           LG L   +S  +ISGF TG AI + LSQ+K + GI     +       V+ +      N 
Sbjct: 118 LGFLIEFLSHAVISGFMTGAAITIGLSQLKGLLGISIFNTRTDT--LGVVISTWAGLPNT 175

Query: 626 HKTNYVAFGVVVILVSVLIIYNDHFKSKIQKNISFPIPTEMIVIVAGALLSSILDVKHKY 685
           H  N+    V+ +++ + ++Y      + +K +  P    ++V++   L  +I   K K 
Sbjct: 176 HNWNWCTL-VIGLVLLLFLLYTKKLGKRNKKLLFAPAVAPLLVVILATLAVTIGLHK-KQ 233

Query: 686 NLSNVGKIPIGLPS--PQPPPFYLIPKLMLDGLFISIVAFSINISMASILAKKK-YKIDS 742
            +S +G IP GL    P    + L+P L  D + I+IV    +I++A   A+   YKID+
Sbjct: 234 GVSILGHIPSGLSFFPPITLDWELLPTLAPDAIAIAIVGLIESIAIARSFARMTGYKIDA 293

Query: 743 NQELLASGVSNIFASFFSCIPFASSLSRSLVQLQVGGKTQLASGVSCVLLVLILLYIGPF 802
           NQEL+A G++NI  SFFSC P   SLSR+ V  + G +TQL+  V+ ++++L+LL + P 
Sbjct: 294 NQELVAQGIANIVGSFFSCYPATGSLSRTAVNAKAGCRTQLSGVVTAIVVLLVLLVLTPL 353

Query: 803 FETLPYCVLTSIVIVAVKGMLNQVKDLKLAFKESYTEACVWCLTFLSVVLLDVDYGLGIG 862
           F  +P   L +I+I AV+G+++  K+L   +K    +  VW +TF  VV   ++ GL +G
Sbjct: 354 FYYIPQAALAAIIISAVRGLID-YKELYKLWKADKMDFVVWLVTFFGVVFTSIEIGLLVG 412

Query: 863 VLCSLIFVVVTGQKVMVYKLGRLIDSNIYVEEDFYESAIDVPGIVILRIIGGMNFINKDK 922
           V  S  F+++   +     LGR+  + +Y     Y +A   PGI++ R+ G + F N + 
Sbjct: 413 VALSAAFLLLRIARPRGAVLGRVPGTEVYRSIKQYPNARPPPGILVYRVDGPLYFANAED 472

Query: 923 VFHKISKL------SLSSEPYPKQIILDMMSLSSVDTSTVKSFLDLYKELMEQGISL 973
           +  ++ K            P  + +ILDM ++  +DTS + +  +L KEL  +GI L
Sbjct: 473 LKDRLLKRIEDETRRELERPPLQVVILDMSAVPHLDTSGIHALEELRKELKARGIQL 529



 Score = 93.5 bits (233), Expect = 1e-19
 Identities = 34/74 (45%), Positives = 47/74 (63%), Gaps = 1/74 (1%)

Query: 1051 PILEWLPNYNWKNDLSHDMVSGFTVAVLHIPQGMAYSSLGGVPPIVGIYMAVFPVIIYMC 1110
            P+L WLP+Y  K     D+++G TV +L IPQ MAY+ L G+ PI G+Y +  P  IY  
Sbjct: 1    PVLRWLPHYRLKK-FKGDLMAGLTVGILLIPQAMAYAILAGLSPIYGLYTSFVPPFIYAL 59

Query: 1111 MGTSRHISMGTFSV 1124
             GTSR I++G  +V
Sbjct: 60   FGTSRDIAIGPVAV 73


>gnl|CDD|216188 pfam00916, Sulfate_transp, Sulfate transporter family.  Mutations
           in human SLC26A2 lead to several human diseases.
          Length = 279

 Score =  212 bits (543), Expect = 4e-62
 Identities = 97/280 (34%), Positives = 156/280 (55%), Gaps = 12/280 (4%)

Query: 569 LGSLSVLMSDIMISGFTTGTAILVILSQIKHVFGIKVKRHIGPFNVIYTLIDVAENIHKT 628
           LG L   +S  +ISGF  G AIL++LSQ+K + G+   RH G  +V+  L D+ +N+   
Sbjct: 6   LGFLVEFLSRAVISGFMAGAAILILLSQLKGLLGLSNTRHSGIVSVLRALFDLVDNLWNW 65

Query: 629 NYVAFGVVVILVSVLI-IYNDHFKSKIQKNISFPIPTEMIVIVAGALLSSILDVKHKYNL 687
             +  G+  ++  ++I +     K      I  P P  ++ ++   ++S I     +Y +
Sbjct: 66  PTLVIGLSFLIFLLIIKLLPKRLKKLF--WIPAPAP--LVAVIVATVISYIFL-ADRYGV 120

Query: 688 SNVGKIPIGLPSPQPPPFYLIPKLMLD----GLFISIVAFSINISMASILAKKK-YKIDS 742
           S +G+IP GLP P  P   L   L+LD     L ++IV    +I  A   AKKK YKIDS
Sbjct: 121 SIIGEIPSGLPPPSLPDVPLDWSLLLDLLPIALALAIVGLLESILTAKSFAKKKGYKIDS 180

Query: 743 NQELLASGVSNIFASFFSCIPFASSLSRSLVQLQVGGKTQLASGVSCVLLVLILLYIGPF 802
           N+EL+A G++NI +S F   P   S SRS V ++ G KTQL+  V  V+++L+LL++ P 
Sbjct: 181 NKELVAQGIANIVSSLFGGYPATGSFSRSAVNVKAGAKTQLSGIVMAVVVLLVLLFLTPL 240

Query: 803 FETLPYCVLTSIVIVAVKGMLNQVKDLKLAFKESYTEACV 842
              +P  VL +I+IVA+ G+++   +L   +K S ++  +
Sbjct: 241 LAYIPMAVLAAIIIVALIGLID-WSELIRLWKLSKSDFLI 279


>gnl|CDD|223732 COG0659, SUL1, Sulfate permease and related transporters (MFS
           superfamily) [Inorganic ion transport and metabolism].
          Length = 554

 Score =  220 bits (563), Expect = 1e-61
 Identities = 106/415 (25%), Positives = 192/415 (46%), Gaps = 28/415 (6%)

Query: 569 LGSLSVLMSDIMISGFTTGTAILVILSQIKHVFGIKVKRHIGPFNVIYTLIDVAENIHKT 628
           LG L   +   ++ GFT G AIL+IL+Q+  + G+  K           +  +   +   
Sbjct: 119 LGRLIRFIPRPVLIGFTAGIAILIILTQLPVLLGLASKVS----GFWAKVSALFTVLLTI 174

Query: 629 NYVAFGVVVILVSVLIIYNDHFKSKIQKNISFPIPTEMIVIVAGALLSSILDVKHKYNLS 688
           N     + ++ +++L+             ++  IP+ +I +V G L+  I  +    +  
Sbjct: 175 NLATLLLGLLTLAILLFL---------PRLTPRIPSPLIALVLGTLIVWIFPL----DSL 221

Query: 689 NVGKIPIGLPSPQPP------PFYLIPKLMLDGLFISIVAFSINISMASILAKK-KYKID 741
             G+IP  LPS  P          L+  L+   L ++++    ++  A         K D
Sbjct: 222 RYGEIPGSLPSGLPHFRLPNVSLSLLLALLPYALALALLGLLESLLTAVSFDGMTGTKHD 281

Query: 742 SNQELLASGVSNIFASFFSCIPFASSLSRSLVQLQVGGKTQLASGVSCVLLVLILLYIGP 801
           SN+EL+A G++NI +  F  IP   S+SRS + ++ G +T+L+  +   LL+L+LL++ P
Sbjct: 282 SNRELIAQGIANIASGLFGGIPATGSISRSAINIKSGARTRLSGIIHAALLLLLLLFLAP 341

Query: 802 FFETLPYCVLTSIVIVAVKGMLNQVKDLKLAFKESYTEACVWCLTFLSVVLLDVDYGLGI 861
               +P   L +++I+   G+L+      L  K    E  V   T L  V  D+  G+ +
Sbjct: 342 LVSYIPLAALAAVLILVGWGLLDWSLLKPLLRKLPRGELLVLLTTALLTVFFDLVIGVVV 401

Query: 862 GVLCSLIFVVVTGQKVMVYKLGRLIDSNIYVEEDFYESAIDVPGIVILRIIGGMNFINKD 921
           G+L + +  +    +  +  LGR+                  PG+++ R+ G + F N D
Sbjct: 402 GILLACLLFIRRISRPSIVVLGRVPGPAGSDNAL-KPLDEIGPGVLVYRLSGPLFFGNAD 460

Query: 922 KVFHKISKLSLSSEPYPKQIILDMMSLSSVDTSTVKSFLDLYKELMEQGISLHIV 976
           ++   +  L    E  P+++ILD+ S+  +D S  ++  DL KEL  +GI L IV
Sbjct: 461 RLERALLGL---IEERPERVILDLKSVPYIDASAAEALEDLIKELERRGIQLLIV 512



 Score = 73.5 bits (181), Expect = 3e-13
 Identities = 29/73 (39%), Positives = 42/73 (57%), Gaps = 1/73 (1%)

Query: 1048 SLFPILEWLPNYNWKNDLSHDMVSGFTVAVLHIPQGMAYSSLGGVPPIVGIYMAVFPVII 1107
            S  P L+WLP Y   + L  D+++G TVA + +P  MA++   GVPP  G+Y ++   II
Sbjct: 5    SEIPTLKWLPYYFR-SWLRGDLLAGLTVAAVALPLAMAFAIAAGVPPEAGLYASIVAGII 63

Query: 1108 YMCMGTSRHISMG 1120
            Y   G SR +  G
Sbjct: 64   YALFGGSRGLISG 76


>gnl|CDD|205965 pfam13792, Sulfate_tra_GLY, Sulfate transporter N-terminal domain
            with GLY motif.  This domain is found usually at the
            N-terminus of sulfate-transporter proteins. It carries a
            highly conserved GLY sequence motif, but the function of
            the domain is not known.
          Length = 83

 Score =  120 bits (305), Expect = 1e-32
 Identities = 35/75 (46%), Positives = 52/75 (69%), Gaps = 1/75 (1%)

Query: 1050 FPILEWLPNYNWKNDLSHDMVSGFTVAVLHIPQGMAYSSLGGVPPIVGIYMAVFPVIIYM 1109
             PIL+WLP Y+    L  D+++G TVA++ IPQGMAY+ L G+PP+ G+Y +  P +IY 
Sbjct: 1    LPILQWLPKYSRSW-LKGDLIAGLTVALVLIPQGMAYALLAGLPPVYGLYASFVPPLIYA 59

Query: 1110 CMGTSRHISMGTFSV 1124
              G+SRH+S+G  + 
Sbjct: 60   LFGSSRHLSVGPTAA 74


>gnl|CDD|219080 pfam06544, DUF1115, Protein of unknown function (DUF1115).  This
           family represents the C-terminus of hypothetical
           eukaryotic proteins of unknown function.
          Length = 127

 Score =  103 bits (259), Expect = 6e-26
 Identities = 35/88 (39%), Positives = 50/88 (56%), Gaps = 5/88 (5%)

Query: 486 AVYRVDDFSNPSKKFKVETNCNQLFMTGAVVMYKDCNVIVVEGGPKQQSKFKRLMMHRIK 545
            VY +   SNP K+FK+  N  +L +TG  ++     ++ VEGG K   ++KRLM+ RIK
Sbjct: 1   YVYHIKSLSNPKKRFKILENAKELGLTGFCLIGPKPGLVCVEGGEKAIKEYKRLMLKRIK 60

Query: 546 WEED---MIKSNEGKE--TPNKCVLVWE 568
           W E     I+  E +   + NKC LVWE
Sbjct: 61  WTEPKKISIRHREDEPVPSDNKCSLVWE 88


>gnl|CDD|183265 PRK11660, PRK11660, putative transporter; Provisional.
          Length = 568

 Score = 78.1 bits (193), Expect = 1e-14
 Identities = 92/429 (21%), Positives = 190/429 (44%), Gaps = 69/429 (16%)

Query: 583 GFTTGTAILVILSQIKHVFGIKVKRHIGPFNVIYTLIDVAENIHKTNYVAFGVVVILVSV 642
           GFT+G  I++   QIK  FG+++     P + +  +  + + +   N+    + ++ + V
Sbjct: 138 GFTSGIGIVIATLQIKDFFGLQMAH--VPEHYLEKVGALFQALPTINWGDALIGIVTLGV 195

Query: 643 LIIYNDHFKSKIQKNISFPIPTEMIVIVAGALLSSIL-----DV-----KHKYNLSNVGK 692
           LI++    + KI       +P  +  ++AG  +  +L      V     +  Y L++ G 
Sbjct: 196 LILWP---RLKI------RLPGHLPALLAGTAVMGVLNLLGGHVATIGSRFHYVLAD-GS 245

Query: 693 IPIGLPSPQPPPF---YLIP-----KLMLD-GLFISIVAFSINISM---------ASIL- 733
              G+P P  P F   + +P        L   L  +++  + +++M         A +L 
Sbjct: 246 QGNGIP-PLLPQFVLPWNLPGADGQPFTLSWDLIRALLPAAFSMAMLGAIESLLCAVVLD 304

Query: 734 --AKKKYKIDSNQELLASGVSNIFASFFSCIPFASSLSRSLVQLQVGGKTQLASGVSCVL 791
                K+   +N EL+  G+ NI A FF  I   ++++RS   ++ G  + +++ +  +L
Sbjct: 305 GMTGTKH--SANSELVGQGLGNIVAPFFGGITATAAIARSAANVRAGATSPISAVIHALL 362

Query: 792 LVLILLYIGPFFETLPYCVLTSIVIVAVKGMLNQVKDLKLAFKESYTEACVWCLTFLSVV 851
           ++L LL + P    LP   + +++++    M    K + L       +  V  L     V
Sbjct: 363 VLLALLVLAPLLSYLPLSAMAALLLMVAWNMSEAHKVVDLLRHAPKDDIIVMLLCMSLTV 422

Query: 852 LLDVDYGLGIG-VLCSLIFVVVTGQKVMVYKLGRLIDSNIYVEEDFYESAIDVP-GIVIL 909
           L D+   + +G VL SL+F+     + +  ++ RL   ++           DVP  +++L
Sbjct: 423 LFDMVIAISVGIVLASLLFM-----RRIA-EMTRLAPISVQ----------DVPDDVLVL 466

Query: 910 RIIGGMNFINKDKVFHKISKLSLSSEPYPKQIILDMMSLSSVDTSTVKSFLDLYKELMEQ 969
           RI G + F   +++F   ++L   +E   + ++L   ++  +D   + +F    K  + +
Sbjct: 467 RINGPLFFAAAERLF---TELESRTEGK-RIVVLQWDAVPVLDAGGLDAFQRFVKR-LPE 521

Query: 970 GISLHIVKL 978
           G  L I  L
Sbjct: 522 GCELRICNL 530



 Score = 40.3 bits (95), Expect = 0.006
 Identities = 20/57 (35%), Positives = 29/57 (50%), Gaps = 1/57 (1%)

Query: 1059 YNWKNDLSHDMVSGFTVAVLHIPQGMAYSSLGGVPPIVGIYMAVFPVIIYMCMGTSR 1115
            Y      + D+++G TV ++ IP  MA +   GVPP  G+Y A    I+    G SR
Sbjct: 24   YTAA-RFTRDLIAGITVGIIAIPLAMALAIASGVPPQYGLYTAAVAGIVIALTGGSR 79


>gnl|CDD|132913 cd07042, STAS_SulP_like_sulfate_transporter, Sulphate Transporter
           and Anti-Sigma factor antagonist domain of SulP-like
           sulfate transporters, plays a role in the function and
           regulation of the transport activity, proposed general
           NTP binding function.  The SulP family is a large and
           diverse family of anion transporters, with members from
           eubacteria, plants, fungi, and mammals. They contain 10
           to 14 transmembrane helices which form the catalytic
           core of the protein and a C-terminal extension, the STAS
           (Sulphate Transporter and AntiSigma factor antagonist)
           domain which plays a role in the function and regulation
           of the transport activity. The STAS domain is found in
           the C-terminal region of sulphate transporters and
           bacterial anti-sigma factor antagonists. It has been
           suggested that this domain may have a general NTP
           binding function.
          Length = 107

 Score = 69.6 bits (171), Expect = 3e-14
 Identities = 24/89 (26%), Positives = 54/89 (60%), Gaps = 1/89 (1%)

Query: 897 YESAIDVPGIVILRIIGGMNFINKDKVFHKISKLSLSSEPYPKQIILDMMSLSSVDTSTV 956
           Y  A + PG++I RI G + F N +    ++ +L +  +P  K +ILD+ +++ +D++  
Sbjct: 1   YPLAEEPPGVLIYRIDGPLFFGNAEYFKDRLLRL-VDEDPPLKVVILDLSAVNFIDSTAA 59

Query: 957 KSFLDLYKELMEQGISLHIVKLLEPVKQV 985
           ++  +L K+L ++G+ L++  L   V+++
Sbjct: 60  EALEELVKDLRKRGVELYLAGLNPQVREL 88


>gnl|CDD|201948 pfam01740, STAS, STAS domain.  The STAS (after Sulphate Transporter
           and AntiSigma factor antagonist) domain is found in the
           C terminal region of Sulphate transporters and bacterial
           antisigma factor antagonists. It has been suggested that
           this domain may have a general NTP binding function.
          Length = 106

 Score = 56.5 bits (137), Expect = 1e-09
 Identities = 25/83 (30%), Positives = 48/83 (57%), Gaps = 2/83 (2%)

Query: 903 VPGIVILRIIGGMNFINKDKVFHKISKLSLSSEPYPKQIILDMMSLSSVDTSTVKSFLDL 962
           +PGI+ILR+ G ++F N +    ++ +     E   + +ILD+  +  +D+S + + L+L
Sbjct: 7   IPGILILRLDGPLDFANAEAFRERLLRAL--EEGEIRHVILDLSGVPFIDSSGLGALLEL 64

Query: 963 YKELMEQGISLHIVKLLEPVKQV 985
           YKEL  +G+ L +V     V++ 
Sbjct: 65  YKELRRRGVELVLVGPSPEVRRT 87


>gnl|CDD|216158 pfam00860, Xan_ur_permease, Permease family.  This family includes
           permeases for diverse substrates such as xanthine,
           uracil, and vitamin C. However many members of this
           family are functionally uncharacterized and may
           transport other substrates. Members of this family have
           ten predicted transmembrane helices.
          Length = 389

 Score = 55.8 bits (135), Expect = 7e-08
 Identities = 62/288 (21%), Positives = 111/288 (38%), Gaps = 59/288 (20%)

Query: 571 SLSVLMSDIMISGFTTGTAILVILSQIKHVFGIKVKRHIGPFNVIY--------TLIDVA 622
           +L+ L   ++++G      +  ++S        ++ R   P  V          +L  +A
Sbjct: 88  ALAGLFGAVLVAGV-----LFTLISFTGL--RGRLARLFPP-VVTGPVVLLIGLSLAPIA 139

Query: 623 EN---IHK----------TNYVAFGVVVILVSVLIIYNDHFKSKIQKNISFPIPTEMIVI 669
                              + +   VVV+ V +L+                PI   +I I
Sbjct: 140 VKGAGGGWAIADGLTVGLLDLLGLAVVVLAVILLLS-----VFLKGFFRQGPI---LIGI 191

Query: 670 VAGALLSSILDVKHKYNLSNVGKIPIGLPSPQPP-PFYLI-----PKLMLDGLFISIVAF 723
           +AG LL+  +      N S      +  P  Q P PF        P L+L  L +++VA 
Sbjct: 192 IAGWLLALFMG---IVNFS---PEVMDAPWFQLPHPFPFGTPLFNPGLILTVLAVALVAI 245

Query: 724 SINISMASILAK-----KKYKIDSNQELLASGVSNIFASFFSCIPFAS-SLSRSLVQL-Q 776
             +      +AK      K K D  + LLA G++ + +  F   P  + + +  +V L +
Sbjct: 246 VESTGDIRAVAKVSGRDLKPKPDLRRGLLADGLATLLSGLFGAFPTTTYAENIGVVALTK 305

Query: 777 VG-GKTQLASGVSCVLLVLILLYIGPFFETLPYCVLTSIVIVAVKGML 823
           V   +  + +GV  +LL LI       F ++P  VL  +++V   GM+
Sbjct: 306 VYSRRVGVTAGVILILLGLI-PKFAALFSSIPSPVLGGVMLVMF-GMI 351


>gnl|CDD|129473 TIGR00377, ant_ant_sig, anti-anti-sigma factor.  This superfamily
           includes small (105-125 residue) proteins related to
           SpoIIAA of Bacillus subtilis, an anti-anti-sigma factor.
           SpoIIAA can bind to and inhibit the anti-sigma F factor
           SpoIIAB. Also, it can be phosphorylated by SpoIIAB on a
           Ser residue at position 59 of the seed alignment. A
           similar arrangement is inferred for RsbV, an
           anti-anti-sigma factor for sigma B. This Ser is fairly
           well conserved within a motif resembling
           MXS[STA]G[VIL]X[VIL][VILF] among homologous known or
           predicted anti-anti-sigma factors. Regions similar to
           SpoIIAA and apparently homologous, but differing
           considerably near the phosphorylated Ser of SpoIIAA,
           appear in a single copy in several longer proteins
           [Regulatory functions, Protein interactions].
          Length = 108

 Score = 40.3 bits (95), Expect = 5e-04
 Identities = 20/81 (24%), Positives = 42/81 (51%), Gaps = 2/81 (2%)

Query: 905 GIVILRIIGGMNFINKDKVFHKISKLSLSSEPYPKQIILDMMSLSSVDTSTVKSFLDLYK 964
           G+VI+R+ G ++      +  K++  +  + P  + I+LD+  L  +D+S +   L  YK
Sbjct: 12  GVVIVRLSGELDAHTAPLLREKVTPAAERTGP--RPIVLDLEDLEFMDSSGLGVLLGRYK 69

Query: 965 ELMEQGISLHIVKLLEPVKQV 985
           ++   G  L +V +   V ++
Sbjct: 70  QVRRVGGQLVLVSVSPRVARL 90


>gnl|CDD|225711 COG3170, FimV, Tfp pilus assembly protein FimV [Cell motility and
           secretion / Intracellular trafficking and secretion].
          Length = 755

 Score = 42.6 bits (100), Expect = 0.001
 Identities = 31/125 (24%), Positives = 44/125 (35%), Gaps = 13/125 (10%)

Query: 6   LRVP-EGKFCRSFPQNQIKNMMAHAQKMIEERKKALLALKGTAPVAPTPVVPAGVTPAII 64
           LR+P   +  R  PQ  +  + A       E  KA    +   PVA  P           
Sbjct: 243 LRIPSAAQILRESPQEALAEVKAQTAAFAGEPSKAD---RVGKPVAKAPAK----VAKER 295

Query: 65  GSAKIPPLMPPAPPSTTAEKDNATYEKAMKIAQLQAQIQSKLSTGVLKPSVPGAKPPGAP 124
             A++P  +        A+ + A +E A K A L A  Q+ L       + P A  P   
Sbjct: 296 ALAELPARVAE----LQAQLNKAQHELAQKAAPLAA-AQAALDAPAETATAPSAPAPQVS 350

Query: 125 ATDKP 129
           A   P
Sbjct: 351 AESSP 355


>gnl|CDD|234136 TIGR03173, pbuX, xanthine permease.  All the seed members of this
           model are observed adjacent to genes for either xanthine
           phosphoribosyltransferase (for the conversion of
           xanthine to guanine, GenProp0696, ) or genes for the
           conversion of xanthine to urate and its concomitant
           catabolism (GenProp0640, GenProp0688, GenProp0686 and
           GenProp0687). A number of sequences scoring higher than
           trusted to this model are found in different genomic
           contexts, and the possibility exist that these transport
           related compounds in addition to or instead of xanthine
           itself. The outgroup to this family are sequences which
           are characterized as uracil permeases or are adjacent to
           established uracil phosphoribosyltransferases.
          Length = 406

 Score = 42.1 bits (100), Expect = 0.001
 Identities = 49/216 (22%), Positives = 94/216 (43%), Gaps = 34/216 (15%)

Query: 631 VAFGVVVILVSVLIIYNDHFKSKIQKNISFPIPTEMIVIVAGALLSSILDVKHKYNLSNV 690
           +A      LV +L++    F     ++I+      ++ +V G ++++ L      + S V
Sbjct: 151 LALLT---LVIILLLN--RFGKGFLRSIAV-----LLGLVVGTIVAAALG---MVDFSGV 197

Query: 691 GKIP-IGLPSPQP---PPFYLIPKLMLDGLFISIVAF--SINISMA--SILAKKKYKIDS 742
            + P   LP+P     P F L+  L +  + + +V+   +    +A   I  +   + D 
Sbjct: 198 AEAPWFALPTPFHFGAPTFDLVAILTM--IIVYLVSMVETTGDFLALGEITGRPITEKDL 255

Query: 743 NQELLASGVSNIFASFFSCIPFAS-SLSRSLVQLQVGGKTQLASGVSCVLLVLILLY--I 799
              L A G+ +     F+  P+ S S +  LVQL  G K++     + V+LVL+ L+  +
Sbjct: 256 AGGLRADGLGSALGGLFNTFPYTSFSQNVGLVQL-TGVKSRYVVAAAGVILVLLGLFPKL 314

Query: 800 GPFFETLPYCVLTSIVI-----VAVKG--MLNQVKD 828
                ++P  VL    +     VA  G  +L++V  
Sbjct: 315 AALVASIPQPVLGGAGLVMFGMVAASGIRILSKVDF 350


>gnl|CDD|224285 COG1366, SpoIIAA, Anti-anti-sigma regulatory factor (antagonist of
           anti-sigma factor) [Signal transduction mechanisms].
          Length = 117

 Score = 39.3 bits (92), Expect = 0.001
 Identities = 13/81 (16%), Positives = 39/81 (48%), Gaps = 2/81 (2%)

Query: 905 GIVILRIIGGMNFINKDKVFHKISKLSLSSEPYPKQIILDMMSLSSVDTSTVKSFLDLYK 964
           GI++L +IG ++      +   + ++   +    + +++D+  +  +D++ +   + L K
Sbjct: 13  GILVLPLIGELDAARAPALKETLLEVI--AASGARGLVIDLSGVDFMDSAGLGVLVALLK 70

Query: 965 ELMEQGISLHIVKLLEPVKQV 985
               +G+ L +V +   V + 
Sbjct: 71  SARLRGVELVLVGIQPEVART 91


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 42.4 bits (99), Expect = 0.001
 Identities = 61/329 (18%), Positives = 125/329 (37%), Gaps = 25/329 (7%)

Query: 163  AKKREEFHEKLKEKPTEDLSESKFFDPRISAKSSVRTKRALKFHEPGKFQQLAEQLRVKA 222
            AKK  E  +   E   ++   ++        K     K+A    +  + ++ A++ + KA
Sbjct: 1341 AKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKA 1400

Query: 223  QLAKLQADISKNAR--KTGISSATKLALIAPKMEDDQDEMPEVEWWDAVIMVEETYEKEN 280
            +  K +AD  K A   K     A K A    K ++ + +  E +  D      E  +K  
Sbjct: 1401 EEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAE 1460

Query: 281  NIKTSAITNLVEHPIQMKPPSDMAKPVYMPVFLTDKERKKLRRQNRR--EAWKEEQEKIR 338
              K  A         +    +D AK         D+ +KK     ++  EA K  + K +
Sbjct: 1461 EAKKKA---------EEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511

Query: 339  LGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLTVEQK 398
                   E   +     +    +   +  K E   +A   K+ ++ ++A   +K   E+ 
Sbjct: 1512 ADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKK--AEEA 1569

Query: 399  REKKIRKIKEDTSLEVHVALRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANA 458
            ++ +  K         ++ALR +   +      +++  K+    +   A+  K  EEA  
Sbjct: 1570 KKAEEDK---------NMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKI 1620

Query: 459  -SRKLTVEQKREKKIRKIKEDTSLEVHVA 486
             + +L   ++ +KK+ ++K+  + E   A
Sbjct: 1621 KAEELKKAEEEKKKVEQLKKKEAEEKKKA 1649



 Score = 40.1 bits (93), Expect = 0.007
 Identities = 36/169 (21%), Positives = 70/169 (41%), Gaps = 9/169 (5%)

Query: 314  TDKERKKLRRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHV 373
            T++   K     + E  K E+ +         E K +  +  +       +D  K E   
Sbjct: 1093 TEEAFGKAEEAKKTETGKAEEARKA------EEAKKKAEDARKAEEARKAEDARKAEEAR 1146

Query: 374  RAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHVALRISNLMRVLGTEAVQ 433
            +A+ AKR +   +A  +RK    +K E   +      + EV  A  +         EA +
Sbjct: 1147 KAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAAR 1206

Query: 434  DPTKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLE 482
               K E   +A+ A++ +D ++A A +K    +K  ++ +K +E+ + E
Sbjct: 1207 ---KAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNE 1252



 Score = 39.4 bits (91), Expect = 0.015
 Identities = 37/168 (22%), Positives = 71/168 (42%), Gaps = 21/168 (12%)

Query: 315  DKERKKLRRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVR 374
            ++ RK        EA K+ ++  +       E   +     +    + V+   K E   +
Sbjct: 1112 EEARKA------EEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARK 1165

Query: 375  AQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHVALRISNLMRVLGTEAVQD 434
            A+ A++ +D ++A A+RK   E ++ +++RK ++    E   A +     +       +D
Sbjct: 1166 AEEARKAEDAKKAEAARK-AEEVRKAEELRKAEDARKAEA--ARKAEEERKAEEARKAED 1222

Query: 435  PTKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLE 482
              K EA  +A+ AK  KD EEA          K+ ++ R  +E    E
Sbjct: 1223 AKKAEAVKKAEEAK--KDAEEA----------KKAEEERNNEEIRKFE 1258



 Score = 35.5 bits (81), Expect = 0.23
 Identities = 67/324 (20%), Positives = 124/324 (38%), Gaps = 27/324 (8%)

Query: 163  AKKREEFHEKLKEKPTEDLSESKFFDPRISAKSSVRTKRALKFHEPGKFQQLAEQLRVKA 222
            AKK +E  +K +E    + ++ K  + +   K+    K+A +  +  + ++ AE+ + KA
Sbjct: 1443 AKKADEAKKKAEEAKKAEEAKKKAEEAK---KADEAKKKAEEAKKADEAKKKAEEAKKKA 1499

Query: 223  QLAKLQADISKNARKTGISSATKLALIAPKME-----DDQDEMPEVEWWDAVIMVEETYE 277
              AK  A+  K A +   +   K A  A K E     D+  +  E +  D +   EE  +
Sbjct: 1500 DEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKK 1559

Query: 278  KENNIKTSAITNLVEHPIQMKPPSDMAKPV----YMPVFLTDKERKKLRRQNRREAWKEE 333
             E   K        E        ++ AK         V    +E KK++ +   EA K E
Sbjct: 1560 AEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAE---EAKKAE 1616

Query: 334  QEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKL 393
            + KI+         +L+ +   +    +  Q   K     +   A+  K  EE N  +  
Sbjct: 1617 EAKIK-------AEELKKAEEEK---KKVEQLKKKEAEEKKK--AEELKKAEEENKIKAA 1664

Query: 394  TVEQKREKKIRKIKEDTSLEVHVALRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDH 453
               +K E+  +K +E    E         L +        +  K +     + A+  K  
Sbjct: 1665 EEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKA 1724

Query: 454  EEANASRKLTVEQKREKKIRKIKE 477
            EE N  +    +++ E+  +K +E
Sbjct: 1725 EEENKIKAEEAKKEAEEDKKKAEE 1748



 Score = 31.3 bits (70), Expect = 4.0
 Identities = 22/110 (20%), Positives = 52/110 (47%), Gaps = 3/110 (2%)

Query: 368  KMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHVALRISNLMRVL 427
            K E   +A+ AK+ +  ++A  ++K   E K+ ++ R  +E    E     R+++  R  
Sbjct: 1213 KAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFE---EARMAHFARRQ 1269

Query: 428  GTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKE 477
                 ++  K +   +A+  K+  + ++A   +K    +K+ ++ +K  E
Sbjct: 1270 AAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADE 1319



 Score = 30.9 bits (69), Expect = 5.5
 Identities = 31/164 (18%), Positives = 67/164 (40%), Gaps = 4/164 (2%)

Query: 316  KERKKLRRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRA 375
            ++ KK     + E  K++ E+ +   E     ++R     R+      Q   K E   +A
Sbjct: 1221 EDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKA 1280

Query: 376  QMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHVALRISNLMRVLGTEAVQ-- 433
               K+ ++ ++A+ ++K   E+K++    K K + + +   A + +   +     A +  
Sbjct: 1281 DELKKAEEKKKADEAKK--AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKA 1338

Query: 434  DPTKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKE 477
            +  K  A      A+   D  EA   +    E+K+E+  +K   
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADA 1382


>gnl|CDD|132914 cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and
           Anti-Sigma factor antagonist) domain of anti-anti-sigma
           factors, key regulators of anti-sigma factors by
           phosphorylation.  Anti-anti-sigma factors play an
           important role in the regulation of several sigma
           factors and their corresponding anti-sigma factors. Upon
           dephosphorylation they bind the anti-sigma factor and
           induce the release of the sigma factor from the
           anti-sigma factor. In a feedback mechanism the
           anti-anti-sigma factor can be inactivated via
           phosphorylation by the anti-sigma factor. Well studied
           examples from Bacillus subtilis are SpoIIAA (regulating
           sigmaF and sigmaC which play an important role in
           sporulation) and RsbV (regulating sigmaB involved in the
           general stress response). The STAS domain is also found
           in the C- terminal region of sulphate transporters and
           stressosomes.
          Length = 99

 Score = 37.5 bits (88), Expect = 0.004
 Identities = 16/87 (18%), Positives = 43/87 (49%), Gaps = 11/87 (12%)

Query: 903 VPGIVILRIIGGMNFIN----KDKVFHKISKLSLSSEPYPKQIILDMMSLSSVDTSTVKS 958
             G++++R+ G ++       ++ +   +++        P++++LD+  ++ +D+S +  
Sbjct: 6   RGGVLVVRLSGELDAATAPELREALEELLAE-------GPRRLVLDLSGVTFIDSSGLGV 58

Query: 959 FLDLYKELMEQGISLHIVKLLEPVKQV 985
            L  YK     G  L +V +   V++V
Sbjct: 59  LLGAYKRARAAGGRLVLVNVSPAVRRV 85


>gnl|CDD|225142 COG2233, UraA, Xanthine/uracil permeases [Nucleotide transport and
           metabolism].
          Length = 451

 Score = 40.3 bits (95), Expect = 0.006
 Identities = 58/300 (19%), Positives = 108/300 (36%), Gaps = 68/300 (22%)

Query: 570 GSLSVLMSDIMISGFTTGTAILVILSQIKHVFGIKVKRHIGPF---NVI----YTLIDVA 622
             ++ L+  I+ +G         ++S I     I++ R   P     V+     +L  VA
Sbjct: 103 DGIAALLGGIIAAGLVY-----FLISPI---VKIRLARLFPPVVTGPVVLVIGLSLAPVA 154

Query: 623 -------------ENIHKTNYVAFGVVVILVSVLIIYNDHFKSKIQKNISFPIPTEMIVI 669
                         +      +   +V +L+ +LI     F     + I       +I +
Sbjct: 155 INMAGGGPGAAGNPDFGSLENLGLALVTLLIILLIN---RFGKGFLRRIPI-----LIGL 206

Query: 670 VAGALLSSILDVKHKYNLSNVGKIP-IGLPSPQPPPFYLIPKLMLDGLFISIVAFSINIS 728
           V G LL+  +      + S V + P   LP+P           +L  L ++IV    +  
Sbjct: 207 VVGYLLALFMG---MVDFSGVAEAPWFALPTPFYFGMAFDWGAILTMLPVAIVTIVEHTG 263

Query: 729 M---ASILAKKKYKIDSNQEL----LASGVSNIFASFFSCIPF-ASSLSRSLVQLQVGGK 780
                  +  +    D    L    LA G++ + A  F   P    + +  +V L     
Sbjct: 264 DITATGEITGRDL--DGKPRLRRGLLADGLATLIAGLFGGFPNTTFAQNIGVVAL----- 316

Query: 781 TQLAS-----GVSCVLLVLILL-YIGPFFETLPYCVL--TSIV---IVAVKG--MLNQVK 827
           T + S     G + +L++L L    G   +++P  VL    +V   ++A  G  +L + K
Sbjct: 317 TGVYSRYVIAGAAVILILLGLFPKFGALIQSIPSPVLGGAMLVLFGMIAASGIRILIRNK 376


>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
           family.  The proteins represented by this model contain
           three RNA recognition motifs (rrm: pfam00076) and have
           been characterized as poly-pyrimidine tract binding
           proteins associated with RNA splicing factors. In the
           case of PUF60 (GP|6176532), in complex with p54, and in
           the presence of U2AF, facilitates association of U2
           snRNP with pre-mRNA.
          Length = 612

 Score = 39.7 bits (92), Expect = 0.008
 Identities = 37/193 (19%), Positives = 65/193 (33%), Gaps = 18/193 (9%)

Query: 39  ALLALKGTAPVAPTPVVPAGV-TPAIIGSA-KIPPLMPPAPPSTTAEKDNATYEKAMKIA 96
           A+L  +  +P  P+  +P  +   A++ SA K    +PP P +  A       E    + 
Sbjct: 322 AVLGPRAQSPATPSSSLPTDIGNKAVVSSAKKEAEEVPPLPQAAPAVVKPGPMEIPTPVP 381

Query: 97  QLQAQIQSKLSTGVLKPSVPGAKPPGAPATDKPTPLILDSEGRTIDITGKQVQLTHVVPT 156
                I S               PPG  A  +  P  L S  + +      V    +  T
Sbjct: 382 PPGLAIPS------------LVAPPGLVAPTEINPSFLASPRKKMKREKLPVTFGALDDT 429

Query: 157 LKANIRAKKREEFHEKLKEKPTE-DLSESKFFDPRISAKSSVRTKRALKFHEPGKFQQLA 215
           L A     K ++  E  K      + + +   +P+   K     +   K     +   LA
Sbjct: 430 L-AWKEPSKEDQTSEDGKMLAIMGEAAAALALEPKKKKKEKEGEELQPKLVMNSEDASLA 488

Query: 216 EQ--LRVKAQLAK 226
            Q  + ++   A+
Sbjct: 489 SQEGMSIRGNSAR 501


>gnl|CDD|225161 COG2252, COG2252, Xanthine/uracil/vitamin C permease [Nucleotide
           transport and    metabolism].
          Length = 436

 Score = 39.5 bits (93), Expect = 0.009
 Identities = 56/229 (24%), Positives = 92/229 (40%), Gaps = 35/229 (15%)

Query: 666 MIVIVAGALLSSILDVKHKYNLSNVGKIPIGLPSPQPPPFYLIPKLMLDGLFISIVAFSI 725
           +I I+   +L  IL +   +    VG  P    SP      L   L L      I  F  
Sbjct: 199 LIGILVTTILGIILGIDVHF-GGLVGAPPS--LSPIFGQLDLSGNLSLAAFAPVIFTFFF 255

Query: 726 NI------------SMASILAKKKYKIDSNQELLASGVSNIFASFFSCIPFASSLSRSLV 773
                         S A +L K        + LLA  V+ +  + F      ++   S  
Sbjct: 256 VDLFDTLGTLIGVASKAGLLDKNGKMPRIGKALLADSVATVVGALFGTST-VTAYIESAA 314

Query: 774 QLQVGGKTQLASGVSCV---LLVLILLYIGPFFETLP-YCVLTSIVIVAVKGMLNQVKDL 829
            +  GG+T    G++ V   LL L+ L+  P    +P Y    +++IV    ML+ VK  
Sbjct: 315 GVAAGGRT----GLTAVVTGLLFLLSLFFSPLAALVPGYATAPALIIVGAL-MLSSVK-- 367

Query: 830 KLAFKESYTEACVWCLTFLSVVLLDVDY----GLGIGVLCSLIFVVVTG 874
           ++ + + +TEA      FL++V++ + Y    G+  G +  +I  V TG
Sbjct: 368 QIDWSD-FTEAVP---AFLTIVMMPLTYSIADGIAFGFISYVILKVFTG 412


>gnl|CDD|147845 pfam05914, RIB43A, RIB43A.  This family consists of several
           RIB43A-like eukaryotic proteins. Ciliary and flagellar
           microtubules contain a specialised set of
           protofilaments, termed ribbons, that are composed of
           tubulin and several associated proteins. RIB43A was
           first characterized in the unicellular biflagellate,
           Chlamydomonas reinhardtii although highly related
           sequences are present in several higher eukaryotes
           including humans. The function of this protein is
           unknown although the structure of RIB43A and its
           association with the specialised protofilament ribbons
           and with basal bodies is relevant to the proposed role
           of ribbons in forming and stabilising doublet and
           triplet microtubules and in organising their
           three-dimensional structure. Human RIB43A homologues
           could represent a structural requirement in centriole
           replication in dividing cells.
          Length = 379

 Score = 36.2 bits (84), Expect = 0.097
 Identities = 41/171 (23%), Positives = 72/171 (42%), Gaps = 20/171 (11%)

Query: 313 LTDKERKKL-RRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEA 371
           L + +RKK+ + Q R   W E+Q + R   +   +    + + +RV      Q+   +E 
Sbjct: 147 LNEADRKKMQQEQQR--EWLEQQVQERQQAKAAEKQADTLHDQLRVELDHRAQELASLEE 204

Query: 372 HVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHVALRISNLMRV-LGTE 430
             R  + +   D  +A A      E++RE++ R+ +E+    +     I N +   L TE
Sbjct: 205 ECRRAVEQATADFNKALA------EEQRERERREKQEEQEDNL---AEIYNQLTSDLLTE 255

Query: 431 ---AVQDPTKMEAH-VRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKE 477
                Q       H V     K     E+  A RK   +Q +EK+ R+ +E
Sbjct: 256 NPDVAQSS--FGPHRVIPDRWKGMSP-EQLAAIRKGQEQQLQEKERRREEE 303


>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
          Length = 393

 Score = 35.4 bits (82), Expect = 0.15
 Identities = 29/122 (23%), Positives = 59/122 (48%), Gaps = 20/122 (16%)

Query: 377 MAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHVALRISN----LMRVLGTEAV 432
           + KR++  EEA    +  +E+  +++I+K  + T   V V    +     L+R++G   +
Sbjct: 99  LEKRRERREEAEEELEKAIEEGDDEEIKKQSKRT---VRVTKEQNEDAKKLLRLMGIPVI 155

Query: 433 QDPTKMEAHVRAQMAKRQKDH----EEANA--------SRKLTVEQKREKKIRKIKEDTS 480
           + P + EA   A++ K+ K +    E+ +A         R L   + +++ I++I   T 
Sbjct: 156 EAPCEAEAQC-AELVKKGKVYAVATEDMDALTFGTPVLLRNLNFSEAKKRPIQEINLSTV 214

Query: 481 LE 482
           LE
Sbjct: 215 LE 216



 Score = 30.4 bits (69), Expect = 6.0
 Identities = 39/144 (27%), Positives = 63/144 (43%), Gaps = 44/144 (30%)

Query: 305 KPVYMPVF------LTDKERKKLRRQNRREAWKEEQEK-----------------IRLGL 341
           KPVY  VF      L   E +K RR+ R EA +EE EK                 +R+  
Sbjct: 82  KPVY--VFDGKPPELKSGELEK-RRERREEA-EEELEKAIEEGDDEEIKKQSKRTVRVTK 137

Query: 342 EPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDH----EEANA-------- 389
           E   + K     L+R++G   ++ P + EA   A++ K+ K +    E+ +A        
Sbjct: 138 EQNEDAK----KLLRLMGIPVIEAPCEAEAQC-AELVKKGKVYAVATEDMDALTFGTPVL 192

Query: 390 SRKLTVEQKREKKIRKIKEDTSLE 413
            R L   + +++ I++I   T LE
Sbjct: 193 LRNLNFSEAKKRPIQEINLSTVLE 216


>gnl|CDD|234022 TIGR02813, omega_3_PfaA, polyketide-type polyunsaturated fatty acid
            synthase PfaA.  Members of the seed for this alignment
            are involved in omega-3 polyunsaturated fatty acid
            biosynthesis, such as the protein PfaA from the
            eicosapentaenoic acid biosynthesis operon in
            Photobacterium profundum strain SS9. PfaA is encoded
            together with PfaB, PfaC, and PfaD, and the functions of
            the individual polypeptides have not yet been described.
            More distant homologs of PfaA, also included with the
            reach of this model, appear to be involved in
            polyketide-like biosynthetic mechanisms of
            polyunsaturated fatty acid biosynthesis, an alternative
            to the more familiar iterated mechanism of chain
            extension and desaturation, and in most cases are encoded
            near genes for homologs of PfaB, PfaC, and/or PfaD.
          Length = 2582

 Score = 35.8 bits (82), Expect = 0.17
 Identities = 23/119 (19%), Positives = 40/119 (33%), Gaps = 6/119 (5%)

Query: 17   FPQNQIKNMMAHAQKMIEERKKALLALKGTAPVAPTPVVPAGVTPAIIGSAKIPPLMPPA 76
            F Q+Q + +  H Q +  +    ++ L   A  AP         P +  +  + P  P  
Sbjct: 1093 FHQHQAETLKVHEQYLNLQTDSNIVKLSPLATQAPVIKSVVTQAPVVQVTISVAPAAPVL 1152

Query: 77   PPSTTAEKDNATYEKAMKIAQLQAQIQS------KLSTGVLKPSVPGAKPPGAPATDKP 129
            P   +    +A   +++  A   A +           +    PSV  A  P A   D  
Sbjct: 1153 PAVVSPPVVSAAPAQSVATAVAMAPVAEVPIAVPVQQSVDYMPSVAQAAAPQASVNDSA 1211


>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 548

 Score = 35.6 bits (82), Expect = 0.18
 Identities = 34/185 (18%), Positives = 64/185 (34%), Gaps = 16/185 (8%)

Query: 294 PIQMKPPSDMAKPVYMPVFLTDKERKKLRRQNRREAWKEEQEKIRLGLEPPPEPKLRISN 353
            + +   +D +K    P    D   ++   Q  ++A   E E  +   E       R + 
Sbjct: 188 SLAINDINDTSKENQDPNNYLDALGRRRIAQVLQDAEIAENEAEKE-TEIAIAEANRDAK 246

Query: 354 LMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLE 413
           L+     E  Q P    A    ++ K      EA  +     E +RE +  +I  + +++
Sbjct: 247 LVE---LEVEQQPAGKTAEQTREV-KIILAETEAEVAAW-KAETRREAEQAEILAEQAIQ 301

Query: 414 VHVALRISNLMRVLGTEAVQDPTKMEAHV-RAQMAKRQKDHEEANASRKLTVEQKREKKI 472
              A            + VQ    +EA   R  + +RQK+ E     R   +   + +  
Sbjct: 302 EEKAQA---------EQEVQHAKALEAREMRVGLIERQKETELEPQERSYFINAAQRQAQ 352

Query: 473 RKIKE 477
            + K 
Sbjct: 353 EEAKA 357


>gnl|CDD|132911 cd06844, STAS, Sulphate Transporter and Anti-Sigma factor
           antagonist domain found in the C-terminal region of
           sulphate transporters as well as in bacterial and
           archaeal proteins involved in the regulation of sigma
           factors.  The STAS (Sulphate Transporter and Anti-Sigma
           factor antagonist) domain is found in the C-terminal
           region of sulphate transporters as well as in bacterial
           and archaeal proteins involved in the regulation of
           sigma factors, like anti-anti-sigma factors and
           "stressosome" components. The sigma factor regulators
           are involved in protein-protein interaction which is
           regulated by phosphorylation.
          Length = 100

 Score = 32.1 bits (73), Expect = 0.33
 Identities = 11/69 (15%), Positives = 30/69 (43%), Gaps = 2/69 (2%)

Query: 903 VPGIVILRIIGGMNFINKDKVFHKISKLSLSSEPYPKQIILDMMSLSSVDTSTVKSFLDL 962
           V    ++R+ G ++  + ++   ++     +     K I++D+ +L  +D+S     L+ 
Sbjct: 6   VDDYWVVRLEGELDHHSVEQFKEELLHNITN--VAGKTIVIDISALEFMDSSGTGVLLER 63

Query: 963 YKELMEQGI 971
            +     G 
Sbjct: 64  SRLAEAVGG 72


>gnl|CDD|224397 COG1480, COG1480, Predicted membrane-associated HD superfamily
           hydrolase [General function prediction only].
          Length = 700

 Score = 34.7 bits (80), Expect = 0.38
 Identities = 39/200 (19%), Positives = 75/200 (37%), Gaps = 31/200 (15%)

Query: 630 YVAFGVVVILVSVLI-IYNDHFKSKIQKNISFPIPTEMIVIVAGALLSSILDVKHKYNLS 688
            +   ++VI + +L  +Y    KS ++   S      + + +A   LS +L +   +N S
Sbjct: 277 LLGLLILVIFLILLFALYERRTKSPLKLRNS---LLLLYLSLAILTLS-LLRIVGYFNYS 332

Query: 689 NVGKIPIGLPSPQPPPFYLIPKLMLDGLF--ISIVAFSINISMASILAKKKYKIDSNQEL 746
             G           PP    P L++  +F  I+I + S+       L    +    N E+
Sbjct: 333 ASG--------LLVPPAL-GPMLLILLVFLRIAIFSSSMIAIALLYL----FGGSYNSEI 379

Query: 747 LASGVSNIFASFFSCIPFASSLSRSLVQLQVGGKTQLASGVSCVLLVLILLYIGPFFETL 806
               + + F++          +SR    L+ G    L         +L+LL +   F   
Sbjct: 380 ALIALLSSFSALV----LLRKMSRRSDILKSGLFLALM-------NMLLLLSLIFAFTLS 428

Query: 807 PYCVLTSIVIVAVKGMLNQV 826
            Y  L   +   + G+L+ +
Sbjct: 429 WYDALQDAIFAFLSGLLSGI 448


>gnl|CDD|234090 TIGR03021, pilP_fam, type IV pilus biogenesis protein PilP.
           Members of this protein family are found in type IV
           pilus biogenesis loci and include proteins designated
           PilP [Cell envelope, Surface structures].
          Length = 119

 Score = 32.0 bits (73), Expect = 0.46
 Identities = 15/70 (21%), Positives = 26/70 (37%), Gaps = 3/70 (4%)

Query: 85  DNATYEKAMKIAQLQAQIQSKLSTGVLKPSVPG---AKPPGAPATDKPTPLILDSEGRTI 141
           + A  E  +  A+ Q +++     G +           PP A     PT        R +
Sbjct: 13  ETALLEAQLARAKAQNELEEAERGGQVGGPGMPFTSGVPPMALTGANPTSAPAILLPRVV 72

Query: 142 DITGKQVQLT 151
           +I G+  +LT
Sbjct: 73  EIFGRGGRLT 82


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 32.8 bits (74), Expect = 0.57
 Identities = 26/91 (28%), Positives = 44/91 (48%), Gaps = 11/91 (12%)

Query: 322 RRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQ 381
           RRQ R +  +EEQE+     E   + +L    L R     A ++  + E   R Q  +R 
Sbjct: 32  RRQAREQREQEEQER----REQEEQDRLEREELKR----RAAEERLRREEEARRQEEERA 83

Query: 382 KDHEEA---NASRKLTVEQKREKKIRKIKED 409
           ++ EE     A  +   EQ+ +++I+K KE+
Sbjct: 84  REKEEKAKRKAEEEEKQEQEEQERIQKQKEE 114



 Score = 30.1 bits (67), Expect = 4.7
 Identities = 30/140 (21%), Positives = 61/140 (43%), Gaps = 6/140 (4%)

Query: 344 PPEPKLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLTVE-QKREKK 402
           P + +   +      GT   ++ T++ A  R Q A+ Q++ EE     +   +  +RE+ 
Sbjct: 2   PGKAENSAALGKPTAGTTDAEEATRLLAEKRRQ-AREQREQEEQERREQEEQDRLEREEL 60

Query: 403 IRKIKED-TSLEVHVALRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRK 461
            R+  E+    E     +     R    +A +   + E   + +  + QK  EEA A  +
Sbjct: 61  KRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAR 120

Query: 462 LTVEQ---KREKKIRKIKED 478
              E+   +REK  ++I+++
Sbjct: 121 EEAERMRLEREKHFQQIEQE 140


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 34.1 bits (78), Expect = 0.63
 Identities = 22/99 (22%), Positives = 27/99 (27%), Gaps = 7/99 (7%)

Query: 39   ALLALKGTAPVAPTPVVPAGVT-----PAIIG-SAKIPPLMPPAPPSTTAEKDNATYEKA 92
            AL A      V   P  P G       P   G  A  PP  P A P     +  A    +
Sbjct: 2734 ALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRP-AVASLS 2792

Query: 93   MKIAQLQAQIQSKLSTGVLKPSVPGAKPPGAPATDKPTP 131
                 L +          +        P  +PA   P P
Sbjct: 2793 ESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPP 2831



 Score = 32.6 bits (74), Expect = 1.9
 Identities = 21/93 (22%), Positives = 25/93 (26%), Gaps = 8/93 (8%)

Query: 47   APVAPTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMKIAQLQAQIQSKL 106
            AP A     P    PA    A       PAPP        A               +   
Sbjct: 2711 APHALVSATPLPPGPAAARQASPALPAAPAPP--------AVPAGPATPGGPARPARPPT 2762

Query: 107  STGVLKPSVPGAKPPGAPATDKPTPLILDSEGR 139
            + G   P+ P A   G P       +   SE R
Sbjct: 2763 TAGPPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795



 Score = 31.1 bits (70), Expect = 5.1
 Identities = 20/92 (21%), Positives = 30/92 (32%), Gaps = 12/92 (13%)

Query: 42  ALKGTAPVAPTP---------VVPAGV-TPAIIGS--AKIPPLMPPAPPSTTAEKDNATY 89
             +G     P P           P GV   A+ G+  A   P  PP P      ++    
Sbjct: 271 TARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPAPAGDAEEEDDE 330

Query: 90  EKAMKIAQLQAQIQSKLSTGVLKPSVPGAKPP 121
           + AM++     + +     G  K   P   PP
Sbjct: 331 DGAMEVVSPLPRPRQHYPLGFPKRRRPTWTPP 362


>gnl|CDD|165458 PHA03188, PHA03188, UL14 tegument protein; Provisional.
          Length = 199

 Score = 32.7 bits (74), Expect = 0.64
 Identities = 21/66 (31%), Positives = 34/66 (51%), Gaps = 2/66 (3%)

Query: 348 KLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIK 407
           K R  +L+R  G  + QDP  + A   A+ A    ++   +A+R   VEQK      K++
Sbjct: 29  KERTLDLIR--GGASTQDPAFVHAFTAAKDACADLNNNIRSAARIAAVEQKIADIQEKVE 86

Query: 408 EDTSLE 413
           E TS++
Sbjct: 87  EQTSIQ 92



 Score = 32.0 bits (72), Expect = 1.3
 Identities = 20/64 (31%), Positives = 33/64 (51%), Gaps = 2/64 (3%)

Query: 419 RISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKED 478
           R  +L+R  G  + QDP  + A   A+ A    ++   +A+R   VEQK      K++E 
Sbjct: 31  RTLDLIR--GGASTQDPAFVHAFTAAKDACADLNNNIRSAARIAAVEQKIADIQEKVEEQ 88

Query: 479 TSLE 482
           TS++
Sbjct: 89  TSIQ 92


>gnl|CDD|183124 PRK11412, PRK11412, putative uracil/xanthine transporter;
           Provisional.
          Length = 433

 Score = 33.2 bits (76), Expect = 0.74
 Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 4/81 (4%)

Query: 746 LLASGVSNIFASFFSCIP---FASSLSRSLVQLQVGGKTQLASGVSCVLLVLILLYIGPF 802
            +A+G   +     + IP   F SS+   L Q     +     G    LLV ++  +   
Sbjct: 281 FVATGFMTLITVPLAVIPFSPFVSSIG-LLTQTGDYRRRSFIYGSVMCLLVALIPALTRL 339

Query: 803 FETLPYCVLTSIVIVAVKGML 823
           F ++P  V +++++V+   +L
Sbjct: 340 FCSIPLPVSSAVMLVSYLPLL 360


>gnl|CDD|241506 cd13352, PH-GRAM2_TBC1D8B, TBC1 domain family member 8B (TBC1D8B)
           Pleckstrin Homology-Glucosyltransferases, Rab-like
           GTPase activators and Myotubularins (PH-GRAM) domain,
           repeat 2.  TBC1D8B may act as a GTPase-activating
           protein for Rab family protein(s). TBC1D8B contains an
           N-terminal PH-GRAM domain and a C-terminal
           Rab-GTPase-TBC (Tre-2, BUB2p, and Cdc16p) domain. This
           cd contains the second repeat of the PH-GRAM domain. The
           GRAM domain is found in glucosyltransferases,
           myotubularins and other putative membrane-associated
           proteins. The GRAM domain is part of a larger motif with
           a pleckstrin homology (PH) domain fold.
          Length = 93

 Score = 30.5 bits (69), Expect = 1.0
 Identities = 11/26 (42%), Positives = 17/26 (65%)

Query: 196 SVRTKRALKFHEPGKFQQLAEQLRVK 221
           S++ K A +F E   F+QL  +LR+K
Sbjct: 67  SIKGKTAFRFSEVRDFEQLVAKLRLK 92


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 32.7 bits (75), Expect = 1.4
 Identities = 27/101 (26%), Positives = 46/101 (45%), Gaps = 7/101 (6%)

Query: 381 QKDHEEANASRKLTVEQKREKKIRKIKEDTS--LEVHVALRISNLMRVLGTEAVQDPTKM 438
            K  EE    R L  ++KRE+ + K++E+    LE   A     L      E  +   K 
Sbjct: 182 LKAEEEEELERAL--KEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKY 239

Query: 439 EAHVRAQMAKRQKDHEE--ANASRKLTVEQKREKKIRKIKE 477
           E  +R ++ ++ + HE+   N      +E +RE   ++IKE
Sbjct: 240 EEKLRQELERQAEAHEQKLKNELALQAIELQREFN-KEIKE 279


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 32.4 bits (74), Expect = 1.5
 Identities = 38/142 (26%), Positives = 57/142 (40%), Gaps = 28/142 (19%)

Query: 332 EEQEKIR--LGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANA 389
           EE  K+R  LGL+P P P  +  +      T+  +D    E          +K  EEA A
Sbjct: 6   EETNKLRAKLGLKPLPVPGAKPGS------TKESRDAAAYE--------NWKKRQEEAEA 51

Query: 390 SRKLTVEQKREKKIRKIKEDTSLEVHVALRISNLMRVLG-TEAVQDPTKMEAHVRAQMAK 448
            RK    ++  +KI K +E          R S L  +    E   D    +A +  +  K
Sbjct: 52  KRK---REELREKIAKAREK-------RERNSKLGGIKTLGEDDDDDDDTKAWL-KKSKK 100

Query: 449 RQKDHEEANASRKLTVEQKREK 470
           RQK  E       L  E+++E+
Sbjct: 101 RQKKKEAERKKALLLDEKEKER 122


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 32.2 bits (74), Expect = 1.7
 Identities = 47/169 (27%), Positives = 80/169 (47%), Gaps = 33/169 (19%)

Query: 316 KERKKLRRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLM----RVLGTEAVQDPTKMEA 371
           KE + L+++   EA KEE  K+R  LE   E K R + L     R+L  E   D  KME 
Sbjct: 43  KEAETLKKEALLEA-KEEVHKLRAELER--ELKERRNELQRLERRLLQREETLD-RKME- 97

Query: 372 HVRAQMAKRQKDHEEAN---ASRKLTVEQKREKKIRKIKE-DTSLEVHVALRISNLMRVL 427
                + K++++ E+     ++++  +++K E+    I E    LE     RIS L +  
Sbjct: 98  ----SLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELE-----RISGLTQ-- 146

Query: 428 GTEAVQD-PTKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKI 475
             EA +    ++E   R + AK  K+ EE         +++ +KK ++I
Sbjct: 147 -EEAKEILLEEVEEEARHEAAKLIKEIEE-------EAKEEADKKAKEI 187


>gnl|CDD|227938 COG5651, COG5651, PPE-repeat proteins [Cell motility and
           secretion].
          Length = 490

 Score = 32.2 bits (73), Expect = 1.7
 Identities = 9/50 (18%), Positives = 13/50 (26%)

Query: 39  ALLALKGTAPVAPTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAEKDNAT 88
             +     A      V   G   A  G+  +      A P   A  + A 
Sbjct: 311 GAIGQTSQATANAGSVNATGGAAAGSGNLGVANSGSAAAPFGIAGANQAA 360


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 32.5 bits (74), Expect = 1.8
 Identities = 18/63 (28%), Positives = 22/63 (34%), Gaps = 4/63 (6%)

Query: 46  TAPVA-PTPVVPAGVTPAIIGSAKIPPLM---PPAPPSTTAEKDNATYEKAMKIAQLQAQ 101
           TAP        P G  P     A  PP     P APP        A  E    +A L++ 
Sbjct: 904 TAPGGDAASAPPPGAGPPAPPQAVPPPRTTQPPAAPPRGPDVPPAAVAELRETLADLRSA 963

Query: 102 IQS 104
            +S
Sbjct: 964 QRS 966


>gnl|CDD|224698 COG1784, COG1784, Predicted membrane protein [Function unknown].
          Length = 395

 Score = 32.0 bits (73), Expect = 1.8
 Identities = 54/193 (27%), Positives = 85/193 (44%), Gaps = 26/193 (13%)

Query: 685 YNLSNVGKIPIGLPSPQPPPFYLIPKLMLDG-LFISIVAFSINIS--MASILAKKKYKID 741
             LSN   IP    S        + K    G L  ++V+F   +S   A++LA K     
Sbjct: 195 EALSNPSSIPKQEDSKFAISGKRLLKSSFAGSLAGALVSFLPGVSPAQAALLASK----- 249

Query: 742 SNQELLASGVSNIFASFFSCIPFASSLSRSLVQLQVGGKTQLASGVSCVLLVLILLYIGP 801
               L        F +  S I  A+++  SLV L   GK +  SGV+    V I   +G 
Sbjct: 250 ----LTGDDEDRRFITAVSGINTANAVF-SLVALIAIGKPR--SGVA----VAIQRSVGD 298

Query: 802 FFETLPYCVL----TSIVIVAVKGMLNQVKDLKLAFKESYTEACVWCLTFLSVVLLDVDY 857
              TLPY +      +++ VA+   L+ +  LK+ FK  Y++ C   L FL ++   +  
Sbjct: 299 I--TLPYLIFLVLSAALIAVALAPWLSILALLKVRFK-DYSKLCAGVLLFLVILGYLLMG 355

Query: 858 GLGIGVLCSLIFV 870
            +G+GVL + I +
Sbjct: 356 FIGLGVLGTAIAI 368


>gnl|CDD|221739 pfam12730, ABC2_membrane_4, ABC-2 family transporter protein.  This
           family is related to the ABC-2 membrane transporter
           family pfam01061.
          Length = 230

 Score = 31.6 bits (72), Expect = 2.0
 Identities = 32/193 (16%), Positives = 71/193 (36%), Gaps = 37/193 (19%)

Query: 629 NYVAFGVVVILVSVLIIYNDHFKSKIQKNISFPIPTE------MIVIVAGALLSSILDVK 682
           + +   +++ +++ L++  +     I+  +S PI         ++V++  +LL+S+L   
Sbjct: 53  SALFLPLLIAILASLLVSREFKNGTIKLLLSLPISRGKIFLAKLLVLLILSLLASLL--- 109

Query: 683 HKYNLSNV--GKIPIGLPSPQPPPFYLIPKLMLDGLFISIV--------------AFSIN 726
               L ++  G +  G           +  L+L  L   ++              A +I 
Sbjct: 110 --LLLLSLIAGLLLGGSGFSLSLLLGALLLLLLLSLLALLLILLILFLSLLLRNSAVAIG 167

Query: 727 ISMASILAKKKYKIDSNQELLASGVSNIFASFFSCIPFASSLSRSLVQLQVGGKTQLASG 786
           I +  IL            LL   +       +S +P    LS SL    +     +   
Sbjct: 168 IGLLLILL----------GLLLGSLPLWLLFPWSYLPLIVLLSLSLNGTLLSSGLSIIIA 217

Query: 787 VSCVLLVLILLYI 799
           +  +L++ I+  I
Sbjct: 218 LIVILILFIIFLI 230


>gnl|CDD|212083 cd11514, SLC6sbd_DAT1, Na(+)- and Cl(-)-dependent dopamine
           transporter 1; solute-binding domain.  DAT1 (also called
           DAT), is a plasma membrane transport protein that
           functions at the dopaminergic synapses to transport
           dopamine from the extracellular space back into the
           presynaptic nerve terminal. Human DAT1 is encoded by the
           SLC6A3 gene, and is expressed in the brain. DAT1 may
           play a role in diseases or disorders related to
           dopaminergic neurons, including attention-deficit
           hyperactivity disorder (ADHD), Tourette syndrome,
           Parkinson's disease, alcoholism, drug abuse,
           schizophrenia, extraversion, and risky behavior. This
           subgroup belongs to the solute carrier 6 (SLC6)
           transporter family.
          Length = 555

 Score = 31.8 bits (72), Expect = 2.3
 Identities = 34/144 (23%), Positives = 62/144 (43%), Gaps = 28/144 (19%)

Query: 741 DSNQELLASGVSNIFASFFSCIPFASSLSRSLVQL-QVGGKTQLASG----VSCVLLVLI 795
             +    +SG+++ + +     P A    R ++ L +  G   L        SC+++V++
Sbjct: 135 PGDSSGDSSGLNDTYKT----TPAAEYFERGVLHLHESHGIHDLGPPRWQLTSCLVVVIV 190

Query: 796 LLYIGPF------------FETLPYCVLTSIVI--VAVKGMLNQVKD-LKLAFKESYTEA 840
           LLY   +              T+PY VLT++++  V + G ++ +K  L + F     EA
Sbjct: 191 LLYFSLWKGVKTSGKVVWITATMPYVVLTALLLRGVTLPGAIDGIKAYLSVDFLRLC-EA 249

Query: 841 CVWCLTFLSVVLLDVDYGLGIGVL 864
            VW      +       G+G GVL
Sbjct: 250 SVWIDAATQICF---SLGVGFGVL 270


>gnl|CDD|221791 pfam12822, DUF3816, Protein of unknown function (DUF3816).  This
           family of proteins is functionally uncharacterized but
           are likely to be membrane transporters. This family of
           proteins is found in bacteria and archaea. Proteins in
           this family are typically between 177 and 208 amino
           acids in length. A subset of this family is associated
           with the TM1506 proteins. In this context, transport
           through the channel is predicted to be regulated by the
           TM1506 protein by either regulating redox potential or
           modification of substrates.
          Length = 168

 Score = 30.7 bits (70), Expect = 2.4
 Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 22/164 (13%)

Query: 568 ELGSLSVLMSDI--MISGFTTGTAILVILSQIKHVFGIKVKRHIGPFNVIYTLIDVAENI 625
               L +  S I  ++  F  G     ++  +  +    +    GPF ++  L       
Sbjct: 20  PFPPLKIDFSHIPVLLGAFLLGPVAGALIGLLTSLLSF-LLFGGGPFALVGPLA------ 72

Query: 626 HKTNYVAFGVVVILVSVLIIYNDHFKSKIQKNISFPIPTEMIVIVAGALLSSILDVKHKY 685
                    ++  L++ LI      K+K  K     I   ++  + G L++++L++    
Sbjct: 73  ----NFLPRILFGLIAGLIYKKLRKKTK--KRAVLAI---ILGTILGTLVATLLNLGLIL 123

Query: 686 NL-SNVGKIPIGLPSPQPPPFYLIPKLM---LDGLFISIVAFSI 725
            L +    +PI           L+P L    + G+  SI+ + +
Sbjct: 124 PLYAKFLGMPISAIVGALFAAVLLPVLPFNLIKGIIASIIVYLL 167


>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl
          carrier protein subunit; Validated.
          Length = 130

 Score = 30.2 bits (68), Expect = 2.4
 Identities = 10/40 (25%), Positives = 14/40 (35%), Gaps = 1/40 (2%)

Query: 47 APVAPTPVVPAGVTPAIIGSAKIPPLMPPAP-PSTTAEKD 85
          AP           TP  + +   P +   AP P+  A  D
Sbjct: 23 APAQAAAPAQPASTPVPVPTEASPQVEAQAPQPAAAAGAD 62


>gnl|CDD|225183 COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain
           an N-terminal double-glycine peptidase domain [Defense
           mechanisms].
          Length = 709

 Score = 31.8 bits (73), Expect = 2.4
 Identities = 6/87 (6%), Positives = 27/87 (31%), Gaps = 13/87 (14%)

Query: 581 ISGFTTGTAILVILSQIKHVFGIKVKRHIGPF--NVIYTLIDVAENIHKTNYVAFGVVVI 638
           +  +      +++ S +  +  +       P    ++   +    +      +A G+++ 
Sbjct: 149 LFKYRRLLFEVLLASLLLQLLAL-----ATPLFSQIVIDKVLPDASRSTLTVLAIGLLLA 203

Query: 639 LVSVLI------IYNDHFKSKIQKNIS 659
            +   +          H   ++   +S
Sbjct: 204 ALFEALLRLLRTYLIAHLGKRLDLELS 230


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 31.4 bits (72), Expect = 2.8
 Identities = 32/182 (17%), Positives = 57/182 (31%), Gaps = 45/182 (24%)

Query: 312 FLTDKERKKLRRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEA 371
           +  +K  ++  R+  R   KEE+E+    L          +            D  + + 
Sbjct: 158 YQREKAEREEEREAERRERKEEKEREVARLR---------AQQEEAEDEREELDELRADL 208

Query: 372 HVRAQMAK-RQKDHEEANASRKLTVE-------QKREKKIRKIKEDTSLEVHVALRISNL 423
           +      K RQK+ EEA   R+   E       Q  EK+ R  +E               
Sbjct: 209 YQEEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEER-------------- 254

Query: 424 MRVLGTEAVQDPTKMEAHVRAQ--------MAKRQKDHEEANASRKLTVEQKREKKIRKI 475
                 E   +  +M                 +R K  E      +  +E+K E++  + 
Sbjct: 255 -----AEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQ-IEEKEERRAAER 308

Query: 476 KE 477
           +E
Sbjct: 309 EE 310



 Score = 30.3 bits (69), Expect = 6.1
 Identities = 24/182 (13%), Positives = 64/182 (35%), Gaps = 33/182 (18%)

Query: 313 LTDKERKKLRRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAH 372
              +ER+++     R   ++E E            K      +R    E + +  +    
Sbjct: 93  ERLQEREQMDEIIERIQEEDEAEAQE---------KREKQKKLR----EEIDEFNEERIE 139

Query: 373 VRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHVALRISNLMRVLGTEAV 432
            + +  +R+++ E      +    ++ E++  + +E          R    +R    EA 
Sbjct: 140 RKEEEKEREREEELKILEYQREKAEREEEREAERRERK----EEKEREVARLRAQQEEAE 195

Query: 433 QDPTKMEAHV----------------RAQMAKRQKDHEEANASRKLTVEQKREKKIRKIK 476
            +  +++                   + +  KR++  +E   +R+  +E+K E+   +  
Sbjct: 196 DEREELDELRADLYQEEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERA 255

Query: 477 ED 478
           E+
Sbjct: 256 EE 257


>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region.  The
           timeless (tim) gene is essential for circadian function
           in Drosophila. Putative homologues of Drosophila tim
           have been identified in both mice and humans (mTim and
           hTIM, respectively). Mammalian TIM is not the true
           orthologue of Drosophila TIM, but is the likely
           orthologue of a fly gene, timeout (also called tim-2).
           mTim has been shown to be essential for embryonic
           development, but does not have substantiated circadian
           function. Some family members contain a SANT domain in
           this region.
          Length = 507

 Score = 31.6 bits (71), Expect = 2.9
 Identities = 48/270 (17%), Positives = 101/270 (37%), Gaps = 18/270 (6%)

Query: 143 ITGKQVQLTHVVPTLKANIRAKKREEFHEKLKEKPTEDLSESKFFDPRISAKSSVRTKRA 202
           ITGK+ +   V   L+  + ++KRE    K + K  +    +         +     +  
Sbjct: 198 ITGKRSRAQVVDKLLQLGLVSEKRELRGGKKRRKKLKPKQPNGEESGEDDFQEDPEEEEQ 257

Query: 203 LKFHEPGKFQQLAEQLRVKAQLAKLQADISKNARKTGISSATKLALIAPKMEDDQDEMPE 262
           L   +P + ++     +V+        ++ +  ++   S    L  +   +    D+  E
Sbjct: 258 LPESKPEETEKRVSAFQVEGSTLISAENLRQQLKQEKTSWP--LLWLQSCLIRAADDREE 315

Query: 263 VEWWDAVIMVEETYEKENNIKTSAITNLVEHPIQMKPP-SDMAKPVYMPVFLTDKERKKL 321
            E   AV +V  T E E  ++      L++  + ++PP S       +P  L+  +   L
Sbjct: 316 DECDQAVPLVPLTEENEEAMENEQFQRLLKA-LGLRPPRSGQEGFWRIPAKLSSTQ---L 371

Query: 322 RRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQ 381
           RR+    + +EE+ +  L  +   E              E+  +   +  + R + A   
Sbjct: 372 RRRAASLSGEEEEPEDELKDDVDGEQ-----------ADESEHETLALRKNARQRKAGLA 420

Query: 382 KDHEEANASRKLTVEQKREKKIRKIKEDTS 411
              EEA    +     K+++  +K K+ T 
Sbjct: 421 SPEEEALGEEEQKAPPKKKQLNQKNKQQTG 450


>gnl|CDD|113413 pfam04642, DUF601, Protein of unknown function, DUF601.  This
           family represents a conserved region found in several
           uncharacterized plant proteins.
          Length = 311

 Score = 31.2 bits (70), Expect = 2.9
 Identities = 19/77 (24%), Positives = 30/77 (38%), Gaps = 4/77 (5%)

Query: 112 KPSVPGAKPPGAPATDKPT----PLILDSEGRTIDITGKQVQLTHVVPTLKANIRAKKRE 167
           KPS    K P AP ++  T        D   R  D   KQ +     P   +  R ++++
Sbjct: 30  KPSTLAGKNPDAPTSESRTPSKATSSKDPSKRYADKKRKQSEKDARSPPRSSRPRTEEKD 89

Query: 168 EFHEKLKEKPTEDLSES 184
               + KEK  +  S+ 
Sbjct: 90  AGPSQQKEKGKKGDSQD 106


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 31.4 bits (72), Expect = 2.9
 Identities = 23/109 (21%), Positives = 32/109 (29%), Gaps = 6/109 (5%)

Query: 25  MMA-HAQKMIEERKKALLALKGTAPVAPTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAE 83
           M+A H    + E +    +    A    T    A V P    +   PP    AP    A 
Sbjct: 356 MLAFHPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPP--ASAPQQAPAV 413

Query: 84  KDNATYEKAMKIAQLQAQIQSKLSTGVLKPSVPGAKPPGAPATDKPTPL 132
               T  + +     + Q+Q        K S P A     P       L
Sbjct: 414 PLPETTSQLL---AARQQLQRAQGATKAKKSEPAAASRARPVNSALERL 459


>gnl|CDD|237539 PRK13878, PRK13878, conjugal transfer relaxase TraI; Provisional.
          Length = 746

 Score = 31.6 bits (72), Expect = 3.0
 Identities = 18/70 (25%), Positives = 32/70 (45%), Gaps = 6/70 (8%)

Query: 338 RLG-LEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLTVE 396
           RLG  EP PE + +          +  +DP ++  +     AK + + +   A+R   +E
Sbjct: 248 RLGPFEPSPERQAQTKA-----KRQYRKDPIRLRVNTVELYAKYKAEQQTLTAARAQALE 302

Query: 397 QKREKKIRKI 406
           + R +K R I
Sbjct: 303 RARRRKDRLI 312


>gnl|CDD|227532 COG5207, UBP14, Isopeptidase T [Posttranslational modification,
           protein turnover, chaperones].
          Length = 749

 Score = 31.6 bits (71), Expect = 3.2
 Identities = 17/67 (25%), Positives = 30/67 (44%), Gaps = 1/67 (1%)

Query: 395 VEQKREKKIR-KIKEDTSLEVHVALRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDH 453
           VE+   KKI+ K  E +  +V    R+   +    T++    T +E   R ++A +  D 
Sbjct: 83  VEENNNKKIKLKNAELSEEDVEELRRVPRTVLSKSTDSDISSTALEKIERMEIAIKPGDE 142

Query: 454 EEANASR 460
           +E    R
Sbjct: 143 DERVLWR 149


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 31.2 bits (71), Expect = 3.2
 Identities = 18/98 (18%), Positives = 27/98 (27%), Gaps = 13/98 (13%)

Query: 46  TAPVAPTPVVPAGVTPAIIGSAKIPPLMPPAP-----PSTTAEKDNATYEKAMKIAQLQA 100
            AP A      A    A       P    PA      P+  A       + A +   +  
Sbjct: 405 AAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPV 464

Query: 101 QIQSKLSTGVLKPSVPGAKPPGAPATDKPTPLILDSEG 138
           ++  + +      +     P  APA      L    EG
Sbjct: 465 RVAPEPA-----VASAAPAPAAAPA---AARLTPTEEG 494



 Score = 30.5 bits (69), Expect = 6.3
 Identities = 18/76 (23%), Positives = 21/76 (27%), Gaps = 1/76 (1%)

Query: 56  PAGVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMKIAQLQAQIQSKLSTGVLKPSV 115
           PA    A   + K  P  P A     A    A    A   A   A   +  +     P  
Sbjct: 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAP-AAAASAPAAPPAAAPPA 424

Query: 116 PGAKPPGAPATDKPTP 131
           P A P  A     P  
Sbjct: 425 PVAAPAAAAPAAAPAA 440


>gnl|CDD|236792 PRK10905, PRK10905, cell division protein DamX; Validated.
          Length = 328

 Score = 31.1 bits (70), Expect = 3.3
 Identities = 19/68 (27%), Positives = 24/68 (35%), Gaps = 1/68 (1%)

Query: 35  ERKKALLALKGTAPVAPTPVVPAGVTPAIIGSAKIP-PLMPPAPPSTTAEKDNATYEKAM 93
            RK+A++  K     A T   P   TP     A        PA  ST A K+ AT     
Sbjct: 157 ARKQAVIEPKKPQATAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQ 216

Query: 94  KIAQLQAQ 101
             +  Q  
Sbjct: 217 TASPAQTT 224


>gnl|CDD|221313 pfam11917, DUF3435, Protein of unknown function (DUF3435).  This
           family of proteins are functionally uncharacterized.
           This protein is found in eukaryotes. Proteins in this
           family are typically between 435 to 791 amino acids in
           length. This family is related to pfam00589 suggesting
           it may be an integrase enzyme.
          Length = 418

 Score = 31.2 bits (71), Expect = 3.6
 Identities = 23/110 (20%), Positives = 39/110 (35%), Gaps = 13/110 (11%)

Query: 376 QMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHVALRISNLMRVLGTEAVQDP 435
           + A R     +    R LT EQK    + +  E   L          ++ + G   V   
Sbjct: 259 RAATRMSRTRDPRRPRDLTDEQKAS--VEEDPELQELIRKRDHLKKEIIALYGQ--VAKA 314

Query: 436 TKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHV 485
                + R +  +R    E  N  ++L     R +  +KI+E+   E  V
Sbjct: 315 KGTPLYERLEKRRR----EVRNERQRL-----RRELKKKIREEFDEEQPV 355


>gnl|CDD|146548 pfam03967, PRCH, Photosynthetic reaction centre, H-chain N-terminal
           region.  The family corresponds the N-terminal
           cytoplasmic domain.
          Length = 135

 Score = 29.9 bits (68), Expect = 3.7
 Identities = 11/32 (34%), Positives = 16/32 (50%), Gaps = 4/32 (12%)

Query: 321 LRRQNRREAWKEEQEK----IRLGLEPPPEPK 348
           LRR+++RE +  E +        G  P P PK
Sbjct: 30  LRREDKREGYPLESDGGGRLPNEGPPPLPSPK 61


>gnl|CDD|237082 PRK12373, PRK12373, NADH dehydrogenase subunit E; Provisional.
          Length = 400

 Score = 30.9 bits (70), Expect = 4.0
 Identities = 24/89 (26%), Positives = 31/89 (34%), Gaps = 10/89 (11%)

Query: 60  TPAIIGSAKIPPLMPPAPPSTTAEKDNA-----TYEKAMKIAQLQAQIQSKLSTGVLKPS 114
            P    +A +PP     P S  AE + A     T  KA        + Q    T   +P+
Sbjct: 232 APWQGDAAPVPPSEAARPKSADAETNAALKTPATAPKAAAKNAKAPEAQPVSGTAAAEPA 291

Query: 115 VPGAKPPGAPAT-----DKPTPLILDSEG 138
              A    A A      DKP PL +   G
Sbjct: 292 PKEAAKAAAAAAKPALEDKPRPLGIARPG 320


>gnl|CDD|221040 pfam11235, Med25_SD1, Mediator complex subunit 25 synapsin 1.  The
           overall function of the full-length Med25 is efficiently
           to coordinate the transcriptional activation of RAR/RXR
           (retinoic acid receptor/retinoic X receptor) in higher
           eukaryotic cells. Human Med25 consists of several
           domains with different binding properties, the
           N-terminal, VWA, domain, this SD1 - synapsin 1 - domain
           from residues 229-381, a PTOV(B) or ACID domain from
           395-545, an SD2 domain from residues 564-645 and a
           C-terminal NR box-containing domain (646-650) from
           646-747. This The function of the SD domains is unclear.
          Length = 168

 Score = 30.2 bits (67), Expect = 4.0
 Identities = 25/102 (24%), Positives = 38/102 (37%), Gaps = 18/102 (17%)

Query: 48  PVAPTPVVPAGVTPAIIGSAKIPPLMPPAPP------STTAEKDNATYEKAMKIAQLQAQ 101
              P  + PA V P     A   PL P  PP      + +    +   +KA++ A+ Q Q
Sbjct: 14  SKQPVSLPPAAVLPPQSLPAPQNPLPPVTPPQMQVPQNVSLHAAHDAAQKAVEAAKNQKQ 73

Query: 102 -IQSKLSTGVLKPSVPGAK-----------PPGAPATDKPTP 131
            ++++ S        P              PPG P   KP+P
Sbjct: 74  GLKNRFSPITPLQQAPIVGPPFSQAPAPVLPPGPPGAPKPSP 115


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 31.0 bits (70), Expect = 4.1
 Identities = 23/89 (25%), Positives = 28/89 (31%), Gaps = 5/89 (5%)

Query: 46  TAPVA---PTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMKIAQLQAQI 102
            APVA   P    PA   PA       P   P A  +  A    A   ++     L A  
Sbjct: 379 AAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAA-APARRSPAPEALAAAR 437

Query: 103 QSKLSTGVLKPSVPGAKPPGAPATDKPTP 131
           Q+        P+ P   P  APA      
Sbjct: 438 QASARGPGGAPA-PAPAPAAAPAAAARPA 465


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 31.1 bits (71), Expect = 4.1
 Identities = 12/85 (14%), Positives = 19/85 (22%)

Query: 47  APVAPTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMKIAQLQAQIQSKL 106
           A  AP              +A  P   P   P+        +           +   +  
Sbjct: 402 AAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAA 461

Query: 107 STGVLKPSVPGAKPPGAPATDKPTP 131
            +    P+   A  P A     P  
Sbjct: 462 PSAQPAPAPAAAPEPTAAPAPAPPA 486



 Score = 31.1 bits (71), Expect = 4.2
 Identities = 21/87 (24%), Positives = 29/87 (33%), Gaps = 1/87 (1%)

Query: 45  GTAPVAPTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMKIAQLQAQIQS 104
           G AP AP P        A  G+A      P    +  A + +    +  + AQ  +    
Sbjct: 675 GAAPAAPPPAPAPAAPAAPAGAAP-AQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSP 733

Query: 105 KLSTGVLKPSVPGAKPPGAPATDKPTP 131
                V  P  P   P  A A  +P P
Sbjct: 734 AADDPVPLPPEPDDPPDPAGAPAQPPP 760


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 30.8 bits (70), Expect = 4.5
 Identities = 66/368 (17%), Positives = 133/368 (36%), Gaps = 73/368 (19%)

Query: 146 KQVQLTHVVPTLKANIRAKKREEFH-EKLKEKPTEDLSESKFFDPRISAKSSVRTKRALK 204
           K + +T V    +  +  ++ E      L EKP EDL    F       +++++ KR  K
Sbjct: 10  KNIDITDV----EQFLEEQRDERITGGVLAEKPDEDL----FVVDTTGDEATIKKKRRKK 61

Query: 205 FHEPGK-FQQLAEQLRVKAQLAKL-QADISKNARKTGISSATKLALIAPKMEDDQDEMPE 262
             +P K  + L  + +V A   K          +K+   S  +L  +  K+   +    +
Sbjct: 62  LKKPLKSDEILENRSKVPAPSVKKRHVKKKSKKKKSKQVSRKELLRL-KKLARGKKGG-K 119

Query: 263 VEWWDAVIMVEETYEKENNIKTSAITN--LVEHPI------QMK-PPSDMAKPV------ 307
            +  +A     + + +E    T       L EH +       +K  P  +A         
Sbjct: 120 AKLLNASKDNYDLWGEEEPPDTPKWPKKDLAEHTLEQVAKKTVKTAPISLAANGKPVPAV 179

Query: 308 --------YMPVF-----LTDKERKK-LRRQNRREAWKEEQEKIRLGLEPPPEPKLRISN 353
                   Y P F     L  +E +K ++ + +R+  +  +EK    + P       +S 
Sbjct: 180 EVPHAGASYNPSFEDHQELLQEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSE 239

Query: 354 LMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANAS-------RKLTVEQKREKKIRKI 406
            +     E   D  + E+      +  +    E           RK   ++ +EK+ +++
Sbjct: 240 GLL----EESDDDGEEES---DDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKEL 292

Query: 407 KEDTSLEVHVALRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLTVEQ 466
           + +   E  +  +++ L R+                  ++AK     E+A A +K   ++
Sbjct: 293 EREAKEEKQLKKKLAQLARL-----------------KEIAKEVAQKEKARARKKEQRKE 335

Query: 467 KREKKIRK 474
           + EKK  K
Sbjct: 336 RGEKKKLK 343


>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal.  Mediator is a
           large complex of up to 33 proteins that is conserved
           from plants to fungi to humans - the number and
           representation of individual subunits varying with
           species. It is arranged into four different sections, a
           core, a head, a tail and a kinase-activity part, and the
           number of subunits within each of these is what varies
           with species. Overall, Mediator regulates the
           transcriptional activity of RNA polymerase II but it
           would appear that each of the four different sections
           has a slightly different function. Mediator subunit
           Hrs1/Med3 is a physical target for Cyc8-Tup1, a yeast
           transcriptional co-repressor.
          Length = 381

 Score = 30.8 bits (69), Expect = 4.6
 Identities = 24/100 (24%), Positives = 28/100 (28%), Gaps = 11/100 (11%)

Query: 42  ALKGTAPVAPTPVVPAGV--------TPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAM 93
           A    A  A  P    GV          A   +        P  P  T +   A   KA 
Sbjct: 153 AKVLKANAASAPNTTTGVGSAATTAAISATTATTPTTTQKKPRKPRQTKKTGPAAAAKAQ 212

Query: 94  KIAQLQAQIQSKLSTGVLKPSVPGAKPPGAPATDKPTPLI 133
             AQ QAQ  +    G L                 PTPL+
Sbjct: 213 ASAQAQAQASAYNQMGSLGVPQNT---SMLAQIPNPTPLM 249


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 31.0 bits (70), Expect = 5.0
 Identities = 16/93 (17%), Positives = 27/93 (29%), Gaps = 3/93 (3%)

Query: 37  KKALLALKGTAPVAPTPVVPAGVTPAIIGSAKIPPLMPPAPP--STTAEKDNATYEKAMK 94
           + A        P        AG   A   +A        APP         +   + A  
Sbjct: 386 RAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADG 445

Query: 95  IAQLQAQIQSKLSTGVLKPSVPGAKPPGAPATD 127
            A + A+  ++ S    +     A+PP    + 
Sbjct: 446 DAPVPAKANARASA-DSRCDERDAQPPADSGSA 477


>gnl|CDD|177816 PLN02156, PLN02156, gibberellin 2-beta-dioxygenase.
          Length = 335

 Score = 30.4 bits (68), Expect = 5.1
 Identities = 25/91 (27%), Positives = 42/91 (46%), Gaps = 8/91 (8%)

Query: 274 ETYEKENNIKTSAITNLVEHPIQMKPPSDMAKPVYMPVFLTDKERKKLRRQNRREAWKEE 333
           E Y KE    +S +  +VE  ++++P   ++K       +  KE     R N     +E 
Sbjct: 138 EEYMKEMKRMSSKVLEMVEEELKIEPKEKLSK------LVKVKESDSCLRMNHYPEKEET 191

Query: 334 QEKIRLGLEPPPEPKLRISNLMRVLGTEAVQ 364
            EK+ +G     +P+L IS L+R   T  +Q
Sbjct: 192 PEKVEIGFGEHTDPQL-IS-LLRSNDTAGLQ 220


>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP).  This family
           contains bacterial fibronectin-attachment proteins
           (FAP). Family members are rich in alanine and proline,
           are approximately 300 long, and seem to be restricted to
           mycobacteria. These proteins contain a
           fibronectin-binding motif that allows mycobacteria to
           bind to fibronectin in the extracellular matrix.
          Length = 297

 Score = 30.2 bits (68), Expect = 5.4
 Identities = 27/105 (25%), Positives = 30/105 (28%), Gaps = 22/105 (20%)

Query: 39  ALLALKGTAPVAPTPVVPAGVTPAIIGSAKIPPLMP-PAPPSTTAEKDNATYEKAMKIAQ 97
             +AL  TA   P P  P   T A   +   PP  P PA P      D            
Sbjct: 29  VAIALPATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPP----- 83

Query: 98  LQAQIQSKLSTGVLKPSVPGAKPPGAPATDKPTPLILDSEGRTID 142
                          P+ P A PP  P  D   P     E   ID
Sbjct: 84  --------------PPADPNAPPP--PPVDPNAPPPPAPEPGRID 112


>gnl|CDD|150430 pfam09755, DUF2046, Uncharacterized conserved protein H4 (DUF2046).
            This is the conserved N-terminal 350 residues of a
           family of proteins of unknown function possibly
           containing a coiled-coil domain.
          Length = 308

 Score = 30.3 bits (68), Expect = 5.5
 Identities = 21/94 (22%), Positives = 36/94 (38%), Gaps = 19/94 (20%)

Query: 322 RRQNRREAWKEE-QEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRA----- 375
           +R ++ EA K   QEK+   +  PP P+   +         A      + +H+ +     
Sbjct: 185 KRMDKLEAEKRMLQEKLDQPVSEPPSPRDIFTE--------AEDTVGGIASHITSLRSEV 236

Query: 376 -----QMAKRQKDHEEANASRKLTVEQKREKKIR 404
                 +AK + D+ E          Q RE+ IR
Sbjct: 237 RRLRQNLAKSEADYTEKMEQYAKEERQTREENIR 270


>gnl|CDD|233199 TIGR00939, 2a57, Equilibrative Nucleoside Transporter (ENT).
           [Transport and binding proteins, Nucleosides, purines
           and pyrimidines].
          Length = 437

 Score = 30.5 bits (69), Expect = 5.6
 Identities = 14/69 (20%), Positives = 21/69 (30%), Gaps = 3/69 (4%)

Query: 634 GVVVILVSVLI--IYNDHFKSKIQKNISFPIPTEMIVIVAGALLSS-ILDVKHKYNLSNV 690
           GV+  L  +L+    ND    K      F  P  + +I     L    L     Y    +
Sbjct: 150 GVLTSLAMILVKASGNDSHGLKKSALGYFGTPCVVQLICIVCYLLLPKLPFARYYLQKKL 209

Query: 691 GKIPIGLPS 699
            K      +
Sbjct: 210 DKGAGEDET 218


>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2).  This family
           represents a conserved region approximately 60 residues
           long within the eukaryotic targeting protein for Xklp2
           (TPX2). Xklp2 is a kinesin-like protein localised on
           centrosomes throughout the cell cycle and on spindle
           pole microtubules during metaphase. In Xenopus, it has
           been shown that Xklp2 protein is required for centrosome
           separation and maintenance of spindle bi-polarity. TPX2
           is a microtubule-associated protein that mediates the
           binding of the C-terminal domain of Xklp2 to
           microtubules. It is phosphorylated during mitosis in a
           microtubule-dependent way.
          Length = 57

 Score = 27.3 bits (61), Expect = 5.9
 Identities = 11/20 (55%), Positives = 16/20 (80%)

Query: 157 LKANIRAKKREEFHEKLKEK 176
           L ++ RA+KR EF +KL+EK
Sbjct: 1   LHSDERAEKRAEFDKKLEEK 20


>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
          Length = 209

 Score = 29.8 bits (67), Expect = 6.5
 Identities = 23/120 (19%), Positives = 35/120 (29%), Gaps = 10/120 (8%)

Query: 18  PQNQIKNMMAHAQKMIEERKKALL-ALKGTAPVAPTPVVPAGVTPAIIGSAKIPPLMPPA 76
           P+  ++  M   ++   E K  L  A   + P           TP         P  P  
Sbjct: 74  PRTSLRQAMNPLRQAGNEIKSDLQKATSMSEPATENKP-AEVTTPVEPMGLPETP--PAV 130

Query: 77  PPSTTAEKDNATYE----KAMKIAQLQAQIQSKLSTGVLKP--SVPGAKPPGAPATDKPT 130
           P    A    A            A+ +A+     +    KP  ++   K     A  KPT
Sbjct: 131 PVPAPAPAVAAAAAQAAAAPKAPAKPRAKSPRPAAKAAPKPTETITAKKAKKTAAAPKPT 190


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 30.3 bits (67), Expect = 7.0
 Identities = 26/98 (26%), Positives = 32/98 (32%), Gaps = 2/98 (2%)

Query: 37  KKALLALKGTAPVAPTPVVPA--GVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMK 94
           K A    K  AP A     PA     PA   +       PPA  +    K  A   KA  
Sbjct: 252 KAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAPAKAAAAPAKAAA 311

Query: 95  IAQLQAQIQSKLSTGVLKPSVPGAKPPGAPATDKPTPL 132
                A   +K +    K + P AK    PA     P+
Sbjct: 312 APAKAAAPPAKAAAPPAKAATPPAKAAAPPAKAAAAPV 349



 Score = 29.9 bits (66), Expect = 7.6
 Identities = 26/97 (26%), Positives = 31/97 (31%), Gaps = 2/97 (2%)

Query: 37  KKALLALKGTAPVAPTPVVPA--GVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMK 94
           K A    K  AP A     PA     PA   +       PPA  +    K  A   KA  
Sbjct: 224 KAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAA 283

Query: 95  IAQLQAQIQSKLSTGVLKPSVPGAKPPGAPATDKPTP 131
                A   +K +    K +   AK   APA     P
Sbjct: 284 PPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPP 320


>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
          Length = 280

 Score = 29.8 bits (66), Expect = 7.4
 Identities = 25/84 (29%), Positives = 34/84 (40%), Gaps = 3/84 (3%)

Query: 45  GTAPVAPTP--VVPAGVTPAIIGSAKIPPLMPPAP-PSTTAEKDNATYEKAMKIAQLQAQ 101
           G +P+AP+P    PA   PA   +A  P +  PAP P+            A+  A  +  
Sbjct: 78  GQSPLAPSPACAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPA 137

Query: 102 IQSKLSTGVLKPSVPGAKPPGAPA 125
                ST    P+ P   P  APA
Sbjct: 138 PACPPSTRQCPPAPPLPTPKPAPA 161


>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
           2'-phosphodiesterase/3'-nucleotidase precursor protein;
           Reviewed.
          Length = 814

 Score = 30.2 bits (68), Expect = 7.6
 Identities = 14/98 (14%), Positives = 31/98 (31%), Gaps = 13/98 (13%)

Query: 46  TAPVAPTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMKIAQLQAQIQSK 105
                 T       TP    + +        P + T   +  +  +  + +   ++    
Sbjct: 32  VTTTPATSTEAEQTTPVESDATEEADNT-ETPVAATTAAEAPSSSETAETSDPTSEATD- 89

Query: 106 LSTGVLKPSVPGAKPPGAPATDKPTPLILDSEGRTIDI 143
            +T     +V         AT+   P+    EG+T+D+
Sbjct: 90  -TTTSEARTVT------PAATETSKPV----EGQTVDV 116


>gnl|CDD|180508 PRK06281, PRK06281, putative monovalent cation/H+ antiporter
           subunit B; Reviewed.
          Length = 154

 Score = 29.0 bits (65), Expect = 7.7
 Identities = 21/82 (25%), Positives = 37/82 (45%), Gaps = 9/82 (10%)

Query: 569 LGSLSVLMSDIMISGFTTGTAIL---VILSQIKHVFGIKVKRHIGPFNVIYTLIDVAENI 625
           LG +++L   I   G   G A++    IL  +  V+G++      PFN  +  + + E+ 
Sbjct: 19  LGIMTILGGHITPGGGFQGGAMIAAGFILCIV--VYGLEKS----PFNFSHEFMSIMESF 72

Query: 626 HKTNYVAFGVVVILVSVLIIYN 647
               YV  G+  +  S   +YN
Sbjct: 73  GALGYVGLGLCGLFFSGSFLYN 94


>gnl|CDD|150291 pfam09577, Spore_YpjB, Sporulation protein YpjB (SpoYpjB).  These
           proteins are found in the endospore-forming bacteria
           which include Bacillus species. In Bacillus subtilis,
           ypjB was found to be part of the sigma-E regulon.
           Sigma-E is a sporulation sigma factor that regulates
           expression in the mother cell compartment. Null mutants
           of ypjB show a sporulation defect, but this gene is not,
           however, a part of the endospore formation minimal gene
           set.
          Length = 232

 Score = 29.6 bits (67), Expect = 8.4
 Identities = 22/89 (24%), Positives = 34/89 (38%), Gaps = 9/89 (10%)

Query: 149 QLTHVVPTLKANIRAKKREEFHEKLKEKPTEDLSESKFFDPRI----SAKSSVRTKRALK 204
            +      +K   + +  E F EKL E     LS      P +    S K   R    L 
Sbjct: 102 PVMDAFSAMKEAAQKEDSEAFQEKLNEF----LSLYDMIYPSLTIDVSEKQLQRVDSHLS 157

Query: 205 FHEPGKFQQLAEQLRVKAQLAKLQADISK 233
             E  +FQQL+++     +L  L+ D+  
Sbjct: 158 VLENFRFQQLSKK-SKLERLTLLEGDLKA 185


>gnl|CDD|223659 COG0586, DedA, Uncharacterized membrane-associated protein
           [Function unknown].
          Length = 208

 Score = 29.2 bits (66), Expect = 8.8
 Identities = 36/204 (17%), Positives = 73/204 (35%), Gaps = 56/204 (27%)

Query: 615 IYTLIDVAENIHKTNYVAFGVVVILVSVLIIYNDHFKSKIQKNISFPIPTEMIVIVAGAL 674
              L+ +        Y+   +++ L S L++               P+P E+++++AGA 
Sbjct: 2   DMELLILWIQEGSLGYLGVFLILFLESGLLVG-------------PPLPGEVLLLLAGA- 47

Query: 675 LSSILDVKHKYNLSNVGKIPIGLPSPQPPPFYLIPKLMLDGLFISIVAFSIN--ISMASI 732
                       L+  GK+ + L         +I    L  L   ++++ I        +
Sbjct: 48  ------------LAAQGKLNLWL---------VILVATLGALLGDLISYWIGRRFGRKLL 86

Query: 733 LAKKKYKIDSNQELLAS-------GVSNIFASFFSCIPFASSLSRSLVQLQVG------G 779
                Y++   ++L  +       G+  IF   F  IP      R+LV +  G       
Sbjct: 87  RKLWSYRLLKRKKLDKAELLFERHGLFAIFLGRF--IPGV----RTLVPIVAGMSKMPLR 140

Query: 780 KTQLASGVSCVLLVLILLYIGPFF 803
           +  L + +  +L  L+L  +G   
Sbjct: 141 RFLLYNILGALLWALVLTLLGYLL 164


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 28.6 bits (64), Expect = 9.2
 Identities = 18/56 (32%), Positives = 20/56 (35%), Gaps = 2/56 (3%)

Query: 28  HAQKMIEERKKALLALKGTAPVAPTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAE 83
           H   M +E +K L  L   AP  P PV P             PP   PA P   A 
Sbjct: 67  HILSMEQEARKILAPLTPPAP--PEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAA 120


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 29.7 bits (67), Expect = 9.4
 Identities = 22/129 (17%), Positives = 31/129 (24%), Gaps = 14/129 (10%)

Query: 46  TAPVAPTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMKIAQLQAQIQSK 105
           T P A      A         +      P AP S T              A +     S 
Sbjct: 384 TQPAAAPQPSAAAAASPSPSQSSAAA-QPSAPQSATQPAGTPPTVSVDPPAAVPVNPPST 442

Query: 106 LSTGVLKPSVPGAKPPGAPATDKPTPLILDSEGRTID--ITGKQVQLTHVVPTLKANIRA 163
                        +P       K     + S G +    I  K  Q T  +         
Sbjct: 443 AP--------QAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATGNIKEA---PTG 491

Query: 164 KKREEFHEK 172
            ++E F E+
Sbjct: 492 TQKEIFTEE 500


>gnl|CDD|216329 pfam01148, CTP_transf_1, Cytidylyltransferase family.  The members
           of this family are integral membrane protein
           cytidylyltransferases. The family includes phosphatidate
           cytidylyltransferase EC:2.7.7.41 as well as Sec59 from
           yeast. Sec59 is a dolichol kinase EC:2.7.1.108.
          Length = 259

 Score = 29.3 bits (66), Expect = 9.6
 Identities = 18/130 (13%), Positives = 50/130 (38%), Gaps = 14/130 (10%)

Query: 780 KTQLASGVSCVLLVLILLYIGPFFETLPYCVLTSIVIVAVKGMLNQVKDLKLAFKESYTE 839
           K ++ + +  +L+ LILL++GP        +L    I+ +   L ++  ++L   +  + 
Sbjct: 1   KQRIITHIVLILIFLILLFLGPL-----IFLLLVAAIIIL--ALYEL--IRLLRLKFSSR 51

Query: 840 ACVWCLTFLSVVLLDVDYGLGIGVLCSLIFVVVTGQKVMVYKLGRLIDSNIYVEEDFYES 899
             +       +  L     +    L +L  +V     ++V  +  L+   +        +
Sbjct: 52  LLLLTRGLGLIFYLLGLLLVYGEGLLALFGLV-----LLVIPVLWLLVLPVLAYHRVSST 106

Query: 900 AIDVPGIVIL 909
            + +  + + 
Sbjct: 107 LLPLFYVGLF 116


>gnl|CDD|225179 COG2270, COG2270, Permeases of the major facilitator superfamily
           [General function prediction only].
          Length = 438

 Score = 29.6 bits (67), Expect = 9.6
 Identities = 15/73 (20%), Positives = 28/73 (38%), Gaps = 9/73 (12%)

Query: 727 ISMASILAKKKYKIDSNQELLASGVSNIFASFFSCIPFASSLSRSLVQLQVGGKTQLASG 786
           ++M  +       + S + LL     ++ A+  + I  A  L         G K  L   
Sbjct: 272 LAMGGVFGAADLGLSSTELLLIGIALSVVAALGAII--AGFLDERF-----GSKPVLM-- 322

Query: 787 VSCVLLVLILLYI 799
           +  V+L +  LY+
Sbjct: 323 IGLVILSIAALYL 335


>gnl|CDD|237541 PRK13881, PRK13881, conjugal transfer protein TrbI; Provisional.
          Length = 472

 Score = 29.7 bits (67), Expect = 9.7
 Identities = 22/117 (18%), Positives = 41/117 (35%), Gaps = 3/117 (2%)

Query: 18  PQNQIKNMMAHAQKMIEERKKALLALKG--TAPVAPTPVVPAGVTPAIIGSAKIPPLMPP 75
           P+ +  N    A+++  ++   L+        P  PT    A +  A   +   PP  P 
Sbjct: 63  PKEKAGNTSMFAKEIAGDQTGGLIEPASPLKVPEMPTGPASAPLPIARPDNPDAPPTPPA 122

Query: 76  APPSTTAEKDNATYEKAM-KIAQLQAQIQSKLSTGVLKPSVPGAKPPGAPATDKPTP 131
            P +     D+      M K+   +  +++K +  V  P   G+ P G         
Sbjct: 123 NPGNPGQVNDDEAQRIRMAKLQMFEEAVKAKTTVRVDAPRSNGSAPGGPSTYTGTPQ 179


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 30.0 bits (67), Expect = 9.9
 Identities = 24/89 (26%), Positives = 30/89 (33%), Gaps = 10/89 (11%)

Query: 46  TAPVAPTPVVPAGVTPAIIGSAKIP-----PLMPPAPPSTTAEKDNATYEKAMKIAQLQA 100
             P APTP+ P    P   G A+ P        PPA     A    A   +A   A    
Sbjct: 696 PPPRAPTPMRPPAAPP---GRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPG 752

Query: 101 QIQSKLSTGVLKPSVPGAKPPGAPATDKP 129
           + +          + P A  PGAP    P
Sbjct: 753 RARP--PAAAPGRARPPAAAPGAPTPQPP 779


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.134    0.383 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 56,945,342
Number of extensions: 5767789
Number of successful extensions: 7706
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7506
Number of HSP's successfully gapped: 175
Length of query: 1124
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 1017
Effective length of database: 6,191,724
Effective search space: 6296983308
Effective search space used: 6296983308
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.5 bits)