RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16876
(1124 letters)
>gnl|CDD|219911 pfam08572, PRP3, pre-mRNA processing factor 3 (PRP3). Pre-mRNA
processing factor 3 (PRP3) is a U4/U6-associated
splicing factor. The human PRP3 has been implicated in
autosomal retinitis pigmentosa.
Length = 222
Score = 263 bits (673), Expect = 8e-81
Identities = 110/222 (49%), Positives = 159/222 (71%), Gaps = 11/222 (4%)
Query: 183 ESKFFDPRIS-AKSSVRTKRALKFHEPGKFQQLAEQLRVKAQLAKLQADISKNARKTGIS 241
E+ + DPR++ AK R +R L+F E GKF Q AE+LR +A+L + +A IS+ A+ G+
Sbjct: 1 ENPYLDPRMAEAKRKKRRRRGLQFIEKGKFSQQAEELRREAKLEEEEARISEEAQNAGLK 60
Query: 242 SATKLALIAPKMEDDQDEMPEVEWWDAVIM-------VEETYEKENNIKTSAITNLVEHP 294
SAT LA PK E D++P++EWWD I+ +E+ +++ + + S+ITN +EHP
Sbjct: 61 SATDLAEKIPKKEL-PDDVPDIEWWDMPILDDPDYNDLEDESDEDIDEEMSSITNYIEHP 119
Query: 295 IQMKPPSDMAKPVYM--PVFLTDKERKKLRRQNRREAWKEEQEKIRLGLEPPPEPKLRIS 352
+ ++PP + P + P++LT KE+KKLRRQ R+E KE+QE+IRLGLEPPP PK+++S
Sbjct: 120 VPIEPPWEAHYPPPIRPPLYLTKKEQKKLRRQRRKEKRKEKQERIRLGLEPPPPPKVKLS 179
Query: 353 NLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLT 394
NLMRVLG+EAVQDPTK+EA VR Q+A+R++ HE+ N RKLT
Sbjct: 180 NLMRVLGSEAVQDPTKVEAEVRRQVAERRQAHEDRNEERKLT 221
>gnl|CDD|162054 TIGR00815, sulP, high affinity sulphate transporter 1. The SulP
family is a large and ubiquitous family with over 30
sequenced members derived from bacteria, fungi, plants
and animals. Many organisms including Bacillus subtilis,
Synechocystis sp, Saccharomyces cerevisiae, Arabidopsis
thaliana and Caenorhabditis elegans possess multiple
SulP family paralogues. Many of these proteins are
functionally characterized, and all are sulfate uptake
transporters. Some transport their substrate with high
affinities, while others transport it with relatively
low affinities. Most function by SO42- :H+symport, but
SO42- :HCO3- antiport has been reported for the rat
protein (spP45380). The bacterial proteins vary in size
from 434 residues to 566 residues with one exception, a
Mycobacterium tuberculosis protein with 784 residues.
The eukaryotic proteins vary in size from 611 residues
to 893 residues with one exception, a protein designated
"early nodulin 70 protein" from Glycine max which is
reported to be of 485 residues. Thus, the eukaryotic
proteins are almost without exception larger than the
prokaryotic proteins. These proteins exhibit 10-13
putative transmembrane a-helical spanners (TMSs)
depending on the protein. The phylogenetic tree for the
SulP family reveals five principal branches. Three of
these are bacterial specific as follows: one bears a
single protein from M. tuberculosis; a second bears two
proteins, one from M. tuberculosis, the other from
Synechocystis sp, and the third bears all remaining
prokaryotic proteins. The remaining two clusters bear
only eukaryotic proteins with the animal proteins all
localized to one branch and the plant and fungal
proteins localized to the other. The generalized
transport reactions catalyzed by SulP family proteins
are: (1) SO42- (out) + nH+ (out) --> SO42- (in) + nH+
(in). (2) SO42- (out) + nHCO3- (in) SO42- (in) + nHCO3-
(out) [Transport and binding proteins, Anions].
Length = 563
Score = 257 bits (659), Expect = 1e-74
Identities = 132/417 (31%), Positives = 220/417 (52%), Gaps = 17/417 (4%)
Query: 569 LGSLSVLMSDIMISGFTTGTAILVILSQIKHVFGI---KVKRHIGPFNVIYTLIDVAENI 625
LG L +S +ISGF TG AI + LSQ+K + GI + V+ + N
Sbjct: 118 LGFLIEFLSHAVISGFMTGAAITIGLSQLKGLLGISIFNTRTDT--LGVVISTWAGLPNT 175
Query: 626 HKTNYVAFGVVVILVSVLIIYNDHFKSKIQKNISFPIPTEMIVIVAGALLSSILDVKHKY 685
H N+ V+ +++ + ++Y + +K + P ++V++ L +I K K
Sbjct: 176 HNWNWCTL-VIGLVLLLFLLYTKKLGKRNKKLLFAPAVAPLLVVILATLAVTIGLHK-KQ 233
Query: 686 NLSNVGKIPIGLPS--PQPPPFYLIPKLMLDGLFISIVAFSINISMASILAKKK-YKIDS 742
+S +G IP GL P + L+P L D + I+IV +I++A A+ YKID+
Sbjct: 234 GVSILGHIPSGLSFFPPITLDWELLPTLAPDAIAIAIVGLIESIAIARSFARMTGYKIDA 293
Query: 743 NQELLASGVSNIFASFFSCIPFASSLSRSLVQLQVGGKTQLASGVSCVLLVLILLYIGPF 802
NQEL+A G++NI SFFSC P SLSR+ V + G +TQL+ V+ ++++L+LL + P
Sbjct: 294 NQELVAQGIANIVGSFFSCYPATGSLSRTAVNAKAGCRTQLSGVVTAIVVLLVLLVLTPL 353
Query: 803 FETLPYCVLTSIVIVAVKGMLNQVKDLKLAFKESYTEACVWCLTFLSVVLLDVDYGLGIG 862
F +P L +I+I AV+G+++ K+L +K + VW +TF VV ++ GL +G
Sbjct: 354 FYYIPQAALAAIIISAVRGLID-YKELYKLWKADKMDFVVWLVTFFGVVFTSIEIGLLVG 412
Query: 863 VLCSLIFVVVTGQKVMVYKLGRLIDSNIYVEEDFYESAIDVPGIVILRIIGGMNFINKDK 922
V S F+++ + LGR+ + +Y Y +A PGI++ R+ G + F N +
Sbjct: 413 VALSAAFLLLRIARPRGAVLGRVPGTEVYRSIKQYPNARPPPGILVYRVDGPLYFANAED 472
Query: 923 VFHKISKL------SLSSEPYPKQIILDMMSLSSVDTSTVKSFLDLYKELMEQGISL 973
+ ++ K P + +ILDM ++ +DTS + + +L KEL +GI L
Sbjct: 473 LKDRLLKRIEDETRRELERPPLQVVILDMSAVPHLDTSGIHALEELRKELKARGIQL 529
Score = 93.5 bits (233), Expect = 1e-19
Identities = 34/74 (45%), Positives = 47/74 (63%), Gaps = 1/74 (1%)
Query: 1051 PILEWLPNYNWKNDLSHDMVSGFTVAVLHIPQGMAYSSLGGVPPIVGIYMAVFPVIIYMC 1110
P+L WLP+Y K D+++G TV +L IPQ MAY+ L G+ PI G+Y + P IY
Sbjct: 1 PVLRWLPHYRLKK-FKGDLMAGLTVGILLIPQAMAYAILAGLSPIYGLYTSFVPPFIYAL 59
Query: 1111 MGTSRHISMGTFSV 1124
GTSR I++G +V
Sbjct: 60 FGTSRDIAIGPVAV 73
>gnl|CDD|216188 pfam00916, Sulfate_transp, Sulfate transporter family. Mutations
in human SLC26A2 lead to several human diseases.
Length = 279
Score = 212 bits (543), Expect = 4e-62
Identities = 97/280 (34%), Positives = 156/280 (55%), Gaps = 12/280 (4%)
Query: 569 LGSLSVLMSDIMISGFTTGTAILVILSQIKHVFGIKVKRHIGPFNVIYTLIDVAENIHKT 628
LG L +S +ISGF G AIL++LSQ+K + G+ RH G +V+ L D+ +N+
Sbjct: 6 LGFLVEFLSRAVISGFMAGAAILILLSQLKGLLGLSNTRHSGIVSVLRALFDLVDNLWNW 65
Query: 629 NYVAFGVVVILVSVLI-IYNDHFKSKIQKNISFPIPTEMIVIVAGALLSSILDVKHKYNL 687
+ G+ ++ ++I + K I P P ++ ++ ++S I +Y +
Sbjct: 66 PTLVIGLSFLIFLLIIKLLPKRLKKLF--WIPAPAP--LVAVIVATVISYIFL-ADRYGV 120
Query: 688 SNVGKIPIGLPSPQPPPFYLIPKLMLD----GLFISIVAFSINISMASILAKKK-YKIDS 742
S +G+IP GLP P P L L+LD L ++IV +I A AKKK YKIDS
Sbjct: 121 SIIGEIPSGLPPPSLPDVPLDWSLLLDLLPIALALAIVGLLESILTAKSFAKKKGYKIDS 180
Query: 743 NQELLASGVSNIFASFFSCIPFASSLSRSLVQLQVGGKTQLASGVSCVLLVLILLYIGPF 802
N+EL+A G++NI +S F P S SRS V ++ G KTQL+ V V+++L+LL++ P
Sbjct: 181 NKELVAQGIANIVSSLFGGYPATGSFSRSAVNVKAGAKTQLSGIVMAVVVLLVLLFLTPL 240
Query: 803 FETLPYCVLTSIVIVAVKGMLNQVKDLKLAFKESYTEACV 842
+P VL +I+IVA+ G+++ +L +K S ++ +
Sbjct: 241 LAYIPMAVLAAIIIVALIGLID-WSELIRLWKLSKSDFLI 279
>gnl|CDD|223732 COG0659, SUL1, Sulfate permease and related transporters (MFS
superfamily) [Inorganic ion transport and metabolism].
Length = 554
Score = 220 bits (563), Expect = 1e-61
Identities = 106/415 (25%), Positives = 192/415 (46%), Gaps = 28/415 (6%)
Query: 569 LGSLSVLMSDIMISGFTTGTAILVILSQIKHVFGIKVKRHIGPFNVIYTLIDVAENIHKT 628
LG L + ++ GFT G AIL+IL+Q+ + G+ K + + +
Sbjct: 119 LGRLIRFIPRPVLIGFTAGIAILIILTQLPVLLGLASKVS----GFWAKVSALFTVLLTI 174
Query: 629 NYVAFGVVVILVSVLIIYNDHFKSKIQKNISFPIPTEMIVIVAGALLSSILDVKHKYNLS 688
N + ++ +++L+ ++ IP+ +I +V G L+ I + +
Sbjct: 175 NLATLLLGLLTLAILLFL---------PRLTPRIPSPLIALVLGTLIVWIFPL----DSL 221
Query: 689 NVGKIPIGLPSPQPP------PFYLIPKLMLDGLFISIVAFSINISMASILAKK-KYKID 741
G+IP LPS P L+ L+ L ++++ ++ A K D
Sbjct: 222 RYGEIPGSLPSGLPHFRLPNVSLSLLLALLPYALALALLGLLESLLTAVSFDGMTGTKHD 281
Query: 742 SNQELLASGVSNIFASFFSCIPFASSLSRSLVQLQVGGKTQLASGVSCVLLVLILLYIGP 801
SN+EL+A G++NI + F IP S+SRS + ++ G +T+L+ + LL+L+LL++ P
Sbjct: 282 SNRELIAQGIANIASGLFGGIPATGSISRSAINIKSGARTRLSGIIHAALLLLLLLFLAP 341
Query: 802 FFETLPYCVLTSIVIVAVKGMLNQVKDLKLAFKESYTEACVWCLTFLSVVLLDVDYGLGI 861
+P L +++I+ G+L+ L K E V T L V D+ G+ +
Sbjct: 342 LVSYIPLAALAAVLILVGWGLLDWSLLKPLLRKLPRGELLVLLTTALLTVFFDLVIGVVV 401
Query: 862 GVLCSLIFVVVTGQKVMVYKLGRLIDSNIYVEEDFYESAIDVPGIVILRIIGGMNFINKD 921
G+L + + + + + LGR+ PG+++ R+ G + F N D
Sbjct: 402 GILLACLLFIRRISRPSIVVLGRVPGPAGSDNAL-KPLDEIGPGVLVYRLSGPLFFGNAD 460
Query: 922 KVFHKISKLSLSSEPYPKQIILDMMSLSSVDTSTVKSFLDLYKELMEQGISLHIV 976
++ + L E P+++ILD+ S+ +D S ++ DL KEL +GI L IV
Sbjct: 461 RLERALLGL---IEERPERVILDLKSVPYIDASAAEALEDLIKELERRGIQLLIV 512
Score = 73.5 bits (181), Expect = 3e-13
Identities = 29/73 (39%), Positives = 42/73 (57%), Gaps = 1/73 (1%)
Query: 1048 SLFPILEWLPNYNWKNDLSHDMVSGFTVAVLHIPQGMAYSSLGGVPPIVGIYMAVFPVII 1107
S P L+WLP Y + L D+++G TVA + +P MA++ GVPP G+Y ++ II
Sbjct: 5 SEIPTLKWLPYYFR-SWLRGDLLAGLTVAAVALPLAMAFAIAAGVPPEAGLYASIVAGII 63
Query: 1108 YMCMGTSRHISMG 1120
Y G SR + G
Sbjct: 64 YALFGGSRGLISG 76
>gnl|CDD|205965 pfam13792, Sulfate_tra_GLY, Sulfate transporter N-terminal domain
with GLY motif. This domain is found usually at the
N-terminus of sulfate-transporter proteins. It carries a
highly conserved GLY sequence motif, but the function of
the domain is not known.
Length = 83
Score = 120 bits (305), Expect = 1e-32
Identities = 35/75 (46%), Positives = 52/75 (69%), Gaps = 1/75 (1%)
Query: 1050 FPILEWLPNYNWKNDLSHDMVSGFTVAVLHIPQGMAYSSLGGVPPIVGIYMAVFPVIIYM 1109
PIL+WLP Y+ L D+++G TVA++ IPQGMAY+ L G+PP+ G+Y + P +IY
Sbjct: 1 LPILQWLPKYSRSW-LKGDLIAGLTVALVLIPQGMAYALLAGLPPVYGLYASFVPPLIYA 59
Query: 1110 CMGTSRHISMGTFSV 1124
G+SRH+S+G +
Sbjct: 60 LFGSSRHLSVGPTAA 74
>gnl|CDD|219080 pfam06544, DUF1115, Protein of unknown function (DUF1115). This
family represents the C-terminus of hypothetical
eukaryotic proteins of unknown function.
Length = 127
Score = 103 bits (259), Expect = 6e-26
Identities = 35/88 (39%), Positives = 50/88 (56%), Gaps = 5/88 (5%)
Query: 486 AVYRVDDFSNPSKKFKVETNCNQLFMTGAVVMYKDCNVIVVEGGPKQQSKFKRLMMHRIK 545
VY + SNP K+FK+ N +L +TG ++ ++ VEGG K ++KRLM+ RIK
Sbjct: 1 YVYHIKSLSNPKKRFKILENAKELGLTGFCLIGPKPGLVCVEGGEKAIKEYKRLMLKRIK 60
Query: 546 WEED---MIKSNEGKE--TPNKCVLVWE 568
W E I+ E + + NKC LVWE
Sbjct: 61 WTEPKKISIRHREDEPVPSDNKCSLVWE 88
>gnl|CDD|183265 PRK11660, PRK11660, putative transporter; Provisional.
Length = 568
Score = 78.1 bits (193), Expect = 1e-14
Identities = 92/429 (21%), Positives = 190/429 (44%), Gaps = 69/429 (16%)
Query: 583 GFTTGTAILVILSQIKHVFGIKVKRHIGPFNVIYTLIDVAENIHKTNYVAFGVVVILVSV 642
GFT+G I++ QIK FG+++ P + + + + + + N+ + ++ + V
Sbjct: 138 GFTSGIGIVIATLQIKDFFGLQMAH--VPEHYLEKVGALFQALPTINWGDALIGIVTLGV 195
Query: 643 LIIYNDHFKSKIQKNISFPIPTEMIVIVAGALLSSIL-----DV-----KHKYNLSNVGK 692
LI++ + KI +P + ++AG + +L V + Y L++ G
Sbjct: 196 LILWP---RLKI------RLPGHLPALLAGTAVMGVLNLLGGHVATIGSRFHYVLAD-GS 245
Query: 693 IPIGLPSPQPPPF---YLIP-----KLMLD-GLFISIVAFSINISM---------ASIL- 733
G+P P P F + +P L L +++ + +++M A +L
Sbjct: 246 QGNGIP-PLLPQFVLPWNLPGADGQPFTLSWDLIRALLPAAFSMAMLGAIESLLCAVVLD 304
Query: 734 --AKKKYKIDSNQELLASGVSNIFASFFSCIPFASSLSRSLVQLQVGGKTQLASGVSCVL 791
K+ +N EL+ G+ NI A FF I ++++RS ++ G + +++ + +L
Sbjct: 305 GMTGTKH--SANSELVGQGLGNIVAPFFGGITATAAIARSAANVRAGATSPISAVIHALL 362
Query: 792 LVLILLYIGPFFETLPYCVLTSIVIVAVKGMLNQVKDLKLAFKESYTEACVWCLTFLSVV 851
++L LL + P LP + +++++ M K + L + V L V
Sbjct: 363 VLLALLVLAPLLSYLPLSAMAALLLMVAWNMSEAHKVVDLLRHAPKDDIIVMLLCMSLTV 422
Query: 852 LLDVDYGLGIG-VLCSLIFVVVTGQKVMVYKLGRLIDSNIYVEEDFYESAIDVP-GIVIL 909
L D+ + +G VL SL+F+ + + ++ RL ++ DVP +++L
Sbjct: 423 LFDMVIAISVGIVLASLLFM-----RRIA-EMTRLAPISVQ----------DVPDDVLVL 466
Query: 910 RIIGGMNFINKDKVFHKISKLSLSSEPYPKQIILDMMSLSSVDTSTVKSFLDLYKELMEQ 969
RI G + F +++F ++L +E + ++L ++ +D + +F K + +
Sbjct: 467 RINGPLFFAAAERLF---TELESRTEGK-RIVVLQWDAVPVLDAGGLDAFQRFVKR-LPE 521
Query: 970 GISLHIVKL 978
G L I L
Sbjct: 522 GCELRICNL 530
Score = 40.3 bits (95), Expect = 0.006
Identities = 20/57 (35%), Positives = 29/57 (50%), Gaps = 1/57 (1%)
Query: 1059 YNWKNDLSHDMVSGFTVAVLHIPQGMAYSSLGGVPPIVGIYMAVFPVIIYMCMGTSR 1115
Y + D+++G TV ++ IP MA + GVPP G+Y A I+ G SR
Sbjct: 24 YTAA-RFTRDLIAGITVGIIAIPLAMALAIASGVPPQYGLYTAAVAGIVIALTGGSR 79
>gnl|CDD|132913 cd07042, STAS_SulP_like_sulfate_transporter, Sulphate Transporter
and Anti-Sigma factor antagonist domain of SulP-like
sulfate transporters, plays a role in the function and
regulation of the transport activity, proposed general
NTP binding function. The SulP family is a large and
diverse family of anion transporters, with members from
eubacteria, plants, fungi, and mammals. They contain 10
to 14 transmembrane helices which form the catalytic
core of the protein and a C-terminal extension, the STAS
(Sulphate Transporter and AntiSigma factor antagonist)
domain which plays a role in the function and regulation
of the transport activity. The STAS domain is found in
the C-terminal region of sulphate transporters and
bacterial anti-sigma factor antagonists. It has been
suggested that this domain may have a general NTP
binding function.
Length = 107
Score = 69.6 bits (171), Expect = 3e-14
Identities = 24/89 (26%), Positives = 54/89 (60%), Gaps = 1/89 (1%)
Query: 897 YESAIDVPGIVILRIIGGMNFINKDKVFHKISKLSLSSEPYPKQIILDMMSLSSVDTSTV 956
Y A + PG++I RI G + F N + ++ +L + +P K +ILD+ +++ +D++
Sbjct: 1 YPLAEEPPGVLIYRIDGPLFFGNAEYFKDRLLRL-VDEDPPLKVVILDLSAVNFIDSTAA 59
Query: 957 KSFLDLYKELMEQGISLHIVKLLEPVKQV 985
++ +L K+L ++G+ L++ L V+++
Sbjct: 60 EALEELVKDLRKRGVELYLAGLNPQVREL 88
>gnl|CDD|201948 pfam01740, STAS, STAS domain. The STAS (after Sulphate Transporter
and AntiSigma factor antagonist) domain is found in the
C terminal region of Sulphate transporters and bacterial
antisigma factor antagonists. It has been suggested that
this domain may have a general NTP binding function.
Length = 106
Score = 56.5 bits (137), Expect = 1e-09
Identities = 25/83 (30%), Positives = 48/83 (57%), Gaps = 2/83 (2%)
Query: 903 VPGIVILRIIGGMNFINKDKVFHKISKLSLSSEPYPKQIILDMMSLSSVDTSTVKSFLDL 962
+PGI+ILR+ G ++F N + ++ + E + +ILD+ + +D+S + + L+L
Sbjct: 7 IPGILILRLDGPLDFANAEAFRERLLRAL--EEGEIRHVILDLSGVPFIDSSGLGALLEL 64
Query: 963 YKELMEQGISLHIVKLLEPVKQV 985
YKEL +G+ L +V V++
Sbjct: 65 YKELRRRGVELVLVGPSPEVRRT 87
>gnl|CDD|216158 pfam00860, Xan_ur_permease, Permease family. This family includes
permeases for diverse substrates such as xanthine,
uracil, and vitamin C. However many members of this
family are functionally uncharacterized and may
transport other substrates. Members of this family have
ten predicted transmembrane helices.
Length = 389
Score = 55.8 bits (135), Expect = 7e-08
Identities = 62/288 (21%), Positives = 111/288 (38%), Gaps = 59/288 (20%)
Query: 571 SLSVLMSDIMISGFTTGTAILVILSQIKHVFGIKVKRHIGPFNVIY--------TLIDVA 622
+L+ L ++++G + ++S ++ R P V +L +A
Sbjct: 88 ALAGLFGAVLVAGV-----LFTLISFTGL--RGRLARLFPP-VVTGPVVLLIGLSLAPIA 139
Query: 623 EN---IHK----------TNYVAFGVVVILVSVLIIYNDHFKSKIQKNISFPIPTEMIVI 669
+ + VVV+ V +L+ PI +I I
Sbjct: 140 VKGAGGGWAIADGLTVGLLDLLGLAVVVLAVILLLS-----VFLKGFFRQGPI---LIGI 191
Query: 670 VAGALLSSILDVKHKYNLSNVGKIPIGLPSPQPP-PFYLI-----PKLMLDGLFISIVAF 723
+AG LL+ + N S + P Q P PF P L+L L +++VA
Sbjct: 192 IAGWLLALFMG---IVNFS---PEVMDAPWFQLPHPFPFGTPLFNPGLILTVLAVALVAI 245
Query: 724 SINISMASILAK-----KKYKIDSNQELLASGVSNIFASFFSCIPFAS-SLSRSLVQL-Q 776
+ +AK K K D + LLA G++ + + F P + + + +V L +
Sbjct: 246 VESTGDIRAVAKVSGRDLKPKPDLRRGLLADGLATLLSGLFGAFPTTTYAENIGVVALTK 305
Query: 777 VG-GKTQLASGVSCVLLVLILLYIGPFFETLPYCVLTSIVIVAVKGML 823
V + + +GV +LL LI F ++P VL +++V GM+
Sbjct: 306 VYSRRVGVTAGVILILLGLI-PKFAALFSSIPSPVLGGVMLVMF-GMI 351
>gnl|CDD|129473 TIGR00377, ant_ant_sig, anti-anti-sigma factor. This superfamily
includes small (105-125 residue) proteins related to
SpoIIAA of Bacillus subtilis, an anti-anti-sigma factor.
SpoIIAA can bind to and inhibit the anti-sigma F factor
SpoIIAB. Also, it can be phosphorylated by SpoIIAB on a
Ser residue at position 59 of the seed alignment. A
similar arrangement is inferred for RsbV, an
anti-anti-sigma factor for sigma B. This Ser is fairly
well conserved within a motif resembling
MXS[STA]G[VIL]X[VIL][VILF] among homologous known or
predicted anti-anti-sigma factors. Regions similar to
SpoIIAA and apparently homologous, but differing
considerably near the phosphorylated Ser of SpoIIAA,
appear in a single copy in several longer proteins
[Regulatory functions, Protein interactions].
Length = 108
Score = 40.3 bits (95), Expect = 5e-04
Identities = 20/81 (24%), Positives = 42/81 (51%), Gaps = 2/81 (2%)
Query: 905 GIVILRIIGGMNFINKDKVFHKISKLSLSSEPYPKQIILDMMSLSSVDTSTVKSFLDLYK 964
G+VI+R+ G ++ + K++ + + P + I+LD+ L +D+S + L YK
Sbjct: 12 GVVIVRLSGELDAHTAPLLREKVTPAAERTGP--RPIVLDLEDLEFMDSSGLGVLLGRYK 69
Query: 965 ELMEQGISLHIVKLLEPVKQV 985
++ G L +V + V ++
Sbjct: 70 QVRRVGGQLVLVSVSPRVARL 90
>gnl|CDD|225711 COG3170, FimV, Tfp pilus assembly protein FimV [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 755
Score = 42.6 bits (100), Expect = 0.001
Identities = 31/125 (24%), Positives = 44/125 (35%), Gaps = 13/125 (10%)
Query: 6 LRVP-EGKFCRSFPQNQIKNMMAHAQKMIEERKKALLALKGTAPVAPTPVVPAGVTPAII 64
LR+P + R PQ + + A E KA + PVA P
Sbjct: 243 LRIPSAAQILRESPQEALAEVKAQTAAFAGEPSKAD---RVGKPVAKAPAK----VAKER 295
Query: 65 GSAKIPPLMPPAPPSTTAEKDNATYEKAMKIAQLQAQIQSKLSTGVLKPSVPGAKPPGAP 124
A++P + A+ + A +E A K A L A Q+ L + P A P
Sbjct: 296 ALAELPARVAE----LQAQLNKAQHELAQKAAPLAA-AQAALDAPAETATAPSAPAPQVS 350
Query: 125 ATDKP 129
A P
Sbjct: 351 AESSP 355
>gnl|CDD|234136 TIGR03173, pbuX, xanthine permease. All the seed members of this
model are observed adjacent to genes for either xanthine
phosphoribosyltransferase (for the conversion of
xanthine to guanine, GenProp0696, ) or genes for the
conversion of xanthine to urate and its concomitant
catabolism (GenProp0640, GenProp0688, GenProp0686 and
GenProp0687). A number of sequences scoring higher than
trusted to this model are found in different genomic
contexts, and the possibility exist that these transport
related compounds in addition to or instead of xanthine
itself. The outgroup to this family are sequences which
are characterized as uracil permeases or are adjacent to
established uracil phosphoribosyltransferases.
Length = 406
Score = 42.1 bits (100), Expect = 0.001
Identities = 49/216 (22%), Positives = 94/216 (43%), Gaps = 34/216 (15%)
Query: 631 VAFGVVVILVSVLIIYNDHFKSKIQKNISFPIPTEMIVIVAGALLSSILDVKHKYNLSNV 690
+A LV +L++ F ++I+ ++ +V G ++++ L + S V
Sbjct: 151 LALLT---LVIILLLN--RFGKGFLRSIAV-----LLGLVVGTIVAAALG---MVDFSGV 197
Query: 691 GKIP-IGLPSPQP---PPFYLIPKLMLDGLFISIVAF--SINISMA--SILAKKKYKIDS 742
+ P LP+P P F L+ L + + + +V+ + +A I + + D
Sbjct: 198 AEAPWFALPTPFHFGAPTFDLVAILTM--IIVYLVSMVETTGDFLALGEITGRPITEKDL 255
Query: 743 NQELLASGVSNIFASFFSCIPFAS-SLSRSLVQLQVGGKTQLASGVSCVLLVLILLY--I 799
L A G+ + F+ P+ S S + LVQL G K++ + V+LVL+ L+ +
Sbjct: 256 AGGLRADGLGSALGGLFNTFPYTSFSQNVGLVQL-TGVKSRYVVAAAGVILVLLGLFPKL 314
Query: 800 GPFFETLPYCVLTSIVI-----VAVKG--MLNQVKD 828
++P VL + VA G +L++V
Sbjct: 315 AALVASIPQPVLGGAGLVMFGMVAASGIRILSKVDF 350
>gnl|CDD|224285 COG1366, SpoIIAA, Anti-anti-sigma regulatory factor (antagonist of
anti-sigma factor) [Signal transduction mechanisms].
Length = 117
Score = 39.3 bits (92), Expect = 0.001
Identities = 13/81 (16%), Positives = 39/81 (48%), Gaps = 2/81 (2%)
Query: 905 GIVILRIIGGMNFINKDKVFHKISKLSLSSEPYPKQIILDMMSLSSVDTSTVKSFLDLYK 964
GI++L +IG ++ + + ++ + + +++D+ + +D++ + + L K
Sbjct: 13 GILVLPLIGELDAARAPALKETLLEVI--AASGARGLVIDLSGVDFMDSAGLGVLVALLK 70
Query: 965 ELMEQGISLHIVKLLEPVKQV 985
+G+ L +V + V +
Sbjct: 71 SARLRGVELVLVGIQPEVART 91
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 42.4 bits (99), Expect = 0.001
Identities = 61/329 (18%), Positives = 125/329 (37%), Gaps = 25/329 (7%)
Query: 163 AKKREEFHEKLKEKPTEDLSESKFFDPRISAKSSVRTKRALKFHEPGKFQQLAEQLRVKA 222
AKK E + E ++ ++ K K+A + + ++ A++ + KA
Sbjct: 1341 AKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKA 1400
Query: 223 QLAKLQADISKNAR--KTGISSATKLALIAPKMEDDQDEMPEVEWWDAVIMVEETYEKEN 280
+ K +AD K A K A K A K ++ + + E + D E +K
Sbjct: 1401 EEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAE 1460
Query: 281 NIKTSAITNLVEHPIQMKPPSDMAKPVYMPVFLTDKERKKLRRQNRR--EAWKEEQEKIR 338
K A + +D AK D+ +KK ++ EA K + K +
Sbjct: 1461 EAKKKA---------EEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511
Query: 339 LGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLTVEQK 398
E + + + + K E +A K+ ++ ++A +K E+
Sbjct: 1512 ADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKK--AEEA 1569
Query: 399 REKKIRKIKEDTSLEVHVALRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANA 458
++ + K ++ALR + + +++ K+ + A+ K EEA
Sbjct: 1570 KKAEEDK---------NMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKI 1620
Query: 459 -SRKLTVEQKREKKIRKIKEDTSLEVHVA 486
+ +L ++ +KK+ ++K+ + E A
Sbjct: 1621 KAEELKKAEEEKKKVEQLKKKEAEEKKKA 1649
Score = 40.1 bits (93), Expect = 0.007
Identities = 36/169 (21%), Positives = 70/169 (41%), Gaps = 9/169 (5%)
Query: 314 TDKERKKLRRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHV 373
T++ K + E K E+ + E K + + + +D K E
Sbjct: 1093 TEEAFGKAEEAKKTETGKAEEARKA------EEAKKKAEDARKAEEARKAEDARKAEEAR 1146
Query: 374 RAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHVALRISNLMRVLGTEAVQ 433
+A+ AKR + +A +RK +K E + + EV A + EA +
Sbjct: 1147 KAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAAR 1206
Query: 434 DPTKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLE 482
K E +A+ A++ +D ++A A +K +K ++ +K +E+ + E
Sbjct: 1207 ---KAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNE 1252
Score = 39.4 bits (91), Expect = 0.015
Identities = 37/168 (22%), Positives = 71/168 (42%), Gaps = 21/168 (12%)
Query: 315 DKERKKLRRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVR 374
++ RK EA K+ ++ + E + + + V+ K E +
Sbjct: 1112 EEARKA------EEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARK 1165
Query: 375 AQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHVALRISNLMRVLGTEAVQD 434
A+ A++ +D ++A A+RK E ++ +++RK ++ E A + + +D
Sbjct: 1166 AEEARKAEDAKKAEAARK-AEEVRKAEELRKAEDARKAEA--ARKAEEERKAEEARKAED 1222
Query: 435 PTKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLE 482
K EA +A+ AK KD EEA K+ ++ R +E E
Sbjct: 1223 AKKAEAVKKAEEAK--KDAEEA----------KKAEEERNNEEIRKFE 1258
Score = 35.5 bits (81), Expect = 0.23
Identities = 67/324 (20%), Positives = 124/324 (38%), Gaps = 27/324 (8%)
Query: 163 AKKREEFHEKLKEKPTEDLSESKFFDPRISAKSSVRTKRALKFHEPGKFQQLAEQLRVKA 222
AKK +E +K +E + ++ K + + K+ K+A + + + ++ AE+ + KA
Sbjct: 1443 AKKADEAKKKAEEAKKAEEAKKKAEEAK---KADEAKKKAEEAKKADEAKKKAEEAKKKA 1499
Query: 223 QLAKLQADISKNARKTGISSATKLALIAPKME-----DDQDEMPEVEWWDAVIMVEETYE 277
AK A+ K A + + K A A K E D+ + E + D + EE +
Sbjct: 1500 DEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKK 1559
Query: 278 KENNIKTSAITNLVEHPIQMKPPSDMAKPV----YMPVFLTDKERKKLRRQNRREAWKEE 333
E K E ++ AK V +E KK++ + EA K E
Sbjct: 1560 AEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAE---EAKKAE 1616
Query: 334 QEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKL 393
+ KI+ +L+ + + + Q K + A+ K EE N +
Sbjct: 1617 EAKIK-------AEELKKAEEEK---KKVEQLKKKEAEEKKK--AEELKKAEEENKIKAA 1664
Query: 394 TVEQKREKKIRKIKEDTSLEVHVALRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDH 453
+K E+ +K +E E L + + K + + A+ K
Sbjct: 1665 EEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKA 1724
Query: 454 EEANASRKLTVEQKREKKIRKIKE 477
EE N + +++ E+ +K +E
Sbjct: 1725 EEENKIKAEEAKKEAEEDKKKAEE 1748
Score = 31.3 bits (70), Expect = 4.0
Identities = 22/110 (20%), Positives = 52/110 (47%), Gaps = 3/110 (2%)
Query: 368 KMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHVALRISNLMRVL 427
K E +A+ AK+ + ++A ++K E K+ ++ R +E E R+++ R
Sbjct: 1213 KAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFE---EARMAHFARRQ 1269
Query: 428 GTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKE 477
++ K + +A+ K+ + ++A +K +K+ ++ +K E
Sbjct: 1270 AAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADE 1319
Score = 30.9 bits (69), Expect = 5.5
Identities = 31/164 (18%), Positives = 67/164 (40%), Gaps = 4/164 (2%)
Query: 316 KERKKLRRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRA 375
++ KK + E K++ E+ + E ++R R+ Q K E +A
Sbjct: 1221 EDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKA 1280
Query: 376 QMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHVALRISNLMRVLGTEAVQ-- 433
K+ ++ ++A+ ++K E+K++ K K + + + A + + + A +
Sbjct: 1281 DELKKAEEKKKADEAKK--AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKA 1338
Query: 434 DPTKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKE 477
+ K A A+ D EA + E+K+E+ +K
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADA 1382
>gnl|CDD|132914 cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and
Anti-Sigma factor antagonist) domain of anti-anti-sigma
factors, key regulators of anti-sigma factors by
phosphorylation. Anti-anti-sigma factors play an
important role in the regulation of several sigma
factors and their corresponding anti-sigma factors. Upon
dephosphorylation they bind the anti-sigma factor and
induce the release of the sigma factor from the
anti-sigma factor. In a feedback mechanism the
anti-anti-sigma factor can be inactivated via
phosphorylation by the anti-sigma factor. Well studied
examples from Bacillus subtilis are SpoIIAA (regulating
sigmaF and sigmaC which play an important role in
sporulation) and RsbV (regulating sigmaB involved in the
general stress response). The STAS domain is also found
in the C- terminal region of sulphate transporters and
stressosomes.
Length = 99
Score = 37.5 bits (88), Expect = 0.004
Identities = 16/87 (18%), Positives = 43/87 (49%), Gaps = 11/87 (12%)
Query: 903 VPGIVILRIIGGMNFIN----KDKVFHKISKLSLSSEPYPKQIILDMMSLSSVDTSTVKS 958
G++++R+ G ++ ++ + +++ P++++LD+ ++ +D+S +
Sbjct: 6 RGGVLVVRLSGELDAATAPELREALEELLAE-------GPRRLVLDLSGVTFIDSSGLGV 58
Query: 959 FLDLYKELMEQGISLHIVKLLEPVKQV 985
L YK G L +V + V++V
Sbjct: 59 LLGAYKRARAAGGRLVLVNVSPAVRRV 85
>gnl|CDD|225142 COG2233, UraA, Xanthine/uracil permeases [Nucleotide transport and
metabolism].
Length = 451
Score = 40.3 bits (95), Expect = 0.006
Identities = 58/300 (19%), Positives = 108/300 (36%), Gaps = 68/300 (22%)
Query: 570 GSLSVLMSDIMISGFTTGTAILVILSQIKHVFGIKVKRHIGPF---NVI----YTLIDVA 622
++ L+ I+ +G ++S I I++ R P V+ +L VA
Sbjct: 103 DGIAALLGGIIAAGLVY-----FLISPI---VKIRLARLFPPVVTGPVVLVIGLSLAPVA 154
Query: 623 -------------ENIHKTNYVAFGVVVILVSVLIIYNDHFKSKIQKNISFPIPTEMIVI 669
+ + +V +L+ +LI F + I +I +
Sbjct: 155 INMAGGGPGAAGNPDFGSLENLGLALVTLLIILLIN---RFGKGFLRRIPI-----LIGL 206
Query: 670 VAGALLSSILDVKHKYNLSNVGKIP-IGLPSPQPPPFYLIPKLMLDGLFISIVAFSINIS 728
V G LL+ + + S V + P LP+P +L L ++IV +
Sbjct: 207 VVGYLLALFMG---MVDFSGVAEAPWFALPTPFYFGMAFDWGAILTMLPVAIVTIVEHTG 263
Query: 729 M---ASILAKKKYKIDSNQEL----LASGVSNIFASFFSCIPF-ASSLSRSLVQLQVGGK 780
+ + D L LA G++ + A F P + + +V L
Sbjct: 264 DITATGEITGRDL--DGKPRLRRGLLADGLATLIAGLFGGFPNTTFAQNIGVVAL----- 316
Query: 781 TQLAS-----GVSCVLLVLILL-YIGPFFETLPYCVL--TSIV---IVAVKG--MLNQVK 827
T + S G + +L++L L G +++P VL +V ++A G +L + K
Sbjct: 317 TGVYSRYVIAGAAVILILLGLFPKFGALIQSIPSPVLGGAMLVLFGMIAASGIRILIRNK 376
>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
family. The proteins represented by this model contain
three RNA recognition motifs (rrm: pfam00076) and have
been characterized as poly-pyrimidine tract binding
proteins associated with RNA splicing factors. In the
case of PUF60 (GP|6176532), in complex with p54, and in
the presence of U2AF, facilitates association of U2
snRNP with pre-mRNA.
Length = 612
Score = 39.7 bits (92), Expect = 0.008
Identities = 37/193 (19%), Positives = 65/193 (33%), Gaps = 18/193 (9%)
Query: 39 ALLALKGTAPVAPTPVVPAGV-TPAIIGSA-KIPPLMPPAPPSTTAEKDNATYEKAMKIA 96
A+L + +P P+ +P + A++ SA K +PP P + A E +
Sbjct: 322 AVLGPRAQSPATPSSSLPTDIGNKAVVSSAKKEAEEVPPLPQAAPAVVKPGPMEIPTPVP 381
Query: 97 QLQAQIQSKLSTGVLKPSVPGAKPPGAPATDKPTPLILDSEGRTIDITGKQVQLTHVVPT 156
I S PPG A + P L S + + V + T
Sbjct: 382 PPGLAIPS------------LVAPPGLVAPTEINPSFLASPRKKMKREKLPVTFGALDDT 429
Query: 157 LKANIRAKKREEFHEKLKEKPTE-DLSESKFFDPRISAKSSVRTKRALKFHEPGKFQQLA 215
L A K ++ E K + + + +P+ K + K + LA
Sbjct: 430 L-AWKEPSKEDQTSEDGKMLAIMGEAAAALALEPKKKKKEKEGEELQPKLVMNSEDASLA 488
Query: 216 EQ--LRVKAQLAK 226
Q + ++ A+
Sbjct: 489 SQEGMSIRGNSAR 501
>gnl|CDD|225161 COG2252, COG2252, Xanthine/uracil/vitamin C permease [Nucleotide
transport and metabolism].
Length = 436
Score = 39.5 bits (93), Expect = 0.009
Identities = 56/229 (24%), Positives = 92/229 (40%), Gaps = 35/229 (15%)
Query: 666 MIVIVAGALLSSILDVKHKYNLSNVGKIPIGLPSPQPPPFYLIPKLMLDGLFISIVAFSI 725
+I I+ +L IL + + VG P SP L L L I F
Sbjct: 199 LIGILVTTILGIILGIDVHF-GGLVGAPPS--LSPIFGQLDLSGNLSLAAFAPVIFTFFF 255
Query: 726 NI------------SMASILAKKKYKIDSNQELLASGVSNIFASFFSCIPFASSLSRSLV 773
S A +L K + LLA V+ + + F ++ S
Sbjct: 256 VDLFDTLGTLIGVASKAGLLDKNGKMPRIGKALLADSVATVVGALFGTST-VTAYIESAA 314
Query: 774 QLQVGGKTQLASGVSCV---LLVLILLYIGPFFETLP-YCVLTSIVIVAVKGMLNQVKDL 829
+ GG+T G++ V LL L+ L+ P +P Y +++IV ML+ VK
Sbjct: 315 GVAAGGRT----GLTAVVTGLLFLLSLFFSPLAALVPGYATAPALIIVGAL-MLSSVK-- 367
Query: 830 KLAFKESYTEACVWCLTFLSVVLLDVDY----GLGIGVLCSLIFVVVTG 874
++ + + +TEA FL++V++ + Y G+ G + +I V TG
Sbjct: 368 QIDWSD-FTEAVP---AFLTIVMMPLTYSIADGIAFGFISYVILKVFTG 412
>gnl|CDD|147845 pfam05914, RIB43A, RIB43A. This family consists of several
RIB43A-like eukaryotic proteins. Ciliary and flagellar
microtubules contain a specialised set of
protofilaments, termed ribbons, that are composed of
tubulin and several associated proteins. RIB43A was
first characterized in the unicellular biflagellate,
Chlamydomonas reinhardtii although highly related
sequences are present in several higher eukaryotes
including humans. The function of this protein is
unknown although the structure of RIB43A and its
association with the specialised protofilament ribbons
and with basal bodies is relevant to the proposed role
of ribbons in forming and stabilising doublet and
triplet microtubules and in organising their
three-dimensional structure. Human RIB43A homologues
could represent a structural requirement in centriole
replication in dividing cells.
Length = 379
Score = 36.2 bits (84), Expect = 0.097
Identities = 41/171 (23%), Positives = 72/171 (42%), Gaps = 20/171 (11%)
Query: 313 LTDKERKKL-RRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEA 371
L + +RKK+ + Q R W E+Q + R + + + + +RV Q+ +E
Sbjct: 147 LNEADRKKMQQEQQR--EWLEQQVQERQQAKAAEKQADTLHDQLRVELDHRAQELASLEE 204
Query: 372 HVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHVALRISNLMRV-LGTE 430
R + + D +A A E++RE++ R+ +E+ + I N + L TE
Sbjct: 205 ECRRAVEQATADFNKALA------EEQRERERREKQEEQEDNL---AEIYNQLTSDLLTE 255
Query: 431 ---AVQDPTKMEAH-VRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKE 477
Q H V K E+ A RK +Q +EK+ R+ +E
Sbjct: 256 NPDVAQSS--FGPHRVIPDRWKGMSP-EQLAAIRKGQEQQLQEKERRREEE 303
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 35.4 bits (82), Expect = 0.15
Identities = 29/122 (23%), Positives = 59/122 (48%), Gaps = 20/122 (16%)
Query: 377 MAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHVALRISN----LMRVLGTEAV 432
+ KR++ EEA + +E+ +++I+K + T V V + L+R++G +
Sbjct: 99 LEKRRERREEAEEELEKAIEEGDDEEIKKQSKRT---VRVTKEQNEDAKKLLRLMGIPVI 155
Query: 433 QDPTKMEAHVRAQMAKRQKDH----EEANA--------SRKLTVEQKREKKIRKIKEDTS 480
+ P + EA A++ K+ K + E+ +A R L + +++ I++I T
Sbjct: 156 EAPCEAEAQC-AELVKKGKVYAVATEDMDALTFGTPVLLRNLNFSEAKKRPIQEINLSTV 214
Query: 481 LE 482
LE
Sbjct: 215 LE 216
Score = 30.4 bits (69), Expect = 6.0
Identities = 39/144 (27%), Positives = 63/144 (43%), Gaps = 44/144 (30%)
Query: 305 KPVYMPVF------LTDKERKKLRRQNRREAWKEEQEK-----------------IRLGL 341
KPVY VF L E +K RR+ R EA +EE EK +R+
Sbjct: 82 KPVY--VFDGKPPELKSGELEK-RRERREEA-EEELEKAIEEGDDEEIKKQSKRTVRVTK 137
Query: 342 EPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDH----EEANA-------- 389
E + K L+R++G ++ P + EA A++ K+ K + E+ +A
Sbjct: 138 EQNEDAK----KLLRLMGIPVIEAPCEAEAQC-AELVKKGKVYAVATEDMDALTFGTPVL 192
Query: 390 SRKLTVEQKREKKIRKIKEDTSLE 413
R L + +++ I++I T LE
Sbjct: 193 LRNLNFSEAKKRPIQEINLSTVLE 216
>gnl|CDD|234022 TIGR02813, omega_3_PfaA, polyketide-type polyunsaturated fatty acid
synthase PfaA. Members of the seed for this alignment
are involved in omega-3 polyunsaturated fatty acid
biosynthesis, such as the protein PfaA from the
eicosapentaenoic acid biosynthesis operon in
Photobacterium profundum strain SS9. PfaA is encoded
together with PfaB, PfaC, and PfaD, and the functions of
the individual polypeptides have not yet been described.
More distant homologs of PfaA, also included with the
reach of this model, appear to be involved in
polyketide-like biosynthetic mechanisms of
polyunsaturated fatty acid biosynthesis, an alternative
to the more familiar iterated mechanism of chain
extension and desaturation, and in most cases are encoded
near genes for homologs of PfaB, PfaC, and/or PfaD.
Length = 2582
Score = 35.8 bits (82), Expect = 0.17
Identities = 23/119 (19%), Positives = 40/119 (33%), Gaps = 6/119 (5%)
Query: 17 FPQNQIKNMMAHAQKMIEERKKALLALKGTAPVAPTPVVPAGVTPAIIGSAKIPPLMPPA 76
F Q+Q + + H Q + + ++ L A AP P + + + P P
Sbjct: 1093 FHQHQAETLKVHEQYLNLQTDSNIVKLSPLATQAPVIKSVVTQAPVVQVTISVAPAAPVL 1152
Query: 77 PPSTTAEKDNATYEKAMKIAQLQAQIQS------KLSTGVLKPSVPGAKPPGAPATDKP 129
P + +A +++ A A + + PSV A P A D
Sbjct: 1153 PAVVSPPVVSAAPAQSVATAVAMAPVAEVPIAVPVQQSVDYMPSVAQAAAPQASVNDSA 1211
>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 548
Score = 35.6 bits (82), Expect = 0.18
Identities = 34/185 (18%), Positives = 64/185 (34%), Gaps = 16/185 (8%)
Query: 294 PIQMKPPSDMAKPVYMPVFLTDKERKKLRRQNRREAWKEEQEKIRLGLEPPPEPKLRISN 353
+ + +D +K P D ++ Q ++A E E + E R +
Sbjct: 188 SLAINDINDTSKENQDPNNYLDALGRRRIAQVLQDAEIAENEAEKE-TEIAIAEANRDAK 246
Query: 354 LMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLE 413
L+ E Q P A ++ K EA + E +RE + +I + +++
Sbjct: 247 LVE---LEVEQQPAGKTAEQTREV-KIILAETEAEVAAW-KAETRREAEQAEILAEQAIQ 301
Query: 414 VHVALRISNLMRVLGTEAVQDPTKMEAHV-RAQMAKRQKDHEEANASRKLTVEQKREKKI 472
A + VQ +EA R + +RQK+ E R + + +
Sbjct: 302 EEKAQA---------EQEVQHAKALEAREMRVGLIERQKETELEPQERSYFINAAQRQAQ 352
Query: 473 RKIKE 477
+ K
Sbjct: 353 EEAKA 357
>gnl|CDD|132911 cd06844, STAS, Sulphate Transporter and Anti-Sigma factor
antagonist domain found in the C-terminal region of
sulphate transporters as well as in bacterial and
archaeal proteins involved in the regulation of sigma
factors. The STAS (Sulphate Transporter and Anti-Sigma
factor antagonist) domain is found in the C-terminal
region of sulphate transporters as well as in bacterial
and archaeal proteins involved in the regulation of
sigma factors, like anti-anti-sigma factors and
"stressosome" components. The sigma factor regulators
are involved in protein-protein interaction which is
regulated by phosphorylation.
Length = 100
Score = 32.1 bits (73), Expect = 0.33
Identities = 11/69 (15%), Positives = 30/69 (43%), Gaps = 2/69 (2%)
Query: 903 VPGIVILRIIGGMNFINKDKVFHKISKLSLSSEPYPKQIILDMMSLSSVDTSTVKSFLDL 962
V ++R+ G ++ + ++ ++ + K I++D+ +L +D+S L+
Sbjct: 6 VDDYWVVRLEGELDHHSVEQFKEELLHNITN--VAGKTIVIDISALEFMDSSGTGVLLER 63
Query: 963 YKELMEQGI 971
+ G
Sbjct: 64 SRLAEAVGG 72
>gnl|CDD|224397 COG1480, COG1480, Predicted membrane-associated HD superfamily
hydrolase [General function prediction only].
Length = 700
Score = 34.7 bits (80), Expect = 0.38
Identities = 39/200 (19%), Positives = 75/200 (37%), Gaps = 31/200 (15%)
Query: 630 YVAFGVVVILVSVLI-IYNDHFKSKIQKNISFPIPTEMIVIVAGALLSSILDVKHKYNLS 688
+ ++VI + +L +Y KS ++ S + + +A LS +L + +N S
Sbjct: 277 LLGLLILVIFLILLFALYERRTKSPLKLRNS---LLLLYLSLAILTLS-LLRIVGYFNYS 332
Query: 689 NVGKIPIGLPSPQPPPFYLIPKLMLDGLF--ISIVAFSINISMASILAKKKYKIDSNQEL 746
G PP P L++ +F I+I + S+ L + N E+
Sbjct: 333 ASG--------LLVPPAL-GPMLLILLVFLRIAIFSSSMIAIALLYL----FGGSYNSEI 379
Query: 747 LASGVSNIFASFFSCIPFASSLSRSLVQLQVGGKTQLASGVSCVLLVLILLYIGPFFETL 806
+ + F++ +SR L+ G L +L+LL + F
Sbjct: 380 ALIALLSSFSALV----LLRKMSRRSDILKSGLFLALM-------NMLLLLSLIFAFTLS 428
Query: 807 PYCVLTSIVIVAVKGMLNQV 826
Y L + + G+L+ +
Sbjct: 429 WYDALQDAIFAFLSGLLSGI 448
>gnl|CDD|234090 TIGR03021, pilP_fam, type IV pilus biogenesis protein PilP.
Members of this protein family are found in type IV
pilus biogenesis loci and include proteins designated
PilP [Cell envelope, Surface structures].
Length = 119
Score = 32.0 bits (73), Expect = 0.46
Identities = 15/70 (21%), Positives = 26/70 (37%), Gaps = 3/70 (4%)
Query: 85 DNATYEKAMKIAQLQAQIQSKLSTGVLKPSVPG---AKPPGAPATDKPTPLILDSEGRTI 141
+ A E + A+ Q +++ G + PP A PT R +
Sbjct: 13 ETALLEAQLARAKAQNELEEAERGGQVGGPGMPFTSGVPPMALTGANPTSAPAILLPRVV 72
Query: 142 DITGKQVQLT 151
+I G+ +LT
Sbjct: 73 EIFGRGGRLT 82
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 32.8 bits (74), Expect = 0.57
Identities = 26/91 (28%), Positives = 44/91 (48%), Gaps = 11/91 (12%)
Query: 322 RRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQ 381
RRQ R + +EEQE+ E + +L L R A ++ + E R Q +R
Sbjct: 32 RRQAREQREQEEQER----REQEEQDRLEREELKR----RAAEERLRREEEARRQEEERA 83
Query: 382 KDHEEA---NASRKLTVEQKREKKIRKIKED 409
++ EE A + EQ+ +++I+K KE+
Sbjct: 84 REKEEKAKRKAEEEEKQEQEEQERIQKQKEE 114
Score = 30.1 bits (67), Expect = 4.7
Identities = 30/140 (21%), Positives = 61/140 (43%), Gaps = 6/140 (4%)
Query: 344 PPEPKLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLTVE-QKREKK 402
P + + + GT ++ T++ A R Q A+ Q++ EE + + +RE+
Sbjct: 2 PGKAENSAALGKPTAGTTDAEEATRLLAEKRRQ-AREQREQEEQERREQEEQDRLEREEL 60
Query: 403 IRKIKED-TSLEVHVALRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRK 461
R+ E+ E + R +A + + E + + + QK EEA A +
Sbjct: 61 KRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQEQEEQERIQKQKEEAEARAR 120
Query: 462 LTVEQ---KREKKIRKIKED 478
E+ +REK ++I+++
Sbjct: 121 EEAERMRLEREKHFQQIEQE 140
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 34.1 bits (78), Expect = 0.63
Identities = 22/99 (22%), Positives = 27/99 (27%), Gaps = 7/99 (7%)
Query: 39 ALLALKGTAPVAPTPVVPAGVT-----PAIIG-SAKIPPLMPPAPPSTTAEKDNATYEKA 92
AL A V P P G P G A PP P A P + A +
Sbjct: 2734 ALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRP-AVASLS 2792
Query: 93 MKIAQLQAQIQSKLSTGVLKPSVPGAKPPGAPATDKPTP 131
L + + P +PA P P
Sbjct: 2793 ESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPP 2831
Score = 32.6 bits (74), Expect = 1.9
Identities = 21/93 (22%), Positives = 25/93 (26%), Gaps = 8/93 (8%)
Query: 47 APVAPTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMKIAQLQAQIQSKL 106
AP A P PA A PAPP A +
Sbjct: 2711 APHALVSATPLPPGPAAARQASPALPAAPAPP--------AVPAGPATPGGPARPARPPT 2762
Query: 107 STGVLKPSVPGAKPPGAPATDKPTPLILDSEGR 139
+ G P+ P A G P + SE R
Sbjct: 2763 TAGPPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795
Score = 31.1 bits (70), Expect = 5.1
Identities = 20/92 (21%), Positives = 30/92 (32%), Gaps = 12/92 (13%)
Query: 42 ALKGTAPVAPTP---------VVPAGV-TPAIIGS--AKIPPLMPPAPPSTTAEKDNATY 89
+G P P P GV A+ G+ A P PP P ++
Sbjct: 271 TARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPAPAGDAEEEDDE 330
Query: 90 EKAMKIAQLQAQIQSKLSTGVLKPSVPGAKPP 121
+ AM++ + + G K P PP
Sbjct: 331 DGAMEVVSPLPRPRQHYPLGFPKRRRPTWTPP 362
>gnl|CDD|165458 PHA03188, PHA03188, UL14 tegument protein; Provisional.
Length = 199
Score = 32.7 bits (74), Expect = 0.64
Identities = 21/66 (31%), Positives = 34/66 (51%), Gaps = 2/66 (3%)
Query: 348 KLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIK 407
K R +L+R G + QDP + A A+ A ++ +A+R VEQK K++
Sbjct: 29 KERTLDLIR--GGASTQDPAFVHAFTAAKDACADLNNNIRSAARIAAVEQKIADIQEKVE 86
Query: 408 EDTSLE 413
E TS++
Sbjct: 87 EQTSIQ 92
Score = 32.0 bits (72), Expect = 1.3
Identities = 20/64 (31%), Positives = 33/64 (51%), Gaps = 2/64 (3%)
Query: 419 RISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKED 478
R +L+R G + QDP + A A+ A ++ +A+R VEQK K++E
Sbjct: 31 RTLDLIR--GGASTQDPAFVHAFTAAKDACADLNNNIRSAARIAAVEQKIADIQEKVEEQ 88
Query: 479 TSLE 482
TS++
Sbjct: 89 TSIQ 92
>gnl|CDD|183124 PRK11412, PRK11412, putative uracil/xanthine transporter;
Provisional.
Length = 433
Score = 33.2 bits (76), Expect = 0.74
Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 4/81 (4%)
Query: 746 LLASGVSNIFASFFSCIP---FASSLSRSLVQLQVGGKTQLASGVSCVLLVLILLYIGPF 802
+A+G + + IP F SS+ L Q + G LLV ++ +
Sbjct: 281 FVATGFMTLITVPLAVIPFSPFVSSIG-LLTQTGDYRRRSFIYGSVMCLLVALIPALTRL 339
Query: 803 FETLPYCVLTSIVIVAVKGML 823
F ++P V +++++V+ +L
Sbjct: 340 FCSIPLPVSSAVMLVSYLPLL 360
>gnl|CDD|241506 cd13352, PH-GRAM2_TBC1D8B, TBC1 domain family member 8B (TBC1D8B)
Pleckstrin Homology-Glucosyltransferases, Rab-like
GTPase activators and Myotubularins (PH-GRAM) domain,
repeat 2. TBC1D8B may act as a GTPase-activating
protein for Rab family protein(s). TBC1D8B contains an
N-terminal PH-GRAM domain and a C-terminal
Rab-GTPase-TBC (Tre-2, BUB2p, and Cdc16p) domain. This
cd contains the second repeat of the PH-GRAM domain. The
GRAM domain is found in glucosyltransferases,
myotubularins and other putative membrane-associated
proteins. The GRAM domain is part of a larger motif with
a pleckstrin homology (PH) domain fold.
Length = 93
Score = 30.5 bits (69), Expect = 1.0
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 196 SVRTKRALKFHEPGKFQQLAEQLRVK 221
S++ K A +F E F+QL +LR+K
Sbjct: 67 SIKGKTAFRFSEVRDFEQLVAKLRLK 92
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 32.7 bits (75), Expect = 1.4
Identities = 27/101 (26%), Positives = 46/101 (45%), Gaps = 7/101 (6%)
Query: 381 QKDHEEANASRKLTVEQKREKKIRKIKEDTS--LEVHVALRISNLMRVLGTEAVQDPTKM 438
K EE R L ++KRE+ + K++E+ LE A L E + K
Sbjct: 182 LKAEEEEELERAL--KEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKY 239
Query: 439 EAHVRAQMAKRQKDHEE--ANASRKLTVEQKREKKIRKIKE 477
E +R ++ ++ + HE+ N +E +RE ++IKE
Sbjct: 240 EEKLRQELERQAEAHEQKLKNELALQAIELQREFN-KEIKE 279
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 32.4 bits (74), Expect = 1.5
Identities = 38/142 (26%), Positives = 57/142 (40%), Gaps = 28/142 (19%)
Query: 332 EEQEKIR--LGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANA 389
EE K+R LGL+P P P + + T+ +D E +K EEA A
Sbjct: 6 EETNKLRAKLGLKPLPVPGAKPGS------TKESRDAAAYE--------NWKKRQEEAEA 51
Query: 390 SRKLTVEQKREKKIRKIKEDTSLEVHVALRISNLMRVLG-TEAVQDPTKMEAHVRAQMAK 448
RK ++ +KI K +E R S L + E D +A + + K
Sbjct: 52 KRK---REELREKIAKAREK-------RERNSKLGGIKTLGEDDDDDDDTKAWL-KKSKK 100
Query: 449 RQKDHEEANASRKLTVEQKREK 470
RQK E L E+++E+
Sbjct: 101 RQKKKEAERKKALLLDEKEKER 122
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 32.2 bits (74), Expect = 1.7
Identities = 47/169 (27%), Positives = 80/169 (47%), Gaps = 33/169 (19%)
Query: 316 KERKKLRRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLM----RVLGTEAVQDPTKMEA 371
KE + L+++ EA KEE K+R LE E K R + L R+L E D KME
Sbjct: 43 KEAETLKKEALLEA-KEEVHKLRAELER--ELKERRNELQRLERRLLQREETLD-RKME- 97
Query: 372 HVRAQMAKRQKDHEEAN---ASRKLTVEQKREKKIRKIKE-DTSLEVHVALRISNLMRVL 427
+ K++++ E+ ++++ +++K E+ I E LE RIS L +
Sbjct: 98 ----SLDKKEENLEKKEKELSNKEKNLDEKEEELEELIAEQREELE-----RISGLTQ-- 146
Query: 428 GTEAVQD-PTKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKI 475
EA + ++E R + AK K+ EE +++ +KK ++I
Sbjct: 147 -EEAKEILLEEVEEEARHEAAKLIKEIEE-------EAKEEADKKAKEI 187
>gnl|CDD|227938 COG5651, COG5651, PPE-repeat proteins [Cell motility and
secretion].
Length = 490
Score = 32.2 bits (73), Expect = 1.7
Identities = 9/50 (18%), Positives = 13/50 (26%)
Query: 39 ALLALKGTAPVAPTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAEKDNAT 88
+ A V G A G+ + A P A + A
Sbjct: 311 GAIGQTSQATANAGSVNATGGAAAGSGNLGVANSGSAAAPFGIAGANQAA 360
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 32.5 bits (74), Expect = 1.8
Identities = 18/63 (28%), Positives = 22/63 (34%), Gaps = 4/63 (6%)
Query: 46 TAPVA-PTPVVPAGVTPAIIGSAKIPPLM---PPAPPSTTAEKDNATYEKAMKIAQLQAQ 101
TAP P G P A PP P APP A E +A L++
Sbjct: 904 TAPGGDAASAPPPGAGPPAPPQAVPPPRTTQPPAAPPRGPDVPPAAVAELRETLADLRSA 963
Query: 102 IQS 104
+S
Sbjct: 964 QRS 966
>gnl|CDD|224698 COG1784, COG1784, Predicted membrane protein [Function unknown].
Length = 395
Score = 32.0 bits (73), Expect = 1.8
Identities = 54/193 (27%), Positives = 85/193 (44%), Gaps = 26/193 (13%)
Query: 685 YNLSNVGKIPIGLPSPQPPPFYLIPKLMLDG-LFISIVAFSINIS--MASILAKKKYKID 741
LSN IP S + K G L ++V+F +S A++LA K
Sbjct: 195 EALSNPSSIPKQEDSKFAISGKRLLKSSFAGSLAGALVSFLPGVSPAQAALLASK----- 249
Query: 742 SNQELLASGVSNIFASFFSCIPFASSLSRSLVQLQVGGKTQLASGVSCVLLVLILLYIGP 801
L F + S I A+++ SLV L GK + SGV+ V I +G
Sbjct: 250 ----LTGDDEDRRFITAVSGINTANAVF-SLVALIAIGKPR--SGVA----VAIQRSVGD 298
Query: 802 FFETLPYCVL----TSIVIVAVKGMLNQVKDLKLAFKESYTEACVWCLTFLSVVLLDVDY 857
TLPY + +++ VA+ L+ + LK+ FK Y++ C L FL ++ +
Sbjct: 299 I--TLPYLIFLVLSAALIAVALAPWLSILALLKVRFK-DYSKLCAGVLLFLVILGYLLMG 355
Query: 858 GLGIGVLCSLIFV 870
+G+GVL + I +
Sbjct: 356 FIGLGVLGTAIAI 368
>gnl|CDD|221739 pfam12730, ABC2_membrane_4, ABC-2 family transporter protein. This
family is related to the ABC-2 membrane transporter
family pfam01061.
Length = 230
Score = 31.6 bits (72), Expect = 2.0
Identities = 32/193 (16%), Positives = 71/193 (36%), Gaps = 37/193 (19%)
Query: 629 NYVAFGVVVILVSVLIIYNDHFKSKIQKNISFPIPTE------MIVIVAGALLSSILDVK 682
+ + +++ +++ L++ + I+ +S PI ++V++ +LL+S+L
Sbjct: 53 SALFLPLLIAILASLLVSREFKNGTIKLLLSLPISRGKIFLAKLLVLLILSLLASLL--- 109
Query: 683 HKYNLSNV--GKIPIGLPSPQPPPFYLIPKLMLDGLFISIV--------------AFSIN 726
L ++ G + G + L+L L ++ A +I
Sbjct: 110 --LLLLSLIAGLLLGGSGFSLSLLLGALLLLLLLSLLALLLILLILFLSLLLRNSAVAIG 167
Query: 727 ISMASILAKKKYKIDSNQELLASGVSNIFASFFSCIPFASSLSRSLVQLQVGGKTQLASG 786
I + IL LL + +S +P LS SL + +
Sbjct: 168 IGLLLILL----------GLLLGSLPLWLLFPWSYLPLIVLLSLSLNGTLLSSGLSIIIA 217
Query: 787 VSCVLLVLILLYI 799
+ +L++ I+ I
Sbjct: 218 LIVILILFIIFLI 230
>gnl|CDD|212083 cd11514, SLC6sbd_DAT1, Na(+)- and Cl(-)-dependent dopamine
transporter 1; solute-binding domain. DAT1 (also called
DAT), is a plasma membrane transport protein that
functions at the dopaminergic synapses to transport
dopamine from the extracellular space back into the
presynaptic nerve terminal. Human DAT1 is encoded by the
SLC6A3 gene, and is expressed in the brain. DAT1 may
play a role in diseases or disorders related to
dopaminergic neurons, including attention-deficit
hyperactivity disorder (ADHD), Tourette syndrome,
Parkinson's disease, alcoholism, drug abuse,
schizophrenia, extraversion, and risky behavior. This
subgroup belongs to the solute carrier 6 (SLC6)
transporter family.
Length = 555
Score = 31.8 bits (72), Expect = 2.3
Identities = 34/144 (23%), Positives = 62/144 (43%), Gaps = 28/144 (19%)
Query: 741 DSNQELLASGVSNIFASFFSCIPFASSLSRSLVQL-QVGGKTQLASG----VSCVLLVLI 795
+ +SG+++ + + P A R ++ L + G L SC+++V++
Sbjct: 135 PGDSSGDSSGLNDTYKT----TPAAEYFERGVLHLHESHGIHDLGPPRWQLTSCLVVVIV 190
Query: 796 LLYIGPF------------FETLPYCVLTSIVI--VAVKGMLNQVKD-LKLAFKESYTEA 840
LLY + T+PY VLT++++ V + G ++ +K L + F EA
Sbjct: 191 LLYFSLWKGVKTSGKVVWITATMPYVVLTALLLRGVTLPGAIDGIKAYLSVDFLRLC-EA 249
Query: 841 CVWCLTFLSVVLLDVDYGLGIGVL 864
VW + G+G GVL
Sbjct: 250 SVWIDAATQICF---SLGVGFGVL 270
>gnl|CDD|221791 pfam12822, DUF3816, Protein of unknown function (DUF3816). This
family of proteins is functionally uncharacterized but
are likely to be membrane transporters. This family of
proteins is found in bacteria and archaea. Proteins in
this family are typically between 177 and 208 amino
acids in length. A subset of this family is associated
with the TM1506 proteins. In this context, transport
through the channel is predicted to be regulated by the
TM1506 protein by either regulating redox potential or
modification of substrates.
Length = 168
Score = 30.7 bits (70), Expect = 2.4
Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 22/164 (13%)
Query: 568 ELGSLSVLMSDI--MISGFTTGTAILVILSQIKHVFGIKVKRHIGPFNVIYTLIDVAENI 625
L + S I ++ F G ++ + + + GPF ++ L
Sbjct: 20 PFPPLKIDFSHIPVLLGAFLLGPVAGALIGLLTSLLSF-LLFGGGPFALVGPLA------ 72
Query: 626 HKTNYVAFGVVVILVSVLIIYNDHFKSKIQKNISFPIPTEMIVIVAGALLSSILDVKHKY 685
++ L++ LI K+K K I ++ + G L++++L++
Sbjct: 73 ----NFLPRILFGLIAGLIYKKLRKKTK--KRAVLAI---ILGTILGTLVATLLNLGLIL 123
Query: 686 NL-SNVGKIPIGLPSPQPPPFYLIPKLM---LDGLFISIVAFSI 725
L + +PI L+P L + G+ SI+ + +
Sbjct: 124 PLYAKFLGMPISAIVGALFAAVLLPVLPFNLIKGIIASIIVYLL 167
>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl
carrier protein subunit; Validated.
Length = 130
Score = 30.2 bits (68), Expect = 2.4
Identities = 10/40 (25%), Positives = 14/40 (35%), Gaps = 1/40 (2%)
Query: 47 APVAPTPVVPAGVTPAIIGSAKIPPLMPPAP-PSTTAEKD 85
AP TP + + P + AP P+ A D
Sbjct: 23 APAQAAAPAQPASTPVPVPTEASPQVEAQAPQPAAAAGAD 62
>gnl|CDD|225183 COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain
an N-terminal double-glycine peptidase domain [Defense
mechanisms].
Length = 709
Score = 31.8 bits (73), Expect = 2.4
Identities = 6/87 (6%), Positives = 27/87 (31%), Gaps = 13/87 (14%)
Query: 581 ISGFTTGTAILVILSQIKHVFGIKVKRHIGPF--NVIYTLIDVAENIHKTNYVAFGVVVI 638
+ + +++ S + + + P ++ + + +A G+++
Sbjct: 149 LFKYRRLLFEVLLASLLLQLLAL-----ATPLFSQIVIDKVLPDASRSTLTVLAIGLLLA 203
Query: 639 LVSVLI------IYNDHFKSKIQKNIS 659
+ + H ++ +S
Sbjct: 204 ALFEALLRLLRTYLIAHLGKRLDLELS 230
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 31.4 bits (72), Expect = 2.8
Identities = 32/182 (17%), Positives = 57/182 (31%), Gaps = 45/182 (24%)
Query: 312 FLTDKERKKLRRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEA 371
+ +K ++ R+ R KEE+E+ L + D + +
Sbjct: 158 YQREKAEREEEREAERRERKEEKEREVARLR---------AQQEEAEDEREELDELRADL 208
Query: 372 HVRAQMAK-RQKDHEEANASRKLTVE-------QKREKKIRKIKEDTSLEVHVALRISNL 423
+ K RQK+ EEA R+ E Q EK+ R +E
Sbjct: 209 YQEEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEER-------------- 254
Query: 424 MRVLGTEAVQDPTKMEAHVRAQ--------MAKRQKDHEEANASRKLTVEQKREKKIRKI 475
E + +M +R K E + +E+K E++ +
Sbjct: 255 -----AEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQ-IEEKEERRAAER 308
Query: 476 KE 477
+E
Sbjct: 309 EE 310
Score = 30.3 bits (69), Expect = 6.1
Identities = 24/182 (13%), Positives = 64/182 (35%), Gaps = 33/182 (18%)
Query: 313 LTDKERKKLRRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAH 372
+ER+++ R ++E E K +R E + + +
Sbjct: 93 ERLQEREQMDEIIERIQEEDEAEAQE---------KREKQKKLR----EEIDEFNEERIE 139
Query: 373 VRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHVALRISNLMRVLGTEAV 432
+ + +R+++ E + ++ E++ + +E R +R EA
Sbjct: 140 RKEEEKEREREEELKILEYQREKAEREEEREAERRERK----EEKEREVARLRAQQEEAE 195
Query: 433 QDPTKMEAHV----------------RAQMAKRQKDHEEANASRKLTVEQKREKKIRKIK 476
+ +++ + + KR++ +E +R+ +E+K E+ +
Sbjct: 196 DEREELDELRADLYQEEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERA 255
Query: 477 ED 478
E+
Sbjct: 256 EE 257
>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region. The
timeless (tim) gene is essential for circadian function
in Drosophila. Putative homologues of Drosophila tim
have been identified in both mice and humans (mTim and
hTIM, respectively). Mammalian TIM is not the true
orthologue of Drosophila TIM, but is the likely
orthologue of a fly gene, timeout (also called tim-2).
mTim has been shown to be essential for embryonic
development, but does not have substantiated circadian
function. Some family members contain a SANT domain in
this region.
Length = 507
Score = 31.6 bits (71), Expect = 2.9
Identities = 48/270 (17%), Positives = 101/270 (37%), Gaps = 18/270 (6%)
Query: 143 ITGKQVQLTHVVPTLKANIRAKKREEFHEKLKEKPTEDLSESKFFDPRISAKSSVRTKRA 202
ITGK+ + V L+ + ++KRE K + K + + + +
Sbjct: 198 ITGKRSRAQVVDKLLQLGLVSEKRELRGGKKRRKKLKPKQPNGEESGEDDFQEDPEEEEQ 257
Query: 203 LKFHEPGKFQQLAEQLRVKAQLAKLQADISKNARKTGISSATKLALIAPKMEDDQDEMPE 262
L +P + ++ +V+ ++ + ++ S L + + D+ E
Sbjct: 258 LPESKPEETEKRVSAFQVEGSTLISAENLRQQLKQEKTSWP--LLWLQSCLIRAADDREE 315
Query: 263 VEWWDAVIMVEETYEKENNIKTSAITNLVEHPIQMKPP-SDMAKPVYMPVFLTDKERKKL 321
E AV +V T E E ++ L++ + ++PP S +P L+ + L
Sbjct: 316 DECDQAVPLVPLTEENEEAMENEQFQRLLKA-LGLRPPRSGQEGFWRIPAKLSSTQ---L 371
Query: 322 RRQNRREAWKEEQEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQ 381
RR+ + +EE+ + L + E E+ + + + R + A
Sbjct: 372 RRRAASLSGEEEEPEDELKDDVDGEQ-----------ADESEHETLALRKNARQRKAGLA 420
Query: 382 KDHEEANASRKLTVEQKREKKIRKIKEDTS 411
EEA + K+++ +K K+ T
Sbjct: 421 SPEEEALGEEEQKAPPKKKQLNQKNKQQTG 450
>gnl|CDD|113413 pfam04642, DUF601, Protein of unknown function, DUF601. This
family represents a conserved region found in several
uncharacterized plant proteins.
Length = 311
Score = 31.2 bits (70), Expect = 2.9
Identities = 19/77 (24%), Positives = 30/77 (38%), Gaps = 4/77 (5%)
Query: 112 KPSVPGAKPPGAPATDKPT----PLILDSEGRTIDITGKQVQLTHVVPTLKANIRAKKRE 167
KPS K P AP ++ T D R D KQ + P + R ++++
Sbjct: 30 KPSTLAGKNPDAPTSESRTPSKATSSKDPSKRYADKKRKQSEKDARSPPRSSRPRTEEKD 89
Query: 168 EFHEKLKEKPTEDLSES 184
+ KEK + S+
Sbjct: 90 AGPSQQKEKGKKGDSQD 106
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 31.4 bits (72), Expect = 2.9
Identities = 23/109 (21%), Positives = 32/109 (29%), Gaps = 6/109 (5%)
Query: 25 MMA-HAQKMIEERKKALLALKGTAPVAPTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAE 83
M+A H + E + + A T A V P + PP AP A
Sbjct: 356 MLAFHPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPP--ASAPQQAPAV 413
Query: 84 KDNATYEKAMKIAQLQAQIQSKLSTGVLKPSVPGAKPPGAPATDKPTPL 132
T + + + Q+Q K S P A P L
Sbjct: 414 PLPETTSQLL---AARQQLQRAQGATKAKKSEPAAASRARPVNSALERL 459
>gnl|CDD|237539 PRK13878, PRK13878, conjugal transfer relaxase TraI; Provisional.
Length = 746
Score = 31.6 bits (72), Expect = 3.0
Identities = 18/70 (25%), Positives = 32/70 (45%), Gaps = 6/70 (8%)
Query: 338 RLG-LEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLTVE 396
RLG EP PE + + + +DP ++ + AK + + + A+R +E
Sbjct: 248 RLGPFEPSPERQAQTKA-----KRQYRKDPIRLRVNTVELYAKYKAEQQTLTAARAQALE 302
Query: 397 QKREKKIRKI 406
+ R +K R I
Sbjct: 303 RARRRKDRLI 312
>gnl|CDD|227532 COG5207, UBP14, Isopeptidase T [Posttranslational modification,
protein turnover, chaperones].
Length = 749
Score = 31.6 bits (71), Expect = 3.2
Identities = 17/67 (25%), Positives = 30/67 (44%), Gaps = 1/67 (1%)
Query: 395 VEQKREKKIR-KIKEDTSLEVHVALRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDH 453
VE+ KKI+ K E + +V R+ + T++ T +E R ++A + D
Sbjct: 83 VEENNNKKIKLKNAELSEEDVEELRRVPRTVLSKSTDSDISSTALEKIERMEIAIKPGDE 142
Query: 454 EEANASR 460
+E R
Sbjct: 143 DERVLWR 149
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 31.2 bits (71), Expect = 3.2
Identities = 18/98 (18%), Positives = 27/98 (27%), Gaps = 13/98 (13%)
Query: 46 TAPVAPTPVVPAGVTPAIIGSAKIPPLMPPAP-----PSTTAEKDNATYEKAMKIAQLQA 100
AP A A A P PA P+ A + A + +
Sbjct: 405 AAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPV 464
Query: 101 QIQSKLSTGVLKPSVPGAKPPGAPATDKPTPLILDSEG 138
++ + + + P APA L EG
Sbjct: 465 RVAPEPA-----VASAAPAPAAAPA---AARLTPTEEG 494
Score = 30.5 bits (69), Expect = 6.3
Identities = 18/76 (23%), Positives = 21/76 (27%), Gaps = 1/76 (1%)
Query: 56 PAGVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMKIAQLQAQIQSKLSTGVLKPSV 115
PA A + K P P A A A A A A + + P
Sbjct: 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAP-AAAASAPAAPPAAAPPA 424
Query: 116 PGAKPPGAPATDKPTP 131
P A P A P
Sbjct: 425 PVAAPAAAAPAAAPAA 440
>gnl|CDD|236792 PRK10905, PRK10905, cell division protein DamX; Validated.
Length = 328
Score = 31.1 bits (70), Expect = 3.3
Identities = 19/68 (27%), Positives = 24/68 (35%), Gaps = 1/68 (1%)
Query: 35 ERKKALLALKGTAPVAPTPVVPAGVTPAIIGSAKIP-PLMPPAPPSTTAEKDNATYEKAM 93
RK+A++ K A T P TP A PA ST A K+ AT
Sbjct: 157 ARKQAVIEPKKPQATAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQ 216
Query: 94 KIAQLQAQ 101
+ Q
Sbjct: 217 TASPAQTT 224
>gnl|CDD|221313 pfam11917, DUF3435, Protein of unknown function (DUF3435). This
family of proteins are functionally uncharacterized.
This protein is found in eukaryotes. Proteins in this
family are typically between 435 to 791 amino acids in
length. This family is related to pfam00589 suggesting
it may be an integrase enzyme.
Length = 418
Score = 31.2 bits (71), Expect = 3.6
Identities = 23/110 (20%), Positives = 39/110 (35%), Gaps = 13/110 (11%)
Query: 376 QMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHVALRISNLMRVLGTEAVQDP 435
+ A R + R LT EQK + + E L ++ + G V
Sbjct: 259 RAATRMSRTRDPRRPRDLTDEQKAS--VEEDPELQELIRKRDHLKKEIIALYGQ--VAKA 314
Query: 436 TKMEAHVRAQMAKRQKDHEEANASRKLTVEQKREKKIRKIKEDTSLEVHV 485
+ R + +R E N ++L R + +KI+E+ E V
Sbjct: 315 KGTPLYERLEKRRR----EVRNERQRL-----RRELKKKIREEFDEEQPV 355
>gnl|CDD|146548 pfam03967, PRCH, Photosynthetic reaction centre, H-chain N-terminal
region. The family corresponds the N-terminal
cytoplasmic domain.
Length = 135
Score = 29.9 bits (68), Expect = 3.7
Identities = 11/32 (34%), Positives = 16/32 (50%), Gaps = 4/32 (12%)
Query: 321 LRRQNRREAWKEEQEK----IRLGLEPPPEPK 348
LRR+++RE + E + G P P PK
Sbjct: 30 LRREDKREGYPLESDGGGRLPNEGPPPLPSPK 61
>gnl|CDD|237082 PRK12373, PRK12373, NADH dehydrogenase subunit E; Provisional.
Length = 400
Score = 30.9 bits (70), Expect = 4.0
Identities = 24/89 (26%), Positives = 31/89 (34%), Gaps = 10/89 (11%)
Query: 60 TPAIIGSAKIPPLMPPAPPSTTAEKDNA-----TYEKAMKIAQLQAQIQSKLSTGVLKPS 114
P +A +PP P S AE + A T KA + Q T +P+
Sbjct: 232 APWQGDAAPVPPSEAARPKSADAETNAALKTPATAPKAAAKNAKAPEAQPVSGTAAAEPA 291
Query: 115 VPGAKPPGAPAT-----DKPTPLILDSEG 138
A A A DKP PL + G
Sbjct: 292 PKEAAKAAAAAAKPALEDKPRPLGIARPG 320
>gnl|CDD|221040 pfam11235, Med25_SD1, Mediator complex subunit 25 synapsin 1. The
overall function of the full-length Med25 is efficiently
to coordinate the transcriptional activation of RAR/RXR
(retinoic acid receptor/retinoic X receptor) in higher
eukaryotic cells. Human Med25 consists of several
domains with different binding properties, the
N-terminal, VWA, domain, this SD1 - synapsin 1 - domain
from residues 229-381, a PTOV(B) or ACID domain from
395-545, an SD2 domain from residues 564-645 and a
C-terminal NR box-containing domain (646-650) from
646-747. This The function of the SD domains is unclear.
Length = 168
Score = 30.2 bits (67), Expect = 4.0
Identities = 25/102 (24%), Positives = 38/102 (37%), Gaps = 18/102 (17%)
Query: 48 PVAPTPVVPAGVTPAIIGSAKIPPLMPPAPP------STTAEKDNATYEKAMKIAQLQAQ 101
P + PA V P A PL P PP + + + +KA++ A+ Q Q
Sbjct: 14 SKQPVSLPPAAVLPPQSLPAPQNPLPPVTPPQMQVPQNVSLHAAHDAAQKAVEAAKNQKQ 73
Query: 102 -IQSKLSTGVLKPSVPGAK-----------PPGAPATDKPTP 131
++++ S P PPG P KP+P
Sbjct: 74 GLKNRFSPITPLQQAPIVGPPFSQAPAPVLPPGPPGAPKPSP 115
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 31.0 bits (70), Expect = 4.1
Identities = 23/89 (25%), Positives = 28/89 (31%), Gaps = 5/89 (5%)
Query: 46 TAPVA---PTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMKIAQLQAQI 102
APVA P PA PA P P A + A A ++ L A
Sbjct: 379 AAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAA-APARRSPAPEALAAAR 437
Query: 103 QSKLSTGVLKPSVPGAKPPGAPATDKPTP 131
Q+ P+ P P APA
Sbjct: 438 QASARGPGGAPA-PAPAPAAAPAAAARPA 465
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 31.1 bits (71), Expect = 4.1
Identities = 12/85 (14%), Positives = 19/85 (22%)
Query: 47 APVAPTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMKIAQLQAQIQSKL 106
A AP +A P P P+ + + +
Sbjct: 402 AAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAA 461
Query: 107 STGVLKPSVPGAKPPGAPATDKPTP 131
+ P+ A P A P
Sbjct: 462 PSAQPAPAPAAAPEPTAAPAPAPPA 486
Score = 31.1 bits (71), Expect = 4.2
Identities = 21/87 (24%), Positives = 29/87 (33%), Gaps = 1/87 (1%)
Query: 45 GTAPVAPTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMKIAQLQAQIQS 104
G AP AP P A G+A P + A + + + + AQ +
Sbjct: 675 GAAPAAPPPAPAPAAPAAPAGAAP-AQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSP 733
Query: 105 KLSTGVLKPSVPGAKPPGAPATDKPTP 131
V P P P A A +P P
Sbjct: 734 AADDPVPLPPEPDDPPDPAGAPAQPPP 760
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 30.8 bits (70), Expect = 4.5
Identities = 66/368 (17%), Positives = 133/368 (36%), Gaps = 73/368 (19%)
Query: 146 KQVQLTHVVPTLKANIRAKKREEFH-EKLKEKPTEDLSESKFFDPRISAKSSVRTKRALK 204
K + +T V + + ++ E L EKP EDL F +++++ KR K
Sbjct: 10 KNIDITDV----EQFLEEQRDERITGGVLAEKPDEDL----FVVDTTGDEATIKKKRRKK 61
Query: 205 FHEPGK-FQQLAEQLRVKAQLAKL-QADISKNARKTGISSATKLALIAPKMEDDQDEMPE 262
+P K + L + +V A K +K+ S +L + K+ + +
Sbjct: 62 LKKPLKSDEILENRSKVPAPSVKKRHVKKKSKKKKSKQVSRKELLRL-KKLARGKKGG-K 119
Query: 263 VEWWDAVIMVEETYEKENNIKTSAITN--LVEHPI------QMK-PPSDMAKPV------ 307
+ +A + + +E T L EH + +K P +A
Sbjct: 120 AKLLNASKDNYDLWGEEEPPDTPKWPKKDLAEHTLEQVAKKTVKTAPISLAANGKPVPAV 179
Query: 308 --------YMPVF-----LTDKERKK-LRRQNRREAWKEEQEKIRLGLEPPPEPKLRISN 353
Y P F L +E +K ++ + +R+ + +EK + P +S
Sbjct: 180 EVPHAGASYNPSFEDHQELLQEEYEKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSE 239
Query: 354 LMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANAS-------RKLTVEQKREKKIRKI 406
+ E D + E+ + + E RK ++ +EK+ +++
Sbjct: 240 GLL----EESDDDGEEES---DDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKEL 292
Query: 407 KEDTSLEVHVALRISNLMRVLGTEAVQDPTKMEAHVRAQMAKRQKDHEEANASRKLTVEQ 466
+ + E + +++ L R+ ++AK E+A A +K ++
Sbjct: 293 EREAKEEKQLKKKLAQLARL-----------------KEIAKEVAQKEKARARKKEQRKE 335
Query: 467 KREKKIRK 474
+ EKK K
Sbjct: 336 RGEKKKLK 343
>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal. Mediator is a
large complex of up to 33 proteins that is conserved
from plants to fungi to humans - the number and
representation of individual subunits varying with
species. It is arranged into four different sections, a
core, a head, a tail and a kinase-activity part, and the
number of subunits within each of these is what varies
with species. Overall, Mediator regulates the
transcriptional activity of RNA polymerase II but it
would appear that each of the four different sections
has a slightly different function. Mediator subunit
Hrs1/Med3 is a physical target for Cyc8-Tup1, a yeast
transcriptional co-repressor.
Length = 381
Score = 30.8 bits (69), Expect = 4.6
Identities = 24/100 (24%), Positives = 28/100 (28%), Gaps = 11/100 (11%)
Query: 42 ALKGTAPVAPTPVVPAGV--------TPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAM 93
A A A P GV A + P P T + A KA
Sbjct: 153 AKVLKANAASAPNTTTGVGSAATTAAISATTATTPTTTQKKPRKPRQTKKTGPAAAAKAQ 212
Query: 94 KIAQLQAQIQSKLSTGVLKPSVPGAKPPGAPATDKPTPLI 133
AQ QAQ + G L PTPL+
Sbjct: 213 ASAQAQAQASAYNQMGSLGVPQNT---SMLAQIPNPTPLM 249
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 31.0 bits (70), Expect = 5.0
Identities = 16/93 (17%), Positives = 27/93 (29%), Gaps = 3/93 (3%)
Query: 37 KKALLALKGTAPVAPTPVVPAGVTPAIIGSAKIPPLMPPAPP--STTAEKDNATYEKAMK 94
+ A P AG A +A APP + + A
Sbjct: 386 RAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADG 445
Query: 95 IAQLQAQIQSKLSTGVLKPSVPGAKPPGAPATD 127
A + A+ ++ S + A+PP +
Sbjct: 446 DAPVPAKANARASA-DSRCDERDAQPPADSGSA 477
>gnl|CDD|177816 PLN02156, PLN02156, gibberellin 2-beta-dioxygenase.
Length = 335
Score = 30.4 bits (68), Expect = 5.1
Identities = 25/91 (27%), Positives = 42/91 (46%), Gaps = 8/91 (8%)
Query: 274 ETYEKENNIKTSAITNLVEHPIQMKPPSDMAKPVYMPVFLTDKERKKLRRQNRREAWKEE 333
E Y KE +S + +VE ++++P ++K + KE R N +E
Sbjct: 138 EEYMKEMKRMSSKVLEMVEEELKIEPKEKLSK------LVKVKESDSCLRMNHYPEKEET 191
Query: 334 QEKIRLGLEPPPEPKLRISNLMRVLGTEAVQ 364
EK+ +G +P+L IS L+R T +Q
Sbjct: 192 PEKVEIGFGEHTDPQL-IS-LLRSNDTAGLQ 220
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 30.2 bits (68), Expect = 5.4
Identities = 27/105 (25%), Positives = 30/105 (28%), Gaps = 22/105 (20%)
Query: 39 ALLALKGTAPVAPTPVVPAGVTPAIIGSAKIPPLMP-PAPPSTTAEKDNATYEKAMKIAQ 97
+AL TA P P P T A + PP P PA P D
Sbjct: 29 VAIALPATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPP----- 83
Query: 98 LQAQIQSKLSTGVLKPSVPGAKPPGAPATDKPTPLILDSEGRTID 142
P+ P A PP P D P E ID
Sbjct: 84 --------------PPADPNAPPP--PPVDPNAPPPPAPEPGRID 112
>gnl|CDD|150430 pfam09755, DUF2046, Uncharacterized conserved protein H4 (DUF2046).
This is the conserved N-terminal 350 residues of a
family of proteins of unknown function possibly
containing a coiled-coil domain.
Length = 308
Score = 30.3 bits (68), Expect = 5.5
Identities = 21/94 (22%), Positives = 36/94 (38%), Gaps = 19/94 (20%)
Query: 322 RRQNRREAWKEE-QEKIRLGLEPPPEPKLRISNLMRVLGTEAVQDPTKMEAHVRA----- 375
+R ++ EA K QEK+ + PP P+ + A + +H+ +
Sbjct: 185 KRMDKLEAEKRMLQEKLDQPVSEPPSPRDIFTE--------AEDTVGGIASHITSLRSEV 236
Query: 376 -----QMAKRQKDHEEANASRKLTVEQKREKKIR 404
+AK + D+ E Q RE+ IR
Sbjct: 237 RRLRQNLAKSEADYTEKMEQYAKEERQTREENIR 270
>gnl|CDD|233199 TIGR00939, 2a57, Equilibrative Nucleoside Transporter (ENT).
[Transport and binding proteins, Nucleosides, purines
and pyrimidines].
Length = 437
Score = 30.5 bits (69), Expect = 5.6
Identities = 14/69 (20%), Positives = 21/69 (30%), Gaps = 3/69 (4%)
Query: 634 GVVVILVSVLI--IYNDHFKSKIQKNISFPIPTEMIVIVAGALLSS-ILDVKHKYNLSNV 690
GV+ L +L+ ND K F P + +I L L Y +
Sbjct: 150 GVLTSLAMILVKASGNDSHGLKKSALGYFGTPCVVQLICIVCYLLLPKLPFARYYLQKKL 209
Query: 691 GKIPIGLPS 699
K +
Sbjct: 210 DKGAGEDET 218
>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2). This family
represents a conserved region approximately 60 residues
long within the eukaryotic targeting protein for Xklp2
(TPX2). Xklp2 is a kinesin-like protein localised on
centrosomes throughout the cell cycle and on spindle
pole microtubules during metaphase. In Xenopus, it has
been shown that Xklp2 protein is required for centrosome
separation and maintenance of spindle bi-polarity. TPX2
is a microtubule-associated protein that mediates the
binding of the C-terminal domain of Xklp2 to
microtubules. It is phosphorylated during mitosis in a
microtubule-dependent way.
Length = 57
Score = 27.3 bits (61), Expect = 5.9
Identities = 11/20 (55%), Positives = 16/20 (80%)
Query: 157 LKANIRAKKREEFHEKLKEK 176
L ++ RA+KR EF +KL+EK
Sbjct: 1 LHSDERAEKRAEFDKKLEEK 20
>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
Length = 209
Score = 29.8 bits (67), Expect = 6.5
Identities = 23/120 (19%), Positives = 35/120 (29%), Gaps = 10/120 (8%)
Query: 18 PQNQIKNMMAHAQKMIEERKKALL-ALKGTAPVAPTPVVPAGVTPAIIGSAKIPPLMPPA 76
P+ ++ M ++ E K L A + P TP P P
Sbjct: 74 PRTSLRQAMNPLRQAGNEIKSDLQKATSMSEPATENKP-AEVTTPVEPMGLPETP--PAV 130
Query: 77 PPSTTAEKDNATYE----KAMKIAQLQAQIQSKLSTGVLKP--SVPGAKPPGAPATDKPT 130
P A A A+ +A+ + KP ++ K A KPT
Sbjct: 131 PVPAPAPAVAAAAAQAAAAPKAPAKPRAKSPRPAAKAAPKPTETITAKKAKKTAAAPKPT 190
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 30.3 bits (67), Expect = 7.0
Identities = 26/98 (26%), Positives = 32/98 (32%), Gaps = 2/98 (2%)
Query: 37 KKALLALKGTAPVAPTPVVPA--GVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMK 94
K A K AP A PA PA + PPA + K A KA
Sbjct: 252 KAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAPAKAAAAPAKAAA 311
Query: 95 IAQLQAQIQSKLSTGVLKPSVPGAKPPGAPATDKPTPL 132
A +K + K + P AK PA P+
Sbjct: 312 APAKAAAPPAKAAAPPAKAATPPAKAAAPPAKAAAAPV 349
Score = 29.9 bits (66), Expect = 7.6
Identities = 26/97 (26%), Positives = 31/97 (31%), Gaps = 2/97 (2%)
Query: 37 KKALLALKGTAPVAPTPVVPA--GVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMK 94
K A K AP A PA PA + PPA + K A KA
Sbjct: 224 KAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAA 283
Query: 95 IAQLQAQIQSKLSTGVLKPSVPGAKPPGAPATDKPTP 131
A +K + K + AK APA P
Sbjct: 284 PPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPP 320
>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
Length = 280
Score = 29.8 bits (66), Expect = 7.4
Identities = 25/84 (29%), Positives = 34/84 (40%), Gaps = 3/84 (3%)
Query: 45 GTAPVAPTP--VVPAGVTPAIIGSAKIPPLMPPAP-PSTTAEKDNATYEKAMKIAQLQAQ 101
G +P+AP+P PA PA +A P + PAP P+ A+ A +
Sbjct: 78 GQSPLAPSPACAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPA 137
Query: 102 IQSKLSTGVLKPSVPGAKPPGAPA 125
ST P+ P P APA
Sbjct: 138 PACPPSTRQCPPAPPLPTPKPAPA 161
>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 814
Score = 30.2 bits (68), Expect = 7.6
Identities = 14/98 (14%), Positives = 31/98 (31%), Gaps = 13/98 (13%)
Query: 46 TAPVAPTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMKIAQLQAQIQSK 105
T TP + + P + T + + + + + ++
Sbjct: 32 VTTTPATSTEAEQTTPVESDATEEADNT-ETPVAATTAAEAPSSSETAETSDPTSEATD- 89
Query: 106 LSTGVLKPSVPGAKPPGAPATDKPTPLILDSEGRTIDI 143
+T +V AT+ P+ EG+T+D+
Sbjct: 90 -TTTSEARTVT------PAATETSKPV----EGQTVDV 116
>gnl|CDD|180508 PRK06281, PRK06281, putative monovalent cation/H+ antiporter
subunit B; Reviewed.
Length = 154
Score = 29.0 bits (65), Expect = 7.7
Identities = 21/82 (25%), Positives = 37/82 (45%), Gaps = 9/82 (10%)
Query: 569 LGSLSVLMSDIMISGFTTGTAIL---VILSQIKHVFGIKVKRHIGPFNVIYTLIDVAENI 625
LG +++L I G G A++ IL + V+G++ PFN + + + E+
Sbjct: 19 LGIMTILGGHITPGGGFQGGAMIAAGFILCIV--VYGLEKS----PFNFSHEFMSIMESF 72
Query: 626 HKTNYVAFGVVVILVSVLIIYN 647
YV G+ + S +YN
Sbjct: 73 GALGYVGLGLCGLFFSGSFLYN 94
>gnl|CDD|150291 pfam09577, Spore_YpjB, Sporulation protein YpjB (SpoYpjB). These
proteins are found in the endospore-forming bacteria
which include Bacillus species. In Bacillus subtilis,
ypjB was found to be part of the sigma-E regulon.
Sigma-E is a sporulation sigma factor that regulates
expression in the mother cell compartment. Null mutants
of ypjB show a sporulation defect, but this gene is not,
however, a part of the endospore formation minimal gene
set.
Length = 232
Score = 29.6 bits (67), Expect = 8.4
Identities = 22/89 (24%), Positives = 34/89 (38%), Gaps = 9/89 (10%)
Query: 149 QLTHVVPTLKANIRAKKREEFHEKLKEKPTEDLSESKFFDPRI----SAKSSVRTKRALK 204
+ +K + + E F EKL E LS P + S K R L
Sbjct: 102 PVMDAFSAMKEAAQKEDSEAFQEKLNEF----LSLYDMIYPSLTIDVSEKQLQRVDSHLS 157
Query: 205 FHEPGKFQQLAEQLRVKAQLAKLQADISK 233
E +FQQL+++ +L L+ D+
Sbjct: 158 VLENFRFQQLSKK-SKLERLTLLEGDLKA 185
>gnl|CDD|223659 COG0586, DedA, Uncharacterized membrane-associated protein
[Function unknown].
Length = 208
Score = 29.2 bits (66), Expect = 8.8
Identities = 36/204 (17%), Positives = 73/204 (35%), Gaps = 56/204 (27%)
Query: 615 IYTLIDVAENIHKTNYVAFGVVVILVSVLIIYNDHFKSKIQKNISFPIPTEMIVIVAGAL 674
L+ + Y+ +++ L S L++ P+P E+++++AGA
Sbjct: 2 DMELLILWIQEGSLGYLGVFLILFLESGLLVG-------------PPLPGEVLLLLAGA- 47
Query: 675 LSSILDVKHKYNLSNVGKIPIGLPSPQPPPFYLIPKLMLDGLFISIVAFSIN--ISMASI 732
L+ GK+ + L +I L L ++++ I +
Sbjct: 48 ------------LAAQGKLNLWL---------VILVATLGALLGDLISYWIGRRFGRKLL 86
Query: 733 LAKKKYKIDSNQELLAS-------GVSNIFASFFSCIPFASSLSRSLVQLQVG------G 779
Y++ ++L + G+ IF F IP R+LV + G
Sbjct: 87 RKLWSYRLLKRKKLDKAELLFERHGLFAIFLGRF--IPGV----RTLVPIVAGMSKMPLR 140
Query: 780 KTQLASGVSCVLLVLILLYIGPFF 803
+ L + + +L L+L +G
Sbjct: 141 RFLLYNILGALLWALVLTLLGYLL 164
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 28.6 bits (64), Expect = 9.2
Identities = 18/56 (32%), Positives = 20/56 (35%), Gaps = 2/56 (3%)
Query: 28 HAQKMIEERKKALLALKGTAPVAPTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAE 83
H M +E +K L L AP P PV P PP PA P A
Sbjct: 67 HILSMEQEARKILAPLTPPAP--PEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAA 120
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 29.7 bits (67), Expect = 9.4
Identities = 22/129 (17%), Positives = 31/129 (24%), Gaps = 14/129 (10%)
Query: 46 TAPVAPTPVVPAGVTPAIIGSAKIPPLMPPAPPSTTAEKDNATYEKAMKIAQLQAQIQSK 105
T P A A + P AP S T A + S
Sbjct: 384 TQPAAAPQPSAAAAASPSPSQSSAAA-QPSAPQSATQPAGTPPTVSVDPPAAVPVNPPST 442
Query: 106 LSTGVLKPSVPGAKPPGAPATDKPTPLILDSEGRTID--ITGKQVQLTHVVPTLKANIRA 163
+P K + S G + I K Q T +
Sbjct: 443 AP--------QAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATGNIKEA---PTG 491
Query: 164 KKREEFHEK 172
++E F E+
Sbjct: 492 TQKEIFTEE 500
>gnl|CDD|216329 pfam01148, CTP_transf_1, Cytidylyltransferase family. The members
of this family are integral membrane protein
cytidylyltransferases. The family includes phosphatidate
cytidylyltransferase EC:2.7.7.41 as well as Sec59 from
yeast. Sec59 is a dolichol kinase EC:2.7.1.108.
Length = 259
Score = 29.3 bits (66), Expect = 9.6
Identities = 18/130 (13%), Positives = 50/130 (38%), Gaps = 14/130 (10%)
Query: 780 KTQLASGVSCVLLVLILLYIGPFFETLPYCVLTSIVIVAVKGMLNQVKDLKLAFKESYTE 839
K ++ + + +L+ LILL++GP +L I+ + L ++ ++L + +
Sbjct: 1 KQRIITHIVLILIFLILLFLGPL-----IFLLLVAAIIIL--ALYEL--IRLLRLKFSSR 51
Query: 840 ACVWCLTFLSVVLLDVDYGLGIGVLCSLIFVVVTGQKVMVYKLGRLIDSNIYVEEDFYES 899
+ + L + L +L +V ++V + L+ + +
Sbjct: 52 LLLLTRGLGLIFYLLGLLLVYGEGLLALFGLV-----LLVIPVLWLLVLPVLAYHRVSST 106
Query: 900 AIDVPGIVIL 909
+ + + +
Sbjct: 107 LLPLFYVGLF 116
>gnl|CDD|225179 COG2270, COG2270, Permeases of the major facilitator superfamily
[General function prediction only].
Length = 438
Score = 29.6 bits (67), Expect = 9.6
Identities = 15/73 (20%), Positives = 28/73 (38%), Gaps = 9/73 (12%)
Query: 727 ISMASILAKKKYKIDSNQELLASGVSNIFASFFSCIPFASSLSRSLVQLQVGGKTQLASG 786
++M + + S + LL ++ A+ + I A L G K L
Sbjct: 272 LAMGGVFGAADLGLSSTELLLIGIALSVVAALGAII--AGFLDERF-----GSKPVLM-- 322
Query: 787 VSCVLLVLILLYI 799
+ V+L + LY+
Sbjct: 323 IGLVILSIAALYL 335
>gnl|CDD|237541 PRK13881, PRK13881, conjugal transfer protein TrbI; Provisional.
Length = 472
Score = 29.7 bits (67), Expect = 9.7
Identities = 22/117 (18%), Positives = 41/117 (35%), Gaps = 3/117 (2%)
Query: 18 PQNQIKNMMAHAQKMIEERKKALLALKG--TAPVAPTPVVPAGVTPAIIGSAKIPPLMPP 75
P+ + N A+++ ++ L+ P PT A + A + PP P
Sbjct: 63 PKEKAGNTSMFAKEIAGDQTGGLIEPASPLKVPEMPTGPASAPLPIARPDNPDAPPTPPA 122
Query: 76 APPSTTAEKDNATYEKAM-KIAQLQAQIQSKLSTGVLKPSVPGAKPPGAPATDKPTP 131
P + D+ M K+ + +++K + V P G+ P G
Sbjct: 123 NPGNPGQVNDDEAQRIRMAKLQMFEEAVKAKTTVRVDAPRSNGSAPGGPSTYTGTPQ 179
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 30.0 bits (67), Expect = 9.9
Identities = 24/89 (26%), Positives = 30/89 (33%), Gaps = 10/89 (11%)
Query: 46 TAPVAPTPVVPAGVTPAIIGSAKIP-----PLMPPAPPSTTAEKDNATYEKAMKIAQLQA 100
P APTP+ P P G A+ P PPA A A +A A
Sbjct: 696 PPPRAPTPMRPPAAPP---GRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPG 752
Query: 101 QIQSKLSTGVLKPSVPGAKPPGAPATDKP 129
+ + + P A PGAP P
Sbjct: 753 RARP--PAAAPGRARPPAAAPGAPTPQPP 779
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.134 0.383
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 56,945,342
Number of extensions: 5767789
Number of successful extensions: 7706
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7506
Number of HSP's successfully gapped: 175
Length of query: 1124
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 1017
Effective length of database: 6,191,724
Effective search space: 6296983308
Effective search space used: 6296983308
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.5 bits)