Citrus Sinensis ID: 019667


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------
MELDHLGGTFGGNRKPTPFMCLVMKMLQIQPEKDIVVEFIKNDDYKYVRVLGAFYLRLTGTDIDIYRYLEPLYNDYRKLRQKSGDGRFILTHVDEVIDELLTKDYSCDIALPRIKKRWNLETVGALEPRKSVLEDDFEEEEEKEENDQLDDGLEDGAYERDYHHGRSPTRERDRDRRRDSYRYRDRDYDRDYDRDRGRGRERDRGRDRDRDRDRDRDRERDRDRNRYRLRDDREYGRERDRDREREGRDRERRDRDRGRRRSRSRSRSRSRDRKRHARSSMSPRRDRDEPEDVGTRAEPKKKKEKKEKKDDGTDHPDPEIAEANRIRASLGLKPLKL
cccEEEcccccccccccHHHHHHHHHHcccccHHHHHHHHHcccccHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHcccccccEEEEEHHHHHHHHHHHHccccccccccccccHHHHccccccccccccccHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHcccccccccccccccHHcccccccccccccccccccccccccHHHHHccccccccccccccccccccccccHHHHHHHcccccccccccHHHHHHHHHHHHcccccccc
ccEEEEEccccccccccHHHHHHHHHHHccccHHHHHHHHcccHHHHHHHHHHHHHHHHccHHHHHHHHHHHHHHHHHHHHHcccccEEEEEHHHHHHHHHccccEEccEccccccHHHHHHcccccccccccHHHHHHHccHHHHcccccccccccccccccccccccccccccccccccccccccccccccHHccccccccccccccccccccccccccccccccccccccccccccccccccccccHccccccccccccccccccccccccccccccccccccccccccccccccccHHHccccccccccccccHHHHHHHHHHHHHccccccc
meldhlggtfggnrkptpFMCLVMKMLQIQPEKDIVVEFIKNDDYKYVRVLGAFYLRltgtdidiyrYLEPLYNDYRKlrqksgdgrfILTHVDEVIDELLtkdyscdialprikkrwnletvgaleprksvleddfeeeeekeendqlddgledgayerdyhhgrsptrerdrdrrrdsyryrdrdydrdydrdrgrgrerdrgrdrdrdrdrdrdrerdrdrnryrlrddreygrerdrdreregrdrerrdrdrgrrrsrsrsrsrsrdrkrharssmsprrdrdepedvgtraepkkkkekkekkddgtdhpdpeiAEANRIRAslglkplkl
meldhlggtfggnrkpTPFMCLVMKMLQIQPEKDIVVEFIKNDDYKYVRVLGAFYLrltgtdidiyrYLEPLYNDYRKLRQKSGDGRFILTHVDEVIDELLTkdyscdialprikkrwnletvgaleprksvleddfeeeeekeendqlddgledgayerdyhhgrsptrerdrdrrrdsyryrdrdydrdydrdrgrgrerdrgrdrdrdrdrdrdrerdrdrnryrlrddreygrerdrdreregrdrerrdrdrgrrrsrsrsrsrsrdrkrharssmsprrdrdepedvgtraepkkkkekkekkddgtdhpdpeiaeanriraslglkplkl
MELDHLGGTFGGNRKPTPFMCLVMKMLQIQPEKDIVVEFIKNDDYKYVRVLGAFYLRLTGTDIDIYRYLEPLYNDYRKLRQKSGDGRFILTHVDEVIDELLTKDYSCDIALPRIKKRWNLETVGALEPRKSVLeddfeeeeekeendqlddGLEDGAYERDYHHGRSPTrerdrdrrrdsyryrdrdydrdydrdrgrgrerdrgrdrdrdrdrdrdrerdrdrnryrlrddreygrerdrdreregrdrerrdrdrgrrrsrsrsrsrsrdrKRHARSSMSPRRDRDEPEDVGTRAEPkkkkekkekkDDGTDHPDPEIAEANRIRASLGLKPLKL
***************PTPFMCLVMKMLQIQPEKDIVVEFIKNDDYKYVRVLGAFYLRLTGTDIDIYRYLEPLYNDYRKLRQKSGDGRFILTHVDEVIDELLTKDYSCDIALPRIKKRWNLETVGAL*******************************************************************************************************************************************************************************************************************
*ELDHLGGTFGGNRKPTPFMCLVMKMLQIQPEKDIVVEFIKNDDYKYVRVLGAFYLRLTGTDIDIYRYLEPLYNDYRKLRQKSGDGRFILTHVDEVIDELLTKDYSCDIALPRIKKRWNLETVGALEPRKSV******************************************************************************************************************************************************************************************************GLKPL**
MELDHLGGTFGGNRKPTPFMCLVMKMLQIQPEKDIVVEFIKNDDYKYVRVLGAFYLRLTGTDIDIYRYLEPLYNDYRKLRQKSGDGRFILTHVDEVIDELLTKDYSCDIALPRIKKRWNLETVGALEPRKSVLE***************DDGLEDGAYERDY***************RDSYRYRDRDYDRDYDR***************************RDRNRYRLRDDRE*********************************************************************************PDPEIAEANRIRASLGLKPLKL
MELDHLGGTFGGNRKPTPFMCLVMKMLQIQPEKDIVVEFIKNDDYKYVRVLGAFYLRLTGTDIDIYRYLEPLYNDYRKLRQKSGDGRFILTHVDEVIDELLTKDYSCDIALPRIKKRWNLETVGALEPRKSVL*DD*EEEE*****************************************************************************************************************************************************************************************R*SLGL*****
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MELDHLGGTFGGNRKPTPFMCLVMKMLQIQPEKDIVVEFIKNDDYKYVRVLGAFYLRLTGTDIDIYRYLEPLYNDYRKLRQKSGDGRFILTHVDEVIDELLTKDYSCDIALPRIKKRWNLETVGALEPRKSVLEDDFEEEEEKEENDQLDDGLEDGAYERDYHHGRSPTRERDRDRRRDSYRYRDRDYDRDYDRDRGRGRERDRGRDRDRDRDRDRDRERDRDRNRYRLRDDREYGRERDRDREREGRDRERRDRDRGRRRSRSRSRSRSRDRKRHARSSMSPRRDRDEPEDVGTRAEPKKKKEKKEKKDDGTDHPDPEIAEANRIRASLGLKPLKL
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query337 2.2.26 [Sep-21-2011]
Q4FK66312 Pre-mRNA-splicing factor yes no 0.406 0.439 0.649 1e-50
Q5RDD2312 Pre-mRNA-splicing factor yes no 0.406 0.439 0.656 2e-50
Q8HXH6312 Pre-mRNA-splicing factor N/A no 0.406 0.439 0.656 2e-50
Q8NAV1312 Pre-mRNA-splicing factor yes no 0.406 0.439 0.656 2e-50
Q0P5I6312 Pre-mRNA-splicing factor yes no 0.406 0.439 0.656 2e-50
Q6DHU4313 Pre-mRNA-splicing factor yes no 0.406 0.437 0.642 2e-49
Q4FZQ6312 Pre-mRNA-splicing factor N/A no 0.406 0.439 0.635 9e-49
Q28H87312 Pre-mRNA-splicing factor yes no 0.406 0.439 0.627 1e-48
Q9UUD2210 Pre-mRNA-splicing factor yes no 0.335 0.538 0.460 2e-25
F1QBY1 2876 Nipped-B-like protein B O no no 0.305 0.035 0.379 8e-09
>sp|Q4FK66|PR38A_MOUSE Pre-mRNA-splicing factor 38A OS=Mus musculus GN=Prpf38a PE=1 SV=1 Back     alignment and function desciption
 Score =  200 bits (509), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 89/137 (64%), Positives = 110/137 (80%)

Query: 1   MELDHLGGTFGGNRKPTPFMCLVMKMLQIQPEKDIVVEFIKNDDYKYVRVLGAFYLRLTG 60
           MEL  +GG +GGN KPTPF+CL +KMLQIQPEKDI+VEFIKN+D+KYVR+LGA Y+RLTG
Sbjct: 52  MELKFVGGVYGGNIKPTPFLCLTLKMLQIQPEKDIIVEFIKNEDFKYVRMLGALYMRLTG 111

Query: 61  TDIDIYRYLEPLYNDYRKLRQKSGDGRFILTHVDEVIDELLTKDYSCDIALPRIKKRWNL 120
           T ID Y+YLEPLYNDYRK++ ++ +G F+L HVDE I ELL  +  CDI LPR++KR+ L
Sbjct: 112 TAIDCYKYLEPLYNDYRKIKSQNRNGEFVLMHVDEFIYELLHSERVCDIILPRLQKRYVL 171

Query: 121 ETVGALEPRKSVLEDDF 137
           E    LEPR S LE+D 
Sbjct: 172 EEAEQLEPRVSALEEDM 188




May be required for pre-mRNA splicing.
Mus musculus (taxid: 10090)
>sp|Q5RDD2|PR38A_PONAB Pre-mRNA-splicing factor 38A OS=Pongo abelii GN=PRPF38A PE=2 SV=1 Back     alignment and function description
>sp|Q8HXH6|PR38A_MACFA Pre-mRNA-splicing factor 38A OS=Macaca fascicularis GN=PRPF38A PE=2 SV=1 Back     alignment and function description
>sp|Q8NAV1|PR38A_HUMAN Pre-mRNA-splicing factor 38A OS=Homo sapiens GN=PRPF38A PE=1 SV=1 Back     alignment and function description
>sp|Q0P5I6|PR38A_BOVIN Pre-mRNA-splicing factor 38A OS=Bos taurus GN=PRPF38A PE=2 SV=1 Back     alignment and function description
>sp|Q6DHU4|PR38A_DANRE Pre-mRNA-splicing factor 38A OS=Danio rerio GN=prpf38a PE=2 SV=1 Back     alignment and function description
>sp|Q4FZQ6|PR38A_XENLA Pre-mRNA-splicing factor 38A OS=Xenopus laevis GN=prpf38a PE=2 SV=1 Back     alignment and function description
>sp|Q28H87|PR38A_XENTR Pre-mRNA-splicing factor 38A OS=Xenopus tropicalis GN=prpf38a PE=2 SV=1 Back     alignment and function description
>sp|Q9UUD2|PRP38_SCHPO Pre-mRNA-splicing factor 38 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=prp38 PE=3 SV=1 Back     alignment and function description
>sp|F1QBY1|NIPLB_DANRE Nipped-B-like protein B OS=Danio rerio GN=nipblb PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query337
388513517385 unknown [Medicago truncatula] 0.976 0.854 0.707 7e-82
317106643390 JHL23J11.3 [Jatropha curcas] 0.498 0.430 0.887 9e-74
225450673393 PREDICTED: uncharacterized protein LOC10 0.557 0.478 0.829 1e-72
296089736326 unnamed protein product [Vitis vinifera] 0.557 0.576 0.829 2e-72
356576821383 PREDICTED: uncharacterized protein LOC10 0.501 0.441 0.810 3e-71
147863592235 hypothetical protein VITISV_013614 [Viti 0.545 0.782 0.826 2e-70
363808110381 uncharacterized protein LOC100792466 [Gl 0.501 0.443 0.798 3e-69
357441821385 Pre-mRNA-splicing factor 38A [Medicago t 0.563 0.493 0.789 9e-68
224125088297 predicted protein [Populus trichocarpa] 0.394 0.447 0.902 1e-67
148906831400 unknown [Picea sitchensis] 0.572 0.482 0.723 1e-66
>gi|388513517|gb|AFK44820.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
 Score =  310 bits (794), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 242/342 (70%), Positives = 278/342 (81%), Gaps = 13/342 (3%)

Query: 1   MELDHLGGTFGGNRKPTPFMCLVMKMLQIQPEKDIVVEFIKNDDYKYVRVLGAFYLRLTG 60
           MELDHLGGT+GGNRKPTPFMCLVMKMLQIQPEK+IV+EFIKNDDYKYVR+LGAFYLRLTG
Sbjct: 52  MELDHLGGTYGGNRKPTPFMCLVMKMLQIQPEKEIVIEFIKNDDYKYVRILGAFYLRLTG 111

Query: 61  TDIDIYRYLEPLYNDYRKLRQKSGDGRFILTHVDEVIDELLTKDYSCDIALPRIKKRWNL 120
           +D D+Y Y EPLYNDYRKLR+K  DG+F LTHVDEVIDELLT DYSCDIA+PRIKKRW L
Sbjct: 112 SDTDVYHYPEPLYNDYRKLRRKLPDGQFALTHVDEVIDELLTTDYSCDIAMPRIKKRWTL 171

Query: 121 ETVGALEPRKSVLEDDFEEEEEKEENDQLDDGLEDGAYERDYHHGRSPTRERDRDRRRDS 180
           E++GALEPR+S LE+DFEEEEE E+N+Q  +       E+DY+ GRSP RERDRDRRRDS
Sbjct: 172 ESLGALEPRQSALEEDFEEEEENEDNEQPAE-----EPEKDYNRGRSPARERDRDRRRDS 226

Query: 181 YRYRDRDYDRDYDRDRGRGRERDRGRDRDRDRDRDRDRERDRDRNRYRLRDDREYGRERD 240
           +R+RDRDYDR+YDRD  R R R R RDRDRDR+++RDR+R+RDR RYRLR++++YGRER+
Sbjct: 227 HRHRDRDYDREYDRDYDRERGRGRDRDRDRDREKERDRDRERDRGRYRLREEKDYGRERE 286

Query: 241 RDREREGRDRERRDRDRGRRRSRSRSRSRSRD-----RKRHARSSMSPRRDRDEPEDVGT 295
                       R R R   RSRSRSR R        RKRHARSS+SPRRD  E    G 
Sbjct: 287 GRERERRDRDRDRGRRRSYSRSRSRSRDRKDHDGGDYRKRHARSSVSPRRDGAED---GE 343

Query: 296 RAEPKKKKEKKEKKDDGTDHPDPEIAEANRIRASLGLKPLKL 337
             + K+KKEKKE+KDDGTDHPDPEIAEANRIRASLGLKPLK+
Sbjct: 344 PKKKKEKKEKKEEKDDGTDHPDPEIAEANRIRASLGLKPLKM 385




Source: Medicago truncatula

Species: Medicago truncatula

Genus: Medicago

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|317106643|dbj|BAJ53148.1| JHL23J11.3 [Jatropha curcas] Back     alignment and taxonomy information
>gi|225450673|ref|XP_002283131.1| PREDICTED: uncharacterized protein LOC100266508 [Vitis vinifera] Back     alignment and taxonomy information
>gi|296089736|emb|CBI39555.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356576821|ref|XP_003556528.1| PREDICTED: uncharacterized protein LOC100794094 [Glycine max] Back     alignment and taxonomy information
>gi|147863592|emb|CAN79776.1| hypothetical protein VITISV_013614 [Vitis vinifera] Back     alignment and taxonomy information
>gi|363808110|ref|NP_001242475.1| uncharacterized protein LOC100792466 [Glycine max] gi|255642457|gb|ACU21492.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|357441821|ref|XP_003591188.1| Pre-mRNA-splicing factor 38A [Medicago truncatula] gi|124359153|gb|ABN05680.1| PRP38 [Medicago truncatula] gi|355480236|gb|AES61439.1| Pre-mRNA-splicing factor 38A [Medicago truncatula] Back     alignment and taxonomy information
>gi|224125088|ref|XP_002319497.1| predicted protein [Populus trichocarpa] gi|222857873|gb|EEE95420.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|148906831|gb|ABR16561.1| unknown [Picea sitchensis] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query337
TAIR|locus:2064811355 AT2G40650 [Arabidopsis thalian 0.394 0.374 0.849 3e-77
UNIPROTKB|Q0P5I6312 PRPF38A "Pre-mRNA-splicing fac 0.394 0.426 0.661 3.3e-47
UNIPROTKB|Q8NAV1312 PRPF38A "Pre-mRNA-splicing fac 0.394 0.426 0.661 3.3e-47
MGI|MGI:1916962312 Prpf38a "PRP38 pre-mRNA proces 0.394 0.426 0.654 5.3e-47
ZFIN|ZDB-GENE-040801-225313 prpf38a "PRP38 pre-mRNA proces 0.394 0.424 0.646 8.6e-47
WB|WBGene00008380320 prp-38 [Caenorhabditis elegans 0.394 0.415 0.578 2e-45
FB|FBgn0050342330 Prp38 "pre-mRNA processing fac 0.394 0.403 0.639 2.8e-41
DICTYBASE|DDB_G0274361515 prpf38a "putative U4/U6.U5 sma 0.391 0.256 0.571 3.2e-41
GENEDB_PFALCIPARUM|PF11_0336386 PF11_0336 "hypothetical protei 0.388 0.339 0.462 1.9e-33
UNIPROTKB|Q8II38386 PF11_0336 "Pre-mRNA splicing f 0.388 0.339 0.462 1.9e-33
TAIR|locus:2064811 AT2G40650 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 627 (225.8 bits), Expect = 3.0e-77, Sum P(3) = 3.0e-77
 Identities = 113/133 (84%), Positives = 126/133 (94%)

Query:     1 MELDHLGGTFGGNRKPTPFMCLVMKMLQIQPEKDIVVEFIKNDDYKYVRVLGAFYLRLTG 60
             MELDHLGGTFGG+RKPTPF+CL++KMLQIQPEK+IVVEFIKNDDYKYVR+LGAFYLRLTG
Sbjct:    52 MELDHLGGTFGGSRKPTPFLCLILKMLQIQPEKEIVVEFIKNDDYKYVRILGAFYLRLTG 111

Query:    61 TDIDIYRYLEPLYNDYRKLRQKSGDGRFILTHVDEVIDELLTKDYSCDIALPRIKKRWNL 120
             TD+D+YRYLEPLYNDYRK+RQK  DG+F LTHVDEVI+ELLTKDYSCDIA+PR+KKRW L
Sbjct:   112 TDVDVYRYLEPLYNDYRKVRQKLSDGKFSLTHVDEVIEELLTKDYSCDIAMPRLKKRWTL 171

Query:   121 ETVGALEPRKSVL 133
             E  G LEPRKSVL
Sbjct:   172 EQNGLLEPRKSVL 184


GO:0005634 "nucleus" evidence=ISM
GO:0006396 "RNA processing" evidence=ISS
UNIPROTKB|Q0P5I6 PRPF38A "Pre-mRNA-splicing factor 38A" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|Q8NAV1 PRPF38A "Pre-mRNA-splicing factor 38A" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
MGI|MGI:1916962 Prpf38a "PRP38 pre-mRNA processing factor 38 (yeast) domain containing A" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-040801-225 prpf38a "PRP38 pre-mRNA processing factor 38 (yeast) domain containing A" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
WB|WBGene00008380 prp-38 [Caenorhabditis elegans (taxid:6239)] Back     alignment and assigned GO terms
FB|FBgn0050342 Prp38 "pre-mRNA processing factor 38" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0274361 prpf38a "putative U4/U6.U5 small nuclear ribonucleoparticle-associated protein" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
GENEDB_PFALCIPARUM|PF11_0336 PF11_0336 "hypothetical protein, conserved" [Plasmodium falciparum (taxid:5833)] Back     alignment and assigned GO terms
UNIPROTKB|Q8II38 PF11_0336 "Pre-mRNA splicing factor, putative" [Plasmodium falciparum 3D7 (taxid:36329)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query337
pfam03371173 pfam03371, PRP38, PRP38 family 4e-60
PRK12678 672 PRK12678, PRK12678, transcription termination fact 2e-15
PRK12678 672 PRK12678, PRK12678, transcription termination fact 2e-15
PRK12678 672 PRK12678, PRK12678, transcription termination fact 5e-15
TIGR01642 509 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, la 2e-14
TIGR01642 509 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, la 5e-14
TIGR01642 509 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, la 1e-13
PRK12678 672 PRK12678, PRK12678, transcription termination fact 4e-13
TIGR01622 457 TIGR01622, SF-CC1, splicing factor, CC1-like famil 5e-10
PRK12678 672 PRK12678, PRK12678, transcription termination fact 6e-10
PRK12678 672 PRK12678, PRK12678, transcription termination fact 2e-09
PRK12678 672 PRK12678, PRK12678, transcription termination fact 9e-09
PRK12678 672 PRK12678, PRK12678, transcription termination fact 3e-07
TIGR01622 457 TIGR01622, SF-CC1, splicing factor, CC1-like famil 7e-07
TIGR01622 457 TIGR01622, SF-CC1, splicing factor, CC1-like famil 4e-06
PRK10811 1068 PRK10811, rne, ribonuclease E; Reviewed 6e-05
pfam07420377 pfam07420, DUF1509, Protein of unknown function (D 2e-04
PRK12678 672 PRK12678, PRK12678, transcription termination fact 3e-04
TIGR01622 457 TIGR01622, SF-CC1, splicing factor, CC1-like famil 3e-04
TIGR01622 457 TIGR01622, SF-CC1, splicing factor, CC1-like famil 4e-04
PRK10811 1068 PRK10811, rne, ribonuclease E; Reviewed 6e-04
TIGR01642 509 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, la 0.001
pfam02956 525 pfam02956, TT_ORF1, TT viral orf 1 0.001
pfam10243 506 pfam10243, MIP-T3, Microtubule-binding protein MIP 0.002
pfam03154 979 pfam03154, Atrophin-1, Atrophin-1 family 0.002
pfam06495182 pfam06495, Transformer, Fruit fly transformer prot 0.003
>gnl|CDD|217519 pfam03371, PRP38, PRP38 family Back     alignment and domain information
 Score =  190 bits (485), Expect = 4e-60
 Identities = 75/124 (60%), Positives = 96/124 (77%)

Query: 3   LDHLGGTFGGNRKPTPFMCLVMKMLQIQPEKDIVVEFIKNDDYKYVRVLGAFYLRLTGTD 62
           +DH+GGT+GGNR+P+PF+CL++K+LQIQP K+I+ E + N+D+KY+R LG FYLRLT   
Sbjct: 50  VDHIGGTYGGNRRPSPFLCLLLKLLQIQPSKEIIKELLNNEDFKYIRALGLFYLRLTYPP 109

Query: 63  IDIYRYLEPLYNDYRKLRQKSGDGRFILTHVDEVIDELLTKDYSCDIALPRIKKRWNLET 122
            D+Y++LEP   DYRKLR K GDG F LTH+DE +D+LLTKD  CD  LPRI KR  LE 
Sbjct: 110 ADLYKWLEPYLQDYRKLRPKLGDGDFKLTHMDEFVDDLLTKDRVCDTILPRIPKRIILEE 169

Query: 123 VGAL 126
            G L
Sbjct: 170 DGEL 173


Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is about 180 amino acids in length. PRP38 is a unique component of the U4/U6.U5 tri-small nuclear ribonucleoprotein (snRNP) particle and is necessary for an essential step late in spliceosome maturation. Length = 173

>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho; Provisional Back     alignment and domain information
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho; Provisional Back     alignment and domain information
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho; Provisional Back     alignment and domain information
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit, splicing factor Back     alignment and domain information
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit, splicing factor Back     alignment and domain information
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit, splicing factor Back     alignment and domain information
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho; Provisional Back     alignment and domain information
>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family Back     alignment and domain information
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho; Provisional Back     alignment and domain information
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho; Provisional Back     alignment and domain information
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho; Provisional Back     alignment and domain information
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho; Provisional Back     alignment and domain information
>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family Back     alignment and domain information
>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family Back     alignment and domain information
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed Back     alignment and domain information
>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509) Back     alignment and domain information
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho; Provisional Back     alignment and domain information
>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family Back     alignment and domain information
>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family Back     alignment and domain information
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed Back     alignment and domain information
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit, splicing factor Back     alignment and domain information
>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1 Back     alignment and domain information
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3 Back     alignment and domain information
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family Back     alignment and domain information
>gnl|CDD|219061 pfam06495, Transformer, Fruit fly transformer protein Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 337
KOG2889204 consensus Predicted PRP38-like splicing factor [Fu 100.0
PF03371172 PRP38: PRP38 family; InterPro: IPR005037 Members o 100.0
KOG2888453 consensus Putative RNA binding protein [General fu 100.0
TIGR01642 509 U2AF_lg U2 snRNP auxilliary factor, large subunit, 97.14
KOG4676 479 consensus Splicing factor, arginine/serine-rich [R 96.82
TIGR01642 509 U2AF_lg U2 snRNP auxilliary factor, large subunit, 96.78
KOG0113335 consensus U1 small nuclear ribonucleoprotein (RRM 96.65
PF1287197 PRP38_assoc: Pre-mRNA-splicing factor 38-associate 96.15
KOG3263196 consensus Nucleic acid binding protein [General fu 95.57
KOG2217 705 consensus U4/U6.U5 snRNP associated protein [RNA p 95.26
KOG2888453 consensus Putative RNA binding protein [General fu 94.61
KOG4368757 consensus Predicted RNA binding protein, contains 81.49
>KOG2889 consensus Predicted PRP38-like splicing factor [Function unknown] Back     alignment and domain information
Probab=100.00  E-value=4.7e-47  Score=337.58  Aligned_cols=148  Identities=64%  Similarity=1.090  Sum_probs=143.2

Q ss_pred             CcccccCCccCCCCCCCchhHHHHHHhccCCCHHHHHHHhcCCCcceeeecccchhhhccchHHHHHHhhhhhhhhhhhh
Q 019667            1 MELDHLGGTFGGNRKPTPFMCLVMKMLQIQPEKDIVVEFIKNDDYKYVRVLGAFYLRLTGTDIDIYRYLEPLYNDYRKLR   80 (337)
Q Consensus         1 ~~l~~iGg~~g~~~~Ps~f~CLL~KLlqi~P~~~iv~~~L~~~d~KYlRaLg~lYlRltg~p~ely~~lEp~l~DyrKlr   80 (337)
                      |+|.||||.||||.+||||+||.+|||||+|+++|+.+||.+.+||||+|||||||||||...++|+||+|||+||+||+
T Consensus        52 m~l~~lgg~yGgn~kpt~fKOG~lkmLqiqpeK~i~~efi~~~~fkY~ralgafYlRLt~~~~~~y~ylepl~nDyRKir  131 (204)
T KOG2889|consen   52 MELLYLGGKYGGNSKPTPFLCLDLKMLQIQPEKEIGSEFIMNHDFKYVRALGAFYLRLTGTDVDVYKYLEPLLNDYRKIR  131 (204)
T ss_pred             HHHHHHHHHhcCCCCCccHHHHHHHHcCCCCCcccchHHHhccchHHHHHHHHHHHHHhcchHHHHHHHHHHHHHHHhhh
Confidence            47889999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             hccCCCceeeeeHHHHHHHHHhccccccccCCCcccchhHhHhccCCCccccchhhhhHHHhhhhhhc
Q 019667           81 QKSGDGRFILTHVDEVIDELLTKDYSCDIALPRIKKRWNLETVGALEPRKSVLEDDFEEEEEKEENDQ  148 (337)
Q Consensus        81 ~~~~dg~~~~~~~defvd~LL~~~~~~~i~LPRip~R~~lE~~~~LepR~s~Le~~~Ee~eeeeEee~  148 (337)
                      +.+.+|++++||||+|||+||++.+||+|+|||+.+|++||+++.|+|++|.|+++++++++.++++.
T Consensus       132 ~~~~~g~~~l~ylDe~iDdLL~~~r~cdI~Lprl~kr~~le~~~~lep~~s~ld~d~~~~e~~~~~e~  199 (204)
T KOG2889|consen  132 VVNGQGNRTLMYLDEVIDDLLNKSRICDIHLPRLDKRWQLEELDLLEPRKSSLDEDDDDEESSEEEEE  199 (204)
T ss_pred             hhcCCCceeeeeHHHHHHHHhhhcceeccccchhhhhhhhhhhccccccccccccchhhhhhhhhhhh
Confidence            99999999999999999999999999999999999999999999999999999999999887766663



>PF03371 PRP38: PRP38 family; InterPro: IPR005037 Members of this family are related to the pre mRNA splicing factor PRP38 from yeast [], therefore all the members of this family could be involved in splicing Back     alignment and domain information
>KOG2888 consensus Putative RNA binding protein [General function prediction only] Back     alignment and domain information
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor Back     alignment and domain information
>KOG4676 consensus Splicing factor, arginine/serine-rich [RNA processing and modification] Back     alignment and domain information
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor Back     alignment and domain information
>KOG0113 consensus U1 small nuclear ribonucleoprotein (RRM superfamily) [RNA processing and modification] Back     alignment and domain information
>PF12871 PRP38_assoc: Pre-mRNA-splicing factor 38-associated hydrophilic C-term; InterPro: IPR024767 This entry represents a hydrophilic domain found mainly at the C terminus of plant and metazoan pre-mRNA-splicing factor 38 proteins Back     alignment and domain information
>KOG3263 consensus Nucleic acid binding protein [General function prediction only] Back     alignment and domain information
>KOG2217 consensus U4/U6 Back     alignment and domain information
>KOG2888 consensus Putative RNA binding protein [General function prediction only] Back     alignment and domain information
>KOG4368 consensus Predicted RNA binding protein, contains SWAP, RPR and G-patch domains [General function prediction only] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query337
3sqw_A579 ATP-dependent RNA helicase MSS116, mitochondrial; 6e-07
3sqw_A579 ATP-dependent RNA helicase MSS116, mitochondrial; 2e-06
>3sqw_A ATP-dependent RNA helicase MSS116, mitochondrial; RECA fold, RNA dependent ATPase, RNA helicase; HET: ANP; 1.91A {Saccharomyces cerevisiae S288C} Length = 579 Back     alignment and structure
 Score = 49.8 bits (119), Expect = 6e-07
 Identities = 16/92 (17%), Positives = 33/92 (35%), Gaps = 1/92 (1%)

Query: 180 SYRYRDRDYDRDYDRDRGRGRERDRGRDRDRDRDRDRDRERDRDRNRYRLRDDREYGRER 239
           S R+ D+         +     RD       ++  D D + +      +  ++R   R+ 
Sbjct: 488 SRRFLDKLGLSRSPIGKAMFEIRDYSSRDGNNKSYDYDDDSEISFRGNKNYNNRSQNRDY 547

Query: 240 DRDREREGRDRERRDRDRGRRRSRSRSRSRSR 271
           D +  R   +  RR   R   ++   SR+ + 
Sbjct: 548 DDEPFRRSNN-NRRSFSRSNDKNNYSSRNSNI 578


>3sqw_A ATP-dependent RNA helicase MSS116, mitochondrial; RECA fold, RNA dependent ATPase, RNA helicase; HET: ANP; 1.91A {Saccharomyces cerevisiae S288C} Length = 579 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query337
3pgw_S437 U1-70K; protein-RNA complex, U1 snRNA, SM fold, SM 97.89
3pgw_S437 U1-70K; protein-RNA complex, U1 snRNA, SM fold, SM 97.71
3plv_C21 66 kDa U4/U6.U5 small nuclear ribonucleoprotein C; 97.34
>3pgw_S U1-70K; protein-RNA complex, U1 snRNA, SM fold, SM core, RRM, splici SNRNPS, splicing factors; HET: DNA; 4.40A {Homo sapiens} PDB: 3cw1_K 2l5i_A 2l5j_A* Back     alignment and structure
Probab=97.89  E-value=8e-06  Score=81.34  Aligned_cols=13  Identities=23%  Similarity=0.248  Sum_probs=8.4

Q ss_pred             cchHHHHHHhhhh
Q 019667           60 GTDIDIYRYLEPL   72 (337)
Q Consensus        60 g~p~ely~~lEp~   72 (337)
                      .+..+|..+|.+|
T Consensus       114 ~te~~L~~~F~~~  126 (437)
T 3pgw_S          114 TTESKLRREFEVY  126 (437)
T ss_pred             CCHHHHHHHHHHc
Confidence            3456677777764



>3pgw_S U1-70K; protein-RNA complex, U1 snRNA, SM fold, SM core, RRM, splici SNRNPS, splicing factors; HET: DNA; 4.40A {Homo sapiens} PDB: 3cw1_K 2l5i_A 2l5j_A* Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00