Citrus Sinensis ID: 025398


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250---
MTEYWVSQGNKWCDFCKIYISNNPSSIRNHDLGQRHKENVQKKLADMRKENAAKEKEHKETARALEQIEAKAKRSYQKDLANQEARNSNAVALNDHEKWDYDGTSGYYYNESNGLYYDPKSGFYYSDAIGNWVTQEEAYAAIPASLGSKHRPTMKHPFPAFGGGSIVENKDSAKSQNGPARGPVVSSSLNPMRSVKGAPSSLAVGKRKREDNKLKKMAKPVSKEEAAAIKAREAAKKRVEEREKQLLGLYHSQ
ccccHHcccccccccccccccccHHHHHHHcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccHHHccccccccccccccccccccccEEEcccccccccccccccEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHcHHHHcccccccccHHHHHHHHHHHHHHHHHHHHHHHcccccccc
ccHHHccccccccHEEEEEEccccccHHHHHccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccHcccccccccccccEEEcccccEEEcccccEEEcccccEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEccccccccccccccccHHHHcHccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHcccccc
MTEYWVSQGNKWCDFCKiyisnnpssirnhdlgqrHKENVQKKLADMRKENAAKEKEHKETARALEQIEAKAKRSYQKDLANqearnsnavalndhekwdydgtsgyyynesnglyydpksgfyysdaignwvtQEEAYAaipaslgskhrptmkhpfpafgggsivenkdsaksqngpargpvvssslnpmrsvkgapsslavgkrkrednKLKKMAKPVSKEEAAAIKAREAAKKRVEEREKQLLGLYHSQ
MTEYWVSQGNKWCDFCKIYISNNPssirnhdlgqrHKENVQKKLADMRKEnaakekehketARALEQIEAKAKRSYQKDLANQEARnsnavalndhekwdyDGTSGYYYNESNGLYYDPKSGFYYSDAIGNWVTQEEAYAAIPASLGSKHRPTMKHPFPAFGGGSIVENKDSaksqngpargpvvssslnpmrsvkgapsslavgkrkrednklkkmakpvskeeaAAIKAREAAKKRVEEREKQllglyhsq
MTEYWVSQGNKWCDFCKIYISNNPSSIRNHDLGQRHKENVQKKLADMRkenaakekehketaraLEQIEAKAKRSYQKDLANQEARNSNAVALNDHEKWDYDGTSGYYYNESNGLYYDPKSGFYYSDAIGNWVTQEEAYAAIPASLGSKHRPTMKHPFPAFGGGSIVENKDSAKSQNGPARGPVVSSSLNPMRSVKGAPSSLAVGKRKREDNKLKKMAKPVSkeeaaaikareaakkrveerekQLLGLYHSQ
***YWVSQGNKWCDFCKIYIS**********************************************************************ALNDHEKWDYDGTSGYYYNESNGLYYDPKSGFYYSDAIGNWVTQEEAYAAIP**************************************************************************************************************
*TEYWVSQGNKWCDFCKIYISNNPSSIR******RH*ENVQ*************************************************************************LYYDPKSGFYYSDAIGNWVTQ****************************************************************************************************************GLY***
MTEYWVSQGNKWCDFCKIYISNNPSSIRNHDLGQRHKENVQKKLA****************ARALEQIEA***************RNSNAVALNDHEKWDYDGTSGYYYNESNGLYYDPKSGFYYSDAIGNWVTQEEAYAAIPASLGSKHRPTMKHPFPAFGGGSIVENK******************LNPMR************************************************REKQLLGLYHSQ
*TEYWVSQGNKWCDFCKIYISNNPSSIRNHDLGQRHKENVQKKLADMRKENAAKEKEHKETARALEQIEAKAKRSYQKDLANQE*****AVALNDHEKWDYDGTSGYYYNESNGLYYDPKSGFYYSDAIGNWVTQEEAY*************************************************************************************************KR*EEREKQL*GLYH**
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MTEYWVSQGNKWCDFCKIYISNNPSSIRNHDLGQRHxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxAKRSYQKDLANQEARNSNAVALNDHEKWDYDGTSGYYYNESNGLYYDPKSGFYYSDAIGNWVTQEEAYAAIPASLGSKHRPTMKHPFPAFGGGSIVENKDSAKSQNGPARGPVVSSSLNPMRSVKGAPSSLAVGKRKREDNKLKKMAKPVSKEEAAAIKAREAAKKRVEEREKQLLGLYHSQ
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query253 2.2.26 [Sep-21-2011]
Q5F457 398 WW domain-binding protein yes no 0.316 0.201 0.412 1e-11
O75554 376 WW domain-binding protein yes no 0.316 0.212 0.387 7e-11
Q5HZF2 374 WW domain-binding protein yes no 0.316 0.213 0.387 9e-11
Q61048 376 WW domain-binding protein yes no 0.316 0.212 0.375 1e-10
P52756 815 RNA-binding protein 5 OS= no no 0.229 0.071 0.396 9e-06
Q91YE7 815 RNA-binding protein 5 OS= no no 0.237 0.073 0.384 3e-05
Q1RMU5 815 RNA-binding protein 5 OS= no no 0.169 0.052 0.458 7e-05
B2GV05 815 RNA-binding protein 5 OS= no no 0.229 0.071 0.380 7e-05
A4IGK4 838 RNA-binding protein 5 OS= no no 0.205 0.062 0.412 0.0002
A0JMV4 833 RNA-binding protein 5-A O N/A no 0.205 0.062 0.396 0.0003
>sp|Q5F457|WBP4_CHICK WW domain-binding protein 4 OS=Gallus gallus GN=WBP4 PE=2 SV=1 Back     alignment and function desciption
 Score = 70.5 bits (171), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 33/80 (41%), Positives = 55/80 (68%)

Query: 1  MTEYWVSQGNKWCDFCKIYISNNPSSIRNHDLGQRHKENVQKKLADMRKENAAKEKEHKE 60
          M +YW SQ  K+CD+CK +I++N  SI  H+ G+ HKENV K+++++RK++  K KE + 
Sbjct: 1  MADYWKSQPKKFCDYCKCWIADNRPSIDFHERGKNHKENVAKRISEIRKKSMEKAKEEEN 60

Query: 61 TARALEQIEAKAKRSYQKDL 80
           ++    +E  A ++YQ+DL
Sbjct: 61 MSKEFAAMEEAAMKAYQEDL 80




A spliceosome-associated protein; may play a role in cross-intron bridging of U1 and U2 snRNPs in the mammalian A complex.
Gallus gallus (taxid: 9031)
>sp|O75554|WBP4_HUMAN WW domain-binding protein 4 OS=Homo sapiens GN=WBP4 PE=1 SV=1 Back     alignment and function description
>sp|Q5HZF2|WBP4_RAT WW domain-binding protein 4 OS=Rattus norvegicus GN=Wbp4 PE=2 SV=1 Back     alignment and function description
>sp|Q61048|WBP4_MOUSE WW domain-binding protein 4 OS=Mus musculus GN=Wbp4 PE=1 SV=4 Back     alignment and function description
>sp|P52756|RBM5_HUMAN RNA-binding protein 5 OS=Homo sapiens GN=RBM5 PE=1 SV=2 Back     alignment and function description
>sp|Q91YE7|RBM5_MOUSE RNA-binding protein 5 OS=Mus musculus GN=Rbm5 PE=1 SV=1 Back     alignment and function description
>sp|Q1RMU5|RBM5_BOVIN RNA-binding protein 5 OS=Bos taurus GN=RBM5 PE=2 SV=1 Back     alignment and function description
>sp|B2GV05|RBM5_RAT RNA-binding protein 5 OS=Rattus norvegicus GN=Rbm5 PE=2 SV=1 Back     alignment and function description
>sp|A4IGK4|RBM5_XENTR RNA-binding protein 5 OS=Xenopus tropicalis GN=rbm5 PE=2 SV=1 Back     alignment and function description
>sp|A0JMV4|RBM5A_XENLA RNA-binding protein 5-A OS=Xenopus laevis GN=rbm5-a PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query253
118489270250 unknown [Populus trichocarpa x Populus d 0.968 0.98 0.694 1e-91
255561701250 nucleic acid binding protein, putative [ 0.964 0.976 0.692 3e-90
449438428254 PREDICTED: uncharacterized protein LOC10 0.968 0.964 0.664 7e-88
225428731257 PREDICTED: uncharacterized protein LOC10 0.984 0.968 0.689 2e-84
356496424247 PREDICTED: uncharacterized protein LOC10 0.956 0.979 0.644 6e-81
388495468236 unknown [Lotus japonicus] 0.913 0.978 0.637 8e-81
224103545218 predicted protein [Populus trichocarpa] 0.841 0.977 0.694 3e-79
356500315246 PREDICTED: uncharacterized protein LOC10 0.952 0.979 0.620 3e-78
356500317249 PREDICTED: uncharacterized protein LOC10 0.964 0.979 0.616 7e-77
115467074243 Os06g0215200 [Oryza sativa Japonica Grou 0.956 0.995 0.575 7e-75
>gi|118489270|gb|ABK96440.1| unknown [Populus trichocarpa x Populus deltoides] Back     alignment and taxonomy information
 Score =  342 bits (877), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 177/255 (69%), Positives = 202/255 (79%), Gaps = 10/255 (3%)

Query: 1   MTEYWVSQGNKWCDFCKIYISNNPSSIRNHDLGQRHKENVQKKLADMRKENAAKEKEHKE 60
           MTEYWVSQGNKWCDFCKI+ISNNP+SIRNH+LGQRHK+NV KKL  MRK+N AKEK+ KE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPTSIRNHELGQRHKDNVAKKLDSMRKDNIAKEKQQKE 60

Query: 61  TARALEQIEAKAKRSYQKDLAN-QEARNSNAVALND--HEKWDYDGTSGYYYNESNGLYY 117
            ARALEQIEAKA RSYQKD+AN +EA +  A+ + +   EKWDYD TSGYYYN+SNGL+Y
Sbjct: 61  AARALEQIEAKANRSYQKDVANLKEASSLRALDIQEDGQEKWDYDSTSGYYYNQSNGLHY 120

Query: 118 DPKSGFYYSDAIGNWVTQEEAYAAIPASLGSKHR-PTMKHPFPAFGGGSIVENKDSAKSQ 176
           DP SGFYYSDAIG WVTQEEAYAA+  S GS+++  + K P PA    S+ ENK +A  Q
Sbjct: 121 DPNSGFYYSDAIGKWVTQEEAYAAVQISSGSRNKESSFKKPLPASAVSSVKENKVAA--Q 178

Query: 177 NGPARGPVVSSSLNPMRSVKGAPSSLAVGKRKREDNKLKKMAKPVSKEEAAAIKAREAAK 236
           +GP  GPVVS+SLNP RSVKGAPS  AV KRKR D K     K VS EE AA+KAREAA+
Sbjct: 179 SGPPPGPVVSASLNPRRSVKGAPSKFAVNKRKRPDEK----PKAVSVEEKAALKAREAAR 234

Query: 237 KRVEEREKQLLGLYH 251
           KRVEEREK LLGLY 
Sbjct: 235 KRVEEREKSLLGLYQ 249




Source: Populus trichocarpa x Populus deltoides

Species: Populus trichocarpa x Populus deltoides

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255561701|ref|XP_002521860.1| nucleic acid binding protein, putative [Ricinus communis] gi|223538898|gb|EEF40496.1| nucleic acid binding protein, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|449438428|ref|XP_004136990.1| PREDICTED: uncharacterized protein LOC101207712 [Cucumis sativus] Back     alignment and taxonomy information
>gi|225428731|ref|XP_002285013.1| PREDICTED: uncharacterized protein LOC100246948 isoform 1 [Vitis vinifera] gi|359475212|ref|XP_003631615.1| PREDICTED: uncharacterized protein LOC100246948 isoform 2 [Vitis vinifera] gi|147811057|emb|CAN63484.1| hypothetical protein VITISV_017085 [Vitis vinifera] gi|297741317|emb|CBI32448.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356496424|ref|XP_003517068.1| PREDICTED: uncharacterized protein LOC100804274 [Glycine max] gi|356534661|ref|XP_003535871.1| PREDICTED: uncharacterized protein LOC100776500 [Glycine max] Back     alignment and taxonomy information
>gi|388495468|gb|AFK35800.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|224103545|ref|XP_002313098.1| predicted protein [Populus trichocarpa] gi|222849506|gb|EEE87053.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356500315|ref|XP_003518978.1| PREDICTED: uncharacterized protein LOC100779677 isoform 1 [Glycine max] Back     alignment and taxonomy information
>gi|356500317|ref|XP_003518979.1| PREDICTED: uncharacterized protein LOC100779677 isoform 2 [Glycine max] Back     alignment and taxonomy information
>gi|115467074|ref|NP_001057136.1| Os06g0215200 [Oryza sativa Japonica Group] gi|51090392|dbj|BAD35314.1| formin-binding protein-related-like [Oryza sativa Japonica Group] gi|113595176|dbj|BAF19050.1| Os06g0215200 [Oryza sativa Japonica Group] gi|125596486|gb|EAZ36266.1| hypothetical protein OsJ_20587 [Oryza sativa Japonica Group] gi|215686790|dbj|BAG89640.1| unnamed protein product [Oryza sativa Japonica Group] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query253
TAIR|locus:2012222242 AT1G49590 [Arabidopsis thalian 0.818 0.855 0.533 4.2e-56
UNIPROTKB|O75554 376 WBP4 "WW domain-binding protei 0.316 0.212 0.35 1.9e-13
UNIPROTKB|Q5F457 398 WBP4 "WW domain-binding protei 0.316 0.201 0.362 2.6e-13
RGD|620033 374 Wbp4 "WW domain binding protei 0.316 0.213 0.35 3.8e-13
UNIPROTKB|F1NXG2 398 WBP4 "WW domain-binding protei 0.316 0.201 0.362 5.2e-13
MGI|MGI:109568 376 Wbp4 "WW domain binding protei 0.316 0.212 0.337 6.6e-13
FB|FBgn0031287338 CG4291 [Drosophila melanogaste 0.359 0.269 0.351 3.8e-12
UNIPROTKB|Q17QQ6 395 WBP4 "Uncharacterized protein" 0.316 0.202 0.362 2.1e-11
UNIPROTKB|F1PCW3 371 WBP4 "Uncharacterized protein" 0.316 0.215 0.35 2.3e-11
ZFIN|ZDB-GENE-050522-342 412 zgc:112384 "zgc:112384" [Danio 0.316 0.194 0.325 2.9e-09
TAIR|locus:2012222 AT1G49590 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 578 (208.5 bits), Expect = 4.2e-56, P = 4.2e-56
 Identities = 118/221 (53%), Positives = 148/221 (66%)

Query:     1 MTEYWVSQGNKWCDFCKIYISNNPSSIRNHDLGQRHKENVQKKLADMRXXXXXXXXXXXX 60
             MTEYWVSQGNKWC+FCKI+I NNP+SIRNHDLG+RH+E V KKL DMR            
Sbjct:     1 MTEYWVSQGNKWCEFCKIWIQNNPTSIRNHDLGKRHRECVDKKLTDMRERSAAKDKELKK 60

Query:    61 XXXXLEQIEAKAKRSYQKDLAN-QEARNSNAVALNDHEKWDYDGTSGYYYNESNGLYYDP 119
                 L+QIEAKA RSYQKD+A  Q+   +N    +    W  D  SGYYYN++NGL+YD 
Sbjct:    61 NEKLLQQIEAKATRSYQKDIATAQQVAKANGAPEDGTSDWMLDSASGYYYNQTNGLHYDS 120

Query:   120 KSGFYYSDAIGNWVTQEEAYAAIPASLGSKHRPTMKHPFPAFGGGSIVENKDSAKSQNGP 179
             +SGFYYSD+IG+WVTQ+EAYAA+  S G+K  P +K P  + G G  V          G 
Sbjct:   121 QSGFYYSDSIGHWVTQDEAYAAVKTSSGTKV-PLVKKPVSSSGAGPSV----------GK 169

Query:   180 ARGPVVSSSLNPMRSVKGAPSSLAVG--KRKREDNKLKKMA 218
               G +V++SLNP R+VKGA SS+ +G  KRKR+D K KK++
Sbjct:   170 PPGRLVTASLNPKRAVKGAASSVDLGNNKRKRQDEKPKKVS 210




GO:0003676 "nucleic acid binding" evidence=IEA
GO:0005634 "nucleus" evidence=ISM;IEA
GO:0008150 "biological_process" evidence=ND
GO:0008270 "zinc ion binding" evidence=IEA
UNIPROTKB|O75554 WBP4 "WW domain-binding protein 4" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|Q5F457 WBP4 "WW domain-binding protein 4" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
RGD|620033 Wbp4 "WW domain binding protein 4" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|F1NXG2 WBP4 "WW domain-binding protein 4" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
MGI|MGI:109568 Wbp4 "WW domain binding protein 4" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
FB|FBgn0031287 CG4291 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
UNIPROTKB|Q17QQ6 WBP4 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|F1PCW3 WBP4 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-050522-342 zgc:112384 "zgc:112384" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query253
pfam0622038 pfam06220, zf-U1, U1 zinc finger 5e-04
smart0045135 smart00451, ZnF_U1, U1-like zinc finger 7e-04
>gnl|CDD|114912 pfam06220, zf-U1, U1 zinc finger Back     alignment and domain information
 Score = 36.6 bits (85), Expect = 5e-04
 Identities = 12/33 (36%), Positives = 25/33 (75%), Gaps = 1/33 (3%)

Query: 11 KWCDFCKIYISNN-PSSIRNHDLGQRHKENVQK 42
           +CD+C  Y++++ PS  ++H+ G++HK+NV+ 
Sbjct: 4  YYCDYCDCYLTHDSPSVRKSHNGGRKHKDNVKD 36


This family consists of several U1 small nuclear ribonucleoprotein C (U1-C) proteins. The U1 small nuclear ribonucleoprotein (U1 snRNP) binds to the pre-mRNA 5' splice site (ss) at early stages of spliceosome assembly. Recruitment of U1 to a class of weak 5' ss is promoted by binding of the protein TIA-1 to uridine-rich sequences immediately downstream from the 5' ss. Binding of TIA-1 in the vicinity of a 5' ss helps to stabilise U1 snRNP recruitment, at least in part, via a direct interaction with U1-C, thus providing one molecular mechanism for the function of this splicing regulator. This domain is probably a zinc-binding. It is found in multiple copies in some members of the family. Length = 38

>gnl|CDD|197732 smart00451, ZnF_U1, U1-like zinc finger Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 253
KOG0150336 consensus Spliceosomal protein FBP21 [RNA processi 100.0
PF0622038 zf-U1: U1 zinc finger; InterPro: IPR013085 Zinc fi 99.54
KOG3454165 consensus U1 snRNP-specific protein C [RNA process 99.21
COG5136188 U1 snRNP-specific protein C [RNA processing and mo 98.52
smart0045135 ZnF_U1 U1-like zinc finger. Family of C2H2-type zi 98.46
PF1217127 zf-C2H2_jaz: Zinc-finger double-stranded RNA-bindi 96.99
KOG4727193 consensus U1-like Zn-finger protein [General funct 96.87
PF1287425 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG 96.35
KOG0154 573 consensus RNA-binding protein RBM5 and related pro 94.98
KOG0717508 consensus Molecular chaperone (DnaJ superfamily) [ 91.56
KOG0227222 consensus Splicing factor 3a, subunit 2 [RNA proce 90.58
KOG2950348 consensus Uncharacterized protein involved in prot 90.25
KOG2384223 consensus Major histocompatibility complex protein 89.09
PF14968336 CCDC84: Coiled coil protein 84 87.25
PF03194254 LUC7: LUC7 N_terminus; InterPro: IPR004882 This fa 86.05
COG5246222 PRP11 Splicing factor 3a, subunit 2 [RNA processin 84.61
PF1523275 DUF4585: Domain of unknown function (DUF4585) 83.96
KOG0796319 consensus Spliceosome subunit [RNA processing and 81.43
PF04988165 AKAP95: A-kinase anchoring protein 95 (AKAP95); In 80.86
>KOG0150 consensus Spliceosomal protein FBP21 [RNA processing and modification] Back     alignment and domain information
Probab=100.00  E-value=2.8e-45  Score=338.21  Aligned_cols=237  Identities=33%  Similarity=0.449  Sum_probs=200.6

Q ss_pred             CcchhhhcCccccccccccccCChhhhhhhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 025398            1 MTEYWVSQGNKWCDFCKIYISNNPSSIRNHDLGQRHKENVQKKLADMRKENAAKEKEHKETARALEQIEAKAKRSYQKDL   80 (253)
Q Consensus         1 MtEYWkS~~K~yCdyCkvwi~dn~~Sir~HE~G~kHk~NV~k~Lrei~k~~~~keKe~~~~~~eL~rIEaaA~~AY~kDl   80 (253)
                      ||||||||+|+|||||+|||.||++||++||+|.||++||+++|++|++++..++++.+++..+|++||++|+++|++||
T Consensus         1 MaeYWkS~~kkfCdyCKiWi~dN~~Sv~~He~GkrHke~V~Kritdi~rks~~kekeekKls~~la~mEaaA~~syaedl   80 (336)
T KOG0150|consen    1 MAEYWKSQPKKFCDYCKIWIKDNPASVRFHERGKRHKENVAKRITDIHRKSLKKEKEEKKLSKELAAMEAAASASYAEDL   80 (336)
T ss_pred             CchhhhhccchhhhhhhhhhcCChHHHHhHhhhhHHHHHHHHHHHHHHHhhHHHHHHHHhhhhHHHHHHHHHHHHHHHhh
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             hh-hhhhhccccccCCCCCceeeCCcceeEeCCCceEecC--CCCceeeCCC-----CccccccchhhcccccCCCCCCC
Q 025398           81 AN-QEARNSNAVALNDHEKWDYDGTSGYYYNESNGLYYDP--KSGFYYSDAI-----GNWVTQEEAYAAIPASLGSKHRP  152 (253)
Q Consensus        81 ~~-~~~~~~~~~~~~~~s~~vyD~sSGYYYd~~TG~YYDp--~tg~YY~~~t-----g~W~~~~e~Y~~~~~~~~~~~~~  152 (253)
                      .. +.-..-.++.+...++|.-++.||++++...|.-+||  -+|+++++..     ..|++.++.+........+.. +
T Consensus        81 ~~~g~~~e~~~psk~r~s~tma~s~ss~~~~~~r~~q~d~~~vsg~~~s~a~q~~~~~g~v~~~e~~~~~k~wv~~Kn-e  159 (336)
T KOG0150|consen   81 SYEGLKKEIPGPSKLRPSKTMATSDSSADINTKRGRQDDPPNVSGTHDSPALQEYIPTGLVTKDEANAETKEWVEGKN-E  159 (336)
T ss_pred             hhcchhhhcCCCccCCccccccccccccccccccccccCcccccccccCccchhhccccccchhhhhhhhhhcccccC-C
Confidence            98 3322333445577889999999999999999999999  8999999443     468888888776554443322 1


Q ss_pred             CCCCCCCccCCcccccccCccCCCCCCCCccccccCCCCCcccCCCcccccc--ccccccccccccCCCCCCHHHHHHHH
Q 025398          153 TMKHPFPAFGGGSIVENKDSAKSQNGPARGPVVSSSLNPMRSVKGAPSSLAV--GKRKREDNKLKKMAKPVSKEEAAAIK  230 (253)
Q Consensus       153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~~~~~p~~~~kga~s~~a~--~krkr~~~~~~~~~~~~s~ee~~al~  230 (253)
                      ...+  .++.        ..++..-.-+|+...++.++|.+..|++-.++..  ++|+|++++..    .++.|++++++
T Consensus       160 s~~~--yy~n--------~~t~esvwk~P~~~~ts~~~~kr~nk~~q~avDe~nnsr~~qe~~~r----k~~ee~k~~~k  225 (336)
T KOG0150|consen  160 SGPT--YYSN--------KRTNESVWKPPRISFTSRLEPKRANKVPQLAVDEGNNSRERQESKER----KVVEEEKASGK  225 (336)
T ss_pred             CCCC--ccee--------cCCCccccCCCCcccccccChhhhcccchhcccccchhhhhhhcccc----hhhHHHHhhhh
Confidence            1111  1221        1455666788999999999999999997777666  67888888888    89999999999


Q ss_pred             HHHHHHHHHHHHhhhhcccc-cC
Q 025398          231 AREAAKKRVEEREKQLLGLY-HS  252 (253)
Q Consensus       231 areaarkrv~~r~~~~~gly-~~  252 (253)
                      +||++..|+.|+++.|+||| ++
T Consensus       226 ~re~~s~r~sDp~kel~~~~erp  248 (336)
T KOG0150|consen  226 EREKSSDRESDPPKELLGLYERP  248 (336)
T ss_pred             hhhhhhcccCCChHHHHHHhhcc
Confidence            99999999999999999999 44



>PF06220 zf-U1: U1 zinc finger; InterPro: IPR013085 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information
>KOG3454 consensus U1 snRNP-specific protein C [RNA processing and modification] Back     alignment and domain information
>COG5136 U1 snRNP-specific protein C [RNA processing and modification] Back     alignment and domain information
>smart00451 ZnF_U1 U1-like zinc finger Back     alignment and domain information
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length Back     alignment and domain information
>KOG4727 consensus U1-like Zn-finger protein [General function prediction only] Back     alignment and domain information
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A Back     alignment and domain information
>KOG0154 consensus RNA-binding protein RBM5 and related proteins, contain G-patch and RRM domains [General function prediction only] Back     alignment and domain information
>KOG0717 consensus Molecular chaperone (DnaJ superfamily) [Posttranslational modification, protein turnover, chaperones] Back     alignment and domain information
>KOG0227 consensus Splicing factor 3a, subunit 2 [RNA processing and modification] Back     alignment and domain information
>KOG2950 consensus Uncharacterized protein involved in protein-protein interaction, contains polyproline-binding GYF domain [General function prediction only] Back     alignment and domain information
>KOG2384 consensus Major histocompatibility complex protein BAT4, contains G-patch and ankyrin domains [General function prediction only] Back     alignment and domain information
>PF14968 CCDC84: Coiled coil protein 84 Back     alignment and domain information
>PF03194 LUC7: LUC7 N_terminus; InterPro: IPR004882 This family consists of several LUC7 protein homologues that are restricted to eukaryotes Back     alignment and domain information
>COG5246 PRP11 Splicing factor 3a, subunit 2 [RNA processing and modification] Back     alignment and domain information
>PF15232 DUF4585: Domain of unknown function (DUF4585) Back     alignment and domain information
>KOG0796 consensus Spliceosome subunit [RNA processing and modification] Back     alignment and domain information
>PF04988 AKAP95: A-kinase anchoring protein 95 (AKAP95); InterPro: IPR007071 A-kinase (or PKA)-anchoring protein AKAP95 is implicated in mitotic chromosome condensation by acting as a targeting molecule for the condensin complex Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query253
3cw1_L77 U1 small nuclear ribonucleoprotein C; PRE-mRNA spl 3e-05
>3cw1_L U1 small nuclear ribonucleoprotein C; PRE-mRNA splicing, spliceosome, RNA-binding domain, SM fold, finger, RNA recognition motif, 5' splice site; 5.49A {Homo sapiens} PDB: 1uw2_A 2vrd_A Length = 77 Back     alignment and structure
 Score = 40.4 bits (94), Expect = 3e-05
 Identities = 15/58 (25%), Positives = 29/58 (50%), Gaps = 5/58 (8%)

Query: 12 WCDFCKIYISNNPSSIR-NHDLGQRHKENV----QKKLADMRKENAAKEKEHKETARA 64
          +CD+C  Y++++  S+R  H  G++HKENV     K + +  +    K     +  + 
Sbjct: 5  YCDYCDTYLTHDSPSVRKTHCSGRKHKENVKDYYCKWMEEQAQSLIDKTTAAFQQGKI 62


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query253
3cw1_L77 U1 small nuclear ribonucleoprotein C; PRE-mRNA spl 99.56
1zu1_A127 DSRBP-ZFA, RNA binding protein ZFA; zinc finger pr 96.86
1zu1_A127 DSRBP-ZFA, RNA binding protein ZFA; zinc finger pr 96.29
1zr9_A124 Zinc finger protein 593; DNA binding, structural g 94.76
1e0n_A27 Hypothetical protein; YJQ8WW domain, WW domain, sa 92.73
2yrk_A55 Zinc finger homeobox protein 4; structure genomics 85.21
4dgw_A402 PRE-mRNA-splicing factor PRP9; zinc finger; 3.11A 84.52
>3cw1_L U1 small nuclear ribonucleoprotein C; PRE-mRNA splicing, spliceosome, RNA-binding domain, SM fold, finger, RNA recognition motif, 5' splice site; 5.49A {Homo sapiens} PDB: 1uw2_A 2vrd_A Back     alignment and structure
Probab=99.56  E-value=3.1e-15  Score=113.89  Aligned_cols=43  Identities=30%  Similarity=0.663  Sum_probs=39.5

Q ss_pred             cCccccccccccccCChhh-hhhhhchhhHHHHHHHHHHHHHHH
Q 025398            8 QGNKWCDFCKIYISNNPSS-IRNHDLGQRHKENVQKKLADMRKE   50 (253)
Q Consensus         8 ~~K~yCdyCkvwi~dn~~S-ir~HE~G~kHk~NV~k~Lrei~k~   50 (253)
                      +||||||||+|||++|.++ +++|++|++|+.||+.+|+++.++
T Consensus         1 mPkYyCdYCd~~lt~Ds~s~Rk~H~~G~kH~~nv~~yy~~~~~~   44 (77)
T 3cw1_L            1 MPKFYCDYCDTYLTHDSPSVRKTHCSGRKHKENVKDYYCKWMEE   44 (77)
T ss_pred             CCCcccccCCceecCCCHHHHHHHHccHHHHHHHHHHHHHHHHH
Confidence            4899999999999877766 689999999999999999999975



>1zu1_A DSRBP-ZFA, RNA binding protein ZFA; zinc finger protein, helix-loop-helix, helix-turn-helix; NMR {Xenopus laevis} SCOP: g.37.1.4 g.37.1.4 Back     alignment and structure
>1zu1_A DSRBP-ZFA, RNA binding protein ZFA; zinc finger protein, helix-loop-helix, helix-turn-helix; NMR {Xenopus laevis} SCOP: g.37.1.4 g.37.1.4 Back     alignment and structure
>1zr9_A Zinc finger protein 593; DNA binding, structural genomics, PSI, protein structure initiative, center for eukaryotic structural genomics, CESG; NMR {Homo sapiens} SCOP: g.37.1.4 Back     alignment and structure
>1e0n_A Hypothetical protein; YJQ8WW domain, WW domain, saccharomyces cerevisae, YJQ8 protein; NMR {Saccharomyces cerevisiae} SCOP: b.72.1.1 Back     alignment and structure
>2yrk_A Zinc finger homeobox protein 4; structure genomics, ZF-C2H2 domain, ZFH-4, structural genomics, NPPSFA; NMR {Mus musculus} SCOP: g.37.1.4 Back     alignment and structure
>4dgw_A PRE-mRNA-splicing factor PRP9; zinc finger; 3.11A {Saccharomyces cerevisiae} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 253
d2vrda161 g.37.1.4 (A:1-61) Spliceosomal protein U1C {Human 2e-05
>d2vrda1 g.37.1.4 (A:1-61) Spliceosomal protein U1C {Human (Homo sapiens) [TaxId: 9606]} Length = 61 Back     information, alignment and structure

class: Small proteins
fold: beta-beta-alpha zinc fingers
superfamily: beta-beta-alpha zinc fingers
family: HkH motif-containing C2H2 finger
domain: Spliceosomal protein U1C
species: Human (Homo sapiens) [TaxId: 9606]
 Score = 39.0 bits (91), Expect = 2e-05
 Identities = 18/58 (31%), Positives = 32/58 (55%), Gaps = 3/58 (5%)

Query: 11 KWCDFCKIYISNNPSSIR-NHDLGQRHKENVQKKLADMRKENAAKEKEHKETARALEQ 67
           +CD+C  Y++++  S+R  H  G++HKENV+       +E A    +  +T  A +Q
Sbjct: 4  FYCDYCDTYLTHDSPSVRKTHCSGRKHKENVKDYYQKWMEEQAQSLID--KTTAAFQQ 59


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query253
d2vrda161 Spliceosomal protein U1C {Human (Homo sapiens) [Ta 99.47
d1zr9a167 Zinc finger protein 593, ZNF593 {Human (Homo sapie 96.74
d1zu1a255 dsRNA-binding protein ZFa (ZNF346, JAZ) {African c 95.78
d1zu1a172 dsRNA-binding protein ZFa (ZNF346, JAZ) {African c 92.48
d1pina134 Mitotic rotamase PIN1 {Human (Homo sapiens) [TaxId 91.39
d2yrka148 Zinc finger homeobox protein 4, ZFHX4 {Human (Homo 91.3
>d2vrda1 g.37.1.4 (A:1-61) Spliceosomal protein U1C {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: Small proteins
fold: beta-beta-alpha zinc fingers
superfamily: beta-beta-alpha zinc fingers
family: HkH motif-containing C2H2 finger
domain: Spliceosomal protein U1C
species: Human (Homo sapiens) [TaxId: 9606]
Probab=99.47  E-value=4.9e-16  Score=111.94  Aligned_cols=43  Identities=30%  Similarity=0.700  Sum_probs=38.9

Q ss_pred             cCccccccccccccCChhhh-hhhhchhhHHHHHHHHHHHHHHH
Q 025398            8 QGNKWCDFCKIYISNNPSSI-RNHDLGQRHKENVQKKLADMRKE   50 (253)
Q Consensus         8 ~~K~yCdyCkvwi~dn~~Si-r~HE~G~kHk~NV~k~Lrei~k~   50 (253)
                      +||||||||++||++|++|+ ++|++|++|++||..+++.+.++
T Consensus         1 MpryyCdYC~~~lt~ds~s~RK~H~~G~kH~~n~~~yy~~~~~~   44 (61)
T d2vrda1           1 MPKFYCDYCDTYLTHDSPSVRKTHCSGRKHKENVKDYYQKWMEE   44 (61)
T ss_dssp             CCSCEETTTTEECSCCSHHHHHHHTCCTHHHHHHHHHTSSSSSS
T ss_pred             CCchhhhhhcchhccCChhHHHHHhccHHHHHHHHHHHHHHHHH
Confidence            58999999999999998885 79999999999999999977655



>d1zr9a1 g.37.1.4 (A:28-94) Zinc finger protein 593, ZNF593 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1zu1a2 g.37.1.4 (A:74-128) dsRNA-binding protein ZFa (ZNF346, JAZ) {African clawed frog (Xenopus laevis) [TaxId: 8355]} Back     information, alignment and structure
>d1zu1a1 g.37.1.4 (A:2-73) dsRNA-binding protein ZFa (ZNF346, JAZ) {African clawed frog (Xenopus laevis) [TaxId: 8355]} Back     information, alignment and structure
>d1pina1 b.72.1.1 (A:6-39) Mitotic rotamase PIN1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d2yrka1 g.37.1.4 (A:8-55) Zinc finger homeobox protein 4, ZFHX4 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure