Psyllid ID: psy12225


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390------
MEMRASLQDKEEQKTLKSKMRERVRPKLGKIDIDYQKLHDAFFKWQTKPRLSIHGDLYYEGKEFETKLKEKKPGDLSAELRTALGMPVGPTAHKTFSRETVIPPPWLIAMQRYGPPPSYPNLKIPGLNAPIPQGCSFGYHAGGWGKPPVDETGKPLYGDVFGTGGWGKPPVDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEGLMTPSGITSVPAGLETPDSIELRKKKIESDMETNEAPQLFQVLTEKRPDKLGPGMMASTHMYEVPGAGASAPGAPVKRTLPSSVELALDPSELDLVDTDAMAARYEQQMREQTSGLQKEDLSDMLAEHVARQKNKRKRQQTTQDNKQAKKYKEFKF
cHHHHHHHHHHHHHHHHHHHHHHHccccccccccHHHHHHHHHccccccccccccccEEccccHHHHcccccccccHHHHHHHHccccccccccccccccccccHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHcccccccccccccccccccccccccccccccccEEcccccccccccccccccccEEEcccccccccccccccccEEEccccccccccccccccccccEEEEEcccccccccHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccc
cHHHHHHHHHHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHcccccccccccEEEcccEEEEcHcccccccccHHHHHHHcccccccccHccccccccccHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHcccccccccccccccccccccccccHHccHcccccccccccccccccccccccccccccccccccccccEEEEccccccccccccccccEEEcccccccccccccccccEEEEccccccccccccccccccccEEEEEcHHHHHcccHHHHHHHHHHHHHHHcccccHHHHHHHHHHHHHHHHHHHHHHcccccccccccEEEEcc
memraslqdKEEQKTLKSKMRErvrpklgkidiDYQKLHDAFFkwqtkprlsihgdlyyegkefetklkekkpgdlSAELRTalgmpvgptahktfsretvipppwliamqrygpppsypnlkipglnapipqgcsfgyhaggwgkppvdetgkplygdvfgtggwgkppvdetgkplygdvfgtsaetktaevneediektawgeleseseeeseeddeeeeeeeegedesglitpaeglmtpsgitsvpagletpdsIELRKKKIEsdmetneaPQLFQVLTekrpdklgpgmmasthmyevpgagasapgapvkrtlpssvelaldpseldlvDTDAMAARYEQQMREQTSGLQKEDLSDMLAEHVARQKNKRKRQQTTQDNKQAKKYKEFKF
memraslqdkeeqktlkskmrervrpklgkididyQKLHDAFfkwqtkprlSIHGDLYYEGKEfetklkekkpgdLSAELRTALGmpvgptahktfsrETVIPPPWLIAMQRYGPPPSYPNLKIPGLNAPIPQGCSFGYHAGGWGKPPVDETGKPLYGDVFGTGGWGKPPVDETGKPLYGDVFGTsaetktaevneediektawgeleseseeeseeddeeeeeeeegedesglitpaeglmtpsgitsvpagletpdSIELRKKKIEsdmetneapqlfqvLTEKRPDKLGPGMMASTHMYEVPGAGASAPGAPVKRTLPSSVELALDPSELDLVDTDAMAARYEQQMREqtsglqkeDLSDMLAEHVarqknkrkrqqttqdnkqakkykefkf
MEMRASLQDKEEQKTLKSKMRERVRPKLGKIDIDYQKLHDAFFKWQTKPRLSIHGDLYYEGKEFETKLKEKKPGDLSAELRTALGMPVGPTAHKTFSRETVIPPPWLIAMQRYGPPPSYPNLKIPGLNAPIPQGCSFGYHAGGWGKPPVDETGKPLYGDVFGTGGWGKPPVDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWgeleseseeeseeddeeeeeeeegedesglITPAEGLMTPSGITSVPAGLETPDSIELRKKKIESDMETNEAPQLFQVLTEKRPDKLGPGMMASTHMYEVpgagasapgapVKRTLPSSVELALDPSELDLVDTDAMAARYEQQMREQTSGLQKEDLSDMLAEHVArqknkrkrqqTTQDNKQAKKYKEFKF
***************************LGKIDIDYQKLHDAFFKWQTKPRLSIHGDLYYEGKEF****************************HKTFSRETVIPPPWLIAMQRYGPPPSYPNLKIPGLNAPIPQGCSFGYHAGGWGKPPVDETGKPLYGDVFGTGGWGKPPVDETGKPLYGDVF*********************************************************************************************************************************************************************************************************************
******************************IDIDYQKLHDAFFKWQTKPRLSIHGDLYYEGKEFETKLKEKKPGDLSAELRTALGMPVGPTAHKTFSRETVIPPPWLIAMQRYGPPPSYPNLKIPGLNAPIPQGCSFGYHAGGWGKPPVDETGKPLYGDVFGTGG***********PL****************************************************************T**GIT****************************************************MY*************************LDPSELDLVDT*******************************************************FKF
*******************MRERVRPKLGKIDIDYQKLHDAFFKWQTKPRLSIHGDLYYEGKEFETKLKEKKPGDLSAELRTALGMPVGPTAHKTFSRETVIPPPWLIAMQRYGPPPSYPNLKIPGLNAPIPQGCSFGYHAGGWGKPPVDETGKPLYGDVFGTGGWGKPPVDETGKPLYGDVFGTSAETKTAEVNEEDIEKTA*****************************GLITPAEGLMTPSGITSVPAGLETPDSIELRKKKIESDMETNEAPQLFQVLTEKRPDKLGPGMMASTHMYEVPGAGASAPGAPVKRTLPSSVELALDPSELDLVDTDAMAARYEQ***********EDLSDMLAEH****************************
**************TLKSKMRERVRPKLGKIDIDYQKLHDAFFKWQTKPRLSIHGDLYYEGKEFETKLKEKKPGDLSAELRTALGMPVGPTAHKTFSRETVIPPPWLIAMQRYGPPPSYPNLKIPGLNAPIPQGCSFGYHAGGWGKPPVDETGKPLYGDVFGTGG***********PLYGDVF*TS**************************************************************S**AGLETPDSIELRKK***********PQLFQ*LTEKRP*KLG***M*************************SSVELALDPSELDLVDTDAMAARYEQQMREQT****KEDLSDMLAEHVARQKNKR*************KY****F
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MEMRASLQDKEEQKTLKSKMRERVRPKLGKIDIDYQKLHDAFFKWQTKPRLSIHGDLYYEGKEFETKLKEKKPGDLSAELRTALGMPVGPTAHKTFSRETVIPPPWLIAMQRYGPPPSYPNLKIPGLNAPIPQGCSFGYHAGGWGKPPVDETGKPLYGDVFGTGGWGKPPVDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGExxxxxxxxxxxxxxxxxxxxxxxxxSGLITPAEGLMTPSGITSVPAGLETPDSIELRKKKIESDMETNEAPQLFQVLTEKRPDKLGPGMMASTHMYEVPGAGASAPGAPVKRTLPSSVELALDPSELDLVDTDAMAARYEQQMREQTSGLQKEDLSDMLAEHVARQKNKRKRQQTTQDNKQAKKYKEFKF
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query396 2.2.26 [Sep-21-2011]
Q13435895 Splicing factor 3B subuni yes N/A 0.909 0.402 0.616 1e-118
Q9UUI3601 Pre-mRNA-splicing factor yes N/A 0.752 0.495 0.421 5e-58
Q02554436 Cold sensitive U2 snRNA s yes N/A 0.292 0.266 0.406 7e-19
>sp|Q13435|SF3B2_HUMAN Splicing factor 3B subunit 2 OS=Homo sapiens GN=SF3B2 PE=1 SV=2 Back     alignment and function desciption
 Score =  426 bits (1095), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 244/396 (61%), Positives = 294/396 (74%), Gaps = 36/396 (9%)

Query: 2   EMRASLQDKEEQKTLKSKMRERVRPKLGKIDIDYQKLHDAFFKWQTKPRLSIHGDLYYEG 61
           EMR +LQ+KEEQKT+KSKMRE+VRPK+GKIDIDYQKLHDAFFKWQTKP+L+IHGDLYYEG
Sbjct: 535 EMREALQEKEEQKTMKSKMREKVRPKMGKIDIDYQKLHDAFFKWQTKPKLTIHGDLYYEG 594

Query: 62  KEFETKLKEKKPGDLSAELRTALGMPVGPTAHKTFSRETVIPPPWLIAMQRYGPPPSYPN 121
           KEFET+LKEKKPGDLS ELR +LGMPVGP AHK       +PPPWLIAMQRYGPPPSYPN
Sbjct: 595 KEFETRLKEKKPGDLSDELRISLGMPVGPNAHK-------VPPPWLIAMQRYGPPPSYPN 647

Query: 122 LKIPGLNAPIPQGCSFGYHAGGWGKPPVDETGKPLYGDVFGTGGWGKPPVDETGKPLYGD 181
           LKIPGLN+PIP+ CSFGYHAGGW                      GKPPVDETGKPLYGD
Sbjct: 648 LKIPGLNSPIPESCSFGYHAGGW----------------------GKPPVDETGKPLYGD 685

Query: 182 VFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAE-G 240
           VFGT+A     +  EE+I++T WGELE   EE SEE++EEE +E++  DE+G ITPA+ G
Sbjct: 686 VFGTNAAEFQTKTEEEEIDRTPWGELEPSDEESSEEEEEEESDEDK-PDETGFITPADSG 744

Query: 241 LMTPSGITSVPAGLETPDSIELRKKKIESDMETNEAPQLFQVLTEKRPDKLGPGMMASTH 300
           L+TP G +SVPAG+ETP+ IELRKKKIE  M+ +E PQLF VL EKR   +G  MM STH
Sbjct: 745 LITPGGFSSVPAGMETPELIELRKKKIEEAMDGSETPQLFTVLPEKRTATVGGAMMGSTH 804

Query: 301 MYEVPGAGASAPGAPVKRTLPSSVELALDPSELDLVDTDAMAARYEQQMREQTSGLQKED 360
           +Y++    +    AP  +     VE+AL P EL+L D  AM  +YE+ +REQ + ++KED
Sbjct: 805 IYDMSTVMSRKGPAPELQ----GVEVALAPEELEL-DPMAMTQKYEEHVREQQAQVEKED 859

Query: 361 LSDMLAEHVARQKNKRKRQQTTQDNKQAKKYKEFKF 396
            SDM+AEH A+QK K+++ Q       +KKYKEFKF
Sbjct: 860 FSDMVAEHAAKQKQKKRKAQPQDSRGGSKKYKEFKF 895




Subunit of the splicing factor SF3B required for 'A' complex assembly formed by the stable binding of U2 snRNP to the branchpoint sequence (BPS) in pre-mRNA. Sequence independent binding of SF3A/SF3B complex upstream of the branch site is essential, it may anchor U2 snRNP to the pre-mRNA. May also be involved in the assembly of the 'E' complex. Belongs also to the minor U12-dependent spliceosome, which is involved in the splicing of rare class of nuclear pre-mRNA intron.
Homo sapiens (taxid: 9606)
>sp|Q9UUI3|SA145_SCHPO Pre-mRNA-splicing factor sap145 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=sap145 PE=1 SV=1 Back     alignment and function description
>sp|Q02554|CUS1_YEAST Cold sensitive U2 snRNA suppressor 1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=CUS1 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query396
242011371 728 Splicing factor 3B subunit, putative [Pe 0.921 0.501 0.693 1e-147
19920622 749 CG3605, isoform A [Drosophila melanogast 0.919 0.485 0.693 1e-141
198474836 758 GA17553 [Drosophila pseudoobscura pseudo 0.921 0.481 0.685 1e-141
195342165 724 GM18174 [Drosophila sechellia] gi|194132 0.919 0.502 0.693 1e-141
195114720 747 GI17098 [Drosophila mojavensis] gi|19391 0.921 0.488 0.686 1e-141
195470953 749 GE14955 [Drosophila yakuba] gi|194173872 0.919 0.485 0.693 1e-141
195148468 758 GL19575 [Drosophila persimilis] gi|19410 0.921 0.481 0.685 1e-141
194759105 749 GF14752 [Drosophila ananassae] gi|190615 0.919 0.485 0.688 1e-141
189239495 685 PREDICTED: similar to CG3605 CG3605-PA [ 0.906 0.524 0.675 1e-140
194855273 752 GG24466 [Drosophila erecta] gi|190660377 0.914 0.481 0.688 1e-140
>gi|242011371|ref|XP_002426426.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis] gi|212510525|gb|EEB13688.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis] Back     alignment and taxonomy information
 Score =  526 bits (1354), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 281/405 (69%), Positives = 322/405 (79%), Gaps = 40/405 (9%)

Query: 1   MEMRASLQDKEEQKTLKSKMRERVRPKLGKIDIDYQKLHDAFFKWQTKPRLSIHGDLYYE 60
           MEMRA+L +KE+Q+TLK+KMRER RPKLGKIDIDYQKLHDAFFKWQTKP+++IHGDLYYE
Sbjct: 355 MEMRAALTEKEDQRTLKAKMRERARPKLGKIDIDYQKLHDAFFKWQTKPKMTIHGDLYYE 414

Query: 61  GKEFETKLKEKKPGDLSAELRTALGMPVGPTAHKTFSRETVIPPPWLIAMQRYGPPPSYP 120
           GKEFETKLKEKKPGDL+ ELRTALGMPVGP+AHK       +PPPWLIAMQRYGPPPSYP
Sbjct: 415 GKEFETKLKEKKPGDLTDELRTALGMPVGPSAHK-------VPPPWLIAMQRYGPPPSYP 467

Query: 121 NLKIPGLNAPIPQGCSFGYHAGGWGKPPVDETGKPLYGDVFGTGGWGKPPVDETGKPLYG 180
           NLKIPGLNAPIP+GCSFGYHAGGWGKPPVDE+GKPLYGDVFGT G   P           
Sbjct: 468 NLKIPGLNAPIPEGCSFGYHAGGWGKPPVDESGKPLYGDVFGTLGTIIP----------- 516

Query: 181 DVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEEEEGEDESGLITPAEG 240
                      +E  +E++++T WGELESESEE  EE++EEEEE+ + EDE+GL+TPAEG
Sbjct: 517 -----------SEGEDEEVDRTMWGELESESEESEEEEEEEEEEQPK-EDETGLVTPAEG 564

Query: 241 LMTPSGITSVPAGLETPDSIELRKKKIESDMETNEAPQLFQVLTEKRPDKLGPGMMASTH 300
           L+TPSGITS+PAGLETP+ IELRKKKIESDME +E P L+QVL EKR D++G  MMASTH
Sbjct: 565 LVTPSGITSIPAGLETPEIIELRKKKIESDMEGSETPALYQVLPEKRVDRIGGAMMASTH 624

Query: 301 MYEV---------PGAGASAPGAPVKRTLPSSVELALDPSELDLVDTDAMAARYEQQMRE 351
           +YE+            G+ AP  P +      VELALDPSELDLVDTDAMA RYEQQ+RE
Sbjct: 625 VYEMNPTNATTTTTTTGSGAPLPPRRAGGSGMVELALDPSELDLVDTDAMAVRYEQQIRE 684

Query: 352 QTSGLQKEDLSDMLAEHVARQKNKRKRQQTTQDNKQAKKYKEFKF 396
           Q S LQKEDLSDM+ EHV+RQKNKRKR Q  Q+ K  KKYKEF+F
Sbjct: 685 QQSQLQKEDLSDMVVEHVSRQKNKRKRAQ-IQETKPTKKYKEFRF 728




Source: Pediculus humanus corporis

Species: Pediculus humanus

Genus: Pediculus

Family: Pediculidae

Order: Phthiraptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|19920622|ref|NP_608739.1| CG3605, isoform A [Drosophila melanogaster] gi|442625643|ref|NP_001259979.1| CG3605, isoform B [Drosophila melanogaster] gi|16769632|gb|AAL29035.1| LD45152p [Drosophila melanogaster] gi|22945358|gb|AAF51159.2| CG3605, isoform A [Drosophila melanogaster] gi|440213248|gb|AGB92516.1| CG3605, isoform B [Drosophila melanogaster] Back     alignment and taxonomy information
>gi|198474836|ref|XP_001356831.2| GA17553 [Drosophila pseudoobscura pseudoobscura] gi|198138569|gb|EAL33897.2| GA17553 [Drosophila pseudoobscura pseudoobscura] Back     alignment and taxonomy information
>gi|195342165|ref|XP_002037672.1| GM18174 [Drosophila sechellia] gi|194132522|gb|EDW54090.1| GM18174 [Drosophila sechellia] Back     alignment and taxonomy information
>gi|195114720|ref|XP_002001915.1| GI17098 [Drosophila mojavensis] gi|193912490|gb|EDW11357.1| GI17098 [Drosophila mojavensis] Back     alignment and taxonomy information
>gi|195470953|ref|XP_002087771.1| GE14955 [Drosophila yakuba] gi|194173872|gb|EDW87483.1| GE14955 [Drosophila yakuba] Back     alignment and taxonomy information
>gi|195148468|ref|XP_002015196.1| GL19575 [Drosophila persimilis] gi|194107149|gb|EDW29192.1| GL19575 [Drosophila persimilis] Back     alignment and taxonomy information
>gi|194759105|ref|XP_001961790.1| GF14752 [Drosophila ananassae] gi|190615487|gb|EDV31011.1| GF14752 [Drosophila ananassae] Back     alignment and taxonomy information
>gi|189239495|ref|XP_975513.2| PREDICTED: similar to CG3605 CG3605-PA [Tribolium castaneum] Back     alignment and taxonomy information
>gi|194855273|ref|XP_001968510.1| GG24466 [Drosophila erecta] gi|190660377|gb|EDV57569.1| GG24466 [Drosophila erecta] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query396
UNIPROTKB|Q13435895 SF3B2 "Splicing factor 3B subu 0.484 0.214 0.722 1.3e-77
UNIPROTKB|E9PPJ0878 SF3B2 "Splicing factor 3B subu 0.484 0.218 0.722 1.3e-77
UNIPROTKB|F1MC31896 SF3B2 "Uncharacterized protein 0.484 0.214 0.722 1.3e-77
UNIPROTKB|E2RL65895 SF3B2 "Uncharacterized protein 0.484 0.214 0.722 1.3e-77
UNIPROTKB|F1RU38879 SF3B2 "Uncharacterized protein 0.484 0.218 0.722 1.3e-77
ZFIN|ZDB-GENE-070928-1826 sf3b2 "splicing factor 3b, sub 0.487 0.233 0.710 7.1e-77
FB|FBgn0031493749 CG3605 [Drosophila melanogaste 0.482 0.255 0.733 1e-75
WB|WBGene00021004602 W03F9.10 [Caenorhabditis elega 0.474 0.312 0.638 5.8e-66
UNIPROTKB|O16997602 W03F9.10 "Protein W03F9.10" [C 0.474 0.312 0.638 5.8e-66
DICTYBASE|DDB_G0284555625 sf3b2 "splicing factor 3B subu 0.371 0.235 0.591 3.3e-61
UNIPROTKB|Q13435 SF3B2 "Splicing factor 3B subunit 2" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
 Score = 781 (280.0 bits), Expect = 1.3e-77, P = 1.3e-77
 Identities = 146/202 (72%), Positives = 165/202 (81%)

Query:     2 EMRASLQDKEEQKTLKSKMRERVRPKLGKIDIDYQKLHDAFFKWQTKPRLSIHGDLYYEG 61
             EMR +LQ+KEEQKT+KSKMRE+VRPK+GKIDIDYQKLHDAFFKWQTKP+L+IHGDLYYEG
Sbjct:   535 EMREALQEKEEQKTMKSKMREKVRPKMGKIDIDYQKLHDAFFKWQTKPKLTIHGDLYYEG 594

Query:    62 KEFETKLKEKKPGDLSAELRTALGMPVGPTAHKTFSRETVIPPPWLIAMQRYGPPPSYPN 121
             KEFET+LKEKKPGDLS ELR +LGMPVGP AHK       +PPPWLIAMQRYGPPPSYPN
Sbjct:   595 KEFETRLKEKKPGDLSDELRISLGMPVGPNAHK-------VPPPWLIAMQRYGPPPSYPN 647

Query:   122 LKIPGLNAPIPQGCSFGYHAGGWGKPPVDETGKPLYGDVFGTGG---WGKPPVDETGKPL 178
             LKIPGLN+PIP+ CSFGYHAGGWGKPPVDETGKPLYGDVFGT       K   +E  +  
Sbjct:   648 LKIPGLNSPIPESCSFGYHAGGWGKPPVDETGKPLYGDVFGTNAAEFQTKTEEEEIDRTP 707

Query:   179 YGDVFGTSAETKTAEVNEEDIE 200
             +G++  +  E+   E  EE  E
Sbjct:   708 WGELEPSDEESSEEEEEEESDE 729


GO:0003676 "nucleic acid binding" evidence=IEA
GO:0019048 "virus-host interaction" evidence=IEA
GO:0071013 "catalytic step 2 spliceosome" evidence=IDA
GO:0005689 "U12-type spliceosomal complex" evidence=IDA
GO:0000398 "mRNA splicing, via spliceosome" evidence=IC;TAS
GO:0006397 "mRNA processing" evidence=TAS
GO:0005681 "spliceosomal complex" evidence=IDA
GO:0005654 "nucleoplasm" evidence=TAS
GO:0008380 "RNA splicing" evidence=TAS
GO:0010467 "gene expression" evidence=TAS
GO:0005634 "nucleus" evidence=IDA
GO:0005730 "nucleolus" evidence=IDA
UNIPROTKB|E9PPJ0 SF3B2 "Splicing factor 3B subunit 2" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|F1MC31 SF3B2 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|E2RL65 SF3B2 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|F1RU38 SF3B2 "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-070928-1 sf3b2 "splicing factor 3b, subunit 2" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
FB|FBgn0031493 CG3605 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
WB|WBGene00021004 W03F9.10 [Caenorhabditis elegans (taxid:6239)] Back     alignment and assigned GO terms
UNIPROTKB|O16997 W03F9.10 "Protein W03F9.10" [Caenorhabditis elegans (taxid:6239)] Back     alignment and assigned GO terms
DICTYBASE|DDB_G0284555 sf3b2 "splicing factor 3B subunit 2" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q13435SF3B2_HUMANNo assigned EC number0.61610.90900.4022yesN/A

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query396
COG5182429 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA 3e-44
pfam04037129 pfam04037, DUF382, Domain of unknown function (DUF 8e-35
pfam0404648 pfam04046, PSP, PSP 9e-21
smart0058154 smart00581, PSP, proline-rich domain in spliceosom 4e-20
TIGR009271096 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger 8e-05
pfam02724 583 pfam02724, CDC45, CDC45-like protein 0.002
>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and modification] Back     alignment and domain information
 Score =  158 bits (401), Expect = 3e-44
 Identities = 90/289 (31%), Positives = 130/289 (44%), Gaps = 76/289 (26%)

Query: 2   EMRASLQDKEEQKTLKSKMRERVRPKLGKIDIDYQKLHDAFFKWQTKPRLSIHGDLYYEG 61
           +MR  ++++E   +L+ ++RERV+PK+G +D+DY+KLHDAFF    KP LS  G+ Y E 
Sbjct: 209 QMRRMMREREAGMSLRERIRERVQPKMGGLDVDYRKLHDAFFSLGPKPYLSKFGEFYEEV 268

Query: 62  KEFETKLKEKKPGDLSAELRTALGMPVGPTAHKTFSRETVIPPPWLIAMQRYGPPPSYPN 121
                 +K+K+PG +SAELR ALG+  G             PPPWL  MQ++G PPSYP+
Sbjct: 269 DNDYRFVKKKRPGAISAELREALGIDSGT------------PPPWLFNMQKHGMPPSYPD 316

Query: 122 LKIPGLN-APIP-QGCSFGYHAGGWGKPPVDETGKPLYGDVFGTGGWGKPPVDETGKPLY 179
           LKIPGLN APIP +G  +GY   GW +P      +                         
Sbjct: 317 LKIPGLNWAPIPLEGDVYGYQPPGWHEP----LFEV------------------------ 348

Query: 180 GDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEEEEE--EEGEDESGLITP 237
                                +TA GEL     EE   D+E+      E  + E  +   
Sbjct: 349 -------------------GPETAEGELLQFEAEEIYNDEEQYTRPVYENEDFEERVDDV 389

Query: 238 AEGLMTPSGITSVPAGLETPDSIELRKKKIESDMETNEAP---QLFQVL 283
           + G                P ++E+  K+     +  +A     L+ VL
Sbjct: 390 SNG----------GLAEFAPAAVEVAPKEDAKVTKRGKAKREKLLYSVL 428


Length = 429

>gnl|CDD|112836 pfam04037, DUF382, Domain of unknown function (DUF382) Back     alignment and domain information
>gnl|CDD|202864 pfam04046, PSP, PSP Back     alignment and domain information
>gnl|CDD|128850 smart00581, PSP, proline-rich domain in spliceosome associated proteins Back     alignment and domain information
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger Back     alignment and domain information
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 396
KOG2330|consensus500 100.0
COG5182429 CUS1 Splicing factor 3b, subunit 2 [RNA processing 100.0
KOG2330|consensus500 100.0
PF04037129 DUF382: Domain of unknown function (DUF382) ; Inte 99.95
smart0058154 PSP proline-rich domain in spliceosome associated 99.87
PF0404648 PSP: PSP; InterPro: IPR006568 PSP is a proline-ric 99.8
COG5182429 CUS1 Splicing factor 3b, subunit 2 [RNA processing 99.73
PF04037129 DUF382: Domain of unknown function (DUF382) ; Inte 99.27
KOG2673|consensus485 97.76
smart0058154 PSP proline-rich domain in spliceosome associated 91.74
PF0404648 PSP: PSP; InterPro: IPR006568 PSP is a proline-ric 91.37
>KOG2330|consensus Back     alignment and domain information
Probab=100.00  E-value=4.3e-99  Score=754.82  Aligned_cols=299  Identities=65%  Similarity=1.114  Sum_probs=260.1

Q ss_pred             ChhhhhhhhhhhhHHhHHHHHHhhCCCCCccCCchhhhhhhcCCCCCCCCccccccccccccchhhccccCCCCCCcHHH
Q psy12225          1 MEMRASLQDKEEQKTLKSKMRERVRPKLGKIDIDYQKLHDAFFKWQTKPRLSIHGDLYYEGKEFETKLKEKKPGDLSAEL   80 (396)
Q Consensus         1 ~~mr~~~~~~e~~~~lK~k~Rerv~pk~g~ldiDy~~lhDaffk~qtkP~l~~~gd~Yyegke~e~~~~~~~Pg~Ls~eL   80 (396)
                      ++||++++|+|+.++||++||+|||||||+||||||+||||||+|||||.||+||++|||||||+++.++++||.||+||
T Consensus       201 ~emR~a~~e~e~~~sLk~kmRerv~PkmGkidiDyqkLhdaFFk~qtkp~lt~~Ge~yyegke~e~~~k~k~PG~iS~eL  280 (500)
T KOG2330|consen  201 QEMREALLEREAGKSLKEKMRERVRPKMGKIDIDYQKLHDAFFKWQTKPYLTKFGELYYEGKELEAMVKEKKPGDISDEL  280 (500)
T ss_pred             HHHHHHHHhhhccccHHHHHHHhhCcccccccchHHHHHHHHHhcCCCcceeecceeeecchhHHHHHhhcCccchhHHH
Confidence            58999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHhCCCCCCCCCcccccccCCCChHHHHhhhcCCCCCCCCcccCCCCCCCCCCCCCCccCCCCCCCCCCCCCCCCCCCc
Q psy12225         81 RTALGMPVGPTAHKTFSRETVIPPPWLIAMQRYGPPPSYPNLKIPGLNAPIPQGCSFGYHAGGWGKPPVDETGKPLYGDV  160 (396)
Q Consensus        81 r~ALg~~~~~~~~k~~~re~~~Pppw~~~mq~~g~pPsyp~lkiPGLn~pip~g~~~g~~~g~wGkPP~de~~~plyGdv  160 (396)
                      |.||||+.|            +||||||||||||||||||+|||||||||||+||+||||+||||  |||++|+||||||
T Consensus       281 r~aLgmp~g------------~pPPWl~aMqryGpPpsYPdlkIpGLNapIPeg~s~Gyh~gGWG--pVDe~g~PLygDV  346 (500)
T KOG2330|consen  281 RIALGMPVG------------TPPPWLIAMQRYGPPPSYPDLKIPGLNAPIPEGCSFGYHAGGWG--PVDEFGKPLYGDV  346 (500)
T ss_pred             HHHhCCCCC------------CCChHHHHhhhcCCCCCCCcccCCCCCCCCCcccccccccCCCc--cccccCCccchhc
Confidence            999999998            99999999999999999999999999999999999999999999  8998888888888


Q ss_pred             cCCCCCCCCCCCCCCCCcccccccccccccccccchhhhhhccccccCCCCccCCcccchhh-hhhccccccCCCCCCCC
Q psy12225        161 FGTGGWGKPPVDETGKPLYGDVFGTSAETKTAEVNEEDIEKTAWGELESESEEESEEDDEEE-EEEEEGEDESGLITPAE  239 (396)
Q Consensus       161 FG~gg~GKPPvDE~GrPLYGdVFG~~~~~~~~~~~~e~vek~~WGEL~~EeEEeeEEEEEEE-EeEEeeede~g~~tp~~  239 (396)
                      ||+...                      ..........+++.+||+|.++.++++||.+||| ++.++-.++.++++|++
T Consensus       347 fG~~~p----------------------~~~~~t~es~~~rn~wgel~~e~~E~~EEreeee~~d~~~~~e~gg~~dp~d  404 (500)
T KOG2330|consen  347 FGLNIP----------------------EHHNGTKESEIERNHWGELESEEEESSEEREEEEREDKNDASENGGAVDPAD  404 (500)
T ss_pred             cccccc----------------------cccccccccccccccccccccccchhhhhhhhhhhhhhhhhhhccccCCccc
Confidence            885442                      1122223445678999999986555554432222 22222345567888887


Q ss_pred             CcccCCCccccCCCCCCCchhhhhccccccccCCCCCCCCeEeccccccCCCCCCCCCCcceeEccCCCCCCCCCCCCCC
Q psy12225        240 GLMTPSGITSVPAGLETPDSIELRKKKIESDMETNEAPQLFQVLTEKRPDKLGPGMMASTHMYEVPGAGASAPGAPVKRT  319 (396)
Q Consensus       240 Gl~TPsGl~SvpsGleTP~~idlrk~~~~s~~eg~etP~LYtVLpEk~~~~~~~~~~GS~h~Ydv~~~~~~~~~~~~~~~  319 (396)
                       ++||+||+|+|+|+|||+.|+|||..+.     .++++|||||||+++++++++||||+|+|||+..++.         
T Consensus       405 -~~~~~~Lts~ptgiEtpd~iELRK~k~e-----e~~r~LYqVLpEK~t~~igg~mmgstH~Ydis~~~a~---------  469 (500)
T KOG2330|consen  405 -DIVPSGLTSFPTGIETPDAIELRKKKRE-----EETRALYQVLPEKRTSRIGGTMMGSTHTYDISTGTAD---------  469 (500)
T ss_pred             -cccccccccCCccccChhHHHHHhhccc-----cccHHHHHhccccccccccceeeccceEEeccccccc---------
Confidence             9999999999999999999999998865     4678999999999999888899999999999875443         


Q ss_pred             CCCCeeeecCcccccCCCHHHHHHHHHHHHHHhc
Q psy12225        320 LPSSVELALDPSELDLVDTDAMAARYEQQMREQT  353 (396)
Q Consensus       320 ~~~~v~v~l~p~~l~~~~~~~~~~~ye~~~~~~~  353 (396)
                      +..||+|||||++|+. +..+|  +||+++++|+
T Consensus       470 k~~GV~~sL~Peeld~-d~~~l--~yee~~reqk  500 (500)
T KOG2330|consen  470 KTPGVEFSLDPEELDA-DGMAL--RYEEQRREQK  500 (500)
T ss_pred             cCCCceeecChhhhcc-chhhH--HHHHHHhhcC
Confidence            2347999999999997 77788  9999999873



>COG5182 CUS1 Splicing factor 3b, subunit 2 [RNA processing and modification] Back     alignment and domain information
>KOG2330|consensus Back     alignment and domain information
>PF04037 DUF382: Domain of unknown function (DUF382) ; InterPro: IPR007180 This domain is specific to the human splicing factor 3b subunit 2 and its orthologs Back     alignment and domain information
>smart00581 PSP proline-rich domain in spliceosome associated proteins Back     alignment and domain information
>PF04046 PSP: PSP; InterPro: IPR006568 PSP is a proline-rich domain of unknown function found in spliceosome associated proteins Back     alignment and domain information
>COG5182 CUS1 Splicing factor 3b, subunit 2 [RNA processing and modification] Back     alignment and domain information
>PF04037 DUF382: Domain of unknown function (DUF382) ; InterPro: IPR007180 This domain is specific to the human splicing factor 3b subunit 2 and its orthologs Back     alignment and domain information
>KOG2673|consensus Back     alignment and domain information
>smart00581 PSP proline-rich domain in spliceosome associated proteins Back     alignment and domain information
>PF04046 PSP: PSP; InterPro: IPR006568 PSP is a proline-rich domain of unknown function found in spliceosome associated proteins Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query396
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 8e-05
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 44.1 bits (103), Expect = 8e-05
 Identities = 59/492 (11%), Positives = 122/492 (24%), Gaps = 138/492 (28%)

Query: 1   MEMRASLQDKEEQKTLKSKMRERVRPKLGKIDIDYQKLHDAFFKWQTKPRLSIHGDLYYE 60
           M+        + +  L               + D + + D       K  LS        
Sbjct: 7   MDFETGEHQYQYKDIL-----SVFEDAFVD-NFDCKDVQDM-----PKSILS-------- 47

Query: 61  GKEFETKLKEKKPGDLSAELRTALGMPVGPTAHKTFSRETVIPP--PWLIAMQRYGPPPS 118
            +E +  +  K     +  L   L +       + F  E V+     +L  M        
Sbjct: 48  KEEIDHIIMSKDAVSGTLRLFWTL-LSKQEEMVQKFV-EEVLRINYKFL--MSPIKTEQR 103

Query: 119 YPNL----------KIPGLNAPIPQGCSFGYHAGGWGKPPVDETGKPLYGD-------VF 161
            P++          ++   N    +     Y+       P  +  + L          + 
Sbjct: 104 QPSMMTRMYIEQRDRLYNDNQVFAK-----YNV--SRLQPYLKLRQALLELRPAKNVLID 156

Query: 162 GTGGWGKPPV-------DETGKPLYGDVF-GTSAETKTAEVNEEDIEKTAWGELESESEE 213
           G  G GK  V        +    +   +F        + E   E ++K    +++     
Sbjct: 157 GVLGSGKTWVALDVCLSYKVQCKMDFKIFWLNLKNCNSPETVLEMLQKL-LYQIDPNWTS 215

Query: 214 ESEEDDEEEEEEEEGEDE-SGLITPAEG-------------------------LMTPSGI 247
            S+     +      + E   L+                              L+T +  
Sbjct: 216 RSDHSSNIKLRIHSIQAELRRLLKSKPYENCLLVLLNVQNAKAWNAFNLSCKILLT-TRF 274

Query: 248 TSVPAGLETPDSIELRKKKIESDMETNEAPQLFQVLTEKRPDKLGPGMMASTHMYEVPGA 307
             V   L    +  +        +  +E   L     + RP  L P  + +T+   +   
Sbjct: 275 KQVTDFLSAATTTHISLDHHSMTLTPDEVKSLLLKYLDCRPQDL-PREVLTTNPRRLSII 333

Query: 308 GASAPGAP-------------VKRTLPSSVELALDPSEL-----DLV----DTDAMAARY 345
             S                  +   + SS+   L+P+E       L              
Sbjct: 334 AESIRDGLATWDNWKHVNCDKLTTIIESSLN-VLEPAEYRKMFDRLSVFPPSAHIPTILL 392

Query: 346 E---QQMREQTSGLQKEDLSDMLAE-H----VARQKNKRK--------RQQTTQDNKQAK 389
                 + +        D+  ++ + H    V +Q  +            +   +N+ A 
Sbjct: 393 SLIWFDVIK-------SDVMVVVNKLHKYSLVEKQPKESTISIPSIYLELKVKLENEYAL 445

Query: 390 ------KYKEFK 395
                  Y   K
Sbjct: 446 HRSIVDHYNIPK 457


Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00