Psyllid ID: psy9417


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-
ELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIGEWNQELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIDAIIHQARLQFKYKDVYAPRFARNLYRTFGYVFQCPDEDRKK
ccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHcccccccccc
ccHHHHHccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccEEEHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHHccccccccc
ellsinstkppiskakITGIVKLALKATHYYKYVVQSVEKFIWKCKsqyklpglyvigewnqellsinstkppiskakITGIVKLALKATHYYKYVVQSVEKFIWKCKsqyklpglyVIDAIIHQARLQFKykdvyaprfarnlyrtfgyvfqcpdedrkk
ellsinstkppiskakITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIGEWNQELLSinstkppiskAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIDAIIHQARLQFKYKDVYAPRFARNLYRTfgyvfqcpdedrkk
ELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIGEWNQELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIDAIIHQARLQFKYKDVYAPRFARNLYRTFGYVFQCPDEDRKK
**************AKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIGEWNQELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIDAIIHQARLQFKYKDVYAPRFARNLYRTFGYVFQC*******
***S****KPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIGEWNQELLSIN*******KAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIDAIIHQARLQFKYKDVYAPRFARNLYRTFGYVFQ*P******
ELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIGEWNQELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIDAIIHQARLQFKYKDVYAPRFARNLYRTFGYVFQCPDEDRKK
*LLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIGEWNQELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIDAIIHQARLQFKYKDVYAPRFARNLYRTFGYVFQCPD*****
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
ELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIGEWNQELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIDAIIHQARLQFKYKDVYAPRFARNLYRTFGYVFQCPDEDRKK
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query161 2.2.26 [Sep-21-2011]
Q9UPN6 1271 Protein SCAF8 OS=Homo sap yes N/A 0.652 0.082 0.575 5e-31
Q6DID3 1268 Protein SCAF8 OS=Mus musc yes N/A 0.652 0.082 0.575 7e-31
Q63623 1268 Protein SCAF8 OS=Rattus n yes N/A 0.652 0.082 0.566 1e-29
O95104 1147 Splicing factor, arginine no N/A 0.652 0.091 0.556 1e-28
>sp|Q9UPN6|SCAF8_HUMAN Protein SCAF8 OS=Homo sapiens GN=SCAF8 PE=1 SV=1 Back     alignment and function desciption
 Score =  133 bits (334), Expect = 5e-31,   Method: Composition-based stats.
 Identities = 61/106 (57%), Positives = 82/106 (77%), Gaps = 1/106 (0%)

Query: 57  IGEWNQELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGL 116
           +  +N EL S+N  KPPISKAK+T I K A+KA  +YK+VVQSVEKFI KCK +YK+PGL
Sbjct: 4   VKTFNSELYSLNDYKPPISKAKMTQITKAAIKAIKFYKHVVQSVEKFIQKCKPEYKVPGL 63

Query: 117 YVIDAIIHQARLQF-KYKDVYAPRFARNLYRTFGYVFQCPDEDRKK 161
           YVID+I+ Q+R QF + KDV+APRF+ N+  TF  +++CP +D+ K
Sbjct: 64  YVIDSIVRQSRHQFGQEKDVFAPRFSNNIISTFQNLYRCPGDDKSK 109




May play a role in mRNA processing.
Homo sapiens (taxid: 9606)
>sp|Q6DID3|SCAF8_MOUSE Protein SCAF8 OS=Mus musculus GN=Scaf8 PE=1 SV=1 Back     alignment and function description
>sp|Q63623|SCAF8_RAT Protein SCAF8 OS=Rattus norvegicus GN=Scaf8 PE=1 SV=1 Back     alignment and function description
>sp|O95104|SFR15_HUMAN Splicing factor, arginine/serine-rich 15 OS=Homo sapiens GN=SCAF4 PE=1 SV=3 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query161
444732459 1022 Protein SCAF8 [Tupaia chinensis] 0.788 0.124 0.475 6e-33
157167923 1337 hypothetical protein AaeL_AAEL002986 [Ae 0.633 0.076 0.611 3e-31
170050233 1014 conserved hypothetical protein [Culex qu 0.633 0.100 0.611 5e-31
347969782 1392 AGAP003369-PA [Anopheles gambiae str. PE 0.633 0.073 0.601 1e-30
312381414 1644 hypothetical protein AND_06286 [Anophele 0.621 0.060 0.613 1e-30
348517487 1364 PREDICTED: protein SCAF8-like [Oreochrom 0.652 0.076 0.566 1e-29
402867872 1316 PREDICTED: protein SCAF8 isoform 3 [Papi 0.652 0.079 0.575 2e-29
402867868 1349 PREDICTED: protein SCAF8 isoform 1 [Papi 0.652 0.077 0.575 2e-29
332825315 1316 PREDICTED: protein SCAF8 isoform 4 [Pan 0.652 0.079 0.575 2e-29
221044370 1316 unnamed protein product [Homo sapiens] 0.652 0.079 0.575 2e-29
>gi|444732459|gb|ELW72751.1| Protein SCAF8 [Tupaia chinensis] Back     alignment and taxonomy information
 Score =  145 bits (365), Expect = 6e-33,   Method: Composition-based stats.
 Identities = 77/162 (47%), Positives = 99/162 (61%), Gaps = 35/162 (21%)

Query: 1   ELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIGEW 60
           EL S+N  KPPISKAK+T I K A+KA   Y                             
Sbjct: 10  ELYSLNDYKPPISKAKMTQITKAAIKAIKLY----------------------------- 40

Query: 61  NQELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVID 120
                S+N  KPPISKAK+T I K A+KA  +YK+VVQSVEKFI KCK +YK+PGLYVID
Sbjct: 41  -----SLNDYKPPISKAKMTQITKAAIKAIKFYKHVVQSVEKFIQKCKPEYKVPGLYVID 95

Query: 121 AIIHQARLQF-KYKDVYAPRFARNLYRTFGYVFQCPDEDRKK 161
           +I+ Q+R QF + KDV+APRF+ N+  TF  +++CP +D+ K
Sbjct: 96  SIVRQSRHQFGQEKDVFAPRFSNNIISTFQNLYRCPGDDKSK 137




Source: Tupaia chinensis

Species: Tupaia chinensis

Genus: Tupaia

Family: Tupaiidae

Order: Scandentia

Class: Mammalia

Phylum: Chordata

Superkingdom: Eukaryota

>gi|157167923|ref|XP_001662905.1| hypothetical protein AaeL_AAEL002986 [Aedes aegypti] gi|108881522|gb|EAT45747.1| AAEL002986-PA [Aedes aegypti] Back     alignment and taxonomy information
>gi|170050233|ref|XP_001859912.1| conserved hypothetical protein [Culex quinquefasciatus] gi|167871912|gb|EDS35295.1| conserved hypothetical protein [Culex quinquefasciatus] Back     alignment and taxonomy information
>gi|347969782|ref|XP_314272.5| AGAP003369-PA [Anopheles gambiae str. PEST] gi|333469268|gb|EAA09643.5| AGAP003369-PA [Anopheles gambiae str. PEST] Back     alignment and taxonomy information
>gi|312381414|gb|EFR27170.1| hypothetical protein AND_06286 [Anopheles darlingi] Back     alignment and taxonomy information
>gi|348517487|ref|XP_003446265.1| PREDICTED: protein SCAF8-like [Oreochromis niloticus] Back     alignment and taxonomy information
>gi|402867872|ref|XP_003898053.1| PREDICTED: protein SCAF8 isoform 3 [Papio anubis] Back     alignment and taxonomy information
>gi|402867868|ref|XP_003898051.1| PREDICTED: protein SCAF8 isoform 1 [Papio anubis] Back     alignment and taxonomy information
>gi|332825315|ref|XP_003311605.1| PREDICTED: protein SCAF8 isoform 4 [Pan troglodytes] Back     alignment and taxonomy information
>gi|221044370|dbj|BAH13862.1| unnamed protein product [Homo sapiens] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query161
UNIPROTKB|F1SB23 1347 SCAF8 "Uncharacterized protein 0.714 0.085 0.537 7.3e-28
UNIPROTKB|F1LPT8 1267 Rbm16 "Protein Rbm16" [Rattus 0.633 0.080 0.592 8.5e-28
MGI|MGI:1925212 1268 Scaf8 "SR-related CTD-associat 0.633 0.080 0.592 8.5e-28
UNIPROTKB|Q9UPN6 1271 SCAF8 "Protein SCAF8" [Homo sa 0.633 0.080 0.592 8.5e-28
ZFIN|ZDB-GENE-030131-4817 1286 wu:fd99c08 "wu:fd99c08" [Danio 0.633 0.079 0.592 1.4e-27
UNIPROTKB|F1NW30 1288 Gga.54017 "Uncharacterized pro 0.670 0.083 0.568 1.8e-27
UNIPROTKB|F1MYK6 1268 LOC540182 "Uncharacterized pro 0.633 0.080 0.572 4.7e-27
UNIPROTKB|F1PC86 1267 SCAF8 "Uncharacterized protein 0.608 0.077 0.595 1.6e-26
RGD|708362 1268 Scaf8 "SR-related CTD-associat 0.633 0.080 0.582 1.6e-26
UNIPROTKB|B7Z888 1337 SCAF8 "Protein SCAF8" [Homo sa 0.608 0.073 0.595 1.8e-26
UNIPROTKB|F1SB23 SCAF8 "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
 Score = 325 (119.5 bits), Expect = 7.3e-28, P = 7.3e-28
 Identities = 65/121 (53%), Positives = 88/121 (72%)

Query:    47 SQYKLPGLYV-----IGEWNQELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVE 101
             S++  PGL       +  +N EL S+N  KPPISKAK+T I K A+KA  +YK+VVQSVE
Sbjct:    67 SRFPGPGLCTDNMEAVKTFNSELYSLNDYKPPISKAKMTQITKAAIKAIKFYKHVVQSVE 126

Query:   102 KFIWKCKSQYKLPGLYVIDAIIHQARLQF-KYKDVYAPRFARNLYRTFGYVFQCPDEDRK 160
             KFI KCK +YK+PGLYVID+I+ Q+R QF + KDV+APRF+ N+  TF  +++CP +D+ 
Sbjct:   127 KFIQKCKPEYKVPGLYVIDSIVRQSRHQFGQEKDVFAPRFSNNIISTFQNLYRCPGDDKS 186

Query:   161 K 161
             K
Sbjct:   187 K 187


GO:0043175 "RNA polymerase core enzyme binding" evidence=IEA
GO:0005634 "nucleus" evidence=IEA
GO:0003676 "nucleic acid binding" evidence=IEA
GO:0000166 "nucleotide binding" evidence=IEA
UNIPROTKB|F1LPT8 Rbm16 "Protein Rbm16" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
MGI|MGI:1925212 Scaf8 "SR-related CTD-associated factor 8" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
UNIPROTKB|Q9UPN6 SCAF8 "Protein SCAF8" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-030131-4817 wu:fd99c08 "wu:fd99c08" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
UNIPROTKB|F1NW30 Gga.54017 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|F1MYK6 LOC540182 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|F1PC86 SCAF8 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
RGD|708362 Scaf8 "SR-related CTD-associated factor 8" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|B7Z888 SCAF8 "Protein SCAF8" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q6DID3SCAF8_MOUSENo assigned EC number0.57540.65210.0828yesN/A
Q63623SCAF8_RATNo assigned EC number0.56600.65210.0828yesN/A
Q9UPN6SCAF8_HUMANNo assigned EC number0.57540.65210.0826yesN/A

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query161
cd03562114 cd03562, CID, CID (CTD-Interacting Domain) domain 8e-15
smart00582124 smart00582, RPR, domain present in proteins, which 4e-14
cd03562114 cd03562, CID, CID (CTD-Interacting Domain) domain 3e-10
smart00582124 smart00582, RPR, domain present in proteins, which 4e-07
pfam0481864 pfam04818, CTD_bind, RNA polymerase II-binding dom 9e-05
>gnl|CDD|239621 cd03562, CID, CID (CTD-Interacting Domain) domain family; CID is present in several RNA-processing factors such as Pcf11 and Nrd1 Back     alignment and domain information
 Score = 66.1 bits (162), Expect = 8e-15
 Identities = 26/89 (29%), Positives = 46/89 (51%), Gaps = 7/89 (7%)

Query: 74  ISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIDAIIHQARLQFKYK 133
            S+  I  + KLA++   + K +V+ +EK I KC  + KLP LY++D+I+     ++K  
Sbjct: 16  NSQPSIQTLTKLAIENRKHAKEIVEIIEKHIKKCPPEQKLPLLYLLDSIVKNVGRKYKEF 75

Query: 134 DVYAPRFARNLYRTFGYVFQ-CPDEDRKK 161
                 F+  L   F   ++   ++ RKK
Sbjct: 76  ------FSEFLVPLFLDAYEKVDEKTRKK 98


Pcf11 is a conserved and essential subunit of the yeast cleavage factor IA, which is required for polyadenylation-dependent 3'-RNA processing and transcription termination. Nrd1 is implicated in polyadenylation-independent 3'-RNA processing. CID binds tightly to the carboxy-terminal domain (CTD) of RNA polymerase (Pol) II. During transcription, Pol II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr 1-Ser 2-Pro 3-Thr 4-Ser 5-Pro 6-Ser 7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices. Length = 114

>gnl|CDD|214731 smart00582, RPR, domain present in proteins, which are involved in regulation of nuclear pre-mRNA Back     alignment and domain information
>gnl|CDD|239621 cd03562, CID, CID (CTD-Interacting Domain) domain family; CID is present in several RNA-processing factors such as Pcf11 and Nrd1 Back     alignment and domain information
>gnl|CDD|214731 smart00582, RPR, domain present in proteins, which are involved in regulation of nuclear pre-mRNA Back     alignment and domain information
>gnl|CDD|218279 pfam04818, CTD_bind, RNA polymerase II-binding domain Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 161
KOG0132|consensus 894 100.0
KOG0132|consensus 894 99.93
cd03562114 CID CID (CTD-Interacting Domain) domain family; CI 99.86
KOG2071|consensus 579 99.73
smart00582121 RPR domain present in proteins, which are involved 99.7
cd03562114 CID CID (CTD-Interacting Domain) domain family; CI 98.97
smart00582121 RPR domain present in proteins, which are involved 98.72
PF0481864 CTD_bind: RNA polymerase II-binding domain.; Inter 98.6
cd00197115 VHS_ENTH_ANTH VHS, ENTH and ANTH domain superfamil 97.97
KOG2669|consensus 325 97.5
PF12243139 CTK3: CTD kinase subunit gamma CTK3 97.09
KOG2071|consensus 579 96.31
PF00790140 VHS: VHS domain; InterPro: IPR002014 The VHS domai 95.46
cd03561133 VHS VHS domain family; The VHS domain is present i 94.73
KOG0151|consensus 877 94.49
cd03572122 ENTH_epsin_related ENTH domain, Epsin Related fami 88.16
smart00288133 VHS Domain present in VPS-27, Hrs and STAM. Unpubl 86.8
PF01417125 ENTH: ENTH domain; InterPro: IPR001026 The ENTH (E 86.24
>KOG0132|consensus Back     alignment and domain information
Probab=100.00  E-value=9.9e-35  Score=273.74  Aligned_cols=107  Identities=54%  Similarity=0.874  Sum_probs=104.6

Q ss_pred             HHHHhhHHHhhhhhcCCCCCcHHHHHHHHHHHHHhhHHHHHHHHHHHHHHhhcCCCCcchhHHHHHHHHHHHhhhhc-cc
Q psy9417          55 YVIGEWNQELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIDAIIHQARLQFK-YK  133 (161)
Q Consensus        55 yvIDef~~eL~sL~~~k~~iSrskI~~LT~lA~enik~~k~IV~~Ie~~I~k~~p~~KL~~LYVIDSIvRn~~~~~~-~k  133 (161)
                      .+|.+|+.||+||.++|+||||+||..||++|++|++.|+|||+.||+||+||+|+||||+|||||||||+++||++ +|
T Consensus         2 e~v~~Fn~eL~SL~DsK~~IS~sKi~~ITkaAikaIk~ykhVVqsVeKfi~kCkpe~Kl~gLYVIDSIVRqsrhq~~~~k   81 (894)
T KOG0132|consen    2 DAVKEFNGELDSLEDSKPGISGSKILKITKAAIKAIKLYKHVVQSVEKFIKKCKPEYKLPGLYVIDSIVRQSRHQFGKEK   81 (894)
T ss_pred             hHHHHHHHHHHHhhccCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccccCeeEEehHHHHHHHHhhcccc
Confidence            47899999999999999999999999999999999999999999999999999999999999999999999999999 99


Q ss_pred             CccHHhhHHHHHHHHhhhhcCCCcccCC
Q psy9417         134 DVYAPRFARNLYRTFGYVFQCPDEDRKK  161 (161)
Q Consensus       134 d~Y~~rF~~nl~~tf~~~~~c~~~dk~k  161 (161)
                      |+|++||.+|+..||+++|+|+.+||++
T Consensus        82 d~F~prf~~n~~~tf~~L~~c~~edks~  109 (894)
T KOG0132|consen   82 DVFGPRFSKNFTGTFQNLYECPQEDKSD  109 (894)
T ss_pred             cccCCccchhHHHHHHHHHhcCHHHHHH
Confidence            9999999999999999999999999864



>KOG0132|consensus Back     alignment and domain information
>cd03562 CID CID (CTD-Interacting Domain) domain family; CID is present in several RNA-processing factors such as Pcf11 and Nrd1 Back     alignment and domain information
>KOG2071|consensus Back     alignment and domain information
>smart00582 RPR domain present in proteins, which are involved in regulation of nuclear pre-mRNA Back     alignment and domain information
>cd03562 CID CID (CTD-Interacting Domain) domain family; CID is present in several RNA-processing factors such as Pcf11 and Nrd1 Back     alignment and domain information
>smart00582 RPR domain present in proteins, which are involved in regulation of nuclear pre-mRNA Back     alignment and domain information
>PF04818 CTD_bind: RNA polymerase II-binding domain Back     alignment and domain information
>cd00197 VHS_ENTH_ANTH VHS, ENTH and ANTH domain superfamily; composed of proteins containing a VHS, ENTH or ANTH domain Back     alignment and domain information
>KOG2669|consensus Back     alignment and domain information
>PF12243 CTK3: CTD kinase subunit gamma CTK3 Back     alignment and domain information
>KOG2071|consensus Back     alignment and domain information
>PF00790 VHS: VHS domain; InterPro: IPR002014 The VHS domain is a ~140 residues long domain, whose name is derived from its occurrence in VPS-27, Hrs and STAM Back     alignment and domain information
>cd03561 VHS VHS domain family; The VHS domain is present in Vps27 (Vacuolar Protein Sorting), Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and STAM (Signal Transducing Adaptor Molecule) Back     alignment and domain information
>KOG0151|consensus Back     alignment and domain information
>cd03572 ENTH_epsin_related ENTH domain, Epsin Related family; composed of hypothetical proteins containing an ENTH-like domain Back     alignment and domain information
>smart00288 VHS Domain present in VPS-27, Hrs and STAM Back     alignment and domain information
>PF01417 ENTH: ENTH domain; InterPro: IPR001026 The ENTH (Epsin N-terminal homology) domain is approximately 150 amino acids in length and is always found located at the N-termini of proteins Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query161
2diw_A152 Solution Structure Of The Rpr Domain Of Putative Rn 5e-31
3d9p_A145 Snapshots Of The Rna Processing Factor Scaf8 Bound 5e-31
3d9j_A145 Snapshots Of The Rna Processing Factor Scaf8 Bound 6e-31
3d9i_A145 Snapshots Of The Rna Processing Factor Scaf8 Bound 2e-30
3d9o_A145 Snapshots Of The Rna Processing Factor Scaf8 Bound 7e-30
>pdb|2DIW|A Chain A, Solution Structure Of The Rpr Domain Of Putative Rna- Binding Protein 16 Length = 152 Back     alignment and structure

Iteration: 1

Score = 130 bits (326), Expect = 5e-31, Method: Compositional matrix adjust. Identities = 61/109 (55%), Positives = 83/109 (76%), Gaps = 1/109 (0%) Query: 54 LYVIGEWNQELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKL 113 + + +N EL S+N KPPISKAK+T I K A+KA +YK+VVQSVEKFI KCK +YK+ Sbjct: 10 MEAVKTFNSELYSLNDYKPPISKAKMTQITKAAIKAIKFYKHVVQSVEKFIQKCKPEYKV 69 Query: 114 PGLYVIDAIIHQARLQF-KYKDVYAPRFARNLYRTFGYVFQCPDEDRKK 161 PGLYVID+I+ Q+R QF + KDV+APRF+ N+ TF +++CP +D+ K Sbjct: 70 PGLYVIDSIVRQSRHQFGQEKDVFAPRFSNNIISTFQNLYRCPGDDKSK 118
>pdb|3D9P|A Chain A, Snapshots Of The Rna Processing Factor Scaf8 Bound To Different Phosphorylated Forms Of The Carboxy-Terminal Domain Of Rna-Polymerase Ii Length = 145 Back     alignment and structure
>pdb|3D9J|A Chain A, Snapshots Of The Rna Processing Factor Scaf8 Bound To Different Phosphorylated Forms Of The Carboxy-Terminal Domain Of Rna-Polymerase Ii Length = 145 Back     alignment and structure
>pdb|3D9I|A Chain A, Snapshots Of The Rna Processing Factor Scaf8 Bound To Different Phosphorylated Forms Of The Carboxy-Terminal Domain Of Rna-Polymerase Ii Length = 145 Back     alignment and structure
>pdb|3D9O|A Chain A, Snapshots Of The Rna Processing Factor Scaf8 Bound To Different Phosphorylated Forms Of The Carboxy-Terminal Domain Of Rna-Polymerase Ii Length = 145 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query161
3d9j_A145 RNA-binding protein 16; SCAF8, RNA polymerase II C 9e-27
3d9j_A145 RNA-binding protein 16; SCAF8, RNA polymerase II C 4e-13
3clj_A157 Protein NRD1; CTD-interacting domain, nucleus, pho 7e-21
3clj_A157 Protein NRD1; CTD-interacting domain, nucleus, pho 6e-13
1sz9_A144 PCF11 protein; RNA polymerase II CTD interacting d 2e-16
1sz9_A144 PCF11 protein; RNA polymerase II CTD interacting d 1e-05
4fld_A135 Regulation of nuclear PRE-mRNA domain-containing 1 1e-12
4fld_A135 Regulation of nuclear PRE-mRNA domain-containing 1 2e-04
4flb_A132 Regulation of nuclear PRE-mRNA domain-containing; 4e-11
4flb_A132 Regulation of nuclear PRE-mRNA domain-containing; 2e-04
2km4_A142 Regulator of TY1 transposition protein 103; CTD-in 2e-08
>3d9j_A RNA-binding protein 16; SCAF8, RNA polymerase II CTD interacting domain, arm repeats phospho-CTD, phosphoprotein, transcription; 1.60A {Homo sapiens} PDB: 3d9k_A* 3d9l_A* 3d9m_A* 3d9n_A* 3d9p_A* 3d9i_A 3d9o_A* 2diw_A Length = 145 Back     alignment and structure
 Score = 97.5 bits (242), Expect = 9e-27
 Identities = 61/107 (57%), Positives = 82/107 (76%), Gaps = 1/107 (0%)

Query: 56  VIGEWNQELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPG 115
            +  +N EL S+N  KPPISKAK+T I K A+KA  +YK+VVQSVEKFI KCK +YK+PG
Sbjct: 3   AVKTFNSELYSLNDYKPPISKAKMTQITKAAIKAIKFYKHVVQSVEKFIQKCKPEYKVPG 62

Query: 116 LYVIDAIIHQARLQF-KYKDVYAPRFARNLYRTFGYVFQCPDEDRKK 161
           LYVID+I+ Q+R QF + KDV+APRF+ N+  TF  +++CP +D+ K
Sbjct: 63  LYVIDSIVRQSRHQFGQEKDVFAPRFSNNIISTFQNLYRCPGDDKSK 109


>3d9j_A RNA-binding protein 16; SCAF8, RNA polymerase II CTD interacting domain, arm repeats phospho-CTD, phosphoprotein, transcription; 1.60A {Homo sapiens} PDB: 3d9k_A* 3d9l_A* 3d9m_A* 3d9n_A* 3d9p_A* 3d9i_A 3d9o_A* 2diw_A Length = 145 Back     alignment and structure
>3clj_A Protein NRD1; CTD-interacting domain, nucleus, phosphoprotein, RNA polymer binding protein, RNA binding protein; 2.10A {Saccharomyces cerevisiae} Length = 157 Back     alignment and structure
>3clj_A Protein NRD1; CTD-interacting domain, nucleus, phosphoprotein, RNA polymer binding protein, RNA binding protein; 2.10A {Saccharomyces cerevisiae} Length = 157 Back     alignment and structure
>1sz9_A PCF11 protein; RNA polymerase II CTD interacting domain, arm repeats, transcription; 2.10A {Saccharomyces cerevisiae} SCOP: a.118.9.4 PDB: 1sza_A* 2bf0_X Length = 144 Back     alignment and structure
>1sz9_A PCF11 protein; RNA polymerase II CTD interacting domain, arm repeats, transcription; 2.10A {Saccharomyces cerevisiae} SCOP: a.118.9.4 PDB: 1sza_A* 2bf0_X Length = 144 Back     alignment and structure
>4flb_A Regulation of nuclear PRE-mRNA domain-containing; structural genomics consortium, SGC, protein binding; 1.80A {Homo sapiens} Length = 132 Back     alignment and structure
>4flb_A Regulation of nuclear PRE-mRNA domain-containing; structural genomics consortium, SGC, protein binding; 1.80A {Homo sapiens} Length = 132 Back     alignment and structure
>2km4_A Regulator of TY1 transposition protein 103; CTD-interacting domain, RNA polymerase II binding protein, phosphoprotein; NMR {Saccharomyces cerevisiae} PDB: 2l0i_A* Length = 142 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query161
3d9j_A145 RNA-binding protein 16; SCAF8, RNA polymerase II C 99.97
3clj_A157 Protein NRD1; CTD-interacting domain, nucleus, pho 99.97
1sz9_A144 PCF11 protein; RNA polymerase II CTD interacting d 99.93
3clj_A157 Protein NRD1; CTD-interacting domain, nucleus, pho 99.79
4fu3_A135 Regulation of nuclear PRE-mRNA domain-containing 1 99.7
4flb_A132 Regulation of nuclear PRE-mRNA domain-containing; 99.69
3d9j_A145 RNA-binding protein 16; SCAF8, RNA polymerase II C 99.67
1sz9_A144 PCF11 protein; RNA polymerase II CTD interacting d 99.37
2km4_A142 Regulator of TY1 transposition protein 103; CTD-in 99.22
4fu3_A135 Regulation of nuclear PRE-mRNA domain-containing 1 98.66
4flb_A132 Regulation of nuclear PRE-mRNA domain-containing; 98.62
2km4_A142 Regulator of TY1 transposition protein 103; CTD-in 96.9
1vdy_A140 Hypothetical protein (RAFL09-17-B18); structural g 90.16
1x5b_A163 Signal transducing adaptor molecule 2; VHS domain, 86.02
1mhq_A148 ADP-ribosylation factor binding protein GGA2; supe 81.51
3ldz_A140 STAM-1, signal transducing adapter molecule 1; ubi 80.92
>3d9j_A RNA-binding protein 16; SCAF8, RNA polymerase II CTD interacting domain, arm repeats phospho-CTD, phosphoprotein, transcription; 1.60A {Homo sapiens} PDB: 3d9k_A* 3d9l_A* 3d9m_A* 3d9n_A* 3d9p_A* 3d9i_A 3d9o_A* 2diw_A Back     alignment and structure
Probab=99.97  E-value=1.1e-31  Score=208.00  Aligned_cols=105  Identities=57%  Similarity=0.968  Sum_probs=100.9

Q ss_pred             HHHhhHHHhhhhhcCCCCCcHHHHHHHHHHHHHhhHHHHHHHHHHHHHHhhcCCCCcchhHHHHHHHHHHHhhhhc-ccC
Q psy9417          56 VIGEWNQELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIDAIIHQARLQFK-YKD  134 (161)
Q Consensus        56 vIDef~~eL~sL~~~k~~iSrskI~~LT~lA~enik~~k~IV~~Ie~~I~k~~p~~KL~~LYVIDSIvRn~~~~~~-~kd  134 (161)
                      ++++|++.|++|.+++|+|||++|++||.+|++|+++|++||+.++++|++|+|++|||+|||+||||||++++.+ ++|
T Consensus         3 ~v~~f~~~L~~L~~~k~~~S~~~I~~LT~~a~~~~~~~~~IV~~~~~~i~k~~~~~KL~~LYL~DsIvrn~~~k~~~~~~   82 (145)
T 3d9j_A            3 AVKTFNSELYSLNDYKPPISKAKMTQITKAAIKAIKFYKHVVQSVEKFIQKCKPEYKVPGLYVIDSIVRQSRHQFGQEKD   82 (145)
T ss_dssp             HHHHHHHHHHGGGGSCSSCCHHHHHHHHHHHHHTGGGHHHHHHHHHHHHHHSCGGGHHHHHHHHHHHHHHHHHHHCTTTC
T ss_pred             HHHHHHHHHHHHHhccCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccceeehhhhhHHHHHHhhhhcCCccc
Confidence            6899999999999999999999999999999999999999999999999999999999999999999999999987 578


Q ss_pred             ccHHhhHHHHHHHHhhhhcCCCcccC
Q psy9417         135 VYAPRFARNLYRTFGYVFQCPDEDRK  160 (161)
Q Consensus       135 ~Y~~rF~~nl~~tf~~~~~c~~~dk~  160 (161)
                      .|+++|+++|.++|.++|+|++++|.
T Consensus        83 ~f~~~F~~~L~~~f~~~~~~~~~~r~  108 (145)
T 3d9j_A           83 VFAPRFSNNIISTFQNLYRCPGDDKS  108 (145)
T ss_dssp             SHHHHHHHHHHHHHHHHTSSCGGGHH
T ss_pred             cHHHHHHHHHHHHHHHHHhCCHHHHH
Confidence            99999999999999999999987764



>3clj_A Protein NRD1; CTD-interacting domain, nucleus, phosphoprotein, RNA polymer binding protein, RNA binding protein; 2.10A {Saccharomyces cerevisiae} Back     alignment and structure
>1sz9_A PCF11 protein; RNA polymerase II CTD interacting domain, arm repeats, transcription; 2.10A {Saccharomyces cerevisiae} SCOP: a.118.9.4 PDB: 1sza_A* 2bf0_X Back     alignment and structure
>3clj_A Protein NRD1; CTD-interacting domain, nucleus, phosphoprotein, RNA polymer binding protein, RNA binding protein; 2.10A {Saccharomyces cerevisiae} Back     alignment and structure
>4fu3_A Regulation of nuclear PRE-mRNA domain-containing 1B; structural genomics consortium, SGC, domain swapping, transc; HET: MSE; 1.90A {Homo sapiens} PDB: 4fld_A* Back     alignment and structure
>4flb_A Regulation of nuclear PRE-mRNA domain-containing; structural genomics consortium, SGC, protein binding; 1.80A {Homo sapiens} Back     alignment and structure
>3d9j_A RNA-binding protein 16; SCAF8, RNA polymerase II CTD interacting domain, arm repeats phospho-CTD, phosphoprotein, transcription; 1.60A {Homo sapiens} PDB: 3d9k_A* 3d9l_A* 3d9m_A* 3d9n_A* 3d9p_A* 3d9i_A 3d9o_A* 2diw_A Back     alignment and structure
>1sz9_A PCF11 protein; RNA polymerase II CTD interacting domain, arm repeats, transcription; 2.10A {Saccharomyces cerevisiae} SCOP: a.118.9.4 PDB: 1sza_A* 2bf0_X Back     alignment and structure
>2km4_A Regulator of TY1 transposition protein 103; CTD-interacting domain, RNA polymerase II binding protein, phosphoprotein; NMR {Saccharomyces cerevisiae} PDB: 2l0i_A* Back     alignment and structure
>4fu3_A Regulation of nuclear PRE-mRNA domain-containing 1B; structural genomics consortium, SGC, domain swapping, transc; HET: MSE; 1.90A {Homo sapiens} PDB: 4fld_A* Back     alignment and structure
>4flb_A Regulation of nuclear PRE-mRNA domain-containing; structural genomics consortium, SGC, protein binding; 1.80A {Homo sapiens} Back     alignment and structure
>2km4_A Regulator of TY1 transposition protein 103; CTD-interacting domain, RNA polymerase II binding protein, phosphoprotein; NMR {Saccharomyces cerevisiae} PDB: 2l0i_A* Back     alignment and structure
>1vdy_A Hypothetical protein (RAFL09-17-B18); structural genomics, riken structural genomics/proteomics initiative, RSGI, unknown function; NMR {Arabidopsis thaliana} PDB: 2dcp_A Back     alignment and structure
>1x5b_A Signal transducing adaptor molecule 2; VHS domain, ubiquitin binding, STAM2, structural genomics, NPPSFA; NMR {Homo sapiens} PDB: 2l0t_B Back     alignment and structure
>1mhq_A ADP-ribosylation factor binding protein GGA2; super helix, protein transport; 2.20A {Homo sapiens} SCOP: a.118.9.2 Back     alignment and structure
>3ldz_A STAM-1, signal transducing adapter molecule 1; ubiquitin-binding, cytoplasm, UBL conjugation, endosome, membrane, protein transport, SH3 domain; 2.60A {Homo sapiens} SCOP: a.118.9.0 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 161
d1szaa_144 a.118.9.4 (A:) PCF11 protein {Baker's yeast (Sacch 6e-12
d1szaa_144 a.118.9.4 (A:) PCF11 protein {Baker's yeast (Sacch 8e-04
>d1szaa_ a.118.9.4 (A:) PCF11 protein {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Length = 144 Back     information, alignment and structure

class: All alpha proteins
fold: alpha-alpha superhelix
superfamily: ENTH/VHS domain
family: RPR domain (SMART 00582 )
domain: PCF11 protein
species: Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]
 Score = 57.8 bits (139), Expect = 6e-12
 Identities = 26/106 (24%), Positives = 44/106 (41%), Gaps = 10/106 (9%)

Query: 56  VIGEWNQELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPG 115
           ++ ++N  L  +     PI    IT + KLA +     +Y V ++E  I KC  + KL  
Sbjct: 8   IVKDFNSILEELTFNSRPI----ITTLTKLAEENISCAQYFVDAIESRIEKCMPKQKLYA 63

Query: 116 LYVIDAIIHQARLQFKYKDVYAPRFARNLYRTFGYVFQCPDEDRKK 161
            Y +D+I             Y   F+RNL+  +   +   D   + 
Sbjct: 64  FYALDSICKNVG------SPYTIYFSRNLFNLYKRTYLLVDNTTRT 103


>d1szaa_ a.118.9.4 (A:) PCF11 protein {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Length = 144 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query161
d1szaa_144 PCF11 protein {Baker's yeast (Saccharomyces cerevi 99.84
d1szaa_144 PCF11 protein {Baker's yeast (Saccharomyces cerevi 98.92
d1dvpa1145 Hrs {Fruit fly (Drosophila melanogaster) [TaxId: 7 81.72
>d1szaa_ a.118.9.4 (A:) PCF11 protein {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
class: All alpha proteins
fold: alpha-alpha superhelix
superfamily: ENTH/VHS domain
family: RPR domain (SMART 00582 )
domain: PCF11 protein
species: Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]
Probab=99.84  E-value=1.2e-21  Score=148.96  Aligned_cols=94  Identities=27%  Similarity=0.405  Sum_probs=86.9

Q ss_pred             hHHHHHhhHHHhhhhhcCCCCCcHHHHHHHHHHHHHhhHHHHHHHHHHHHHHhhcCCCCcchhHHHHHHHHHHHhhhhcc
Q psy9417          53 GLYVIGEWNQELLSINSTKPPISKAKITGIVKLALKATHYYKYVVQSVEKFIWKCKSQYKLPGLYVIDAIIHQARLQFKY  132 (161)
Q Consensus        53 ~lyvIDef~~eL~sL~~~k~~iSrskI~~LT~lA~enik~~k~IV~~Ie~~I~k~~p~~KL~~LYVIDSIvRn~~~~~~~  132 (161)
                      +-.++++|+..|++|++    +||++|++||.+|++|.+++++||++|++++++|+|++||+.||++|||++|++++   
T Consensus         5 ~~~~~~~f~~~L~~L~~----ns~~~I~~Lt~~a~~~~~~a~~Iv~~i~~~i~~~~~~~KL~~LYLiddI~~n~~~~---   77 (144)
T d1szaa_           5 TEVIVKDFNSILEELTF----NSRPIITTLTKLAEENISCAQYFVDAIESRIEKCMPKQKLYAFYALDSICKNVGSP---   77 (144)
T ss_dssp             HHHHHHHHHHHHTTCSS----CCHHHHHHHHHHHHHCGGGHHHHHHHHHHHHHHSCHHHHHHHHHHHHHHHHHTCTT---
T ss_pred             HHHHHHHHHHHHHHHhh----ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccchhhhhhHHHHHHHhHHH---
Confidence            34688999999999964    79999999999999999999999999999999999999999999999999999876   


Q ss_pred             cCccHHhhHHHHHHHHhhhhcCCC
Q psy9417         133 KDVYAPRFARNLYRTFGYVFQCPD  156 (161)
Q Consensus       133 kd~Y~~rF~~nl~~tf~~~~~c~~  156 (161)
                         |.+.|++.|.++|.++|.+.+
T Consensus        78 ---y~~~f~~~l~~~f~~~y~~~~   98 (144)
T d1szaa_          78 ---YTIYFSRNLFNLYKRTYLLVD   98 (144)
T ss_dssp             ---HHHHHHTTHHHHHHHHHTTSC
T ss_pred             ---HHHHHHHHHHHHHHHHHHcCC
Confidence               779999999999999996544



>d1szaa_ a.118.9.4 (A:) PCF11 protein {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
>d1dvpa1 a.118.9.2 (A:1-145) Hrs {Fruit fly (Drosophila melanogaster) [TaxId: 7227]} Back     information, alignment and structure