Citrus Sinensis ID: 020866


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320
MTEEKEEEELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRLESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIRALVSGTNAKNG
ccHHHHHHHccccccccccccccccccccccccccccccEEEEEcccccHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEccccEEEEEccccccccHHHHHHHHHHHHHccccccccEEEEEEEccEEEcccHHHHHHHHHHHHHHHHHHHHcccccccEEEEEEEEccccccccccccccccccccccccHHHHHHHHHHHHHHcccccEEEcccccEEEEEEEEEcccccccccEEEEEEccccccccccccHHHHHccccccccc
ccHHHHHHHcccHHHHHHHEcccccccccccHHHHcccccEEEEccccccccHHHHHHHHHHHHccccccccccHHcccccHcccHHcccccccccccccccccccccccccccEcccccccccEEEEEEEcccEEEEEEccccccccHHHHHHHHHHHHcccccccccHHHcccccHHHccccHHHHHHHHHHHHHHHHHcccccccccEEEEEEEcccccccccccccccccccccHHHccHHHHHHHHHHHHHHHHHHcEEccccccEEEEEEEccccccccccEEEEEEEccHHHHcccccEEEEEcccccccccc
mteekeeeelkpweqhsrvisiprfdynspssllqrshsgflLTCTIKREKSATKEAISILHKRlessntagdskrrkvctddmggkcadgaeinsieedsagglqknechssvktdtntetdfdMSLVKLTRNGLllltfprehspntiNIVSNIFQslgsgslkspvwchrifpiqatCVLKEKELQATVSKLVLQFVNDeqnklsrpvkfavgynrrgfeekqnkipkdtkdsdvsalldrnkCFTIVAAAVKEVVsdsavdlkspelSVLVEllpisglpseLLVVGVsilpqklvttkprlSIRALVSGTNAKNG
mteekeeeelkpweqhsrvisiPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILhkrlessntagdskrrkvctDDMGGKCADGAEINSIeedsagglqknechssvktdtntetdfdmsLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFvndeqnklsrpvkfavgynrrgfeekqnkipkdtkdsdvsaLLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILpqklvttkprlsiralvsgtnakng
MTeekeeeeLKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRLESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIRALVSGTNAKNG
*************************************HSGFLLTCTIK***************************************************************************FDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNR********************ALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIRAL*********
***********************************RSHSGFLLTCTIKREKSATKEAISILHKRLESSNT*******************************************************MSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVND******R**KFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSI************
**************QHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRLES***************DMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIRALVSGTNAKNG
********ELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRLES******************************************************TDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEK*********DSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIRALVS*******
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MTEEKEEEELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRLESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIRALVSGTNAKNG
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query320
255561747336 conserved hypothetical protein [Ricinus 0.937 0.892 0.627 1e-105
225424542337 PREDICTED: uncharacterized protein LOC10 0.993 0.943 0.6 1e-104
356575470320 PREDICTED: uncharacterized protein LOC10 0.943 0.943 0.583 4e-96
42561854348 uncharacterized protein [Arabidopsis tha 0.975 0.896 0.564 1e-90
24030215373 unknown protein [Arabidopsis thaliana] 0.975 0.836 0.564 2e-90
388515009369 unknown [Lotus japonicus] 0.95 0.823 0.554 3e-90
297849234348 hypothetical protein ARALYDRAFT_471019 [ 0.965 0.887 0.556 8e-88
449487893345 PREDICTED: uncharacterized protein LOC10 0.937 0.869 0.562 3e-87
283132369336 hypothetical protein [Lotus japonicus] 0.884 0.842 0.546 2e-83
357136974359 PREDICTED: uncharacterized protein LOC10 0.959 0.855 0.504 4e-80
>gi|255561747|ref|XP_002521883.1| conserved hypothetical protein [Ricinus communis] gi|223538921|gb|EEF40519.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  387 bits (993), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 199/317 (62%), Positives = 237/317 (74%), Gaps = 17/317 (5%)

Query: 10  LKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRL---- 65
           +KPWEQH+ +ISIPRFDYN+PS+LL  SHSGFL+TC+IKREKSATKE +SIL K +    
Sbjct: 29  MKPWEQHAAIISIPRFDYNAPSALLHNSHSGFLITCSIKREKSATKEVMSILEKYIGSYT 88

Query: 66  -ESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDF 124
            +SSN +   KRRK     MGG CA G E   + ED            S +T T  ET F
Sbjct: 89  KDSSNGSQGIKRRKTL---MGGTCAQGMESKDVSEDPD--------QVSEETHTVEETGF 137

Query: 125 DMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLK 184
            +SLVKLTR+GLLLL F  E+SP+   IVSNIFQ + SGSLKSP+WCHRIFPIQATC L 
Sbjct: 138 TLSLVKLTRSGLLLLNFVGENSPDATEIVSNIFQRIESGSLKSPLWCHRIFPIQATCCLD 197

Query: 185 EKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDT-KDSDVSALLD 243
           EKEL+  VSKLVL+F+ND+ NK  RP+K+AVGYNRRG EE Q K  KDT KDS + +LLD
Sbjct: 198 EKELRTVVSKLVLRFINDKANKFERPIKYAVGYNRRGIEETQAKNVKDTSKDSALCSLLD 257

Query: 244 RNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTK 303
           RNKCF +VA+AVK+V+SDSAVDLKSPELS+LVELLP+SG+P+  LV  VS+LPQ LV+ K
Sbjct: 258 RNKCFDVVASAVKDVISDSAVDLKSPELSILVELLPLSGVPNGSLVAAVSVLPQNLVSVK 317

Query: 304 PRLSIRALVSGTNAKNG 320
           PRL I+ LVS  NAK G
Sbjct: 318 PRLCIKPLVSDANAKKG 334




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225424542|ref|XP_002285300.1| PREDICTED: uncharacterized protein LOC100267955 [Vitis vinifera] gi|296081413|emb|CBI16846.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356575470|ref|XP_003555863.1| PREDICTED: uncharacterized protein LOC100797260 [Glycine max] Back     alignment and taxonomy information
>gi|42561854|ref|NP_172400.2| uncharacterized protein [Arabidopsis thaliana] gi|14334964|gb|AAK59659.1| unknown protein [Arabidopsis thaliana] gi|332190304|gb|AEE28425.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|24030215|gb|AAN41286.1| unknown protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|388515009|gb|AFK45566.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|297849234|ref|XP_002892498.1| hypothetical protein ARALYDRAFT_471019 [Arabidopsis lyrata subsp. lyrata] gi|297338340|gb|EFH68757.1| hypothetical protein ARALYDRAFT_471019 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|449487893|ref|XP_004157853.1| PREDICTED: uncharacterized protein LOC101227649 [Cucumis sativus] Back     alignment and taxonomy information
>gi|283132369|dbj|BAI63590.1| hypothetical protein [Lotus japonicus] Back     alignment and taxonomy information
>gi|357136974|ref|XP_003570077.1| PREDICTED: uncharacterized protein LOC100839515 isoform 1 [Brachypodium distachyon] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query320
TAIR|locus:2203033348 AT1G09290 "AT1G09290" [Arabido 0.959 0.882 0.567 4.4e-84
TAIR|locus:2203033 AT1G09290 "AT1G09290" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 842 (301.5 bits), Expect = 4.4e-84, P = 4.4e-84
 Identities = 181/319 (56%), Positives = 221/319 (69%)

Query:    10 LKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKRLESSN 69
             L PWEQHS +ISIPRFDY +PSSLL  SHSGFL+TC IKREKSATKE +SIL K + S +
Sbjct:    32 LTPWEQHSSIISIPRFDYKAPSSLLHHSHSGFLVTCNIKREKSATKEVMSILGKYIGSMH 91

Query:    70 -------TAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTET 122
                     +  SK++KVC  +       G +   +E D+     +N     +K   N E 
Sbjct:    92 EEKPEVLNSTASKKQKVCAQETE---EGGEKTVPLENDALQETGENPNVEDLKL-ANEEH 147

Query:   123 DFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCV 182
             +  MSLVKLT++GLLL TFP E+SPNT NIVS +FQS+ SG+LK+P+WCHRIFP+QATC 
Sbjct:   148 NSLMSLVKLTKSGLLLFTFPVENSPNTTNIVSRVFQSMESGALKAPIWCHRIFPVQATCG 207

Query:   183 LKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSA-L 241
             L EKEL+ TVSKLV +FVND+ N LS+PVKFA GY RRG EE + KI KD  D  V   L
Sbjct:   208 LTEKELRETVSKLVQRFVNDKDNTLSKPVKFAAGYQRRGAEETKGKIRKDASDVLVQCPL 267

Query:   242 LDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVT 301
             LDR KCF  VAA VK++V DS VDLKSPEL VLVELLP+S + S   V  VS+LP +LV+
Sbjct:   268 LDRIKCFETVAAGVKDIVPDSVVDLKSPELCVLVELLPLSRISSGSFVAAVSVLPHRLVS 327

Query:   302 TKPRLSIRALVSGTNAKNG 320
             TKP+L I+ LV  +  K G
Sbjct:   328 TKPKLCIKPLVPESKHKKG 346


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.314   0.130   0.366    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      320       313   0.00081  116 3  11 23  0.44    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  598 (64 KB)
  Total size of DFA:  195 KB (2111 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  27.13u 0.14s 27.27t   Elapsed:  00:00:01
  Total cpu time:  27.13u 0.14s 27.27t   Elapsed:  00:00:01
  Start:  Fri May 10 13:06:47 2013   End:  Fri May 10 13:06:48 2013


GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0008150 "biological_process" evidence=ND

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query320
cd11717158 cd11717, THUMP_THUMPD1_like, THUMP domain-containi 5e-13
COG1818175 COG1818, COG1818, Predicted RNA-binding protein, c 0.003
>gnl|CDD|212586 cd11717, THUMP_THUMPD1_like, THUMP domain-containing protein 1-like Back     alignment and domain information
 Score = 65.7 bits (161), Expect = 5e-13
 Identities = 35/168 (20%), Positives = 64/168 (38%), Gaps = 34/168 (20%)

Query: 129 VKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKEL 188
           V+     ++ +       P  + +V  IF+   S   K   +  R+ PI  TC    +E+
Sbjct: 25  VETGVKNVVFIKTRPPVDP--VELVEKIFEDAASTKKKRTRFIQRLIPIDVTCKASLEEI 82

Query: 189 QATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCF 248
           +    +L+ +     +     P  FAV    R      NK+ +D                
Sbjct: 83  EKLAKELLKKHFPTAE----PPKTFAVECKSRNN----NKLSRD---------------- 118

Query: 249 TIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILP 296
            ++ A  + V     VDLK+P+  +LVE++          V G+S++ 
Sbjct: 119 EVIKAVAELVPEIHKVDLKNPDKVILVEVIK--------NVCGISVVK 158


This family contains THUMP domain-only proteins including THUMP domain-containing protein 1 and Saccharomyces cerevisiae Tan1. Tan1 is non essential and has been shown to be required for the formation of the modified nucleoside N(4)-acetylcytidine (ac(4)C) in tRNA. To date, there is no functional information available about THUMPD1. The THUMP domain is named after thiouridine synthases, methylases and PSUSs. The domain consists of about 110 amino acid residues. It is predicted to be an RNA-binding domain and probably functions by delivering a variety of RNA modification enzymes to their targets. Length = 158

>gnl|CDD|224731 COG1818, COG1818, Predicted RNA-binding protein, contains THUMP domain [General function prediction only] Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 320
KOG3943291 consensus THUMP domain-containing proteins [Genera 100.0
PF02926144 THUMP: THUMP domain; InterPro: IPR004114 The THUMP 99.58
COG1818175 Predicted RNA-binding protein, contains THUMP doma 99.49
PRK01565 394 thiamine biosynthesis protein ThiI; Provisional 98.47
PRK01269 482 tRNA s(4)U8 sulfurtransferase; Provisional 98.22
PRK08384 381 thiamine biosynthesis protein ThiI; Provisional 98.18
COG0301 383 ThiI Thiamine biosynthesis ATP pyrophosphatase [Co 97.91
TIGR00342 371 thiazole biosynthesis/tRNA modification protein Th 97.8
TIGR01177 329 conserved hypothetical protein TIGR01177. This fam 93.34
COG0116 381 Predicted N6-adenine-specific DNA methylase [DNA r 90.11
PRK11783 702 rlmL 23S rRNA m(2)G2445 methyltransferase; Provisi 87.74
>KOG3943 consensus THUMP domain-containing proteins [General function prediction only] Back     alignment and domain information
Probab=100.00  E-value=8.5e-47  Score=340.65  Aligned_cols=239  Identities=20%  Similarity=0.311  Sum_probs=187.0

Q ss_pred             CCCCCCCceEEEEcCCChhhHHHHHHHHHHHHhhhcc--CCCCCcccccccccCCCCCCCCCCcccChhhhhhhHHhhh-
Q 020866           32 SLLQRSHSGFLLTCTIKREKSATKEAISILHKRLESS--NTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKN-  108 (320)
Q Consensus        32 ~~l~~g~~G~lvTC~~~rEk~A~~E~~~lL~e~~d~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ie~~~~i~kEl~-  108 (320)
                      ..|+||.+|||+||+ ++||||++|||.||+|.....  +++....+++.   +.|.|+.+.+    .|.+ +-..|+. 
T Consensus        37 ~~lepG~~Gi~~tC~-~HErQA~QEaysl~~ElygD~~~G~~~~~~~E~a---p~gsE~~~~~----aeaE-lkdEe~gd  107 (291)
T KOG3943|consen   37 RQLEPGLQGILITCN-MHERQAVQEAYSLLNELYGDDMYGPEKFTDKEQA---PSGSEGEDDD----AEAE-LKDEEVGD  107 (291)
T ss_pred             cccCCCcceEEEEcc-chHHHHHHHHHHHHHHHhcCcccCchhccCcccC---CCCCcCCCCh----HHHh-hhhhhhHH
Confidence            379999999999999 599999999999999986554  33321111100   1111111111    1111 0001111 


Q ss_pred             hcccCCCCCCCCCCccccceeecCCcceEEEEccCCCCCCHHHHHHHHHHHHcC--CCCCCCceEEeeeeeeeeeecCHH
Q 020866          109 ECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGS--GSLKSPVWCHRIFPIQATCVLKEK  186 (320)
Q Consensus       109 elk~~~kr~~~~~~~~~f~~v~~~~~gv~fi~f~~~~~~dP~~lv~~I~~d~~~--~~~~~tR~i~RliPI~~tc~a~~~  186 (320)
                      +|++..     .++...|++|+++|+|+.||+  +..++-|.++|+.|++++.+  ++.++|||+|||+||+.||.|.++
T Consensus       108 dLk~~~-----~~~~~P~~Fv~~~~~Cv~f~~--t~KniVpe~~v~~i~~dm~elk~k~kRtR~~Qr~~Pi~~tc~a~le  180 (291)
T KOG3943|consen  108 DLKAST-----EMRLRPFQFVESGANCVVFIR--TLKNIVPEKLVHHILQDMYELKTKKKRTRVIQRMLPISGTCKAFLE  180 (291)
T ss_pred             HHhhhh-----hhhcCchhhhhccCceEEeec--ccCccCchHHHHHHHHHHHhhhccchhhhhhhhhccccchHHhhHH
Confidence            233222     111222999999999999999  88899999999999999965  557899999999999999999999


Q ss_pred             HHHHHHHHHHHhhcccccccCCCCceEEEEEcccCccccccCCCCCCCCcccccCCChhHHHHHHHHhhhhcCCCceeeC
Q 020866          187 ELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDL  266 (320)
Q Consensus       187 ~i~~~~~~ll~~~F~~~~~~~~~~~tfaI~~~~Rn~~e~~~~~~~~~~~~~~~~~l~R~~vI~~VA~~V~~~~~~hkVdL  266 (320)
                      .|.+++..+|.+||+..  ...+..||+|+|++||++                 ++.|++||++|+.+|.-.++..+|||
T Consensus       181 ~m~k~a~~VI~p~fkap--~tgK~~tf~VE~k~RNn~-----------------~v~r~~vi~~V~~~Vc~l~se~~VdL  241 (291)
T KOG3943|consen  181 DMKKYAETVIEPWFKAP--NTGKKGTFQVEYKSRNNS-----------------HVNREEVIREVAGIVCTLNSENKVDL  241 (291)
T ss_pred             HHHHHHHHhhcccccCC--CCCcCceEEEEEEecccc-----------------chhHHHHHHHHHHHHHhcCccceeec
Confidence            99999999999999863  335888999999999987                 99999999999999998888999999


Q ss_pred             CCCCeEEEEEEeeCCCCCCCceEEEEEeeCCCccccccceeeeeccCCC
Q 020866          267 KSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIRALVSGT  315 (320)
Q Consensus       267 knPd~vIlVEV~k~~~~~~~~~vcgiSVv~~y~~~~K~~l~~~~l~~~~  315 (320)
                      +|||++|+|||++  .      +.|||||++|+-++||||  ++++.-+
T Consensus       242 ~n~D~t~~Ve~~k--s------~I~~~Vv~dy~~~RKYnl--q~~~~s~  280 (291)
T KOG3943|consen  242 TNPDYTVVVEIIK--A------VICLSVVKDYMLFRKYNL--QEVVKSP  280 (291)
T ss_pred             cCCCeEEEEEeee--c------eeeeeehHHHHHHHHhhH--HHhhccc
Confidence            9999999999999  7      899999999999999997  6665433



>PF02926 THUMP: THUMP domain; InterPro: IPR004114 The THUMP domain is shared by 4-thiouridine, pseudouridine synthases and RNA methylases[] and is probably an RNA-binding domain that adopts an alpha/beta fold similar to that found in the C-terminal domain of translation initiation factor 3 and ribosomal protein S8 Back     alignment and domain information
>COG1818 Predicted RNA-binding protein, contains THUMP domain [General function prediction only] Back     alignment and domain information
>PRK01565 thiamine biosynthesis protein ThiI; Provisional Back     alignment and domain information
>PRK01269 tRNA s(4)U8 sulfurtransferase; Provisional Back     alignment and domain information
>PRK08384 thiamine biosynthesis protein ThiI; Provisional Back     alignment and domain information
>COG0301 ThiI Thiamine biosynthesis ATP pyrophosphatase [Coenzyme metabolism] Back     alignment and domain information
>TIGR00342 thiazole biosynthesis/tRNA modification protein ThiI Back     alignment and domain information
>TIGR01177 conserved hypothetical protein TIGR01177 Back     alignment and domain information
>COG0116 Predicted N6-adenine-specific DNA methylase [DNA replication, recombination, and repair] Back     alignment and domain information
>PRK11783 rlmL 23S rRNA m(2)G2445 methyltransferase; Provisional Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query320
2dir_A98 Thump domain-containing protein 1; structural geno 1e-09
>2dir_A Thump domain-containing protein 1; structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} SCOP: d.308.1.3 Length = 98 Back     alignment and structure
 Score = 53.7 bits (129), Expect = 1e-09
 Identities = 16/119 (13%), Positives = 41/119 (34%), Gaps = 30/119 (25%)

Query: 179 ATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDV 238
           ++     ++++      +  +             F + Y  R                  
Sbjct: 5   SSGKAFLEDMKKYAETFLEPWFKAPN-----KGTFQIVYKSRNNSH-------------- 45

Query: 239 SALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQ 297
              ++R +    +A  V  + S++ VDL +P+ +V+VE++          V  +S++  
Sbjct: 46  ---VNREEVIRELAGIVCTLNSENKVDLTNPQYTVVVEIIKA--------VCCLSVVKS 93


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query320
2dir_A98 Thump domain-containing protein 1; structural geno 99.77
1vbk_A 307 Hypothetical protein PH1313; structural genomics, 98.38
2c5s_A 413 THII, probable thiamine biosynthesis protein THII; 98.23
3tma_A 354 Methyltransferase; thump domain; 2.05A {Thermus th 95.96
3k0b_A 393 Predicted N6-adenine-specific DNA methylase; methy 95.69
3ldg_A 384 Putative uncharacterized protein SMU.472; YPSC, me 94.88
3v97_A 703 Ribosomal RNA large subunit methyltransferase L; Y 94.86
3tm4_A 373 TRNA (guanine N2-)-methyltransferase TRM14; rossma 93.97
3ldu_A 385 Putative methylase; structural genomics, PSI-2, pr 93.31
>2dir_A Thump domain-containing protein 1; structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} SCOP: d.308.1.3 Back     alignment and structure
Probab=99.77  E-value=1.6e-18  Score=138.43  Aligned_cols=94  Identities=17%  Similarity=0.289  Sum_probs=85.3

Q ss_pred             eeeeecCHHHHHHHHHHHHHhhcccccccCCCCceEEEEEcccCccccccCCCCCCCCcccccCCChhHHHHHHHHhhhh
Q 020866          178 QATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKE  257 (320)
Q Consensus       178 ~~tc~a~~~~i~~~~~~ll~~~F~~~~~~~~~~~tfaI~~~~Rn~~e~~~~~~~~~~~~~~~~~l~R~~vI~~VA~~V~~  257 (320)
                      ..+|.+++++|.+.|..++.++|+.     .++.||+|.+++|+++                 .|++++|+..|+++|.+
T Consensus         4 ~~~~~~~~e~I~~~a~~~~~~~f~~-----~~~~TF~V~~kR~~k~-----------------~~~S~ei~~~vG~~i~~   61 (98)
T 2dir_A            4 GSSGKAFLEDMKKYAETFLEPWFKA-----PNKGTFQIVYKSRNNS-----------------HVNREEVIRELAGIVCT   61 (98)
T ss_dssp             CCCCCHHHHHHHHHHHHHHHHHHST-----TCCCEEEEEEECSSCC-----------------SSCHHHHHHHHHHHHHH
T ss_pred             eEEEECCHHHHHHHHHHHHHHhcCc-----CCCceEEEEEEeCCCC-----------------CCCHHHHHHHHHHHHHh
Confidence            4689999999999999999999863     2457999999999976                 89999999999999987


Q ss_pred             cCCCceeeCCCCCeEEEEEEeeCCCCCCCceEEEEEeeCCCccc
Q 020866          258 VVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVT  301 (320)
Q Consensus       258 ~~~~hkVdLknPd~vIlVEV~k~~~~~~~~~vcgiSVv~~y~~~  301 (320)
                      ..++++|||+|||++|.|||.+  +      .|||||+++|+..
T Consensus        62 ~~~~~kVdL~nPd~~I~VEI~~--~------~~~isv~~~y~~~   97 (98)
T 2dir_A           62 LNSENKVDLTNPQYTVVVEIIK--A------VCCLSVVKSGPSS   97 (98)
T ss_dssp             HCTTCEECSSSCSEEEEEEEET--T------EEEEEEEECCCTT
T ss_pred             hCCCCeeEcCCCCEEEEEEEeC--C------EEEEEEccccccC
Confidence            6678999999999999999999  8      9999999999753



>1vbk_A Hypothetical protein PH1313; structural genomics, riken structural genomics/proteomics initiative, RSGI, NPPSFA; 1.90A {Pyrococcus horikoshii} SCOP: c.26.2.6 d.308.1.1 Back     alignment and structure
>2c5s_A THII, probable thiamine biosynthesis protein THII; RNA-binding protein, RNA binding protein, tRNA modification, 4-thiouridine synthase; HET: AMP; 2.5A {Bacillus anthracis} SCOP: c.26.2.6 d.308.1.1 Back     alignment and structure
>3tma_A Methyltransferase; thump domain; 2.05A {Thermus thermophilus} Back     alignment and structure
>3k0b_A Predicted N6-adenine-specific DNA methylase; methylase,PF01170, putative RNA methylase, PSI,MCSG, structu genomics; 1.50A {Listeria monocytogenes str} Back     alignment and structure
>3ldg_A Putative uncharacterized protein SMU.472; YPSC, methyltransferase, transferase; HET: SAH; 1.96A {Streptococcus mutans} Back     alignment and structure
>3v97_A Ribosomal RNA large subunit methyltransferase L; YCBY, RNA methyltransferase, ribosome RNA, SAH, RLML; HET: SAH OSU; 2.20A {Escherichia coli} PDB: 3v8v_A* Back     alignment and structure
>3tm4_A TRNA (guanine N2-)-methyltransferase TRM14; rossmann fold, thump domain, tRNA methyltransferase; HET: SAM; 1.95A {Pyrococcus furiosus} PDB: 3tlj_A* 3tm5_A* Back     alignment and structure
>3ldu_A Putative methylase; structural genomics, PSI-2, protein structure initiative, midwest center for structural genomics, MCSG; HET: MSE GTP; 1.70A {Clostridium difficile} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 320
d2dira185 d.308.1.3 (A:8-92) THUMP domain-containing protein 6e-04
>d2dira1 d.308.1.3 (A:8-92) THUMP domain-containing protein 1 {Human (Homo sapiens) [TaxId: 9606]} Length = 85 Back     information, alignment and structure

class: Alpha and beta proteins (a+b)
fold: THUMP domain
superfamily: THUMP domain-like
family: Minimal THUMP
domain: THUMP domain-containing protein 1
species: Human (Homo sapiens) [TaxId: 9606]
 Score = 35.9 bits (83), Expect = 6e-04
 Identities = 16/110 (14%), Positives = 39/110 (35%), Gaps = 30/110 (27%)

Query: 186 KELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRN 245
           ++++      +  +             F + Y  R                     ++R 
Sbjct: 5   EDMKKYAETFLEPWFKAPNKG-----TFQIVYKSRNNSH-----------------VNRE 42

Query: 246 KCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSIL 295
           +    +A  V  + S++ VDL +P+ +V+VE++          V  +S++
Sbjct: 43  EVIRELAGIVCTLNSENKVDLTNPQYTVVVEIIKA--------VCCLSVV 84


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query320
d2dira185 THUMP domain-containing protein 1 {Human (Homo sap 99.84
d1vbka2175 Hypothetical protein PH1313, N-terminal domain {Ar 98.91
d2c5sa2171 Thiamine biosynthesis protein ThiI, N-terminal dom 98.75
>d2dira1 d.308.1.3 (A:8-92) THUMP domain-containing protein 1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: THUMP domain
superfamily: THUMP domain-like
family: Minimal THUMP
domain: THUMP domain-containing protein 1
species: Human (Homo sapiens) [TaxId: 9606]
Probab=99.84  E-value=4.9e-21  Score=147.92  Aligned_cols=85  Identities=19%  Similarity=0.331  Sum_probs=79.1

Q ss_pred             ecCHHHHHHHHHHHHHhhcccccccCCCCceEEEEEcccCccccccCCCCCCCCcccccCCChhHHHHHHHHhhhhcCCC
Q 020866          182 VLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSD  261 (320)
Q Consensus       182 ~a~~~~i~~~~~~ll~~~F~~~~~~~~~~~tfaI~~~~Rn~~e~~~~~~~~~~~~~~~~~l~R~~vI~~VA~~V~~~~~~  261 (320)
                      +|++++|.+.|..|+.+||+.     .++.||+|.+++|+++                 .|+|++|+..||++|.+..++
T Consensus         1 ~A~ledi~k~a~~l~~~~~~~-----~~~~TF~V~~krr~~~-----------------~l~s~ev~~~vg~~v~~~~~~   58 (85)
T d2dira1           1 KAFLEDMKKYAETFLEPWFKA-----PNKGTFQIVYKSRNNS-----------------HVNREEVIRELAGIVCTLNSE   58 (85)
T ss_dssp             CHHHHHHHHHHHHHHHHHHST-----TCCCEEEEEEECSSCC-----------------SSCHHHHHHHHHHHHHHHCTT
T ss_pred             CCCHHHHHHHHHHHHHHHhcc-----CCCceEEEEEEeCCCC-----------------CCCHHHHHHHHHHHHHhhCCC
Confidence            478999999999999999974     4688999999999976                 999999999999999877789


Q ss_pred             ceeeCCCCCeEEEEEEeeCCCCCCCceEEEEEeeC
Q 020866          262 SAVDLKSPELSVLVELLPISGLPSELLVVGVSILP  296 (320)
Q Consensus       262 hkVdLknPd~vIlVEV~k~~~~~~~~~vcgiSVv~  296 (320)
                      |+|||+|||++|+|||++  +      +||||||+
T Consensus        59 ~~VdL~~Pd~~I~Vev~~--~------~~~iSvv~   85 (85)
T d2dira1          59 NKVDLTNPQYTVVVEIIK--A------VCCLSVVK   85 (85)
T ss_dssp             CEECSSSCSEEEEEEEET--T------EEEEEEEE
T ss_pred             ceeeccCCCEEEEEEEeC--C------EEEEEEeC
Confidence            999999999999999999  9      99999984



>d1vbka2 d.308.1.1 (A:1-175) Hypothetical protein PH1313, N-terminal domain {Archaeon Pyrococcus horikoshii [TaxId: 53953]} Back     information, alignment and structure
>d2c5sa2 d.308.1.1 (A:3-173) Thiamine biosynthesis protein ThiI, N-terminal domain {Bacillus anthracis [TaxId: 1392]} Back     information, alignment and structure