Citrus Sinensis ID: 020116


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-
MTEEKEEEELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKYVGHNNIENSQRLESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIRALVSGTNAKNG
ccHHHHHHHccccccccccccccccccccccccccccccEEEEEccccccccHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEcccEEEEEccccccccHHHHHHHHHHHHccccccccEEEEEEccccccccccHHHHHHHHHHHHHHHHHHHHccccccEEEEEEEEcccccccccccccccccccccccccHHHHHHHHHHHHHHHccccccccccccEEEEEEEEEcccccccccEEEEEEccccccccccccEEEHHccccccccc
ccHHHHHHHcccHHHHHHHEcccccccccccHHHHcccccEEEEccccccccHHHHHHHHHHHHHcccccccccccccccccHHHHHcEEHHHccccHcccccccccccccccccccccccccccccccccccccEEEEEEEcccEEEEEEccccccccHHHHHHHHHHHHcccccccccHHHcccccHHHccccHHHHHHHHHHHHHHHHHcccccccccEEEEEEEcccccccccccccccccccccHHHccHHHHHHHHHHHHHHHHHHcEEccccccEEEEEEEccccccccccEEEEEEEccHHHHcccccEEEEEcccccccccc
mteekeeeelkpweqhsrvisiprfdynspssllqrshsgflLTCTIKREKSATKEAISILHKYVghnniensqrlessntagdskrrkvctddmggkcadgaeinsieedsagglqknechssvktdtntetdfdMSLVKLTRNGLllltfprehspntiNIVSNIFQslgsgslkspvwchrifpiqatCVLKEKELQATVSKLVLQFVNDeqnklsrpvkfavgynrrgfeekqnkipkdtkdsdvsalldrnkCFTIVAAAVKEVVsdsavdlkspelSVLVEllpisglpseLLVVGVsilpqklvttkprlSIRALVSGTNAKNG
mteekeeeelkpweqhsrvisiPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKYvghnniensqrlessntagdskrrkvCTDDMGGKCADGAEINSIeedsagglqknechssvktdtntetdfdmsLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFvndeqnklsrpvkfavgynrrgfeekqnkipkdtkdsdvsaLLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILpqklvttkprlsiralvsgtnakng
MTeekeeeeLKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKYVGHNNIENSQRLESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIRALVSGTNAKNG
************************F***********SHSGFLLTCTIKREKSATKEAISILHKYVGHN******************************************************************FDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNR********************ALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIRAL*********
****************SRVISIPRFDYNSP**LLQRSHSGFLLTCTIKREKSATKEAISILHKYVGH**********************************************************************SLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVND**********FAVGYNRRGFE***********DSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIR***********
**************QHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKYVGHNNIENSQ*******************DMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIRALVSGTNAKNG
********ELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKYVGHNN************************DM*************************************TDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEK*********DSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIRALVS*******
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MTEEKEEEELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKYVGHNNIENSQRLESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIRALVSGTNAKNG
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query331
225424542337 PREDICTED: uncharacterized protein LOC10 0.993 0.976 0.614 1e-107
255561747336 conserved hypothetical protein [Ricinus 0.921 0.907 0.625 1e-106
356575470320 PREDICTED: uncharacterized protein LOC10 0.939 0.971 0.579 1e-97
42561854348 uncharacterized protein [Arabidopsis tha 0.963 0.916 0.570 1e-93
24030215373 unknown protein [Arabidopsis thaliana] 0.963 0.855 0.570 2e-93
388515009369 unknown [Lotus japonicus] 0.951 0.853 0.557 5e-92
297849234348 hypothetical protein ARALYDRAFT_471019 [ 0.954 0.908 0.562 2e-91
449487893345 PREDICTED: uncharacterized protein LOC10 0.939 0.901 0.577 5e-91
357136974359 PREDICTED: uncharacterized protein LOC10 0.960 0.885 0.517 1e-83
283132369336 hypothetical protein [Lotus japonicus] 0.854 0.842 0.527 8e-82
>gi|225424542|ref|XP_002285300.1| PREDICTED: uncharacterized protein LOC100267955 [Vitis vinifera] gi|296081413|emb|CBI16846.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 206/335 (61%), Positives = 253/335 (75%), Gaps = 6/335 (1%)

Query: 1   MTEEKEEEELKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISI 60
           M+EE+ EE +KPWEQHS VISIPRFDYN+PSSLL  SHSGFL+TCTIKREKSATKEA+ I
Sbjct: 1   MSEEEREEGMKPWEQHSAVISIPRFDYNAPSSLLDHSHSGFLVTCTIKREKSATKEAMPI 60

Query: 61  LHKYVGHNNIENSQRLESSNTAGDSKRRKVCTDDMGGKCADGAE----INSIEEDSAGGL 116
           L KYVG  +  +S+ LESS+    +KRRK+CT+++  +C +  E     N+  ED  G L
Sbjct: 61  LEKYVGSFSSCSSESLESSDANATTKRRKICTEEIDEECVNSVENKTASNNCGED-GGEL 119

Query: 117 QKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSL 176
            K+   SS   D   E    +SLVKLTR+GLLL  FPR +S +T+++VS I +SL SGS+
Sbjct: 120 SKDAGVSSANRDAIVENGHVLSLVKLTRSGLLLFVFPRNNSVDTVDVVSQIIRSLQSGSV 179

Query: 177 KSPVWCHRIFPIQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEK 236
           K P+WCHRIFPIQATC L EKEL   V+KLV+QFVN+EQNK +RP+KFAVGYNRRG EE 
Sbjct: 180 KPPLWCHRIFPIQATCRLDEKELHEVVTKLVVQFVNNEQNKFARPIKFAVGYNRRGIEET 239

Query: 237 QNKIPKDT-KDSDVSALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLP 295
           + KIPK T +D +  ALLDR KCF++VA AVK  VSDS VDLKSPELSVLVELLP+S +P
Sbjct: 240 EMKIPKSTPRDCNSHALLDRKKCFSVVATAVKGAVSDSVVDLKSPELSVLVELLPLSRVP 299

Query: 296 SELLVVGVSILPQKLVTTKPRLSIRALVSGTNAKN 330
           +  +VV VS+LPQ L+TTKPRL I+AL+S T   N
Sbjct: 300 NGSMVVAVSVLPQNLITTKPRLCIKALLSDTKVGN 334




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255561747|ref|XP_002521883.1| conserved hypothetical protein [Ricinus communis] gi|223538921|gb|EEF40519.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|356575470|ref|XP_003555863.1| PREDICTED: uncharacterized protein LOC100797260 [Glycine max] Back     alignment and taxonomy information
>gi|42561854|ref|NP_172400.2| uncharacterized protein [Arabidopsis thaliana] gi|14334964|gb|AAK59659.1| unknown protein [Arabidopsis thaliana] gi|332190304|gb|AEE28425.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|24030215|gb|AAN41286.1| unknown protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|388515009|gb|AFK45566.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|297849234|ref|XP_002892498.1| hypothetical protein ARALYDRAFT_471019 [Arabidopsis lyrata subsp. lyrata] gi|297338340|gb|EFH68757.1| hypothetical protein ARALYDRAFT_471019 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|449487893|ref|XP_004157853.1| PREDICTED: uncharacterized protein LOC101227649 [Cucumis sativus] Back     alignment and taxonomy information
>gi|357136974|ref|XP_003570077.1| PREDICTED: uncharacterized protein LOC100839515 isoform 1 [Brachypodium distachyon] Back     alignment and taxonomy information
>gi|283132369|dbj|BAI63590.1| hypothetical protein [Lotus japonicus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query331
TAIR|locus:2203033348 AT1G09290 "AT1G09290" [Arabido 0.948 0.902 0.578 2.3e-87
TAIR|locus:2203033 AT1G09290 "AT1G09290" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 873 (312.4 bits), Expect = 2.3e-87, P = 2.3e-87
 Identities = 187/323 (57%), Positives = 227/323 (70%)

Query:    10 LKPWEQHSRVISIPRFDYNSPSSLLQRSHSGFLLTCTIKREKSATKEAISILHKYVGHNN 69
             L PWEQHS +ISIPRFDY +PSSLL  SHSGFL+TC IKREKSATKE +SIL KY+G  +
Sbjct:    32 LTPWEQHSSIISIPRFDYKAPSSLLHHSHSGFLVTCNIKREKSATKEVMSILGKYIGSMH 91

Query:    70 IENSQRLESSNTAGDSKRRKVCTDDMGGKCADGAEINSIEEDSAGGLQKNECHSSVKTDT 129
              E  + L S  TA  SK++KVC  +       G +   +E D+     +N     +K   
Sbjct:    92 EEKPEVLNS--TA--SKKQKVCAQETE---EGGEKTVPLENDALQETGENPNVEDLKL-A 143

Query:   130 NTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQ 189
             N E +  MSLVKLT++GLLL TFP E+SPNT NIVS +FQS+ SG+LK+P+WCHRIFP+Q
Sbjct:   144 NEEHNSLMSLVKLTKSGLLLFTFPVENSPNTTNIVSRVFQSMESGALKAPIWCHRIFPVQ 203

Query:   190 ATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDV 249
             ATC L EKEL+ TVSKLV +FVND+ N LS+PVKFA GY RRG EE + KI KD  D  V
Sbjct:   204 ATCGLTEKELRETVSKLVQRFVNDKDNTLSKPVKFAAGYQRRGAEETKGKIRKDASDVLV 263

Query:   250 SA-LLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQ 308
                LLDR KCF  VAA VK++V DS VDLKSPEL VLVELLP+S + S   V  VS+LP 
Sbjct:   264 QCPLLDRIKCFETVAAGVKDIVPDSVVDLKSPELCVLVELLPLSRISSGSFVAAVSVLPH 323

Query:   309 KLVTTKPRLSIRALVSGTNAKNG 331
             +LV+TKP+L I+ LV  +  K G
Sbjct:   324 RLVSTKPKLCIKPLVPESKHKKG 346


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.314   0.130   0.366    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      331       324   0.00086  116 3  11 23  0.44    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  600 (64 KB)
  Total size of DFA:  203 KB (2115 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  28.19u 0.12s 28.31t   Elapsed:  00:00:02
  Total cpu time:  28.19u 0.12s 28.31t   Elapsed:  00:00:02
  Start:  Sat May 11 02:33:36 2013   End:  Sat May 11 02:33:38 2013


GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0008150 "biological_process" evidence=ND

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query331
cd11717158 cd11717, THUMP_THUMPD1_like, THUMP domain-containi 4e-13
COG1818175 COG1818, COG1818, Predicted RNA-binding protein, c 0.003
>gnl|CDD|212586 cd11717, THUMP_THUMPD1_like, THUMP domain-containing protein 1-like Back     alignment and domain information
 Score = 65.7 bits (161), Expect = 4e-13
 Identities = 35/168 (20%), Positives = 64/168 (38%), Gaps = 34/168 (20%)

Query: 140 VKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGSGSLKSPVWCHRIFPIQATCVLKEKEL 199
           V+     ++ +       P  + +V  IF+   S   K   +  R+ PI  TC    +E+
Sbjct: 25  VETGVKNVVFIKTRPPVDP--VELVEKIFEDAASTKKKRTRFIQRLIPIDVTCKASLEEI 82

Query: 200 QATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCF 259
           +    +L+ +     +     P  FAV    R      NK+ +D                
Sbjct: 83  EKLAKELLKKHFPTAE----PPKTFAVECKSRNN----NKLSRD---------------- 118

Query: 260 TIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILP 307
            ++ A  + V     VDLK+P+  +LVE++          V G+S++ 
Sbjct: 119 EVIKAVAELVPEIHKVDLKNPDKVILVEVIK--------NVCGISVVK 158


This family contains THUMP domain-only proteins including THUMP domain-containing protein 1 and Saccharomyces cerevisiae Tan1. Tan1 is non essential and has been shown to be required for the formation of the modified nucleoside N(4)-acetylcytidine (ac(4)C) in tRNA. To date, there is no functional information available about THUMPD1. The THUMP domain is named after thiouridine synthases, methylases and PSUSs. The domain consists of about 110 amino acid residues. It is predicted to be an RNA-binding domain and probably functions by delivering a variety of RNA modification enzymes to their targets. Length = 158

>gnl|CDD|224731 COG1818, COG1818, Predicted RNA-binding protein, contains THUMP domain [General function prediction only] Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 331
KOG3943291 consensus THUMP domain-containing proteins [Genera 100.0
PF02926144 THUMP: THUMP domain; InterPro: IPR004114 The THUMP 99.6
COG1818175 Predicted RNA-binding protein, contains THUMP doma 99.57
PRK01565 394 thiamine biosynthesis protein ThiI; Provisional 98.51
PRK01269 482 tRNA s(4)U8 sulfurtransferase; Provisional 98.31
PRK08384 381 thiamine biosynthesis protein ThiI; Provisional 98.19
TIGR00342 371 thiazole biosynthesis/tRNA modification protein Th 98.0
COG0301 383 ThiI Thiamine biosynthesis ATP pyrophosphatase [Co 97.76
COG0116 381 Predicted N6-adenine-specific DNA methylase [DNA r 94.47
TIGR01177 329 conserved hypothetical protein TIGR01177. This fam 94.18
PRK11783 702 rlmL 23S rRNA m(2)G2445 methyltransferase; Provisi 90.35
>KOG3943 consensus THUMP domain-containing proteins [General function prediction only] Back     alignment and domain information
Probab=100.00  E-value=6.6e-45  Score=330.84  Aligned_cols=243  Identities=19%  Similarity=0.311  Sum_probs=186.0

Q ss_pred             CcccCCCCceEEEEeCCChhhhHHHHHHHHHHHhh-hcccCcccccccCCCccCCccccccccCCCCCCcCCCcccCccc
Q 020116           31 SSLLQRSHSGFLLTCTIKREKSATKEAISILHKYV-GHNNIENSQRLESSNTAGDSKRRKVCTDDMGGKCADGAEINSIE  109 (331)
Q Consensus        31 ~~~L~~G~~GilvTC~~grEk~A~~E~~~LLne~a-d~ly~~~~~~~e~~~~~~~~~~~k~~~~~~~~e~~d~~~~e~e~  109 (331)
                      ...|+||.+|||+||+ ++||||++|+|.||+|.. |..||++.  .+.++. .          +.|.|..++ ..|.|.
T Consensus        36 ~~~lepG~~Gi~~tC~-~HErQA~QEaysl~~ElygD~~~G~~~--~~~~E~-a----------p~gsE~~~~-~aeaEl  100 (291)
T KOG3943|consen   36 PRQLEPGLQGILITCN-MHERQAVQEAYSLLNELYGDDMYGPEK--FTDKEQ-A----------PSGSEGEDD-DAEAEL  100 (291)
T ss_pred             ccccCCCcceEEEEcc-chHHHHHHHHHHHHHHHhcCcccCchh--ccCccc-C----------CCCCcCCCC-hHHHhh
Confidence            3579999999999999 599999999999999975 45665431  001111 1          111111000 111110


Q ss_pred             cccccccccccccCCCcCCCCCCccccccceeccccceEEEEeccCCCCCHHHHHHHHHHhhcc--CCcCCceeEEEeee
Q 020116          110 EDSAGGLQKNECHSSVKTDTNTETDFDMSLVKLTRNGLLLLTFPREHSPNTINIVSNIFQSLGS--GSLKSPVWCHRIFP  187 (331)
Q Consensus       110 ~die~~~ke~e~~k~~krf~~~~~~~~f~~vkl~~~Gv~fi~f~~~~~~dP~~lv~~I~~dv~~--~~~~~tRfi~Rl~P  187 (331)
                      .|.+.+    .++++.-    -+....|.+|+++|+|+.||....  ++.|.++|+.|++|+.+  ++.+++||+|||+|
T Consensus       101 kdEe~g----ddLk~~~----~~~~~P~~Fv~~~~~Cv~f~~t~K--niVpe~~v~~i~~dm~elk~k~kRtR~~Qr~~P  170 (291)
T KOG3943|consen  101 KDEEVG----DDLKAST----EMRLRPFQFVESGANCVVFIRTLK--NIVPEKLVHHILQDMYELKTKKKRTRVIQRMLP  170 (291)
T ss_pred             hhhhhH----HHHhhhh----hhhcCchhhhhccCceEEeecccC--ccCchHHHHHHHHHHHhhhccchhhhhhhhhcc
Confidence            011111    1122100    001124789999999999999865  57999999999999987  56789999999999


Q ss_pred             eeeEeecCHHHHHHHHHHHHHhhcccccccCCCCceeEEEEcccCccccccCCCCCCCCccccccCChhHHHHHHHHHhh
Q 020116          188 IQATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVK  267 (331)
Q Consensus       188 I~~tc~A~~~~i~~~~~~l~~~~F~~~~~~~~~p~tfaI~~~~Rn~~~~e~~~~~~n~~~~~~~~l~R~~vI~~VA~~V~  267 (331)
                      |++||.|.+++|.+++..+|.+||+....  .+-.||+|+|++||           |      +++.|++||+.|+.+|.
T Consensus       171 i~~tc~a~le~m~k~a~~VI~p~fkap~t--gK~~tf~VE~k~RN-----------n------~~v~r~~vi~~V~~~Vc  231 (291)
T KOG3943|consen  171 ISGTCKAFLEDMKKYAETVIEPWFKAPNT--GKKGTFQVEYKSRN-----------N------SHVNREEVIREVAGIVC  231 (291)
T ss_pred             ccchHHhhHHHHHHHHHHhhcccccCCCC--CcCceEEEEEEecc-----------c------cchhHHHHHHHHHHHHH
Confidence            99999999999999999999999975531  35579999999999           6      99999999999999999


Q ss_pred             hcCCCCeeeCCCCCeEEEEEEeeCCCCCCCccEEEEEeecCCccccccceeeeeccCCCC
Q 020116          268 EVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLVTTKPRLSIRALVSGTN  327 (331)
Q Consensus       268 ~~~~~hkVdLknPd~~IlVEV~k~~~~~~~~~v~giSVv~~y~~~~K~~l~~~~l~~~~~  327 (331)
                      ..++..+|||+|||++|+|||++  .      ++||||+.+|+-++||||  ++++.-+.
T Consensus       232 ~l~se~~VdL~n~D~t~~Ve~~k--s------~I~~~Vv~dy~~~RKYnl--q~~~~s~~  281 (291)
T KOG3943|consen  232 TLNSENKVDLTNPDYTVVVEIIK--A------VICLSVVKDYMLFRKYNL--QEVVKSPK  281 (291)
T ss_pred             hcCccceeeccCCCeEEEEEeee--c------eeeeeehHHHHHHHHhhH--HHhhcccC
Confidence            99999999999999999999999  7      999999999999999998  66665443



>PF02926 THUMP: THUMP domain; InterPro: IPR004114 The THUMP domain is shared by 4-thiouridine, pseudouridine synthases and RNA methylases[] and is probably an RNA-binding domain that adopts an alpha/beta fold similar to that found in the C-terminal domain of translation initiation factor 3 and ribosomal protein S8 Back     alignment and domain information
>COG1818 Predicted RNA-binding protein, contains THUMP domain [General function prediction only] Back     alignment and domain information
>PRK01565 thiamine biosynthesis protein ThiI; Provisional Back     alignment and domain information
>PRK01269 tRNA s(4)U8 sulfurtransferase; Provisional Back     alignment and domain information
>PRK08384 thiamine biosynthesis protein ThiI; Provisional Back     alignment and domain information
>TIGR00342 thiazole biosynthesis/tRNA modification protein ThiI Back     alignment and domain information
>COG0301 ThiI Thiamine biosynthesis ATP pyrophosphatase [Coenzyme metabolism] Back     alignment and domain information
>COG0116 Predicted N6-adenine-specific DNA methylase [DNA replication, recombination, and repair] Back     alignment and domain information
>TIGR01177 conserved hypothetical protein TIGR01177 Back     alignment and domain information
>PRK11783 rlmL 23S rRNA m(2)G2445 methyltransferase; Provisional Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query331
2dir_A98 Thump domain-containing protein 1; structural geno 1e-09
>2dir_A Thump domain-containing protein 1; structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} SCOP: d.308.1.3 Length = 98 Back     alignment and structure
 Score = 53.7 bits (129), Expect = 1e-09
 Identities = 16/119 (13%), Positives = 41/119 (34%), Gaps = 30/119 (25%)

Query: 190 ATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDV 249
           ++     ++++      +  +             F + Y  R                  
Sbjct: 5   SSGKAFLEDMKKYAETFLEPWFKAPN-----KGTFQIVYKSRNNSH-------------- 45

Query: 250 SALLDRNKCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQ 308
              ++R +    +A  V  + S++ VDL +P+ +V+VE++          V  +S++  
Sbjct: 46  ---VNREEVIRELAGIVCTLNSENKVDLTNPQYTVVVEIIKA--------VCCLSVVKS 93


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query331
2dir_A98 Thump domain-containing protein 1; structural geno 99.76
1vbk_A 307 Hypothetical protein PH1313; structural genomics, 98.41
2c5s_A 413 THII, probable thiamine biosynthesis protein THII; 98.39
3tma_A 354 Methyltransferase; thump domain; 2.05A {Thermus th 96.37
3k0b_A 393 Predicted N6-adenine-specific DNA methylase; methy 96.3
3ldg_A 384 Putative uncharacterized protein SMU.472; YPSC, me 95.81
3v97_A 703 Ribosomal RNA large subunit methyltransferase L; Y 95.32
3tm4_A 373 TRNA (guanine N2-)-methyltransferase TRM14; rossma 95.15
3ldu_A 385 Putative methylase; structural genomics, PSI-2, pr 94.69
>2dir_A Thump domain-containing protein 1; structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} SCOP: d.308.1.3 Back     alignment and structure
Probab=99.76  E-value=2.1e-18  Score=138.62  Aligned_cols=93  Identities=17%  Similarity=0.303  Sum_probs=84.7

Q ss_pred             eeEeecCHHHHHHHHHHHHHhhcccccccCCCCceeEEEEcccCccccccCCCCCCCCccccccCChhHHHHHHHHHhhh
Q 020116          189 QATCVLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKE  268 (331)
Q Consensus       189 ~~tc~A~~~~i~~~~~~l~~~~F~~~~~~~~~p~tfaI~~~~Rn~~~~e~~~~~~n~~~~~~~~l~R~~vI~~VA~~V~~  268 (331)
                      ..+|.+++++|.+.|..++.++|+.     .++.||+|++++|+           +      +.|++++|++.||++|.+
T Consensus         4 ~~~~~~~~e~I~~~a~~~~~~~f~~-----~~~~TF~V~~kR~~-----------k------~~~~S~ei~~~vG~~i~~   61 (98)
T 2dir_A            4 GSSGKAFLEDMKKYAETFLEPWFKA-----PNKGTFQIVYKSRN-----------N------SHVNREEVIRELAGIVCT   61 (98)
T ss_dssp             CCCCCHHHHHHHHHHHHHHHHHHST-----TCCCEEEEEEECSS-----------C------CSSCHHHHHHHHHHHHHH
T ss_pred             eEEEECCHHHHHHHHHHHHHHhcCc-----CCCceEEEEEEeCC-----------C------CCCCHHHHHHHHHHHHHh
Confidence            4789999999999999999999852     24469999999999           5      599999999999999998


Q ss_pred             cCCCCeeeCCCCCeEEEEEEeeCCCCCCCccEEEEEeecCCcc
Q 020116          269 VVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSILPQKLV  311 (331)
Q Consensus       269 ~~~~hkVdLknPd~~IlVEV~k~~~~~~~~~v~giSVv~~y~~  311 (331)
                      ..++++|||+|||++|.|||.+  +      .|||||+++|+.
T Consensus        62 ~~~~~kVdL~nPd~~I~VEI~~--~------~~~isv~~~y~~   96 (98)
T 2dir_A           62 LNSENKVDLTNPQYTVVVEIIK--A------VCCLSVVKSGPS   96 (98)
T ss_dssp             HCTTCEECSSSCSEEEEEEEET--T------EEEEEEEECCCT
T ss_pred             hCCCCeeEcCCCCEEEEEEEeC--C------EEEEEEcccccc
Confidence            7778999999999999999999  8      999999999975



>1vbk_A Hypothetical protein PH1313; structural genomics, riken structural genomics/proteomics initiative, RSGI, NPPSFA; 1.90A {Pyrococcus horikoshii} SCOP: c.26.2.6 d.308.1.1 Back     alignment and structure
>2c5s_A THII, probable thiamine biosynthesis protein THII; RNA-binding protein, RNA binding protein, tRNA modification, 4-thiouridine synthase; HET: AMP; 2.5A {Bacillus anthracis} SCOP: c.26.2.6 d.308.1.1 Back     alignment and structure
>3tma_A Methyltransferase; thump domain; 2.05A {Thermus thermophilus} Back     alignment and structure
>3k0b_A Predicted N6-adenine-specific DNA methylase; methylase,PF01170, putative RNA methylase, PSI,MCSG, structu genomics; 1.50A {Listeria monocytogenes str} Back     alignment and structure
>3ldg_A Putative uncharacterized protein SMU.472; YPSC, methyltransferase, transferase; HET: SAH; 1.96A {Streptococcus mutans} Back     alignment and structure
>3v97_A Ribosomal RNA large subunit methyltransferase L; YCBY, RNA methyltransferase, ribosome RNA, SAH, RLML; HET: SAH OSU; 2.20A {Escherichia coli} PDB: 3v8v_A* Back     alignment and structure
>3tm4_A TRNA (guanine N2-)-methyltransferase TRM14; rossmann fold, thump domain, tRNA methyltransferase; HET: SAM; 1.95A {Pyrococcus furiosus} PDB: 3tlj_A* 3tm5_A* Back     alignment and structure
>3ldu_A Putative methylase; structural genomics, PSI-2, protein structure initiative, midwest center for structural genomics, MCSG; HET: MSE GTP; 1.70A {Clostridium difficile} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 331
d2dira185 d.308.1.3 (A:8-92) THUMP domain-containing protein 7e-04
>d2dira1 d.308.1.3 (A:8-92) THUMP domain-containing protein 1 {Human (Homo sapiens) [TaxId: 9606]} Length = 85 Back     information, alignment and structure

class: Alpha and beta proteins (a+b)
fold: THUMP domain
superfamily: THUMP domain-like
family: Minimal THUMP
domain: THUMP domain-containing protein 1
species: Human (Homo sapiens) [TaxId: 9606]
 Score = 35.9 bits (83), Expect = 7e-04
 Identities = 16/110 (14%), Positives = 39/110 (35%), Gaps = 30/110 (27%)

Query: 197 KELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRN 256
           ++++      +  +             F + Y  R                     ++R 
Sbjct: 5   EDMKKYAETFLEPWFKAPNKG-----TFQIVYKSRNNSH-----------------VNRE 42

Query: 257 KCFTIVAAAVKEVVSDSAVDLKSPELSVLVELLPISGLPSELLVVGVSIL 306
           +    +A  V  + S++ VDL +P+ +V+VE++          V  +S++
Sbjct: 43  EVIRELAGIVCTLNSENKVDLTNPQYTVVVEIIKA--------VCCLSVV 84


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query331
d2dira185 THUMP domain-containing protein 1 {Human (Homo sap 99.82
d1vbka2175 Hypothetical protein PH1313, N-terminal domain {Ar 98.99
d2c5sa2171 Thiamine biosynthesis protein ThiI, N-terminal dom 98.87
>d2dira1 d.308.1.3 (A:8-92) THUMP domain-containing protein 1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: THUMP domain
superfamily: THUMP domain-like
family: Minimal THUMP
domain: THUMP domain-containing protein 1
species: Human (Homo sapiens) [TaxId: 9606]
Probab=99.82  E-value=1.5e-20  Score=146.08  Aligned_cols=85  Identities=19%  Similarity=0.349  Sum_probs=78.8

Q ss_pred             ecCHHHHHHHHHHHHHhhcccccccCCCCceeEEEEcccCccccccCCCCCCCCccccccCChhHHHHHHHHHhhhcCCC
Q 020116          193 VLKEKELQATVSKLVLQFVNDEQNKLSRPVKFAVGYNRRGFEEKQNKIPKDTKDSDVSALLDRNKCFTIVAAAVKEVVSD  272 (331)
Q Consensus       193 ~A~~~~i~~~~~~l~~~~F~~~~~~~~~p~tfaI~~~~Rn~~~~e~~~~~~n~~~~~~~~l~R~~vI~~VA~~V~~~~~~  272 (331)
                      +|++++|.+.|..++++||+.     .++.||+|.+++|+           |      +.|+|++|++.||++|.+..++
T Consensus         1 ~A~ledi~k~a~~l~~~~~~~-----~~~~TF~V~~krr~-----------~------~~l~s~ev~~~vg~~v~~~~~~   58 (85)
T d2dira1           1 KAFLEDMKKYAETFLEPWFKA-----PNKGTFQIVYKSRN-----------N------SHVNREEVIRELAGIVCTLNSE   58 (85)
T ss_dssp             CHHHHHHHHHHHHHHHHHHST-----TCCCEEEEEEECSS-----------C------CSSCHHHHHHHHHHHHHHHCTT
T ss_pred             CCCHHHHHHHHHHHHHHHhcc-----CCCceEEEEEEeCC-----------C------CCCCHHHHHHHHHHHHHhhCCC
Confidence            488999999999999999962     36789999999999           6      7999999999999999888889


Q ss_pred             CeeeCCCCCeEEEEEEeeCCCCCCCccEEEEEeec
Q 020116          273 SAVDLKSPELSVLVELLPISGLPSELLVVGVSILP  307 (331)
Q Consensus       273 hkVdLknPd~~IlVEV~k~~~~~~~~~v~giSVv~  307 (331)
                      |+|||+|||++|+|||++  +      +||||||.
T Consensus        59 ~~VdL~~Pd~~I~Vev~~--~------~~~iSvv~   85 (85)
T d2dira1          59 NKVDLTNPQYTVVVEIIK--A------VCCLSVVK   85 (85)
T ss_dssp             CEECSSSCSEEEEEEEET--T------EEEEEEEE
T ss_pred             ceeeccCCCEEEEEEEeC--C------EEEEEEeC
Confidence            999999999999999999  9      99999984



>d1vbka2 d.308.1.1 (A:1-175) Hypothetical protein PH1313, N-terminal domain {Archaeon Pyrococcus horikoshii [TaxId: 53953]} Back     information, alignment and structure
>d2c5sa2 d.308.1.1 (A:3-173) Thiamine biosynthesis protein ThiI, N-terminal domain {Bacillus anthracis [TaxId: 1392]} Back     information, alignment and structure