Citrus Sinensis ID: 008521


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520-------530-------540-------550-------560---
MQPLFNAFANQTQSNSSTSTLPQMQAFRPNMLQNSVPMQPQTGIMNPQVPFPLSNSNMPATAPLMNQNNFMNASNPLLIAQSNHLGLPQLGHMFPVHGNFGNMPLNVSQGQLLGPNLSNLPQQFNQNMGFSNGQMCLQSVNQLLSMQMSNPCLFGNPQFGLVNQNQQNFVQPIKDVSGNNTLKSCSSEPRQGQNLQPSAFMRSQRNFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDENRERMHHNIQGFEDENDDEEEDNDNGEQMKVYIAGKGNTNDEHVRLEEEEGEIID
cccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHcccccccccHHHHHHHHHHHHHcccccHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccHHHHHHHHHHccccccccccccccccccccccccccccHHHHccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccEEEccccccccHHHHHHcccccccccccccccccccccccccccccccccccccccccHHHccccccccHHHHcccccccHHHHHcHHHHccccc
cccccccccccHccccccccccccccccHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHccccccEEEcHHHHHHHHHHHHHcccccHHHHHHHcccccccccccHHHHHHHHHHHHHHHHHHHcccccccccccccccccHcccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHHHHccccccccccEEEEEcccccccEEcccccccccccccccccEEEEEEcccccccccccccccccccccccccccccccccccEEEEcccccccccccccccccEcc
MQPLFNafanqtqsnsststlpqmqafrpnmlqnsvpmqpqtgimnpqvpfplsnsnmpataplmnqnnfmnasnplliaqsnhlglpqlghmfpvhgnfgnmplnvsqgqllgpnlsnlpqqfnqnmgfsngqMCLQSVNQLLSmqmsnpclfgnpqfglvnqnqqnfvqpikdvsgnntlkscsseprqgqnlqpsafmRSQRNfiqngqgsisksnskkvsgkNFIKNRTKEFSQGfqkshfhqmhngkrkfgfyndqkpkghgngkAIKFGlanqmnqpqekKRKSLALTYTEQEVKQWCEERrknyptkaninkkltekqsnsdvIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENtmplnkrgrfqnnydkrgrykrkdqfskeqkladkdssntssfnkkkPTLLQKLLSADVKRDKSHLLQTFRFMAINSffkdwpekpvnfplvmvkdggaageVVEEKSlllgeedsegkgRRMVEHfehdenrermhhniqgfedenddeeedndngEQMKVYIAgkgntndehvrleeeegeiid
MQPLFNAFanqtqsnsstsTLPQMQAFRPNMLQNSVPMQPQTGIMNPQVPFPLSNSNMPATAPLMNQNNFMNASNPLLIAQSNHLGLPQLGHMFPVHGNFGNMPLNVSQGQLLGPNLSNLPQQFNQNMGFSNGQMCLQSVNQLLSMQMSNPCLFGNPQFGLVNQNQQNFVQPIKDVSGNNTLKSCSSEPRQGQNLQPSAFMRSQRNFIQngqgsisksnskkvsGKNFIKNRTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGlanqmnqpqekKRKSLALTYTEQEVKqwceerrknyptkaninkkltekqsnsdviDKEAKMRREQLKEILAKqaelgvevaeipsyylldskkqVRARventmplnkrgrfqnnydkrgrykrkdqfskeqkladkdssntssfnkkkptLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVeekslllgeedsegkgrRMVEHfehdenrermhhniqgfedenddeeEDNDNGEQMKVYIagkgntndehvrleeeegeiid
MQPLFNAFAnqtqsnsststLPQMQAFRPNMLQNSVPMQPQTGIMNPQVPFPLSNSNMPATAPLMNQNNFMNASNPLLIAQSNHLGLPQLGHMFPVHGNFGNMPLNVSQGQLLGPNLSNLPQQFNQNMGFSNGQMCLQSVNQLLSMQMSNPCLFGNPQFGLVNQNQQNFVQPIKDVSGNNTLKSCSSEPRQGQNLQPSAFMRSQRNFIQNGQgsisksnskkvsgknFIKNRTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDENRERMHHNIQGFedenddeeedndnGEQMKVYIAGKGNTNDEHVRLEEEEGEIID
***************************************************************************PLLIAQSNHLGLPQLGHMFPVHGNFGNMPLNVSQGQLLG*************MGFSNGQMCLQSVNQLLSMQMSNPCLFGNPQFGLVNQNQQNFV************************************************************************************************************************************W*****************************************ILAKQAELGVEVAEIPSYYLLDS********************************************************************V*RDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEV***********************************************************************************
*************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************************ADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLV************************************************************************************LEE****I**
********************LPQMQAFRPNMLQNSVPMQPQTGIMNPQVPFPLSNSNMPATAPLMNQNNFMNASNPLLIAQSNHLGLPQLGHMFPVHGNFGNMPLNVSQGQLLGPNLSNLPQQFNQNMGFSNGQMCLQSVNQLLSMQMSNPCLFGNPQFGLVNQNQQNFVQPIKDVSGNNTL************LQPSAFMRSQRNFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRY************************KKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDENRERMHHNIQGFED***********GEQMKVYIAGKGNTNDEHVRL*********
*****************TSTLPQMQAFRPNMLQNSVPMQPQTGIMNPQVPFPLSNSNMPATAPLMNQNNFMNASNPLLIAQSNHLGLPQLGHMFPVHGNFGNMPLNVSQGQLLGPNLSNLPQQFNQNMGFSNGQMCLQSVNQLLSMQMSNPCLFGNPQFGLVNQNQQNFVQPIKDVSGNNTLKSCSSEPRQGQNLQPSAFMRSQR**********************************************************************************RKSLALTYTEQEVKQWCEERRKNYPTKANINKKLT******DVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYL**********************************************************KKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEE****KGRRMVEHFEH***********************DNDNGEQMKVYIAGKGNTNDEHVRLEEEE**II*
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MQPLFNAFANQTQSNSSTSTLPQMQAFRPNMLQNSVPMQPQTGIMNPQVPFPLSNSNMPATAPLMNQNNFMNASNPLLIAQSNHLGLPQLGHMFPVHGNFGNMPLNVSQGQLLGPNLSNLPQQFNQNMGFSNGQMCLQSVNQLLSMQMSNPCLFGNPQFGLVNQNQQNFVQPIKDVSGNNTLKSCSSEPRQGQNLQPSAFMRSQRNFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNGKAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSEGKGRRMVEHFEHDENxxxxxxxxxxxxxxxxxxxxxxxxxxxxKVYIAGKGNTNDEHVRLEEEEGEIID
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query563
147775405605 hypothetical protein VITISV_043067 [Viti 0.793 0.738 0.407 5e-78
449446325599 PREDICTED: uncharacterized protein LOC10 0.841 0.791 0.396 2e-77
449494110552 PREDICTED: uncharacterized LOC101213190 0.833 0.849 0.395 4e-77
224100559659 predicted protein [Populus trichocarpa] 0.611 0.522 0.447 6e-76
224113339 674 predicted protein [Populus trichocarpa] 0.921 0.770 0.363 2e-74
356553962 822 PREDICTED: uncharacterized protein LOC10 0.861 0.590 0.360 3e-65
255554855273 conserved hypothetical protein [Ricinus 0.380 0.783 0.587 4e-63
145334515470 uncharacterized protein [Arabidopsis tha 0.678 0.812 0.420 4e-62
358347440553 hypothetical protein MTR_101s0034 [Medic 0.749 0.763 0.373 5e-62
297807865454 hypothetical protein ARALYDRAFT_488722 [ 0.646 0.801 0.391 7e-62
>gi|147775405|emb|CAN73830.1| hypothetical protein VITISV_043067 [Vitis vinifera] Back     alignment and taxonomy information
 Score =  298 bits (763), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 224/550 (40%), Positives = 282/550 (51%), Gaps = 103/550 (18%)

Query: 35  SVPMQPQTGIMNPQVPFPLSNSNMPATAPLMNQNNFMNASNPLLIAQSNHLGLPQLGHMF 94
           ++P   Q G++ PQ   P    NMP      NQ  FM    PL   Q+NHLG+P LG   
Sbjct: 40  ALPNPLQMGMLXPQTSIPFI-PNMP------NQFGFM----PL---QNNHLGMPHLGSAQ 85

Query: 95  PVHGNFG--NMPLNVSQ-----------GQLLGPNLSNLPQQFNQNMGFSNGQMCLQSVN 141
              G+F   N   N+ Q             L G N + LP          N Q CLQ+ N
Sbjct: 86  SYQGSFNPQNCVPNLGQSAXRSNPSQPPANLFGHNSAILP---------GNLQFCLQNXN 136

Query: 142 QLLSMQMSNPCLFG------------------NPQFGLVNQNQQNFVQPIKDVS------ 177
           Q   M M N    G                   PQ      NQQ     +  V       
Sbjct: 137 QNFPMPMPNLNQVGPYNIPQASHHMFQLAQALGPQSSAFIPNQQFVPMCVSGVGENVHQD 196

Query: 178 ---------GNNTLK-----------------SCSSEPRQ-GQNLQPSAFMRSQRNFIQN 210
                    G+N LK                 + S +P+Q   N QPS FM+SQ NF+++
Sbjct: 197 NHKLGPPAMGSNALKQPPIANPQLQGNWPTMATSSVQPQQTWNNQQPSNFMKSQGNFVKD 256

Query: 211 GQGSISKSNSKKVSGKNFIKN-RTKEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHGNG 269
              + S  N K  S KNF+ N + K    G +KS  H M NG+ K G  N+ + KG  N 
Sbjct: 257 AGVNNSNPNWKNSSRKNFMXNPKGKNSHWGSRKSQLHHMQNGRGKAGISNENRGKGLSNN 316

Query: 270 KAIKFGLANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQSNSD 329
            A      N   Q + +K++ L L YTEQE++ W EER+KNYP+K N+ KK  EK +NS+
Sbjct: 317 MAGNLCRPNFTYQDKVEKKRPLPLNYTEQEIQNWREERKKNYPSKINLEKKSAEKLTNSE 376

Query: 330 VIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDS-KKQVRARVENT-MPLNKRGRF 387
           VI+ E K RR+QLKEILAKQAELGVEVAEIP +YL DS K+QV  R EN      K+ RF
Sbjct: 377 VIEAEVKSRRQQLKEILAKQAELGVEVAEIPPHYLSDSEKQQVHGREENNKKAFGKKERF 436

Query: 388 QNNYDKRGRYKRK--------DQFSKEQKLADKDSSNTSS-----FNKKKPTLLQKLLSA 434
           QN  +KR R+ RK        D F+K+Q+LA  DS +T++      NKKK TLLQKLLS 
Sbjct: 437 QNRGNKRRRHDRKQWQRHDQEDGFTKKQRLAGTDSGDTNASNQPPLNKKKQTLLQKLLST 496

Query: 435 DVKRDKSHLLQTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSLLLGEEDSE 494
           D+KRDK HLLQ FRFMA+NSFFKDWPEKP+ FPLV VK+ G  GEVV+ KS    +   +
Sbjct: 497 DIKRDKRHLLQVFRFMAMNSFFKDWPEKPLKFPLVAVKETGCQGEVVDRKSSPTSKGVPQ 556

Query: 495 GKGRRMVEHF 504
           G  +   E F
Sbjct: 557 GGRKTXAEEF 566




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|449446325|ref|XP_004140922.1| PREDICTED: uncharacterized protein LOC101213190 [Cucumis sativus] Back     alignment and taxonomy information
>gi|449494110|ref|XP_004159451.1| PREDICTED: uncharacterized LOC101213190 [Cucumis sativus] Back     alignment and taxonomy information
>gi|224100559|ref|XP_002311923.1| predicted protein [Populus trichocarpa] gi|222851743|gb|EEE89290.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224113339|ref|XP_002316461.1| predicted protein [Populus trichocarpa] gi|222865501|gb|EEF02632.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356553962|ref|XP_003545319.1| PREDICTED: uncharacterized protein LOC100786384 [Glycine max] Back     alignment and taxonomy information
>gi|255554855|ref|XP_002518465.1| conserved hypothetical protein [Ricinus communis] gi|223542310|gb|EEF43852.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|145334515|ref|NP_001078603.1| uncharacterized protein [Arabidopsis thaliana] gi|145358174|ref|NP_197345.2| uncharacterized protein [Arabidopsis thaliana] gi|60547897|gb|AAX23912.1| hypothetical protein At5g18440 [Arabidopsis thaliana] gi|71905555|gb|AAZ52755.1| hypothetical protein At5g18440 [Arabidopsis thaliana] gi|71905557|gb|AAZ52756.1| hypothetical protein At5g18440 [Arabidopsis thaliana] gi|332005180|gb|AED92563.1| uncharacterized protein [Arabidopsis thaliana] gi|332005181|gb|AED92564.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|358347440|ref|XP_003637765.1| hypothetical protein MTR_101s0034 [Medicago truncatula] gi|355503700|gb|AES84903.1| hypothetical protein MTR_101s0034 [Medicago truncatula] Back     alignment and taxonomy information
>gi|297807865|ref|XP_002871816.1| hypothetical protein ARALYDRAFT_488722 [Arabidopsis lyrata subsp. lyrata] gi|297317653|gb|EFH48075.1| hypothetical protein ARALYDRAFT_488722 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query563
TAIR|locus:2146238470 NUFIP "AT5G18440" [Arabidopsis 0.740 0.887 0.402 3.1e-67
ZFIN|ZDB-GENE-030131-6203406 zgc:171476 "zgc:171476" [Danio 0.065 0.091 0.475 1.8e-05
RGD|1359440486 Nufip1 "nuclear fragile X ment 0.154 0.179 0.350 2.1e-05
UNIPROTKB|Q9UHK0495 NUFIP1 "Nuclear fragile X ment 0.101 0.115 0.446 2.4e-05
UNIPROTKB|J9P0P5487 NUFIP1 "Uncharacterized protei 0.101 0.117 0.446 7.5e-05
UNIPROTKB|F1PK39488 NUFIP1 "Uncharacterized protei 0.101 0.116 0.446 7.6e-05
UNIPROTKB|F1RJZ3494 NUFIP1 "Uncharacterized protei 0.101 0.115 0.446 7.7e-05
UNIPROTKB|E1C0G7360 NUFIP1 "Uncharacterized protei 0.060 0.094 0.459 0.00013
UNIPROTKB|E1C0G8434 NUFIP1 "Uncharacterized protei 0.060 0.078 0.459 0.00021
UNIPROTKB|F1NXC7464 NUFIP1 "Uncharacterized protei 0.060 0.073 0.459 0.00025
TAIR|locus:2146238 NUFIP "AT5G18440" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 683 (245.5 bits), Expect = 3.1e-67, P = 3.1e-67
 Identities = 186/462 (40%), Positives = 251/462 (54%)

Query:    52 PLSNSNMPATAPLMNQNNFMNASNPLLIAQSNHLGLPQLGHMFPVHGN-FGNMPLNVSQG 110
             P  NS  P   P   Q+         +   SNH      G+  P++ N  G M   +   
Sbjct:     3 PPYNSFQPQARPYRPQHQLQQQQQ--INGFSNHQQQQHNGYQNPMNANQLGMMNPQMMNN 60

Query:   111 QLLGPNLS--NLP--QQFNQNMGFSNGQMCLQS-VNQLLSMQMSNPCLFGNPQFGLVNQN 165
              ++G N+   N+P   QF  NM     Q  + + +NQLL      P L GN QF + N N
Sbjct:    61 PMMGHNMPMPNMPIHPQFFNNMPQQLPQFAMPNHINQLL------PNLLGNLQFAVANSN 114

Query:   166 QQ-----NFVQPIKD---VSGNNTLKSCSS-----EPRQGQNLQ--PSAFM--RSQRNFI 208
                    NF QP  +    S    L S +S      P   QN Q  P  F   R Q   +
Sbjct:   115 LMGHSLPNFFQPSLEPHAFSSRPQLNSFNSLPYPPVPNPHQNHQSGPPGFSEPRPQGQSV 174

Query:   209 QNGQXXXXXXXXXXXXXXXFIKNRT-KEFSQGFQKSHFHQMHNGKRKFGFYNDQKPKGHG 267
              N                 F K++  K   QGFQ+   HQ  NGKRK GF  D +  G G
Sbjct:   175 DN---TNGSGPNGNDFRNKFPKHQNFKGPGQGFQRPQLHQADNGKRKSGFNKDHR--GKG 229

Query:   268 NGKAIKFGL--ANQMNQPQEKKRKSLALTYTEQEVKQWCEERRKNYPTKANINKKLTEKQ 325
             N   +K GL  ++  N  +EKKR S AL YT +EV+QW E RRKNYPTK  + KK+ +K 
Sbjct:   230 NNNKMKTGLDGSDTGNIAKEKKR-SYALMYTPREVQQWREARRKNYPTKFLVEKKV-KKN 287

Query:   326 SNSDVIDKEAKMRREQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPLNKRG 385
              ++ ++D+EAKMRR+QL+E+LAKQAELGVEVAE+PS+YL ++ +QV     N     ++G
Sbjct:   288 VSASILDEEAKMRRQQLREVLAKQAELGVEVAEVPSHYLSNNDEQVNGDRGNNN--GRKG 345

Query:   386 RFQNNYDKRGRYKRKDQF-SKEQKLADKDSSNTSSFNKKKPTLLQKLLSADVKRDKSHLL 444
             RFQNN   + R+ RKD+F +K+ +L DK SS  SS   +KPTLL+KLLSAD+KRDKS LL
Sbjct:   346 RFQNNRRNKRRHDRKDKFDNKKPRLEDKKSSQDSSITTRKPTLLEKLLSADIKRDKSQLL 405

Query:   445 QTFRFMAINSFFKDWPEKPVNFPLVMVKDGGAAGEVVEEKSL 486
             Q FRFM +NS  K++PE+P+  PL+ VK+ G   + +E+ S+
Sbjct:   406 QVFRFMVMNSLLKEFPEQPLKLPLITVKETGCE-DAMEDPSI 446




GO:0003674 "molecular_function" evidence=ND
GO:0000491 "small nucleolar ribonucleoprotein complex assembly" evidence=IMP
GO:0005634 "nucleus" evidence=IDA
GO:0048046 "apoplast" evidence=IDA
ZFIN|ZDB-GENE-030131-6203 zgc:171476 "zgc:171476" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
RGD|1359440 Nufip1 "nuclear fragile X mental retardation protein interacting protein 1" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|Q9UHK0 NUFIP1 "Nuclear fragile X mental retardation-interacting protein 1" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|J9P0P5 NUFIP1 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|F1PK39 NUFIP1 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|F1RJZ3 NUFIP1 "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
UNIPROTKB|E1C0G7 NUFIP1 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|E1C0G8 NUFIP1 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|F1NXC7 NUFIP1 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query563
pfam1045357 pfam10453, NUFIP1, Nuclear fragile X mental retard 3e-07
>gnl|CDD|220763 pfam10453, NUFIP1, Nuclear fragile X mental retardation-interacting protein 1 (NUFIP1) Back     alignment and domain information
 Score = 47.0 bits (112), Expect = 3e-07
 Identities = 17/32 (53%), Positives = 23/32 (71%)

Query: 296 TEQEVKQWCEERRKNYPTKANINKKLTEKQSN 327
           T +E+ +W EER+KNYPTKANI KK   ++  
Sbjct: 20  TPEEIAKWIEERKKNYPTKANIEKKKKAEEER 51


Proteins in this family have been implicated in the assembly of the large subunit of the ribosome and in telomere maintenance. Some proteins in this family contain a CCCH zinc finger. This family contains a protein called human fragile X mental retardation-interacting protein 1, which is known to bind RNA and is phosphorylated upon DNA damage. Length = 57

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 563
PF1045356 NUFIP1: Nuclear fragile X mental retardation-inter 99.11
PF0064227 zf-CCCH: Zinc finger C-x8-C-x5-C-x3-H type (and si 87.32
>PF10453 NUFIP1: Nuclear fragile X mental retardation-interacting protein 1 (NUFIP1); InterPro: IPR019496 Nuclear fragile X mental retardation-interacting protein 1 (Nufip1) has been implicated in the assembly of the large subunit of the ribosome [] and in telomere maintenance [] Back     alignment and domain information
Probab=99.11  E-value=7.9e-11  Score=94.23  Aligned_cols=31  Identities=61%  Similarity=0.901  Sum_probs=28.2

Q ss_pred             cccCHHHHHHHHHHHHhcCCChHHHHHHHHH
Q 008521          293 LTYTEQEVKQWCEERRKNYPTKANINKKLTE  323 (563)
Q Consensus       293 l~~t~eeI~~WreERrKNyPT~anIe~K~~e  323 (563)
                      .++|++||++|||||||||||+++|++|+++
T Consensus        16 ~L~t~eeI~~W~eERrk~~PT~~~i~~k~~~   46 (56)
T PF10453_consen   16 KLQTPEEIAKWIEERRKNYPTKANIEKKKAE   46 (56)
T ss_pred             ecCCHHHHHHHHHHHHHcCCcHHHHHHHHHH
Confidence            3479999999999999999999999998765



It is known to bind RNA [] and is phosphorylated upon DNA damage []. This entry represents a conserved domain found within Nufip1. Some proteins containing this region also contain a CCCH zinc finger.

>PF00642 zf-CCCH: Zinc finger C-x8-C-x5-C-x3-H type (and similar); InterPro: IPR000571 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query563
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 7e-13
3lvg_D190 LCB, clathrin light chain B; SELF assembly, coated 6e-05
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 70.7 bits (172), Expect = 7e-13
 Identities = 76/551 (13%), Positives = 152/551 (27%), Gaps = 184/551 (33%)

Query: 97  HGNFGNMPLNVSQGQLLGPN-LSNLPQQFNQNMGFSNGQMCLQSVNQLLSMQMSNPCLFG 155
           H +  +M     + Q    + LS     F  N    + Q   +S+  L   ++ +  +  
Sbjct: 1   HHHHHHMDFETGEHQYQYKDILSVFEDAFVDNFDCKDVQDMPKSI--LSKEEIDH--IIM 56

Query: 156 NPQ--------FGLVNQNQQNFVQP-IKDVSGNNT--LKSCSSEPRQGQNLQPSAFMR-- 202
           +          F  +   Q+  VQ  +++V   N   L      P + +  QPS   R  
Sbjct: 57  SKDAVSGTLRLFWTLLSKQEEMVQKFVEEVLRINYKFLM----SPIKTEQRQPSMMTRMY 112

Query: 203 -SQRNFIQNGQGSISKSNSKKVSGKNFIKNRTKEFSQGFQKSHFHQMHNGKRKFGF---- 257
             QR+ + N     +K N  ++     ++    E     +      +       G     
Sbjct: 113 IEQRDRLYNDNQVFAKYNVSRLQPYLKLRQALLELRPA-KNVLIDGV------LGSGKTW 165

Query: 258 -----YNDQKPKGHGNGKA--IKFG--------------LANQMNQP-QEKKRKSLALTY 295
                    K +   + K   +                 L  Q++     +   S  +  
Sbjct: 166 VALDVCLSYKVQCKMDFKIFWLNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKL 225

Query: 296 TEQEVKQWCEE--RRKNYPT--------------KA-NIN-KKL--TEKQSNSDVIDKEA 335
               ++       + K Y                 A N++ K L  T  +  +D +    
Sbjct: 226 RIHSIQAELRRLLKSKPYENCLLVLLNVQNAKAWNAFNLSCKILLTTRFKQVTDFLSAAT 285

Query: 336 KMR-----------REQLKEILAKQAELGVEVAEIPSYYLLDSKKQVRARVENTMPL--- 381
                          +++K +L K   L     ++P              V  T P    
Sbjct: 286 TTHISLDHHSMTLTPDEVKSLLLKY--LDCRPQDLPR------------EVLTTNPRRLS 331

Query: 382 -------NKRGRFQNNYDKRGRYKRKDQFSKEQKLADKDSSNTSSFNKKKPTLLQKLLSA 434
                  +    +  N+    ++   D+ +   +         SS N  +P   +K+   
Sbjct: 332 IIAESIRDGLATWD-NW----KHVNCDKLTTIIE---------SSLNVLEPAEYRKM--- 374

Query: 435 DVKRDKSHLLQTFRFMAINSFFKD-----------WPEKPVNFPLVMVKDGGAAGEVVEE 483
                       F  +++  F              W +   +  +V+V         + +
Sbjct: 375 ------------FDRLSV--FPPSAHIPTILLSLIWFDVIKSDVMVVVNK-------LHK 413

Query: 484 KSLLLGEEDSEGKGRRMVEH---FE---HDENRERMHHNI-------QGFEDENDDEEED 530
            SL+  E+    K   +       E     EN   +H +I       + F D +D     
Sbjct: 414 YSLV--EKQP--KESTISIPSIYLELKVKLENEYALHRSIVDHYNIPKTF-DSDDLIPPY 468

Query: 531 NDNGEQMKVYI 541
            D       Y 
Sbjct: 469 LDQ------YF 473


>3lvg_D LCB, clathrin light chain B; SELF assembly, coated PIT, cytoplasmic vesicle, membrane, Ca structural protein; 7.94A {Bos taurus} Length = 190 Back     alignment and structure

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query563
d2cqea156 Zinc finger CCCH domain-containing protein C19orf7 92.98
d1m9oa_40 Tristetraproline (ttp, tis11, nup475) {Mouse (Mus 86.09
d1rgoa136 Butyrate response factor 2 (Tis11D) {Human (Homo s 82.32
>d2cqea1 g.66.1.1 (A:458-513) Zinc finger CCCH domain-containing protein C19orf7 (KIAA1064) {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
class: Small proteins
fold: CCCH zinc finger
superfamily: CCCH zinc finger
family: CCCH zinc finger
domain: Zinc finger CCCH domain-containing protein C19orf7 (KIAA1064)
species: Human (Homo sapiens) [TaxId: 9606]
Probab=92.98  E-value=0.012  Score=45.05  Aligned_cols=24  Identities=8%  Similarity=-0.003  Sum_probs=19.7

Q ss_pred             CcceeccCCCCcccCCCCccchhhh
Q 008521          384 RGRFQNNYDKRGRYKRKDQFSKEQK  408 (563)
Q Consensus       384 ~CKF~nrGg~RgRcgRKcRFsh~~k  408 (563)
                      +||||.+|+ +|=-|.+|+|+|.+-
T Consensus         2 PCK~yHtg~-kCy~gd~C~FSH~pL   25 (56)
T d2cqea1           2 PCKLYHTTG-NCINGDDCMFSHDPL   25 (56)
T ss_dssp             BCSHHHHTS-CCSSCTTCSSBCCCC
T ss_pred             CcceeecCC-cccCCCcccccCCcc
Confidence            699999987 455789999999654



>d1m9oa_ g.66.1.1 (A:) Tristetraproline (ttp, tis11, nup475) {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1rgoa1 g.66.1.1 (A:151-186) Butyrate response factor 2 (Tis11D) {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure