Citrus Sinensis ID: 043680


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-----
MPSPTLLLPLLSSSPPSLFLFRLGQSHSEAVEAAYKVAIARCEHEKLKSADACLMEVIGEKNPEHFFVATQDVDLRKKLQEVPGVPLIFGLRNALLLEPPSSFQRKFVKTSEEARSCMTKSEFKKLKKSTKNILETKEIGDSSNKNEELENQKLEMQADKKTHYARKGMGVKDRPQFKRKRAKAPNPLSCKKKKNHENPSTSSGK
ccccccccccccccHHHHHHHHHcccHHHHHHHHHHcHHccccccccccHHHHHHHHHHcccccEEEEEEccHHHHHHHcccccccEEEEEccEEEEccccHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccc
cccHHHHHHHHcccccHHHHHHccHHHHHHHHHHHHHHHHcccccccccHHHHHHHHHcccccccEEEEEccHHHHHHHHccccccEEEEEccEEEEccccHHHHHHHHHHHHHHccccHHHHHHHHHHccccccccccccccccHccHHHHHHHcccccccHHccccccccccccccccccccccccccccccccccccccccc
mpsptlllpllsssppslflfRLGQSHSEAVEAAYKVAIARCEHEKLKSADACLMEVigeknpehffvaTQDVDLRKklqevpgvplifglrnallleppssfqrKFVKTSEEARSCMTKSEFKKLKKSTKNILetkeigdssnkneeLENQKLEMQADKKThyarkgmgvkdrpqfkrkrakapnplsckkkknhenpstssgk
MPSPTLLLPLLSSSPPSLFLFRLGQSHSEAVEAAYKVAIARCEHEKLKSADACLMEVIGEKNPEHFFVATQDVDLRKKLQEVPGVPLIFGLRNALLLEPPSSFQRKFVKtseearscmtksefkklkkstkniletkeigdssnkneelenQKLEMqadkkthyarkgmgvkdrpqfkrkrakapnplsckkkknhenpstssgk
MpsptlllpllsssppslflfRLGQSHSEAVEAAYKVAIARCEHEKLKSADACLMEVIGEKNPEHFFVATQDVDLRKKLQEVPGVPLIFGLRNALLLEPPSSFQRKFVKTSEEARSCMtksefkklkkstkNILETKEIGDSSnkneelenqkleMQADKKTHYARKGMGVKDRPQFKRKRAKAPNPLSCKKKKNHENPSTSSGK
*****************LFLFRLGQSHSEAVEAAYKVAIARCEHEKLKSADACLMEVIGEKNPEHFFVATQDVDLRKKLQEVPGVPLIFGLRNALLLE***********************************************************************************************************
****TLLLPLLSSSPPSLFLFRLGQSHSEAVEAAYKVAIARCEHEKLKSADACLMEVIGEKNPEHFFVATQDVDLRKKLQEVPGVPLIFGLRNALLL************************************************************************************************************
MPSPTLLLPLLSSSPPSLFLFRLGQSHSEAVEAAYKVAIARCEHEKLKSADACLMEVIGEKNPEHFFVATQDVDLRKKLQEVPGVPLIFGLRNALLLEPPSSFQR***********************STKNILETKEIGDSSNKNEELENQKLEMQADKKTHYARKGMGVKD************NPLSC***************
MPSPTLLLPLLSSSPPSLFLFRLGQSHSEAVEAAYKVAIARCEHEKLKSADACLMEVIGEKNPEHFFVATQDVDLRKKLQEVPGVPLIFGLRNALLLEPPSSFQRKFVKTSEEARSCMT****K*********************************************************************************
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MPSPTLLLPLLSSSPPSLFLFRLGQSHSEAVEAAYKVAIARCEHEKLKSADACLMEVIGEKNPEHFFVATQDVDLRKKLQEVPGVPLIFGLRNALLLEPPSSFQRKFVKTSEEARSCMTKSEFKKLKKSTKNILETKExxxxxxxxxxxxxxxxxxxxxxxxxxxxKGMGVKDRPQFKRKRAKAPNPLSCKKKKNHENPSTSSGK
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query205 2.2.26 [Sep-21-2011]
Q9BRU9249 rRNA-processing protein U yes no 0.717 0.590 0.324 2e-13
Q9CX11249 rRNA-processing protein U yes no 0.517 0.425 0.376 1e-11
Q08DU1248 rRNA-processing protein U yes no 0.512 0.423 0.351 2e-10
Q12339254 rRNA-processing protein U yes no 0.570 0.460 0.356 2e-09
O74862260 rRNA-processing protein u yes no 0.760 0.6 0.327 1e-08
>sp|Q9BRU9|UTP23_HUMAN rRNA-processing protein UTP23 homolog OS=Homo sapiens GN=UTP23 PE=1 SV=2 Back     alignment and function desciption
 Score = 75.9 bits (185), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 59/182 (32%), Positives = 85/182 (46%), Gaps = 35/182 (19%)

Query: 23  LGQSHSEAVEAAYKVAIARCEHEK-LKSADACLMEVIGEKNPEHFFVATQDVDLRKKLQE 81
           LG+    A   A K  +  C H K   S   CL+ ++ E NP H+FVATQD +L  K+++
Sbjct: 72  LGKDLYGAKLIAQKCQVRNCPHFKNAVSGSECLLSMVEEGNPHHYFVATQDQNLSVKVKK 131

Query: 82  VPGVPLIFGLRNALLLEPPSSFQRKFVKTSEEARSCMTKSEFKKLKKSTKNILETKEIGD 141
            PGVPL+F ++N ++L+ PS     FVK  E  +                          
Sbjct: 132 KPGVPLMFIIQNTMVLDKPSPKTIAFVKAVESGQLV------------------------ 167

Query: 142 SSNKNEELENQKLEMQADKKTHYARKGMGVKDRPQFKRKRAKAPNPLSCKKKKNHENPST 201
           S ++ E +++ K E    K T  +R+          KRK+   PNPLSC KKK  + P T
Sbjct: 168 SVHEKESIKHLKEEQGLVKNTEQSRRK---------KRKKISGPNPLSCLKKKK-KAPDT 217

Query: 202 SS 203
            S
Sbjct: 218 QS 219




Involved in rRNA-processing and ribosome biogenesis.
Homo sapiens (taxid: 9606)
>sp|Q9CX11|UTP23_MOUSE rRNA-processing protein UTP23 homolog OS=Mus musculus GN=Utp23 PE=2 SV=1 Back     alignment and function description
>sp|Q08DU1|UTP23_BOVIN rRNA-processing protein UTP23 homolog OS=Bos taurus GN=UTP23 PE=2 SV=1 Back     alignment and function description
>sp|Q12339|UTP23_YEAST rRNA-processing protein UTP23 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=UTP23 PE=1 SV=1 Back     alignment and function description
>sp|O74862|UTP23_SCHPO rRNA-processing protein utp23 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=utp23 PE=3 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query205
224132294251 predicted protein [Populus trichocarpa] 0.887 0.725 0.597 8e-58
356501304285 PREDICTED: rRNA-processing protein UTP23 0.858 0.617 0.636 3e-57
225451633284 PREDICTED: rRNA-processing protein UTP23 0.892 0.644 0.622 1e-56
356554407285 PREDICTED: rRNA-processing protein UTP23 0.858 0.617 0.625 3e-56
147802219 621 hypothetical protein VITISV_029908 [Viti 0.892 0.294 0.518 7e-52
357454221289 rRNA-processing protein UTP23-like prote 0.951 0.674 0.535 1e-51
357490851289 rRNA-processing protein UTP23-like prote 0.951 0.674 0.531 3e-51
255578672290 rRNA-processing protein UTP23, putative 0.878 0.620 0.607 1e-50
449452036280 PREDICTED: rRNA-processing protein UTP23 0.858 0.628 0.551 3e-46
449486590 399 PREDICTED: LOW QUALITY PROTEIN: rRNA-pro 0.858 0.441 0.545 5e-46
>gi|224132294|ref|XP_002328233.1| predicted protein [Populus trichocarpa] gi|222837748|gb|EEE76113.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  228 bits (582), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 110/184 (59%), Positives = 146/184 (79%), Gaps = 2/184 (1%)

Query: 20  LFRLGQSHSEAVEAAYKVAIARCEHEKLKSADACLMEVIGEKNPEHFFVATQDVDLRKKL 79
           L RLG+S++E+++AA  + IARC+HE+ K+A+ C++E+IGE NP+HF+V TQD D+RKK 
Sbjct: 70  LKRLGKSYTESLQAANTLMIARCDHEQNKNAEGCIVEIIGENNPDHFYVGTQDTDMRKKF 129

Query: 80  QEVPGVPLIFGLRNALLLEPPSSFQRKFVKTSEEARSCMTKSEFKKLKKSTKNILETKEI 139
           QEVPGVPLIFGLRNAL L+PPS+FQR+F K SEE RS MT+ E   LKK TK+++E  EI
Sbjct: 130 QEVPGVPLIFGLRNALFLQPPSAFQRQFAKNSEEERSHMTEKEVALLKKRTKDLVENWEI 189

Query: 140 GDSSNKNEELENQKLEMQADKKTHYARKGMGVKDRPQFKRKRAKAPNPLSCKKKKNHENP 199
           GDSS++N   E++ LEMQ  K  + +RKGM VKDRPQFKR +AK PNPLS +KKK+ +N 
Sbjct: 190 GDSSDENGGPEDENLEMQPQK--YSSRKGMKVKDRPQFKRNKAKGPNPLSVQKKKSRQNT 247

Query: 200 STSS 203
           ++ S
Sbjct: 248 NSMS 251




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|356501304|ref|XP_003519465.1| PREDICTED: rRNA-processing protein UTP23 homolog [Glycine max] Back     alignment and taxonomy information
>gi|225451633|ref|XP_002276803.1| PREDICTED: rRNA-processing protein UTP23 homolog [Vitis vinifera] gi|296082256|emb|CBI21261.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356554407|ref|XP_003545538.1| PREDICTED: rRNA-processing protein UTP23 homolog [Glycine max] Back     alignment and taxonomy information
>gi|147802219|emb|CAN68267.1| hypothetical protein VITISV_029908 [Vitis vinifera] Back     alignment and taxonomy information
>gi|357454221|ref|XP_003597391.1| rRNA-processing protein UTP23-like protein [Medicago truncatula] gi|87241272|gb|ABD33130.1| Protein of unknown function DUF652 [Medicago truncatula] gi|355486439|gb|AES67642.1| rRNA-processing protein UTP23-like protein [Medicago truncatula] Back     alignment and taxonomy information
>gi|357490851|ref|XP_003615713.1| rRNA-processing protein UTP23-like protein [Medicago truncatula] gi|355517048|gb|AES98671.1| rRNA-processing protein UTP23-like protein [Medicago truncatula] Back     alignment and taxonomy information
>gi|255578672|ref|XP_002530196.1| rRNA-processing protein UTP23, putative [Ricinus communis] gi|223530289|gb|EEF32186.1| rRNA-processing protein UTP23, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|449452036|ref|XP_004143766.1| PREDICTED: rRNA-processing protein UTP23 homolog [Cucumis sativus] Back     alignment and taxonomy information
>gi|449486590|ref|XP_004157341.1| PREDICTED: LOW QUALITY PROTEIN: rRNA-processing protein UTP23 homolog [Cucumis sativus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query205
TAIR|locus:2062303281 MEE21 "AT2G34570" [Arabidopsis 0.839 0.612 0.461 1.1e-34
UNIPROTKB|Q08DU1248 UTP23 "rRNA-processing protein 0.453 0.375 0.361 6.4e-19
UNIPROTKB|Q9BRU9249 UTP23 "rRNA-processing protein 0.453 0.373 0.393 8.1e-19
MGI|MGI:1925831249 Utp23 "UTP23, small subunit (S 0.439 0.361 0.395 1.7e-18
ZFIN|ZDB-GENE-050417-353257 utp23 "UTP23, small subunit (S 0.346 0.276 0.402 2.5e-16
CGD|CAL0001138269 orf19.3724 [Candida albicans ( 0.380 0.289 0.325 5.5e-14
UNIPROTKB|Q59V16269 CaO19.3724 "Putative uncharact 0.380 0.289 0.325 5.5e-14
SGD|S000005530254 UTP23 "Component of the small 0.814 0.657 0.281 5.4e-10
POMBASE|SPCC18.12c260 utp23 "rRNA processing protein 0.751 0.592 0.308 1.7e-09
FB|FBgn0031361244 CG17652 [Drosophila melanogast 0.409 0.344 0.272 2.5e-05
TAIR|locus:2062303 MEE21 "AT2G34570" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 376 (137.4 bits), Expect = 1.1e-34, P = 1.1e-34
 Identities = 84/182 (46%), Positives = 107/182 (58%)

Query:    22 RLGQSHSEAVEAAYKVAIARCEHEKLKSADACLMEVIGEKNPEHFFVATQDVDLRKKLQE 81
             +LG+  +E++EAA  +  A CEHE+ K+AD CL EVIG +N EHFF+ TQD + R+KLQ+
Sbjct:    72 KLGKDFAESLEAAQTLNTATCEHEEAKTADECLSEVIGVQNTEHFFLGTQDAEFRRKLQQ 131

Query:    82 VPGVPLIFGLRNALLLEPPSSFQRKFVKTSEEARSCMXXXXXXXXXXXXXNILETKEIGD 141
                VPL+FGLRN LL++ PS FQR+  K SE  R  M              +  T +I  
Sbjct:   132 ESIVPLVFGLRNILLIDQPSDFQRQSAKDSENKRLTMTDTEKKLL------VKRTAKIIA 185

Query:   142 SSXXXXXXXXXXXXMQADKKTHYARKGMGVKDRPQFKRKRAKAPNPLSCKKKKNHENPST 201
             S+            M     T   + G+GVKDRPQFKR RAK PNPLSC KKK  ENP +
Sbjct:   186 SNRKEATIANEEWGMPRVVST---KNGLGVKDRPQFKRNRAKGPNPLSCMKKKK-ENPQS 241

Query:   202 SS 203
              S
Sbjct:   242 KS 243




GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0009793 "embryo development ending in seed dormancy" evidence=IMP
GO:0009220 "pyrimidine ribonucleotide biosynthetic process" evidence=RCA
UNIPROTKB|Q08DU1 UTP23 "rRNA-processing protein UTP23 homolog" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|Q9BRU9 UTP23 "rRNA-processing protein UTP23 homolog" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
MGI|MGI:1925831 Utp23 "UTP23, small subunit (SSU) processome component, homolog (yeast)" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-050417-353 utp23 "UTP23, small subunit (SSU) processome component, homolog (yeast)" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
CGD|CAL0001138 orf19.3724 [Candida albicans (taxid:5476)] Back     alignment and assigned GO terms
UNIPROTKB|Q59V16 CaO19.3724 "Putative uncharacterized protein" [Candida albicans SC5314 (taxid:237561)] Back     alignment and assigned GO terms
SGD|S000005530 UTP23 "Component of the small subunit processome, involved in 40S ribosomal" [Saccharomyces cerevisiae (taxid:4932)] Back     alignment and assigned GO terms
POMBASE|SPCC18.12c utp23 "rRNA processing protein Utp23 (predicted)" [Schizosaccharomyces pombe (taxid:4896)] Back     alignment and assigned GO terms
FB|FBgn0031361 CG17652 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query205
cd09866147 cd09866, PIN_Fcf1-Utp23-H, PIN domain of rRNA-proc 5e-28
pfam04900101 pfam04900, Fcf1, Fcf1 2e-20
cd08553120 cd08553, PIN_Fcf1-like, PIN domain of rRNA-process 4e-17
cd09865148 cd09865, PIN_Utp23, PIN domain of rRNA-processing 5e-10
cd09864143 cd09864, PIN_Fcf1, PIN domain of rRNA-processing p 3e-05
COG1412136 COG1412, COG1412, Uncharacterized proteins of PilT 5e-04
cd09854125 cd09854, PIN_VapC-Smg6_family, PIN domains of VapC 0.003
>gnl|CDD|189036 cd09866, PIN_Fcf1-Utp23-H, PIN domain of rRNA-processing protein Fcf1- and Utp23-like homologs found in eukaryotes except fungi Back     alignment and domain information
 Score =  102 bits (257), Expect = 5e-28
 Identities = 37/77 (48%), Positives = 53/77 (68%), Gaps = 1/77 (1%)

Query: 23  LGQSHSEAVEAAYKVAIARCEHEKLK-SADACLMEVIGEKNPEHFFVATQDVDLRKKLQE 81
           LG     A+  A +  +A+C HEK   SA  CL  +IG+ NPEH+FVATQD +LR+KL++
Sbjct: 71  LGPDLYGALLIAKRFQVAKCGHEKSPVSASECLKSMIGDNNPEHYFVATQDKELREKLRK 130

Query: 82  VPGVPLIFGLRNALLLE 98
           +PGVPL++   N ++LE
Sbjct: 131 IPGVPLLYLHNNTIVLE 147


PIN domain homologs of Fcf1/Utp24 (FAF1-copurifying factor 1/U three-associated protein 24) and Utp23, essential proteins involved in pre-rRNA processing and 40S ribosomal subunit assembly, are included in this subfamily. Fcf1 is a component of the small subunit (SSU) processome and an essential nucleolar protein required for processing of the 18S pre-rRNA at sites A0-A2. These PIN (PilT N terminus) domains are homologs of flap endonuclease-1 (FEN1)-like PIN domains, but apparently lack the H3TH domain or extensive arch/clamp region seen in the latter. PIN domains typically contain three or four conserved acidic residues (putative metal-binding, active site residues). The Fcf1-Utp23 homolog PIN domain subfamily has three of these conserved acidic residues rather than the four seen in the Fcf1 PIN domain subfamily. Length = 147

>gnl|CDD|218319 pfam04900, Fcf1, Fcf1 Back     alignment and domain information
>gnl|CDD|189021 cd08553, PIN_Fcf1-like, PIN domain of rRNA-processing proteins, Fcf1 (Utp24, YDR339C), Utp23 (YOR004W), and other eukaryotic homologs Back     alignment and domain information
>gnl|CDD|189035 cd09865, PIN_Utp23, PIN domain of rRNA-processing protein, Utp23 (YOR004W), and other fungal homologs Back     alignment and domain information
>gnl|CDD|189034 cd09864, PIN_Fcf1, PIN domain of rRNA-processing protein, Fcf1 (Utp24, YDR339C), and other eukaryotic homologs Back     alignment and domain information
>gnl|CDD|224330 COG1412, COG1412, Uncharacterized proteins of PilT N-term Back     alignment and domain information
>gnl|CDD|189024 cd09854, PIN_VapC-Smg6_family, PIN domains of VapC and Smg6 ribonucleases, ribosome assembly factor NOB1, rRNA-processing protein Fcf1, Archaeoglobus fulgidus AF0591 protein, and homologs Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 205
KOG3164236 consensus Uncharacterized proteins of PilT N-term. 100.0
PF04900101 Fcf1: Fcf1; InterPro: IPR006984 This family is com 99.96
KOG3165195 consensus Predicted nucleic-acid-binding protein, 99.93
COG1412136 Uncharacterized proteins of PilT N-term./Vapc supe 99.83
PRK13764 602 ATPase; Provisional 94.73
PF13638133 PIN_4: PIN domain; PDB: 2HWW_C 2HWX_A 2DOK_B 2HWY_ 92.32
smart00670111 PINc Large family of predicted nucleotide-binding 90.13
PF05991166 NYN_YacP: YacP-like NYN domain; InterPro: IPR01029 80.68
>KOG3164 consensus Uncharacterized proteins of PilT N-term Back     alignment and domain information
Probab=100.00  E-value=2.4e-48  Score=330.32  Aligned_cols=161  Identities=36%  Similarity=0.511  Sum_probs=138.4

Q ss_pred             cceeccchHHHHHHhhccchHHHHHHHhhcceeecCCCCCCCHHHHHHHHHhccCCccEEEEccCHHHHHHhhcCCCCcE
Q 043680            8 LPLLSSSPPSLFLFRLGQSHSEAVEAAYKVAIARCEHEKLKSADACLMEVIGEKNPEHFFVATQDVDLRKKLQEVPGVPL   87 (205)
Q Consensus         8 ~~l~tT~CVl~ELe~LG~~~~~Al~iaK~f~~~kC~H~~~~~a~~CI~~~v~~~N~~~yiVATQD~~Lr~~LR~ipGVPl   87 (205)
                      +-|+||.|||.|||.+|.++.||+.||++|++++|+|.++.+|++||.+||+.+|+|||||||||++||+.||.+|||||
T Consensus        57 vKL~tTqCvikele~~g~~l~ga~~iAK~fe~~~C~H~~~~s~seCl~svv~~~Nk~~YvvATQD~el~~kLr~~pgvPl  136 (236)
T KOG3164|consen   57 VKLMTTQCVIKELEELGKDLYGAKGIAKQFEIRNCNHKDARSPSECLRSVVRISNKHHYVVATQDQELRRKLRKEPGVPL  136 (236)
T ss_pred             CeeeehHHHHHHHHHhCcchhhhHHHHHHHhHhcCCCCCCCCHHHHHHHHHhccCCceEEEecCCHHHHHHHhcCCCCce
Confidence            56899999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             EEecCceEEEeCCChhhHHHHHHHHHhhccCCHHHHHHHhhhhhhhhhhhhcCCCCCchhhHHHHHHHHhhhhhhhhhhc
Q 043680           88 IFGLRNALLLEPPSSFQRKFVKTSEEARSCMTKSEFKKLKKSTKNILETKEIGDSSNKNEELENQKLEMQADKKTHYARK  167 (205)
Q Consensus        88 iyi~~~~~~LE~PS~as~~~~~~~e~~kl~~~~~E~~~lk~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  167 (205)
                      ||+.+++|+||+||++|..+++.+|+++|. +..|.+.++++.....+.            ...|.              
T Consensus       137 i~~~r~t~vld~~S~at~~~sk~se~~~L~-~~~e~e~~~k~~~e~~~l------------~~~E~--------------  189 (236)
T KOG3164|consen  137 IYLKRNTLVLDAPSQATAKYSKDSEEKKLT-SDNEKEIDKKLLEEKGAL------------KGKET--------------  189 (236)
T ss_pred             EEEecceEEecCcchhhHHhhcchhhhhcc-ccchHHHHHHHHHHhhcc------------cchhh--------------
Confidence            999999999999999999999989999994 877877776553321110            11111              


Q ss_pred             cCCCCCCCccccCCCCCCCCCCcccccCCCCCC
Q 043680          168 GMGVKDRPQFKRKRAKAPNPLSCKKKKNHENPS  200 (205)
Q Consensus       168 ~~~~~~~~~~KrK~~KgPNPLS~KKKKkk~~~~  200 (205)
                           ....+|||+|||||||||||+++++..+
T Consensus       190 -----s~~~kk~k~~k~pNpLs~kkk~k~~~~~  217 (236)
T KOG3164|consen  190 -----SNKEKKRKGPKGPNPLSCKKKKKKKSNT  217 (236)
T ss_pred             -----hhhhhcCCCCCCCCCccccccccccccc
Confidence                 0124577799999999999998876654



/Vapc superfamily [General function prediction only]

>PF04900 Fcf1: Fcf1; InterPro: IPR006984 This family is comprises of uncharacterised eukaryotic proteins Back     alignment and domain information
>KOG3165 consensus Predicted nucleic-acid-binding protein, contains PIN domain [General function prediction only] Back     alignment and domain information
>COG1412 Uncharacterized proteins of PilT N-term Back     alignment and domain information
>PRK13764 ATPase; Provisional Back     alignment and domain information
>PF13638 PIN_4: PIN domain; PDB: 2HWW_C 2HWX_A 2DOK_B 2HWY_B 2WP8_J Back     alignment and domain information
>smart00670 PINc Large family of predicted nucleotide-binding domains Back     alignment and domain information
>PF05991 NYN_YacP: YacP-like NYN domain; InterPro: IPR010298 This family consists of several hypothetical bacterial proteins as well as some uncharacterised sequences from Arabidopsis thaliana Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query205
1o4w_A147 PIN (PILT N-terminus) domain; structural genomics, 9e-06
>1o4w_A PIN (PILT N-terminus) domain; structural genomics, joint CEN structural genomics, JCSG, protein structure initiative, PS translation; 1.90A {Archaeoglobus fulgidus} SCOP: c.120.1.1 Length = 147 Back     alignment and structure
 Score = 42.9 bits (101), Expect = 9e-06
 Identities = 16/74 (21%), Positives = 31/74 (41%), Gaps = 8/74 (10%)

Query: 20  LFRLGQSHSEAVEAAYKVAIA---RCEH-EKLKSADACLMEVIGEKNPEHFFVATQDVDL 75
           L +L  S     + A + A+      E  E     D  L+E   +       + T D +L
Sbjct: 65  LEKLEMSLRGKEKVAARFALKLLEHFEVVETESEGDPSLIEAAEKYG---CILITNDKEL 121

Query: 76  RKKLQEVPGVPLIF 89
           ++K ++  G+P+ +
Sbjct: 122 KRKAKQ-RGIPVGY 134


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query205
1o4w_A147 PIN (PILT N-terminus) domain; structural genomics, 98.96
3ix7_A134 Uncharacterized protein TTHA0540; unknown function 97.91
3i8o_A142 KH domain-containing protein MJ1533; APC89320.5, m 96.69
2dok_A186 Telomerase-binding protein EST1A; telomerase-assoc 96.43
2lcq_A165 Putative toxin VAPC6; PIN domain, Zn ribbon domain 84.77
>1o4w_A PIN (PILT N-terminus) domain; structural genomics, joint CEN structural genomics, JCSG, protein structure initiative, PS translation; 1.90A {Archaeoglobus fulgidus} SCOP: c.120.1.1 Back     alignment and structure
Probab=98.96  E-value=1.1e-09  Score=86.20  Aligned_cols=80  Identities=18%  Similarity=0.219  Sum_probs=67.0

Q ss_pred             cceeccchHHHHHHhhcc--------chHHHHHHHhhcceeecCCCCCCCHHHHHHHHHhccCCccEEEEccCHHHHHHh
Q 043680            8 LPLLSSSPPSLFLFRLGQ--------SHSEAVEAAYKVAIARCEHEKLKSADACLMEVIGEKNPEHFFVATQDVDLRKKL   79 (205)
Q Consensus         8 ~~l~tT~CVl~ELe~LG~--------~~~~Al~iaK~f~~~kC~H~~~~~a~~CI~~~v~~~N~~~yiVATQD~~Lr~~L   79 (205)
                      ..++++.||+.||+.+..        ....++.++..++++.|+ . + + |.||.+++...+   ++|+|+|.+|++++
T Consensus        53 ~~~~I~~~vl~EL~~~~~~~~~~~~~~~~~~~~ll~~~~vv~~~-~-~-~-D~~Ila~A~~~~---~~LvT~D~~l~~~a  125 (147)
T 1o4w_A           53 SRFLITASVKRELEKLEMSLRGKEKVAARFALKLLEHFEVVETE-S-E-G-DPSLIEAAEKYG---CILITNDKELKRKA  125 (147)
T ss_dssp             EEEEEEHHHHHHHHHHHHHCCHHHHHHHHHHHHHHTTSEEECCS-S-C-S-HHHHHHHHHHHT---CEEECCCHHHHHHH
T ss_pred             CCEEEcHHHHHHHHHHHhccCcchHHHHHHHHHHHhcCcEeeCC-C-C-c-hHHHHHHHHHhC---CEEEECCHHHHHHH
Confidence            578899999999999874        334567788888888887 2 3 6 999999997653   89999999999999


Q ss_pred             hcCCCCcEEEecCceE
Q 043680           80 QEVPGVPLIFGLRNAL   95 (205)
Q Consensus        80 R~ipGVPliyi~~~~~   95 (205)
                      +.. |||++|++++.+
T Consensus       126 ~~~-Gv~vi~~~~~~~  140 (147)
T 1o4w_A          126 KQR-GIPVGYLKEDKR  140 (147)
T ss_dssp             HHT-TCCEECCCCCCC
T ss_pred             HHC-CCeEEEecCCcE
Confidence            998 999999998764



>3ix7_A Uncharacterized protein TTHA0540; unknown function, thermus thermophilus HB8, structural genom 2, protein structure initiative; HET: MSE; 2.15A {Thermus thermophilus} Back     alignment and structure
>3i8o_A KH domain-containing protein MJ1533; APC89320.5, methanocaldococcus jannaschii DSM 2661, structural genomics, PSI-2; 2.64A {Methanocaldococcus jannaschii} Back     alignment and structure
>2dok_A Telomerase-binding protein EST1A; telomerase-associated protein, unknown function; 1.80A {Homo sapiens} PDB: 2hww_A 2hwx_A Back     alignment and structure
>2lcq_A Putative toxin VAPC6; PIN domain, Zn ribbon domain, ribosome biogenesis, metal BIN protein; NMR {Pyrococcus horikoshii} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 205
d1o4wa_125 c.120.1.1 (A:) Hypothetical protein AF0591 {Archae 5e-09
>d1o4wa_ c.120.1.1 (A:) Hypothetical protein AF0591 {Archaeon Archaeoglobus fulgidus [TaxId: 2234]} Length = 125 Back     information, alignment and structure

class: Alpha and beta proteins (a/b)
fold: PIN domain-like
superfamily: PIN domain-like
family: PIN domain
domain: Hypothetical protein AF0591
species: Archaeon Archaeoglobus fulgidus [TaxId: 2234]
 Score = 50.2 bits (120), Expect = 5e-09
 Identities = 15/71 (21%), Positives = 31/71 (43%), Gaps = 2/71 (2%)

Query: 20  LFRLGQSHSEAVEAAYKVAIARCEH-EKLKSADACLMEVIGEKNPEHFFVATQDVDLRKK 78
           L +L  S     + A + A+   EH E +++       +I         + T D +L++K
Sbjct: 44  LEKLEMSLRGKEKVAARFALKLLEHFEVVETESEGDPSLIEAAEKYGCILITNDKELKRK 103

Query: 79  LQEVPGVPLIF 89
            +   G+P+ +
Sbjct: 104 AK-QRGIPVGY 113


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query205
d1o4wa_125 Hypothetical protein AF0591 {Archaeon Archaeoglobu 99.74
>d1o4wa_ c.120.1.1 (A:) Hypothetical protein AF0591 {Archaeon Archaeoglobus fulgidus [TaxId: 2234]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: PIN domain-like
superfamily: PIN domain-like
family: PIN domain
domain: Hypothetical protein AF0591
species: Archaeon Archaeoglobus fulgidus [TaxId: 2234]
Probab=99.74  E-value=2.8e-19  Score=137.44  Aligned_cols=87  Identities=21%  Similarity=0.242  Sum_probs=76.3

Q ss_pred             cceeccchHHHHHHhhccchHHHHHHHhhcceeecCCC----CCCCHHHHHHHHHhccCCccEEEEccCHHHHHHhhcCC
Q 043680            8 LPLLSSSPPSLFLFRLGQSHSEAVEAAYKVAIARCEHE----KLKSADACLMEVIGEKNPEHFFVATQDVDLRKKLQEVP   83 (205)
Q Consensus         8 ~~l~tT~CVl~ELe~LG~~~~~Al~iaK~f~~~kC~H~----~~~~a~~CI~~~v~~~N~~~yiVATQD~~Lr~~LR~ip   83 (205)
                      ..+++|.||+.||+.|+.++.++..++++|....|+|.    .+.++++||++++..   ++|||||||.+||++++. +
T Consensus        32 ~~~~ip~~Vi~EL~~L~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~Dd~il~~a~~---~~~iv~T~D~~Lr~~~~~-~  107 (125)
T d1o4wa_          32 SRFLITASVKRELEKLEMSLRGKEKVAARFALKLLEHFEVVETESEGDPSLIEAAEK---YGCILITNDKELKRKAKQ-R  107 (125)
T ss_dssp             EEEEEEHHHHHHHHHHHHHCCHHHHHHHHHHHHHHTTSEEECCSSCSHHHHHHHHHH---HTCEEECCCHHHHHHHHH-T
T ss_pred             ceEEecHHHHHHHHHHHhcccHHHHHHHHHHHHHHHhCCCCCCCCcHHHHHHHHHhh---CCeEEEecCHHHHHHHHH-c
Confidence            46799999999999999999999999998888888873    456899999998864   579999999999999997 9


Q ss_pred             CCcEEEecCceE-EEe
Q 043680           84 GVPLIFGLRNAL-LLE   98 (205)
Q Consensus        84 GVPliyi~~~~~-~LE   98 (205)
                      |||+||++++.+ .||
T Consensus       108 GVpvi~l~~~~~l~~~  123 (125)
T d1o4wa_         108 GIPVGYLKEDKRVFVE  123 (125)
T ss_dssp             TCCEECCCCCCCCCCC
T ss_pred             CCcEEEEeCCceEEEE
Confidence            999999999864 455