Citrus Sinensis ID: 015565


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400----
MGNFTVSLSSYGPSEGRCLPCMGVSGRAVAVSSCSSRGDFRILSQVKIGAVKYSSKGASFACRASSGSYRRNPDFSRQNRNGYRGRNRQNEERDSIDDESELLSSRNGPILSLSNSPKFQATSTPGPREKEIVELFRKVQVQLRERAAAREDKKTESMQGQRKESETVDSLLKLLRKHSVEQGKRKSSSGSSNDFSLEQDEPNGSHIEDKSTSFFDSSSRARTESREPNVSSFTRPASNFRRRSPVTQFKYQPVYSSEETENSISRLDSSGKGKKSSLEIHTDPKRESESEPELETEPEASFADGEVYDEASDVESNVITDGSLYDEVLENEPNIIDDSVYRVSEEGESSDNDEDENKQEEIENGDLSALKLPELRALAKSRGVKGFSKMKKAVLLELLAGSSV
cccEEEEccccccccccccccccccccEEccccccccccccccccEEEccccccccccEEEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHccHHHHHcccccccHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHcccccccccHHHHHHHHHHccccccccccHHHHHHHHHcccc
cccEEEEccccccccccccccccccccEEEEcccccccccccEEEEEEccEEEccccEEEEEEcccccccccccccHHccccccccccccccccccccccccHccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHcccHHHHccccccccHHHHHHHHHHcccccccEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEccEEccccccccccccHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHccccccccccccccccccHHHccHHHHHHHHHHcccccccHccHHHHHHHHccccc
mgnftvslssygpsegrclpcmgvsgrAVAVSscssrgdfrilSQVKIGavkysskgasfacrassgsyrrnpdfsrqnrngyrgrnrqneerdsiddesellssrngpilslsnspkfqatstpgpreKEIVELFRKVQVQLRERAAARedkktesmqgqrkesETVDSLLKLLRKHSveqgkrksssgssndfsleqdepngshiedkstsffdsssrartesrepnvssftrpasnfrrrspvtqfkyqpvysseetensisrldssgkgkkssleihtdpkresesepeletepeasfadgevydeasdvesnvitdgslydevlenepniiddsvyrvseegessdndedenkqeeiengdlsalklPELRALAKSRGVKGFSKMKKAVLLELLAGSSV
MGNFTVSlssygpsegRCLPCMGVSGRAVAVSSCSSRGDFRILSQVKIgavkysskgasfacrassgsyrrnpdfsrqnrngyrgrnrqneerdsiDDESELLSsrngpilslsnspkfqatstpgprekEIVELFRKVQVQLRERaaaredkktesmqgqrkesetvDSLLKLLRKhsveqgkrksssgssndfsleqdepngshiedkstsffdsssrartesrepnvssftrpasnfrrrspvtqfkyqpvysseetensisrldssgkgkkssleihtdpkresesepeletepeasfadgevYDEASDVESNVITdgslydevlenepniiddsvyrVSEEgessdndedenkqeeiengdlsalklpELRALAksrgvkgfskmKKAVLLellagssv
MGNFTVSLSSYGPSEGRCLPCMGVSGRAVAVSSCSSRGDFRILSQVKIGAVKYSSKGASFACRASSGSYRRNPDFSrqnrngyrgrnrqneerdsiddesellssrNGPILSLSNSPKFQATSTPGPREKEIVELFRKVQVQLRERAAAREDKKTESMQGQRKESETVDSLLKLLRKHSVEQGKRKSSSGSSNDFSLEQDEPNGSHIEDKSTSFFDSSSRARTESREPNVSSFTRPASNFRRRSPVTQFKYQPVYSSEETENSISRLDssgkgkkssLEIHTDPKResesepeletepeASFADGEVYDEASDVESNVITDGSLYDEVLENEPNIIDDSVYRVseegessdndedenkqeeienGDLSALKLPELRALAKSRGVKGFSKMKKAVLLELLAGSSV
****************RCLPCMGVSGRAVAVSSCSSRGDFRILSQVKIGAVKYSSK******************************************************************************LF*************************************************************************************************************************************************************************************VITDGSLYDEVLE***NII********************************************************AVLL********
***FTVSLSSYGPSEGRCLPCMGVSGRAVAVSSCSSRGDFRILSQVKIGAVKYSSKGAS*************************************************************************************************************************************************************************************************************************************************************************************************************************PELRALAKSRGVKGFSKMKKAVLLELLAG***
MGNFTVSLSSYGPSEGRCLPCMGVSGRAVAVSSCSSRGDFRILSQVKIGAVKYSSKGASFACRASSGSYRRNPDFSRQNRNGYRGR*****************SSRNGPILSLSNSP**********REKEIVELFRKVQVQLRE**********************VDSLLKLLRK****************************HIEDKSTSF*********************PASNFRRRSPVTQFKYQP***********************************************SFADGEVYDEASDVESNVITDGSLYDEVLENEPNIIDDSVYRV******************IENGDLSALKLPELRALAKSRGVKGFSKMKKAVLLELLAGSSV
*GNFTVSLSSYGPSEGRCLPCMGVSGRAVAVSSCSSRGDFRILSQVKIGAVKYSSKGASFACRAS************************************************************GPREKEIVELFRKVQVQLRERAAA*******************DS*****RKH**********************************************************************FK*********************************************************************************************************************SALKLPELRALAKSRGVKGFSKMKKAVLLELLAG***
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGNFTVSLSSYGPSEGRCLPCMGVSGRAVAVSSCSSRGDFRILSQVKIGAVKYSSKGASFACRASSGSYRRNPDFSRQNRNGYRGRNRQNEERDSIDDESELLSSRNGPILSLSNSPKFQATSTPGPREKEIVELFRKVQVQLRERAAAREDKKTESMQGQRKESETVDSLLKLLRKHSVEQGKRKSSSGSSNDFSLEQDEPNGSHIEDKSTSFFDSSSRARTESREPNVSSFTRPASNFRRRSPVTQFKYQPVYSSEETENSISRLDSSGKGKKSSLEIHTDPKRESESEPELETEPEASFADGEVYDEASDVESNVITDGSLYDEVLENEPNIIDDSVYRVSEEGESSDNDEDENKQEEIENGDLSALKLPELRALAKSRGVKGFSKMKKAVLLELLAGSSV
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query404 2.2.26 [Sep-21-2011]
Q94K75401 Rho-N domain-containing p yes no 0.940 0.947 0.466 1e-66
Q8L4E7375 SAP-like protein BP-73 OS yes no 0.883 0.952 0.370 1e-44
>sp|Q94K75|RHON1_ARATH Rho-N domain-containing protein 1, chloroplastic OS=Arabidopsis thaliana GN=RHON1 PE=1 SV=1 Back     alignment and function desciption
 Score =  253 bits (647), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 195/418 (46%), Positives = 251/418 (60%), Gaps = 38/418 (9%)

Query: 2   GNFTVS---LSSYGPSEGRCLPCMGVSGRAVAVSSCSSRGDFRILSQVKIGAVKYSSKGA 58
           G F ++   +  Y  S+ RC     VS R +A+  CSS  D       K G +K     +
Sbjct: 5   GTFHLTSDYVPGYTLSDSRCFFNSAVSRRTLAILPCSSCLDH------KNGRLKSVPNRS 58

Query: 59  SFACRASSGSYRRNPDFSRQNRNGYRGRNRQNEERDSIDDE-SELLSSRNGPILSLSNSP 117
           SF CRASSG YRRNPDFSR N++GYRG NRQ+  R+  D E S++LSSRNGP+ +LS+SP
Sbjct: 59  SFVCRASSGGYRRNPDFSRLNKHGYRGNNRQSGGREDFDIENSDMLSSRNGPLFNLSSSP 118

Query: 118 KFQATSTPGPREKEIVELFRKVQVQLRERAAAREDKKTESM--QGQRKESETVDSLLKLL 175
           KFQATS+PGPREKEIVELFRKVQ QLR RAAA++++K      +GQ KESETVDSLLKLL
Sbjct: 119 KFQATSSPGPREKEIVELFRKVQAQLRARAAAKKEEKKIEEASKGQGKESETVDSLLKLL 178

Query: 176 RKHSVEQGKRKSSSGSSNDFSLEQDEPNGSHI--EDKSTSFFDSSSRARTESREPNVSSF 233
           RKHS EQ KR+ S  SS      Q E  G  +  +D++ +   S       +++ N SSF
Sbjct: 179 RKHSGEQSKRQVSKFSS------QGEVQGDTVDKQDRTGNLVTSG------NKDNNASSF 226

Query: 234 TRPASNFRRRSPVTQFKYQPVYSSEETENSISRLDSSGKGKKSSLEIHTDPKR----ESE 289
           TRP S+FRR+SPV + +  P YSSE T +  S    +   KK ++E+H +P+     E E
Sbjct: 227 TRPTSSFRRKSPVPRSQSPPAYSSEATFDQSSSYSVTWTQKKDTVELHDEPEHEPAYEHE 286

Query: 290 SEPELETEPEASFA----DGEVYDEASDVESNVITDGSLYDEVLENEPNIIDDSVYRVSE 345
            EPE E+EP         D E+  E+S        D   +D VL  +  I+D       +
Sbjct: 287 HEPENESEPGPVTTMLEPDSELKPESSSFYQEEEDDDVTFD-VLSQDDGILD---VLSDD 342

Query: 346 EGESSDNDEDENKQEEIENGDLSALKLPELRALAKSRGVKGFSKMKKAVLLELLAGSS 403
           +    D DED ++ EE    DLS LKL ELR +AKSRG+KG SKMKKA L+ELL   S
Sbjct: 343 DESLDDADEDSDEAEEEAVKDLSELKLVELRGIAKSRGLKGLSKMKKAELVELLGSDS 400




Binds to and supports processing of specific plastid RNAs. Associates via its C-terminal Rho-N domain to single stranded regions of 16S and 23S rRNAs or to rbcL mRNAs. May be involved in targeting transcripts to RNases such as RNE or RNase J.
Arabidopsis thaliana (taxid: 3702)
>sp|Q8L4E7|BP73_ORYSJ SAP-like protein BP-73 OS=Oryza sativa subsp. japonica GN=BP-73 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query404
224066243389 predicted protein [Populus trichocarpa] 0.898 0.933 0.566 1e-113
255582495385 conserved hypothetical protein [Ricinus 0.903 0.948 0.559 1e-109
297744156351 unnamed protein product [Vitis vinifera] 0.841 0.968 0.566 1e-104
225438113377 PREDICTED: SAP-like protein BP-73-like [ 0.891 0.954 0.580 1e-103
449468556405 PREDICTED: SAP-like protein BP-73-like i 0.918 0.916 0.476 1e-83
224082822389 predicted protein [Populus trichocarpa] 0.858 0.892 0.480 2e-83
449468554410 PREDICTED: SAP-like protein BP-73-like i 0.900 0.887 0.479 6e-83
356537005374 PREDICTED: SAP-like protein BP-73-like [ 0.868 0.938 0.455 4e-76
449525124330 PREDICTED: SAP-like protein BP-73-like, 0.710 0.869 0.510 9e-71
18390623401 Rho termination factor [Arabidopsis thal 0.940 0.947 0.466 9e-65
>gi|224066243|ref|XP_002302043.1| predicted protein [Populus trichocarpa] gi|222843769|gb|EEE81316.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  415 bits (1066), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 237/418 (56%), Positives = 291/418 (69%), Gaps = 55/418 (13%)

Query: 5   TVSLSSYGPSEGRCLPCMGVSGRAVAVSSCSSRGDFRILSQVKIGAVKYSSKGASFACRA 64
           T +L  YG SEGRCLPC G+SGRAVAVS CSSRG   + SQ + G +K SS+ +S  C+ 
Sbjct: 9   TKNLPGYGSSEGRCLPCSGISGRAVAVSPCSSRGVHYVHSQAQFGKMKCSSRASSIVCK- 67

Query: 65  SSGSYRRNPDFSRQNRNGY-RGRNRQNEERDSID--DESELLSSRNGPILSLSNSPKFQA 121
           SSG +RRNPDFSRQN+ G+ R RNRQNEE +  +  DES+LL+S+NGP+LSLS +PKFQA
Sbjct: 68  SSGGHRRNPDFSRQNKQGFSRNRNRQNEEGNGFENLDESDLLTSKNGPLLSLSGTPKFQA 127

Query: 122 TSTPGPREKEIVELFRKVQVQLRERAAAREDKKTESMQGQRKESETVDSLLKLLRKHSVE 181
           T+ PGPREKEIVELFRKVQ QLRERAA +E+KK E+ QG+ +E+ETVDSLLKLLRKHSVE
Sbjct: 128 TAAPGPREKEIVELFRKVQAQLRERAAVKEEKKVEASQGKGRENETVDSLLKLLRKHSVE 187

Query: 182 QGKRKSSSGSSNDFSLEQDEPNGSHIEDKSTSFFDSSSRARTESREPNVSSFTRPASNFR 241
           QGK+K+S+ SS D SL+Q E NG++ + K TSFFDSS + R +  EP ++SFTRP SNFR
Sbjct: 188 QGKKKTSNISSGDLSLDQPE-NGTYKKAKGTSFFDSSKKERNDVLEP-ITSFTRPPSNFR 245

Query: 242 RRSPVTQFKYQPVYSSEETENSISRLDSSGKGKKSSLEIHTDPKRESESEPELETEPE-- 299
           R+SPV Q K+QP+YSSE+  NS S L+ +G+ KK   EI  D  +E E +PEL+ E E  
Sbjct: 246 RKSPVPQVKFQPIYSSEDPVNSTSHLNLNGE-KKQQFEILPDTTQELELDPELDAEEEHE 304

Query: 300 ----------ASFADGEVYDEASDVESNVITDGSLYDEVLENEPNIIDDSVYRVSEEGES 349
                      SFA G+V+DE S                                 EGES
Sbjct: 305 LDSEPEPEPEPSFAGGDVFDELS---------------------------------EGES 331

Query: 350 SDNDE---DENKQEEIENGDLSALKLPELRALAKSRGVKGFSKMKKAVLLELLAGSSV 404
           SD D    D  KQ+ IE+ DLS+LKLPELRALAKSRGVKGFSKMKK  L+ELL+GSS+
Sbjct: 332 SDMDNVDGDGEKQQLIEHEDLSSLKLPELRALAKSRGVKGFSKMKKGELVELLSGSSM 389




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255582495|ref|XP_002532033.1| conserved hypothetical protein [Ricinus communis] gi|223528303|gb|EEF30349.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|297744156|emb|CBI37126.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|225438113|ref|XP_002277984.1| PREDICTED: SAP-like protein BP-73-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|449468556|ref|XP_004151987.1| PREDICTED: SAP-like protein BP-73-like isoform 2 [Cucumis sativus] Back     alignment and taxonomy information
>gi|224082822|ref|XP_002306853.1| predicted protein [Populus trichocarpa] gi|222856302|gb|EEE93849.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449468554|ref|XP_004151986.1| PREDICTED: SAP-like protein BP-73-like isoform 1 [Cucumis sativus] Back     alignment and taxonomy information
>gi|356537005|ref|XP_003537022.1| PREDICTED: SAP-like protein BP-73-like [Glycine max] Back     alignment and taxonomy information
>gi|449525124|ref|XP_004169569.1| PREDICTED: SAP-like protein BP-73-like, partial [Cucumis sativus] Back     alignment and taxonomy information
>gi|18390623|ref|NP_563761.1| Rho termination factor [Arabidopsis thaliana] gi|13877951|gb|AAK44053.1|AF370238_1 unknown protein [Arabidopsis thaliana] gi|17065622|gb|AAL33805.1| unknown protein [Arabidopsis thaliana] gi|332189835|gb|AEE27956.1| Rho termination factor [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query404
TAIR|locus:2038560401 RHON1 "Rho-N 1" [Arabidopsis t 0.943 0.950 0.400 1.1e-55
TAIR|locus:2055846309 AT2G31150 [Arabidopsis thalian 0.368 0.482 0.481 1.5e-26
TAIR|locus:2124127245 AT4G18740 [Arabidopsis thalian 0.089 0.146 0.5 2.8e-05
TAIR|locus:2038560 RHON1 "Rho-N 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 574 (207.1 bits), Expect = 1.1e-55, P = 1.1e-55
 Identities = 167/417 (40%), Positives = 208/417 (49%)

Query:     2 GNFTVSLSSYGP----SEGRCLPCMGVSGRAVAVSSCSSRGDFRILSQVKIGAVKYSSKG 57
             G F ++ S Y P    S+ RC     VS R +A+  CSS  D       K G +K     
Sbjct:     5 GTFHLT-SDYVPGYTLSDSRCFFNSAVSRRTLAILPCSSCLDH------KNGRLKSVPNR 57

Query:    58 ASFACRASSGSYRRNPDFSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-NGPILSLSNS 116
             +SF CRASSG YRRNPDFS                               NGP+ +LS+S
Sbjct:    58 SSFVCRASSGGYRRNPDFSRLNKHGYRGNNRQSGGREDFDIENSDMLSSRNGPLFNLSSS 117

Query:   117 PKFQATSTPGPREKEIVELFRKVQVQLRERAAAR-EDKKTE-SMQGQRKESETVDSLLKL 174
             PKFQATS+PGPREKEIVELFRKVQ QLR RAAA+ E+KK E + +GQ KESETVDSLLKL
Sbjct:   118 PKFQATSSPGPREKEIVELFRKVQAQLRARAAAKKEEKKIEEASKGQGKESETVDSLLKL 177

Query:   175 LRKHSVEQGKRKSSSGSSNDFSLEQDEPNGSHIEDKSTSFFDSSSRARTESREPNVSSFT 234
             LRKHS EQ KR+ S  SS      Q E  G  + DK      + +   + +++ N SSFT
Sbjct:   178 LRKHSGEQSKRQVSKFSS------QGEVQGDTV-DKQDR---TGNLVTSGNKDNNASSFT 227

Query:   235 RPASNFRRRSPVTQFKYQPVYSSEETENSISRLDXXXXXXXXXLEIHTDPKRXXXXXXXX 294
             RP S+FRR+SPV + +  P YSSE T +  S            +E+H +P+         
Sbjct:   228 RPTSSFRRKSPVPRSQSPPAYSSEATFDQSSSYSVTWTQKKDTVELHDEPEHEPAYEHEH 287

Query:   295 XXXXXASFA--------DGEVYDEASDVESNVITDGSLYDEVLENEPNIIDDSVYRVXXX 346
                  +           D E+  E+S        D   +D VL  +  I+D         
Sbjct:   288 EPENESEPGPVTTMLEPDSELKPESSSFYQEEEDDDVTFD-VLSQDDGILD---VLSDDD 343

Query:   347 XXXXXXXXXXXXXXXXXXGDLSALKLPELRALAKSRGVKGFSKMKKAVLLELLAGSS 403
                                DLS LKL ELR +AKSRG+KG SKMKKA L+ELL   S
Sbjct:   344 ESLDDADEDSDEAEEEAVKDLSELKLVELRGIAKSRGLKGLSKMKKAELVELLGSDS 400




GO:0005634 "nucleus" evidence=ISM
GO:0009507 "chloroplast" evidence=IMP;IDA
GO:0009737 "response to abscisic acid stimulus" evidence=IDA
GO:0009793 "embryo development ending in seed dormancy" evidence=RCA
GO:0016226 "iron-sulfur cluster assembly" evidence=RCA
GO:0042793 "transcription from plastid promoter" evidence=RCA
GO:0045893 "positive regulation of transcription, DNA-dependent" evidence=RCA
GO:0005515 "protein binding" evidence=IPI
GO:0010239 "chloroplast mRNA processing" evidence=IMP
GO:0019843 "rRNA binding" evidence=IDA
GO:0043621 "protein self-association" evidence=IPI
GO:1901259 "chloroplast rRNA processing" evidence=IMP
TAIR|locus:2055846 AT2G31150 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2124127 AT4G18740 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q94K75RHON1_ARATHNo assigned EC number0.46650.94050.9476yesno
Q8L4E7BP73_ORYSJNo assigned EC number0.37090.88360.952yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query404
PRK12678 672 PRK12678, PRK12678, transcription termination fact 5e-06
smart0095943 smart00959, Rho_N, Rho termination factor, N-termi 2e-05
pfam0749843 pfam07498, Rho_N, Rho termination factor, N-termin 4e-05
PRK14537230 PRK14537, PRK14537, 50S ribosomal protein L20/unkn 0.004
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho; Provisional Back     alignment and domain information
 Score = 48.4 bits (116), Expect = 5e-06
 Identities = 19/33 (57%), Positives = 24/33 (72%)

Query: 365 GDLSALKLPELRALAKSRGVKGFSKMKKAVLLE 397
           G L+ +KLPELRALAK  G+KG S M+K  L+ 
Sbjct: 21  GGLAGMKLPELRALAKQLGIKGTSGMRKGELIA 53


Length = 672

>gnl|CDD|198027 smart00959, Rho_N, Rho termination factor, N-terminal domain Back     alignment and domain information
>gnl|CDD|203652 pfam07498, Rho_N, Rho termination factor, N-terminal domain Back     alignment and domain information
>gnl|CDD|237748 PRK14537, PRK14537, 50S ribosomal protein L20/unknown domain fusion protein; Provisional Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 404
PF0749843 Rho_N: Rho termination factor, N-terminal domain; 98.77
PRK12678 672 transcription termination factor Rho; Provisional 98.33
PRK14537230 50S ribosomal protein L20/unknown domain fusion pr 98.08
PF0203735 SAP: SAP domain; InterPro: IPR003034 The SAP (afte 97.1
PF1294935 HeH: HeH/LEM domain; PDB: 2OUT_A. 96.93
smart0051335 SAP Putative DNA-binding (bihelical) motif predict 96.13
TIGR00578584 ku70 ATP-dependent DNA helicase ii, 70 kDa subunit 93.44
PF1028138 Ish1: Putative stress-responsive nuclear envelope 90.77
>PF07498 Rho_N: Rho termination factor, N-terminal domain; InterPro: IPR011112 The Rho termination factor disengages newly transcribed RNA from its DNA template at certain, specific transcripts Back     alignment and domain information
Probab=98.77  E-value=3e-09  Score=76.73  Aligned_cols=34  Identities=41%  Similarity=0.692  Sum_probs=29.5

Q ss_pred             CCCCCHHHHHHHHHHcCCCCCCCCCHHHHHHHHh
Q 015565          367 LSALKLPELRALAKSRGVKGFSKMKKAVLLELLA  400 (404)
Q Consensus       367 Ls~MtvaELK~LAKe~GIKGySkMKKaELIeaLq  400 (404)
                      |..|++.||++||+++||+||++|+|.|||.+|.
T Consensus         2 L~~~~~~eL~~iAk~lgI~~~~~~~K~eLI~~Il   35 (43)
T PF07498_consen    2 LKSMTLSELREIAKELGIEGYSKMRKQELIFAIL   35 (43)
T ss_dssp             HHCS-HHHHHHHHHCTT-TTGCCS-HHHHHHHHH
T ss_pred             cccCCHHHHHHHHHHcCCCCCCcCCHHHHHHHHH
Confidence            6789999999999999999999999999999984



It is thought that two copies of Rho bind to RNA and that Rho functions as a hexamer of protomers []. This domain is found to the N terminus of the RNA binding domain (IPR011113 from INTERPRO).; GO: 0006353 transcription termination, DNA-dependent; PDB: 1A8V_B 1PVO_A 1PV4_D 3ICE_A 1XPU_C 1XPO_D 1XPR_F 2A8V_B 2HT1_B 1A63_A ....

>PRK12678 transcription termination factor Rho; Provisional Back     alignment and domain information
>PRK14537 50S ribosomal protein L20/unknown domain fusion protein; Provisional Back     alignment and domain information
>PF02037 SAP: SAP domain; InterPro: IPR003034 The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins involved in chromosomal organisation [], including in apoptosis [] Back     alignment and domain information
>PF12949 HeH: HeH/LEM domain; PDB: 2OUT_A Back     alignment and domain information
>smart00513 SAP Putative DNA-binding (bihelical) motif predicted to be involved in chromosomal organisation Back     alignment and domain information
>TIGR00578 ku70 ATP-dependent DNA helicase ii, 70 kDa subunit (ku70) Back     alignment and domain information
>PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query404
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-10
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 61.8 bits (149), Expect = 2e-10
 Identities = 55/387 (14%), Positives = 110/387 (28%), Gaps = 106/387 (27%)

Query: 55  SKGASFACRASSGSYRRNPDF-----SRQNRNGYRGRNRQNEERDSIDDESELLSSRNGP 109
           SK      +      R N  F       + R          E+RD + +++++ +  N  
Sbjct: 73  SKQEEMVQKFVEEVLRINYKFLMSPIKTEQRQPSMMTRMYIEQRDRLYNDNQVFAKYNVS 132

Query: 110 ILSLSNSPKFQATSTPGPREKEIVELFRKVQVQLRER------------------AAARE 151
                             R +  ++L R+  ++LR                         
Sbjct: 133 ------------------RLQPYLKL-RQALLELRPAKNVLIDGVLGSGKTWVALDVCLS 173

Query: 152 DKKTESMQGQ-----RKESETVDSLLKLLRK--HSVEQGKRKSSSGSSNDFSLEQDEPNG 204
            K    M  +      K   + +++L++L+K  + ++      S  SSN   L       
Sbjct: 174 YKVQCKMDFKIFWLNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNI-KLRIHS--- 229

Query: 205 SHIEDKSTSFFDSSSRART-----ESREPNV-SSFTRPASNFRRRSPVTQFKYQPVYSSE 258
             I+ +      S             +     ++F     N   +  +T           
Sbjct: 230 --IQAELRRLLKSKPYENCLLVLLNVQNAKAWNAF-----NLSCKILLT----------- 271

Query: 259 ETEN-SISRLDSSGKGKKSSLEIHTDPKRESESEPELETEPEASFADGEVYD---EASDV 314
            T    ++   S+      SL+ H+      E +  L       + D    D   E    
Sbjct: 272 -TRFKQVTDFLSAATTTHISLDHHSMTLTPDEVKSLL-----LKYLDCRPQDLPREVLTT 325

Query: 315 ESNVITDGSLYDEVLENEPNIIDDSVYRVSEEGESSDNDEDENKQEEIENGDLSALKLPE 374
               +   S+  E + +     D+  ++            + +K   I    L+ L+  E
Sbjct: 326 NPRRL---SIIAESIRDGLATWDN--WK----------HVNCDKLTTIIESSLNVLEPAE 370

Query: 375 LRALAKSRGVKGFSKMKK--AVLLELL 399
            R +     V  F        +LL L+
Sbjct: 371 YRKMFDRLSV--FPPSAHIPTILLSLI 395


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query404
2out_A131 MU-like prophage flumu protein GP35, protein HI150 98.17
2do1_A55 Nuclear protein HCC-1; SAP domain, structural geno 96.9
1h1j_S51 THO1 protein; SAP domain, DNA binding; NMR {Saccha 96.76
1zrj_A50 E1B-55KDA-associated protein 5 isoform C; SAP doma 96.54
2kvu_A75 MKL/myocardin-like protein 1; SAP motif, DNA/RNA b 96.33
1jjr_A151 KU70, thyroid autoantigen; DNA repair protein, pro 95.52
2rnn_A114 E3 SUMO-protein ligase SIZ1; SUMO ligase, DNA bind 95.15
1jjr_A151 KU70, thyroid autoantigen; DNA repair protein, pro 95.0
1jey_A609 KU70; double-strand DNA break repair, non-homologo 89.93
2ld7_A94 Histone deacetylase complex subunit SAP30; transcr 86.74
>2out_A MU-like prophage flumu protein GP35, protein HI1507 in MU-like prophage flumu region...; structural genomics, hypothetical protein; NMR {Haemophilus influenzae} SCOP: a.140.3.2 d.344.1.1 Back     alignment and structure
Probab=98.17  E-value=1.4e-06  Score=75.90  Aligned_cols=39  Identities=28%  Similarity=0.393  Sum_probs=36.5

Q ss_pred             cCCCCCCCHHHHHHHHHHcCCCCCCCCCHHHHHHHHhcC
Q 015565          364 NGDLSALKLPELRALAKSRGVKGFSKMKKAVLLELLAGS  402 (404)
Q Consensus       364 ~~dLs~MtvaELK~LAKe~GIKGySkMKKaELIeaLq~~  402 (404)
                      ..+|..|||+|||++++++||+++++|+|.|||++|...
T Consensus        86 ~~~l~~lTV~ELKa~Lde~gIe~~ssakKaELiaLL~~a  124 (131)
T 2out_A           86 STDLNTFTVEQLKAQLTERGITFKQSATKAELIALFAPA  124 (131)
T ss_dssp             CCSSTTCCSHHHHHHHHHHTCCCSSSCCSHHHHHHCCSC
T ss_pred             cCccccccHHHHHHHHHHcCCcccccCCHHHHHHHHHHh
Confidence            468999999999999999999999999999999999763



>2do1_A Nuclear protein HCC-1; SAP domain, structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} SCOP: a.140.2.1 Back     alignment and structure
>1h1j_S THO1 protein; SAP domain, DNA binding; NMR {Saccharomyces cerevisiae} SCOP: a.140.2.1 PDB: 2wqg_A Back     alignment and structure
>1zrj_A E1B-55KDA-associated protein 5 isoform C; SAP domain, structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} SCOP: a.140.2.1 Back     alignment and structure
>2kvu_A MKL/myocardin-like protein 1; SAP motif, DNA/RNA binding, structural genomics, northeast structural genomics consortium (NESG), PSI-2; NMR {Homo sapiens} PDB: 2kw9_A Back     alignment and structure
>2rnn_A E3 SUMO-protein ligase SIZ1; SUMO ligase, DNA binding, sumoylation, metal-binding, nucLeu phosphoprotein, UBL conjugation pathway; NMR {Saccharomyces cerevisiae} Back     alignment and structure
>1jey_A KU70; double-strand DNA break repair, non-homologous END-joining, protein/nucleic acid complex, alpha/beta domain, beta barrel; HET: DNA; 2.50A {Homo sapiens} SCOP: b.131.1.1 c.62.1.3 PDB: 1jeq_A* 3rzx_B Back     alignment and structure
>2ld7_A Histone deacetylase complex subunit SAP30; transcription; NMR {Mus musculus} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 404
d1jeqa151 a.140.2.1 (A:559-609) DNA binding C-terminal domai 8e-04
>d1jeqa1 a.140.2.1 (A:559-609) DNA binding C-terminal domain of ku70 {Human (Homo sapiens) [TaxId: 9606]} Length = 51 Back     information, alignment and structure

class: All alpha proteins
fold: LEM/SAP HeH motif
superfamily: SAP domain
family: SAP domain
domain: DNA binding C-terminal domain of ku70
species: Human (Homo sapiens) [TaxId: 9606]
 Score = 35.0 bits (81), Expect = 8e-04
 Identities = 15/45 (33%), Positives = 23/45 (51%), Gaps = 2/45 (4%)

Query: 355 DENKQEEIENGDLSALKLPELRALAKSRGVKGFSKMKKAVLLELL 399
           +E  +  I  G L    +P L+   ++ G+K  S +KK  LLE L
Sbjct: 3   EEELKTHISKGTLGKFTVPMLKEACRAYGLK--SGLKKQELLEAL 45


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query404
d1a62a147 Rho termination factor, N-terminal domain {Escheri 98.14
d1jeqa151 DNA binding C-terminal domain of ku70 {Human (Homo 97.66
d1h1js_44 S/mar DNA-binding protein Tho1 {Baker's yeast (Sac 97.57
d1zrja137 Heterogeneous nuclear ribonucleoprotein U-like pro 96.8
d2do1a142 Nuclear protein hcc-1 {Human (Homo sapiens) [TaxId 89.77
>d1a62a1 a.140.3.1 (A:1-47) Rho termination factor, N-terminal domain {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
class: All alpha proteins
fold: LEM/SAP HeH motif
superfamily: Rho N-terminal domain-like
family: Rho termination factor, N-terminal domain
domain: Rho termination factor, N-terminal domain
species: Escherichia coli [TaxId: 562]
Probab=98.14  E-value=5.9e-07  Score=64.45  Aligned_cols=35  Identities=20%  Similarity=0.406  Sum_probs=32.8

Q ss_pred             CCCCCCHHHHHHHHHHcCCCCCCCCCHHHHHHHHh
Q 015565          366 DLSALKLPELRALAKSRGVKGFSKMKKAVLLELLA  400 (404)
Q Consensus       366 dLs~MtvaELK~LAKe~GIKGySkMKKaELIeaLq  400 (404)
                      +|..++++||.+||+++||++|++|+|.|||-+|-
T Consensus         5 eLk~k~~~EL~~iA~~lgIen~~~lrKQeLif~IL   39 (47)
T d1a62a1           5 ELKNTPVSELITLGENMGLENLARMRKQDIIFAIL   39 (47)
T ss_dssp             HHHTSCHHHHHHHHHTTTCCCCTTSCHHHHHHHHH
T ss_pred             HHhcCCHHHHHHHHHHcCCCCcccccHHHHHHHHH
Confidence            47889999999999999999999999999999883



>d1jeqa1 a.140.2.1 (A:559-609) DNA binding C-terminal domain of ku70 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1h1js_ a.140.2.1 (S:) S/mar DNA-binding protein Tho1 {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
>d1zrja1 a.140.2.1 (A:1-37) Heterogeneous nuclear ribonucleoprotein U-like protein 1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d2do1a1 a.140.2.1 (A:5-46) Nuclear protein hcc-1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure