Citrus Sinensis ID: 012498
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 462 | ||||||
| 296083035 | 673 | unnamed protein product [Vitis vinifera] | 0.976 | 0.670 | 0.684 | 1e-160 | |
| 359475478 | 721 | PREDICTED: uncharacterized protein LOC10 | 0.974 | 0.624 | 0.683 | 1e-159 | |
| 255583327 | 658 | conserved hypothetical protein [Ricinus | 0.965 | 0.677 | 0.696 | 1e-159 | |
| 356495819 | 653 | PREDICTED: uncharacterized protein LOC10 | 0.956 | 0.676 | 0.651 | 1e-145 | |
| 238479244 | 652 | myosin heavy chain-related protein [Arab | 0.976 | 0.691 | 0.604 | 1e-138 | |
| 297831932 | 629 | predicted protein [Arabidopsis lyrata su | 0.969 | 0.712 | 0.608 | 1e-138 | |
| 15225987 | 629 | myosin heavy chain-related protein [Arab | 0.967 | 0.710 | 0.607 | 1e-137 | |
| 357480651 | 586 | hypothetical protein MTR_5g005010 [Medic | 0.987 | 0.778 | 0.591 | 1e-137 | |
| 449438615 | 641 | PREDICTED: uncharacterized protein LOC10 | 0.930 | 0.670 | 0.610 | 1e-133 | |
| 449479020 | 641 | PREDICTED: uncharacterized LOC101211967 | 0.932 | 0.672 | 0.607 | 1e-132 |
| >gi|296083035|emb|CBI22439.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Score = 570 bits (1468), Expect = e-160, Method: Compositional matrix adjust.
Identities = 312/456 (68%), Positives = 365/456 (80%), Gaps = 5/456 (1%)
Query: 7 EKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEI 66
EK N++E+L+ARIQQLEHERDELRKDIEQLC+QQAGPSYL VATRMHFQRTAGLEQEIE
Sbjct: 3 EKSNDNESLIARIQQLEHERDELRKDIEQLCIQQAGPSYLVVATRMHFQRTAGLEQEIEN 62
Query: 67 LKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAE 126
LK+K+AA REN NLQEELSEAYRIK QLADLH AEV KN++AEKQ+KFFQGC+AAAFAE
Sbjct: 63 LKKKVAASTRENLNLQEELSEAYRIKSQLADLHNAEVSKNLDAEKQIKFFQGCVAAAFAE 122
Query: 127 RDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESF 186
RD+S+MEAEKAKEKEEL Q+FN FQ R++EL S+N++ K+ N L+ DL QE+ NE+F
Sbjct: 123 RDHSIMEAEKAKEKEELALQEFNNFQKRIKELESDNLKQKQLNVALQIDLANQEDQNETF 182
Query: 187 KEVINKFYEIRQQSLEVLE-TSWEDKCACLLLDSAEMWSFN---DTSTSKYISALEDELE 242
K+VINKF+EIRQ SLE E +W+DKC CLL D EMWSFN +TSTSKYI ALE+ELE
Sbjct: 183 KKVINKFFEIRQYSLEAFEDANWDDKCGCLLSDPVEMWSFNTNEETSTSKYIDALEEELE 242
Query: 243 KTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVV 302
R+SV+NLQ+KLR+GLEIENHLKK V ELEK+ I S + N I+ L HSQ R+HVV
Sbjct: 243 MVRNSVDNLQNKLRVGLEIENHLKKKVSELEKQKIISHQMFKNRISGLLHYHSQHRLHVV 302
Query: 303 NSLEEGRSHIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHINNDADTNL 362
N L+EG+SH+KSI DV+EEK + D R QN QR+ KL E ECRDVH++ D N
Sbjct: 303 NLLDEGKSHLKSIIDVVEEKIRQL-DADREQNLEPPQRDLKLYENECRDVHVSIVGDHNS 361
Query: 363 VSQRNDPAYCDIEADRKGEASETLAQALQEKVAALLLLSQQEERHLLERNVNSALQKKIE 422
V++ N P + D G+ASE AQA+QEKVAALLLLSQQEERHLLE NVN LQKK+E
Sbjct: 362 VAESNIPGLKNNVIDGMGDASEAFAQAMQEKVAALLLLSQQEERHLLESNVNIVLQKKME 421
Query: 423 ELQRNLFQVTTEKVKALMELAQLKQDYQLLQEYDSY 458
ELQRNL QVT EKVKALMELAQLKQ+YQLLQE S+
Sbjct: 422 ELQRNLLQVTNEKVKALMELAQLKQEYQLLQEKISH 457
|
Source: Vitis vinifera Species: Vitis vinifera Genus: Vitis Family: Vitaceae Order: Vitales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|359475478|ref|XP_002264967.2| PREDICTED: uncharacterized protein LOC100251573 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|255583327|ref|XP_002532426.1| conserved hypothetical protein [Ricinus communis] gi|223527875|gb|EEF29967.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|356495819|ref|XP_003516769.1| PREDICTED: uncharacterized protein LOC100775191 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|238479244|ref|NP_001154517.1| myosin heavy chain-related protein [Arabidopsis thaliana] gi|330251226|gb|AEC06320.1| myosin heavy chain-related protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|297831932|ref|XP_002883848.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297329688|gb|EFH60107.1| predicted protein [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|15225987|ref|NP_179075.1| myosin heavy chain-related protein [Arabidopsis thaliana] gi|334184229|ref|NP_001189526.1| myosin heavy chain-related protein [Arabidopsis thaliana] gi|3810592|gb|AAC69374.1| unknown protein [Arabidopsis thaliana] gi|330251225|gb|AEC06319.1| myosin heavy chain-related protein [Arabidopsis thaliana] gi|330251227|gb|AEC06321.1| myosin heavy chain-related protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|357480651|ref|XP_003610611.1| hypothetical protein MTR_5g005010 [Medicago truncatula] gi|355511946|gb|AES93569.1| hypothetical protein MTR_5g005010 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|449438615|ref|XP_004137083.1| PREDICTED: uncharacterized protein LOC101211967 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|449479020|ref|XP_004155483.1| PREDICTED: uncharacterized LOC101211967 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 462 | ||||||
| TAIR|locus:2064247 | 652 | MEE13 "AT2G14680" [Arabidopsis | 0.954 | 0.676 | 0.594 | 3.6e-128 | |
| UNIPROTKB|F1N1F8 | 3077 | CENPF "Uncharacterized protein | 0.880 | 0.132 | 0.204 | 1.1e-10 | |
| UNIPROTKB|F1PRS3 | 1869 | CENPF "Uncharacterized protein | 0.794 | 0.196 | 0.222 | 3.5e-09 | |
| UNIPROTKB|F1NNI6 | 3103 | LOC100858793 "Uncharacterized | 0.904 | 0.134 | 0.215 | 1e-08 | |
| UNIPROTKB|P49454 | 3210 | CENPF "Centromere protein F" [ | 0.880 | 0.126 | 0.221 | 1.3e-08 | |
| UNIPROTKB|F1NDQ5 | 1891 | CCDC88A "Uncharacterized prote | 0.924 | 0.225 | 0.215 | 1.6e-08 | |
| UNIPROTKB|F8W7S5 | 751 | RRBP1 "Ribosome-binding protei | 0.803 | 0.494 | 0.254 | 1.8e-08 | |
| UNIPROTKB|Q02224 | 2701 | CENPE "Centromere-associated p | 0.865 | 0.148 | 0.205 | 2.4e-08 | |
| ZFIN|ZDB-GENE-041111-270 | 1398 | eea1 "early endosome antigen 1 | 0.911 | 0.301 | 0.233 | 2.4e-08 | |
| UNIPROTKB|Q9P2E9 | 1410 | RRBP1 "Ribosome-binding protei | 0.803 | 0.263 | 0.254 | 4e-08 |
| TAIR|locus:2064247 MEE13 "AT2G14680" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 1258 (447.9 bits), Expect = 3.6e-128, P = 3.6e-128
Identities = 265/446 (59%), Positives = 328/446 (73%)
Query: 15 LMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAAC 74
L ARI+QLEHER+EL+KDIEQLCMQQ GPS L VA RMHFQRTA LEQEIE LK K+AAC
Sbjct: 11 LKARIKQLEHERNELQKDIEQLCMQQGGPSILGVAARMHFQRTASLEQEIESLKLKLAAC 70
Query: 75 ARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMXX 134
RE NLQEEL+EAYR+K QLADLHA EV KN+EAEKQV+FFQG +AAAF+ERD SVM
Sbjct: 71 TREKHNLQEELAEAYRVKAQLADLHAGEVAKNLEAEKQVRFFQGSVAAAFSERDKSVMEA 130
Query: 135 XXXXXXXXLMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFY 194
+MSQK +E + RLEELSS+ + K+ N TL+ DL K EE ++ VI KFY
Sbjct: 131 EKAEENAEMMSQKLSEIEMRLEELSSDCLVQKRLNDTLQADLAKLEEQTRTYAGVIEKFY 190
Query: 195 EIRQQSL-EVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVENLQS 253
+IR+ SL E LE S +KCA LL D E W+FND STS Y++ALE EL K +++V+NLQS
Sbjct: 191 DIRKASLCESLEMSLHEKCASLLDDPIESWTFNDPSTSDYVAALEGELGKVKNTVDNLQS 250
Query: 254 KLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGRSHIK 313
KLR+GLEIENHLKK VR LEKK I +D I N I ++R HSQLR +++ L E +IK
Sbjct: 251 KLRVGLEIENHLKKRVRALEKKNIVADGLIVNGITDIRHHHSQLRAYIIALLNEEGLYIK 310
Query: 314 SISDVIEEKTQ-HCDDVIRGQNTGTYQRETKLDEFECRDVHINNDADTNLVSQRNDPAYC 372
SIS +EEK + H +V QN Q + K DE ECRDVH+ ++ V++ + +
Sbjct: 311 SISKNVEEKLKLHSSEV---QNVVPPQHDLKPDESECRDVHMTTVVESCQVTKLAEASIA 367
Query: 373 DIEADRKGEASETLAQALQEKVAALLLLSQQEERHLLERNVNSALQKKIEELQRNLFQVT 432
I A+ +G+ASE AQALQEKV ALLLLSQQEERHL E NVN+ALQ+K++ELQRN+ QVT
Sbjct: 368 KIMAESRGDASEAFAQALQEKVGALLLLSQQEERHLHEENVNAALQQKVDELQRNILQVT 427
Query: 433 TEKVKALMELAQLKQDYQLLQEYDSY 458
EKV+ LMELAQL+Q+YQ L+E+ ++
Sbjct: 428 NEKVRTLMELAQLRQEYQSLREHGTF 453
|
|
| UNIPROTKB|F1N1F8 CENPF "Uncharacterized protein" [Bos taurus (taxid:9913)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|F1PRS3 CENPF "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|F1NNI6 LOC100858793 "Uncharacterized protein" [Gallus gallus (taxid:9031)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|P49454 CENPF "Centromere protein F" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|F1NDQ5 CCDC88A "Uncharacterized protein" [Gallus gallus (taxid:9031)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|F8W7S5 RRBP1 "Ribosome-binding protein 1" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q02224 CENPE "Centromere-associated protein E" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
| ZFIN|ZDB-GENE-041111-270 eea1 "early endosome antigen 1" [Danio rerio (taxid:7955)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q9P2E9 RRBP1 "Ribosome-binding protein 1" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 462 | |||
| COG1196 | 1163 | COG1196, Smc, Chromosome segregation ATPases [Cell | 4e-05 | |
| TIGR02168 | 1179 | TIGR02168, SMC_prok_B, chromosome segregation prot | 5e-04 | |
| TIGR02169 | 1164 | TIGR02169, SMC_prok_A, chromosome segregation prot | 7e-04 | |
| COG1196 | 1163 | COG1196, Smc, Chromosome segregation ATPases [Cell | 0.002 |
| >gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] | Back alignment and domain information |
|---|
Score = 45.9 bits (109), Expect = 4e-05
Identities = 51/207 (24%), Positives = 92/207 (44%), Gaps = 8/207 (3%)
Query: 2 DENSKEKENESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMH--FQRTAG 59
E E E L +R+++LE E +EL +++E+L + + + + A
Sbjct: 722 KRELAALEEELEQLQSRLEELEEELEELEEELEELQERLE-----ELEEELESLEEALAK 776
Query: 60 LEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGC 119
L++EIE L++K A E L+EEL EA R L + + E++++ +
Sbjct: 777 LKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEE 836
Query: 120 MAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQ 179
+ + D E E+ +++ E + ++ E + EEL E EL+++ L +L +
Sbjct: 837 IEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELREL 896
Query: 180 EELNESFKEVINKFYEIRQQSLEVLET 206
E KE I K R + LE
Sbjct: 897 ESELAELKEEIEK-LRERLEELEAKLE 922
|
Length = 1163 |
| >gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common bacterial type | Back alignment and domain information |
|---|
| >gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC, primarily archaeal type | Back alignment and domain information |
|---|
| >gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning] | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 462 | |||
| PF00038 | 312 | Filament: Intermediate filament protein; InterPro: | 97.38 | |
| PRK09039 | 343 | hypothetical protein; Validated | 97.27 | |
| PF10174 | 775 | Cast: RIM-binding protein of the cytomatrix active | 96.78 | |
| PHA02562 | 562 | 46 endonuclease subunit; Provisional | 96.43 | |
| KOG0161 | 1930 | consensus Myosin class II heavy chain [Cytoskeleto | 95.18 | |
| COG1196 | 1163 | Smc Chromosome segregation ATPases [Cell division | 94.72 | |
| PF05667 | 594 | DUF812: Protein of unknown function (DUF812); Inte | 93.7 | |
| TIGR02168 | 1179 | SMC_prok_B chromosome segregation protein SMC, com | 93.69 | |
| TIGR02169 | 1164 | SMC_prok_A chromosome segregation protein SMC, pri | 93.6 | |
| TIGR00606 | 1311 | rad50 rad50. This family is based on the phylogeno | 92.62 | |
| PRK10884 | 206 | SH3 domain-containing protein; Provisional | 92.12 | |
| PF15070 | 617 | GOLGA2L5: Putative golgin subfamily A member 2-lik | 92.04 | |
| TIGR02168 | 1179 | SMC_prok_B chromosome segregation protein SMC, com | 92.02 | |
| PRK04863 | 1486 | mukB cell division protein MukB; Provisional | 91.87 | |
| PF08614 | 194 | ATG16: Autophagy protein 16 (ATG16); InterPro: IPR | 91.82 | |
| PF00038 | 312 | Filament: Intermediate filament protein; InterPro: | 91.71 | |
| TIGR02169 | 1164 | SMC_prok_A chromosome segregation protein SMC, pri | 91.45 | |
| PF10174 | 775 | Cast: RIM-binding protein of the cytomatrix active | 91.16 | |
| PF12325 | 120 | TMF_TATA_bd: TATA element modulatory factor 1 TATA | 90.64 | |
| PRK02224 | 880 | chromosome segregation protein; Provisional | 90.43 | |
| PRK09039 | 343 | hypothetical protein; Validated | 89.63 | |
| PRK02224 | 880 | chromosome segregation protein; Provisional | 89.57 | |
| PF05557 | 722 | MAD: Mitotic checkpoint protein; InterPro: IPR0086 | 88.12 | |
| KOG0161 | 1930 | consensus Myosin class II heavy chain [Cytoskeleto | 85.96 | |
| PRK11637 | 428 | AmiB activator; Provisional | 85.88 | |
| PF10168 | 717 | Nup88: Nuclear pore component; InterPro: IPR019321 | 85.46 | |
| PF00261 | 237 | Tropomyosin: Tropomyosin; InterPro: IPR000533 Trop | 85.26 | |
| PRK03918 | 880 | chromosome segregation protein; Provisional | 84.17 | |
| KOG0612 | 1317 | consensus Rho-associated, coiled-coil containing p | 84.13 | |
| PF04912 | 388 | Dynamitin: Dynamitin ; InterPro: IPR006996 Dynamit | 83.91 | |
| PF12718 | 143 | Tropomyosin_1: Tropomyosin like; InterPro: IPR0005 | 83.5 | |
| PRK03918 | 880 | chromosome segregation protein; Provisional | 82.48 | |
| COG1196 | 1163 | Smc Chromosome segregation ATPases [Cell division | 81.99 | |
| PF04849 | 306 | HAP1_N: HAP1 N-terminal conserved region; InterPro | 81.05 | |
| KOG0995 | 581 | consensus Centromere-associated protein HEC1 [Cell | 81.04 | |
| TIGR03185 | 650 | DNA_S_dndD DNA sulfur modification protein DndD. T | 80.56 | |
| PF09726 | 697 | Macoilin: Transmembrane protein; InterPro: IPR0191 | 80.42 |
| >PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope | Back alignment and domain information |
|---|
Probab=97.38 E-value=0.13 Score=48.79 Aligned_cols=117 Identities=17% Similarity=0.238 Sum_probs=87.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC-CchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHH
Q 012498 12 SEALMARIQQLEHERDELRKDIEQLCMQQAG-PSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYR 90 (462)
Q Consensus 12 ~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaG-pgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYR 90 (462)
-++-+.||..||.+...|...|..+---.+. ||-+ -...+.+|..|+.++..++.++-.|+-++..+..
T Consensus 13 la~YIekVr~LE~~N~~Le~~i~~~~~~~~~~~~~~----------~~~ye~el~~lr~~id~~~~eka~l~~e~~~l~~ 82 (312)
T PF00038_consen 13 LASYIEKVRFLEQENKRLESEIEELREKKGEEVSRI----------KEMYEEELRELRRQIDDLSKEKARLELEIDNLKE 82 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH---------HHH----------HHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHhcccccCccc----------ccchhhHHHHhHHhhhhHHHHhhHHhhhhhhHHH
Confidence 4677899999999999999999999876422 2211 2456888999999999999999999999998877
Q ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhH
Q 012498 91 IKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAK 138 (462)
Q Consensus 91 iK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaK 138 (462)
--..+-.-|..+...+..+|.++.=+..-+-.+.+.|...=-+++-.+
T Consensus 83 e~~~~r~k~e~e~~~~~~le~el~~lrk~ld~~~~~r~~le~~i~~L~ 130 (312)
T PF00038_consen 83 ELEDLRRKYEEELAERKDLEEELESLRKDLDEETLARVDLENQIQSLK 130 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhHhHHHHHHHHHH
Confidence 666666667788889999999988888777777777766544444444
|
They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A .... |
| >PRK09039 hypothetical protein; Validated | Back alignment and domain information |
|---|
| >PF10174 Cast: RIM-binding protein of the cytomatrix active zone; InterPro: IPR019323 This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion [] | Back alignment and domain information |
|---|
| >PHA02562 46 endonuclease subunit; Provisional | Back alignment and domain information |
|---|
| >KOG0161 consensus Myosin class II heavy chain [Cytoskeleton] | Back alignment and domain information |
|---|
| >COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning] | Back alignment and domain information |
|---|
| >PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function | Back alignment and domain information |
|---|
| >TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type | Back alignment and domain information |
|---|
| >TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type | Back alignment and domain information |
|---|
| >TIGR00606 rad50 rad50 | Back alignment and domain information |
|---|
| >PRK10884 SH3 domain-containing protein; Provisional | Back alignment and domain information |
|---|
| >PF15070 GOLGA2L5: Putative golgin subfamily A member 2-like protein 5 | Back alignment and domain information |
|---|
| >TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type | Back alignment and domain information |
|---|
| >PRK04863 mukB cell division protein MukB; Provisional | Back alignment and domain information |
|---|
| >PF08614 ATG16: Autophagy protein 16 (ATG16); InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells | Back alignment and domain information |
|---|
| >PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope | Back alignment and domain information |
|---|
| >TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type | Back alignment and domain information |
|---|
| >PF10174 Cast: RIM-binding protein of the cytomatrix active zone; InterPro: IPR019323 This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion [] | Back alignment and domain information |
|---|
| >PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes [] | Back alignment and domain information |
|---|
| >PRK02224 chromosome segregation protein; Provisional | Back alignment and domain information |
|---|
| >PRK09039 hypothetical protein; Validated | Back alignment and domain information |
|---|
| >PRK02224 chromosome segregation protein; Provisional | Back alignment and domain information |
|---|
| >PF05557 MAD: Mitotic checkpoint protein; InterPro: IPR008672 This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins | Back alignment and domain information |
|---|
| >KOG0161 consensus Myosin class II heavy chain [Cytoskeleton] | Back alignment and domain information |
|---|
| >PRK11637 AmiB activator; Provisional | Back alignment and domain information |
|---|
| >PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs | Back alignment and domain information |
|---|
| >PF00261 Tropomyosin: Tropomyosin; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells | Back alignment and domain information |
|---|
| >PRK03918 chromosome segregation protein; Provisional | Back alignment and domain information |
|---|
| >KOG0612 consensus Rho-associated, coiled-coil containing protein kinase [Signal transduction mechanisms] | Back alignment and domain information |
|---|
| >PF04912 Dynamitin: Dynamitin ; InterPro: IPR006996 Dynamitin is a subunit of the microtubule-dependent motor complex, it is also implicated in cell adhesion by binding to macrophage-enriched myristoylated alanine-rice C kinase substrate (MacMARCKS) [] | Back alignment and domain information |
|---|
| >PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells | Back alignment and domain information |
|---|
| >PRK03918 chromosome segregation protein; Provisional | Back alignment and domain information |
|---|
| >COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning] | Back alignment and domain information |
|---|
| >PF04849 HAP1_N: HAP1 N-terminal conserved region; InterPro: IPR006933 This family is defined by an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues | Back alignment and domain information |
|---|
| >KOG0995 consensus Centromere-associated protein HEC1 [Cell cycle control, cell division, chromosome partitioning] | Back alignment and domain information |
|---|
| >TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD | Back alignment and domain information |
|---|
| >PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 462 | |||
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 3e-13 | |
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 8e-10 | |
| 1i84_S | 1184 | Smooth muscle myosin heavy chain; muscle protein, | 5e-09 | |
| 1i84_S | 1184 | Smooth muscle myosin heavy chain; muscle protein, | 1e-04 | |
| 1i84_S | 1184 | Smooth muscle myosin heavy chain; muscle protein, | 7e-04 | |
| 2dfs_A | 1080 | Myosin-5A; myosin-V, inhibited state, cryoelectron | 9e-05 | |
| 2dfs_A | 1080 | Myosin-5A; myosin-V, inhibited state, cryoelectron | 1e-04 | |
| 2dfs_A | 1080 | Myosin-5A; myosin-V, inhibited state, cryoelectron | 3e-04 | |
| 3vkg_A | 3245 | Dynein heavy chain, cytoplasmic; AAA+ protein, mol | 7e-04 |
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
Score = 71.4 bits (174), Expect = 3e-13
Identities = 67/509 (13%), Positives = 145/509 (28%), Gaps = 112/509 (22%)
Query: 2 DENSKEKENESEALMAR-----IQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQR 56
+ + K+ ++ ++++++ I + + L +Q V
Sbjct: 32 NFDCKDVQDMPKSILSKEEIDHIIMSKDAVSGTLRLFWTLLSKQEEMVQKFVEE------ 85
Query: 57 TAGLEQEIEILKQKIAACARENSNLQEELSEA----------------YRIKGQLADLHA 100
L + L I R+ S + E R++ L
Sbjct: 86 --VLRINYKFLMSPIKTEQRQPSMMTRMYIEQRDRLYNDNQVFAKYNVSRLQ-PYLKLRQ 142
Query: 101 AEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSS 160
A + + K V G + + + + + + F F L+ +S
Sbjct: 143 A--LLELRPAKNV-LIDG-VLGS----GKTWVALDVCLSYKVQCKMDFKIFWLNLKNCNS 194
Query: 161 ENIELKKQNATLRFDLEKQ-EELNESFKEVINKFYEIRQQSLEVLETSWEDKCACLL-LD 218
L+ L + ++ ++ + + + I+ + +L + CLL L
Sbjct: 195 PETVLEMLQ-KLLYQIDPNWTSRSDHSSNIKLRIHSIQAELRRLL--KSKPYENCLLVLL 251
Query: 219 SAEMWSFNDTSTSKYISALEDE---LEKTR-SSVENLQSKLRMGLEIENHLKKSVRELEK 274
+ +K +A L TR V + S +H ++ E
Sbjct: 252 --------NVQNAKAWNAFNLSCKILLTTRFKQVTDFLSAATTTHISLDHHSMTLTPDEV 303
Query: 275 KIIHSDKFISNAIAELRLCHSQLRVH------VVNSLEEGRSHIKSISDVIEEKTQHCDD 328
K + K++ +L L + + S+ +G + + V +K +
Sbjct: 304 KSLLL-KYLDCRPQDLP--REVLTTNPRRLSIIAESIRDGLATWDNWKHVNCDKLTTIIE 360
Query: 329 V-IRGQNTGTYQRE-TKLDEFECRDVHINNDADTNLVSQRNDPA--YCDIEADRKGEASE 384
+ Y++ +L F HI P I D
Sbjct: 361 SSLNVLEPAEYRKMFDRLSVFP-PSAHI--------------PTILLSLIWFDVIKSDVM 405
Query: 385 TLAQALQEKVAALLLLSQQEERHLLERNVNSALQKKIEELQRNLFQVTTEKVKALMEL-A 443
+ L + L+ +Q + + I + L K++ L
Sbjct: 406 VVVNKLHKY----SLVEKQPKESTIS----------IPSIYLEL----KVKLENEYALHR 447
Query: 444 QLKQDYQLLQEY----------DSYFYFH 462
+ Y + + + D YFY H
Sbjct: 448 SIVDHYNIPKTFDSDDLIPPYLDQYFYSH 476
|
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
| >1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 | Back alignment and structure |
|---|
| >1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 | Back alignment and structure |
|---|
| >1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A Length = 1184 | Back alignment and structure |
|---|
| >2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080 | Back alignment and structure |
|---|
| >2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080 | Back alignment and structure |
|---|
| >2dfs_A Myosin-5A; myosin-V, inhibited state, cryoelectron tomograp contractIle protein-transport protein complex; 24.00A {Gallus gallus} Length = 1080 | Back alignment and structure |
|---|
| >3vkg_A Dynein heavy chain, cytoplasmic; AAA+ protein, molecular motor, microtubles, motor protein; HET: ADP SPM; 2.81A {Dictyostelium discoideum} PDB: 3vkh_A* 3vkh_C* Length = 3245 | Back alignment and structure |
|---|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 462 | |||
| 3oja_B | 597 | Anopheles plasmodium-responsive leucine-rich REPE | 89.38 | |
| 1c1g_A | 284 | Tropomyosin; contractIle protein; 7.00A {Sus scrof | 88.92 | |
| 3s4r_A | 93 | Vimentin; alpha-helix, cytoskeleton, intermediate | 88.47 | |
| 1i84_S | 1184 | Smooth muscle myosin heavy chain; muscle protein, | 86.91 | |
| 3o0z_A | 168 | RHO-associated protein kinase 1; coiled-coil, tran | 84.32 | |
| 3oja_B | 597 | Anopheles plasmodium-responsive leucine-rich REPE | 81.26 |
| >3oja_B Anopheles plasmodium-responsive leucine-rich REPE 1; coiled-coil, helix-loop-helix, leucine-rich repeat, protein; HET: NAG MAN; 2.70A {Anopheles gambiae} | Back alignment and structure |
|---|
Probab=89.38 E-value=3.5 Score=40.49 Aligned_cols=63 Identities=10% Similarity=0.133 Sum_probs=41.4
Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHHHHHHHhh
Q 012498 136 KAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQ 198 (462)
Q Consensus 136 kaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~ 198 (462)
+.++..++.-+++..+++..++++..+.++|+.-..++.+.+..+.+...+...+++=|+-|.
T Consensus 527 ~~~~~~~~~~~~~~~~~~~~~~le~~~~~~~~~~~~l~~e~~~~~~~~~~l~~~~~~~~~~~~ 589 (597)
T 3oja_B 527 ARRTEADAKQKETEDLEQENIALEKQLDNKRAKQAELRQETSLKRQKVKQLEAKKNRNPDTRR 589 (597)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTC-------
T ss_pred HHHHhhhhhhcchhhHHhhhHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc
Confidence 333444445566677777777888888888888888888888888888888888887777765
|
| >1c1g_A Tropomyosin; contractIle protein; 7.00A {Sus scrofa} SCOP: h.1.5.1 PDB: 2tma_A 2w49_A 2w4u_A | Back alignment and structure |
|---|
| >3s4r_A Vimentin; alpha-helix, cytoskeleton, intermediate filament, structural; 2.45A {Homo sapiens} PDB: 3ssu_A | Back alignment and structure |
|---|
| >1i84_S Smooth muscle myosin heavy chain; muscle protein, myosin subfragment 2, heavy meromyosin, essential light chain, motor protein; HET: MLY; 20.00A {Gallus gallus} SCOP: i.15.1.1 PDB: 3j04_A 3dtp_B 3dtp_A | Back alignment and structure |
|---|
| >3o0z_A RHO-associated protein kinase 1; coiled-coil, transferase; HET: MSE; 2.33A {Homo sapiens} | Back alignment and structure |
|---|
| >3oja_B Anopheles plasmodium-responsive leucine-rich REPE 1; coiled-coil, helix-loop-helix, leucine-rich repeat, protein; HET: NAG MAN; 2.70A {Anopheles gambiae} | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00