Citrus Sinensis ID: 001655
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in SWISS-PROT Database Detected by BLAST 
Original result of BLAST against SWISS-PROT Database
ID ![]() | Alignment graph ![]() | Length ![]() |
Definition ![]() |
RBH(Q2H) ![]() |
RBH(H2Q) ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 1036 | 2.2.26 [Sep-21-2011] | |||||||
| Q9M0V3 | 1088 | DNA damage-binding protei | yes | no | 1.0 | 0.952 | 0.918 | 0.0 | |
| Q6QNU4 | 1090 | DNA damage-binding protei | N/A | no | 1.0 | 0.950 | 0.923 | 0.0 | |
| Q6E7D1 | 1095 | DNA damage-binding protei | N/A | no | 1.0 | 0.946 | 0.923 | 0.0 | |
| O49552 | 1088 | DNA damage-binding protei | no | no | 1.0 | 0.952 | 0.892 | 0.0 | |
| Q3U1J4 | 1140 | DNA damage-binding protei | yes | no | 0.991 | 0.900 | 0.529 | 0.0 | |
| P33194 | 1140 | DNA damage-binding protei | N/A | no | 0.991 | 0.900 | 0.528 | 0.0 | |
| A1A4K3 | 1140 | DNA damage-binding protei | yes | no | 0.991 | 0.900 | 0.529 | 0.0 | |
| Q16531 | 1140 | DNA damage-binding protei | yes | no | 0.991 | 0.900 | 0.528 | 0.0 | |
| Q5R649 | 1140 | DNA damage-binding protei | yes | no | 0.991 | 0.900 | 0.527 | 0.0 | |
| Q805F9 | 1140 | DNA damage-binding protei | yes | no | 0.991 | 0.900 | 0.526 | 0.0 |
| >sp|Q9M0V3|DDB1A_ARATH DNA damage-binding protein 1a OS=Arabidopsis thaliana GN=DDB1A PE=1 SV=1 | Back alignment and function desciption |
|---|
Score = 1974 bits (5114), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 952/1036 (91%), Positives = 1005/1036 (97%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDPESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLFGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
AKPTI VLYQDNKDARHVKTYEV+LKDKDFVEGPWSQN+LDNGADLLIPVPPPLCGVLII
Sbjct: 173 AKPTIAVLYQDNKDARHVKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSITKAYGRVD DGSRYLLGDHAG++HLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSASAFKAIPIRPSITKAYGRVDVDGSRYLLGDHAGMIHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNAVV++GSSYGDSQL+KLNL PDAKGSYVEVLERY+NLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKGSYVEVLERYINLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKGMWSL+SS D+
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELR+EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+
Sbjct: 473 SSTTRELRDEWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHALLEYEV 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG+NP+YSQ+AAVGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGDNPNYSQLAAVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF ++ TG+L DRKKVSLGTQPITLRTFSSK+ THVFAASDRPTV
Sbjct: 593 ISYLLCALGDGHLLNFQMDTTTGQLKDRKKVSLGTQPITLRTFSSKSATHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQ+RTF ICSL NQS +EESEMHFVRLLDDQTFEF+STYPLD+FEYGCSIL
Sbjct: 713 EHARRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQTFEFMSTYPLDSFEYGCSIL 772
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSF++D NVYYCVGTAYVLPEENEPTKGRILVFIVEDG+LQLIAEKETKGAVYSLNAFN
Sbjct: 773 SCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIVEDGRLQLIAEKETKGAVYSLNAFN 832
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
+YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLE
Sbjct: 893 LYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLLTVKKNSEGATDEERGRLE 952
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY FLEKL
Sbjct: 953 VVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYTFLEKL 1012
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q++LRKVIKGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV
Sbjct: 1013 QSSLRKVIKGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNV 1072
Query: 1021 SVEELCKRVEELTRLH 1036
VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088
|
Component of light signal transduction machinery. Involved in repression of photomorphogenesis in darkness by participating in the CDD complex, a complex probably required to regulate the activity of ubiquitin conjugating enzymes (E2s). Repression of photomorphogenesis is probably mediated by ubiquitination and subsequent degradation of photomorphogenesis-promoting factors such as HY5, HYH and LAF1. Plays a role in DNA repair by forming with DDB2 the UV-damaged DNA-binding protein complex (UV-DDB). Component of the CUL4-RBX1-DDB1-PRL1 E3 ubiquitin-protein ligase complex. Arabidopsis thaliana (taxid: 3702) |
| >sp|Q6QNU4|DDB1_SOLLC DNA damage-binding protein 1 OS=Solanum lycopersicum GN=DDB1 PE=1 SV=1 | Back alignment and function description |
|---|
Score = 1967 bits (5095), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 959/1038 (92%), Positives = 1004/1038 (96%), Gaps = 2/1038 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGE QD LFIATERYKFCVLQWD E+SE+ITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGETQDLLFIATERYKFCVLQWDTEASEVITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTIVVLYQDNKDARHVKTYEV+LKDKDF+EGPW+QNNLDNGA LLIPVPPPLCGVLII
Sbjct: 173 PKPTIVVLYQDNKDARHVKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSIT+AYGRVDADGSRYLLGDH GLLHLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSASAFKAIPIRPSITRAYGRVDADGSRYLLGDHNGLLHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNA V+IGSSYGDSQL+KLNLQPD KGSYVEVLERYVNLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNLQPDTKGSYVEVLERYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS+TDDP
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSATDDP 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+DTFLVVSFISETR+LAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 YDTFLVVSFISETRVLAMNLEDELEETEIEGFNSQVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SSTSR+L+NEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDG+L EVK+A+L+Y+I
Sbjct: 473 SSTSRDLKNEWFAPVGYSVNVATANATQVLLATGGGHLVYLEIGDGVLNEVKYAKLDYDI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP+YS IAAVGMWTDISVRI+SLPDLNLITKE LGGEIIPRSVL+C+FEG
Sbjct: 533 SCLDINPIGENPNYSNIAAVGMWTDISVRIYSLPDLNLITKEQLGGEIIPRSVLMCSFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF+L+M TGELTDRKKVSLGTQPITLRTFSSK+TTHVFAASDRPTV
Sbjct: 593 ISYLLCALGDGHLLNFVLSMSTGELTDRKKVSLGTQPITLRTFSSKDTTHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFN AAFPDSLAIAKEGELTIGTID+IQKLHIRSIPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNVAAFPDSLAIAKEGELTIGTIDEIQKLHIRSIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRI HQEQ+RTFA+CS+K QS A++ EMHFVRLLDDQTFEFISTYPLD FEYGCSI
Sbjct: 713 EHARRISHQEQTRTFALCSVKYTQSNADDPEMHFVRLLDDQTFEFISTYPLDQFEYGCSI 772
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
LSCSFSDDSNVYYC+GTAYV+PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 773 LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 832
Query: 780 NGKLLAAINQKIQLYKWMLRDD-GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
NGKLLAAINQKIQLYKW R+D G+RELQ+ECGHHGHILALYVQTRGDFIVVGDLMKSIS
Sbjct: 833 NGKLLAAINQKIQLYKWASREDGGSRELQTECGHHGHILALYVQTRGDFIVVGDLMKSIS 892
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
LLI+KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER R
Sbjct: 893 LLIFKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERSR 952
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QYLFLE
Sbjct: 953 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLE 1012
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
KLQTNLRKVIKGVGGL+HEQWRSF NEKKTVDAKNFLDGDLIESFLDLSR RM+EISK M
Sbjct: 1013 KLQTNLRKVIKGVGGLSHEQWRSFYNEKKTVDAKNFLDGDLIESFLDLSRNRMEEISKAM 1072
Query: 1019 NVSVEELCKRVEELTRLH 1036
+V VEEL KRVEELTRLH
Sbjct: 1073 SVPVEELMKRVEELTRLH 1090
|
Component of light signal transduction machinery. Involved in fruit pigmentation and fruit nutritional quality. Acts as a negative regulator of fruit pigmentation. Probably acts by participating in the CDD complex, a complex probably required to regulate the activity of ubiquitin conjugating enzymes. Repression of photomorphogenesis is probably mediated by ubiquitination and subsequent degradation of photomorphogenesis-promoting factors such as HY5. Solanum lycopersicum (taxid: 4081) |
| >sp|Q6E7D1|DDB1_SOLCE DNA damage-binding protein 1 OS=Solanum cheesmanii GN=DDB1 PE=3 SV=1 | Back alignment and function description |
|---|
Score = 1964 bits (5088), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 959/1038 (92%), Positives = 1004/1038 (96%), Gaps = 2/1038 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGE QD LFIATERYKFCVLQWD E+SE+ITRAMGDVSDRIG
Sbjct: 58 MLDVPIYGRIATLELFRPHGETQDLLFIATERYKFCVLQWDTEASEVITRAMGDVSDRIG 117
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 118 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 177
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTIVVLYQDNKDARHVKTYEV+LKDKDF+EGPW+QNNLDNGA LLIPVPPPLCGVLII
Sbjct: 178 PKPTIVVLYQDNKDARHVKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLII 237
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSIT+AYGRVDADGSRYLLGDH GLLHLLVITHEKEKVTGL
Sbjct: 238 GEETIVYCSASAFKAIPIRPSITRAYGRVDADGSRYLLGDHNGLLHLLVITHEKEKVTGL 297
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNA V+IGSSYGDSQL+KLNLQPD KGSYVEVLERYVNLGPIV
Sbjct: 298 KIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNLQPDTKGSYVEVLERYVNLGPIV 357
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS+TDDP
Sbjct: 358 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSATDDP 417
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+DTFLVVSFISETR+LAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 418 YDTFLVVSFISETRVLAMNLEDELEETEIEGFNSQVQTLFCHDAVYNQLVQVTSNSVRLV 477
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SSTSR+L+NEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDG+L EVK+A+L+Y+I
Sbjct: 478 SSTSRDLKNEWFAPVGYSVNVATANATQVLLATGGGHLVYLEIGDGVLNEVKYAKLDYDI 537
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP+YS IAAVGMWTDISVRI+SLPDLNLITKE LGGEIIPRSVL+C+FEG
Sbjct: 538 SCLDINPIGENPNYSNIAAVGMWTDISVRIYSLPDLNLITKEQLGGEIIPRSVLMCSFEG 597
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF+L+M TGELTDRKKVSLGTQPITLRTFSSK+TTHVFAASDRPTV
Sbjct: 598 ISYLLCALGDGHLLNFVLSMSTGELTDRKKVSLGTQPITLRTFSSKDTTHVFAASDRPTV 657
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFN AAFPDSLAIAKEGELTIGTID+IQKLHIRSIPLG
Sbjct: 658 IYSSNKKLLYSNVNLKEVSHMCPFNVAAFPDSLAIAKEGELTIGTIDEIQKLHIRSIPLG 717
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRI HQEQ+RTFA+CS+K QS A++ EMHFVRLLDDQTFEFISTYPLD FEYGCSI
Sbjct: 718 EHARRISHQEQTRTFALCSVKYTQSNADDPEMHFVRLLDDQTFEFISTYPLDQFEYGCSI 777
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
LSCSFSDDSNVYYC+GTAYV+PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 778 LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 837
Query: 780 NGKLLAAINQKIQLYKWMLRDD-GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
NGKLLAAINQKIQLYKW R+D G+RELQ+ECGHHGHILALYVQTRGDFIVVGDLMKSIS
Sbjct: 838 NGKLLAAINQKIQLYKWASREDGGSRELQTECGHHGHILALYVQTRGDFIVVGDLMKSIS 897
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
LLI+KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER R
Sbjct: 898 LLIFKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERSR 957
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QYLFLE
Sbjct: 958 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLE 1017
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
KLQTNLRKVIKGVGGL+HEQWRSF NEKKTVDAKNFLDGDLIESFLDLSR RM+EISK M
Sbjct: 1018 KLQTNLRKVIKGVGGLSHEQWRSFYNEKKTVDAKNFLDGDLIESFLDLSRNRMEEISKAM 1077
Query: 1019 NVSVEELCKRVEELTRLH 1036
+V VEEL KRVEELTRLH
Sbjct: 1078 SVPVEELMKRVEELTRLH 1095
|
Component of light signal transduction machinery. Involved in fruit pigmentation and fruit nutritional quality. Acts as a negative regulator of fruit pigmentation. Probably acts by participating in the CDD complex, a complex probably required to regulate the activity of ubiquitin conjugating enzymes. Repression of photomorphogenesis is probably mediated by ubiquitination and subsequent degradation of photomorphogenesis-promoting factors such as HY5. Solanum cheesmanii (taxid: 142759) |
| >sp|O49552|DDB1B_ARATH DNA damage-binding protein 1b OS=Arabidopsis thaliana GN=DDB1B PE=2 SV=2 | Back alignment and function description |
|---|
Score = 1920 bits (4973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 925/1036 (89%), Positives = 989/1036 (95%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LDVP+YGRIAT+ELFRPHGEAQDFLF+ATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53 ILDVPLYGRIATMELFRPHGEAQDFLFVATERYKFCVLQWDYESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCR+IGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRVIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTI VLYQDNKDARHVKTYEV+LKDK+FVEGPWSQNNLDNGADLLIPVP PLCGVLII
Sbjct: 173 TKPTIAVLYQDNKDARHVKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDHAGL+HLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSANAFKAIPIRPSITKAYGRVDLDGSRYLLGDHAGLIHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIAS+ISYLDNAVV++GSSYGDSQLIKLNLQPDAKGSYVE+LE+YVNLGPIV
Sbjct: 293 KIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKLNLQPDAKGSYVEILEKYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL+SS D+
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMN+EDELEETEIEGF S+ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNIEDELEETEIEGFLSEVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELRN+W +P G+SVNVATANASQVLLATGGGHLVYLEIGDG LTEVKH LEYE+
Sbjct: 473 SSTTRELRNKWDAPAGFSVNVATANASQVLLATGGGHLVYLEIGDGTLTEVKHVLLEYEV 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG+NP+YSQ+AAVGMWTDISVRIF LPDL LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGDNPNYSQLAAVGMWTDISVRIFVLPDLTLITKEELGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF L+ G+L DRKKVSLGT+PITLRTFSSK+ THVFAASDRP V
Sbjct: 593 ISYLLCALGDGHLLNFQLDTSCGKLRDRKKVSLGTRPITLRTFSSKSATHVFAASDRPAV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYS+NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IP+G
Sbjct: 653 IYSNNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPIG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQ+RTFAI L+N+ AEESE HFVRLLD Q+FEF+S+YPLD FE GCSIL
Sbjct: 713 EHARRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQSFEFLSSYPLDAFECGCSIL 772
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVE+G+LQLI EKETKGAVYSLNAFN
Sbjct: 773 SCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIVEEGRLQLITEKETKGAVYSLNAFN 832
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLA+INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI VGDLMKSISLL
Sbjct: 833 GKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIAVGDLMKSISLL 892
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
IYKHEEGAIEERARDYNANWM+AVEIL+DDIYLG +N FN+FTV+KN+EGATDEER R+E
Sbjct: 893 IYKHEEGAIEERARDYNANWMTAVEILNDDIYLGTDNCFNIFTVKKNNEGATDEERARME 952
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYH+GEFVNRFRHGSLVM+LPDSD+GQIPTVIFGTV+G+IGVIASLP EQY FLEKL
Sbjct: 953 VVGEYHIGEFVNRFRHGSLVMKLPDSDIGQIPTVIFGTVSGMIGVIASLPQEQYAFLEKL 1012
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
QT+LRKVIKGVGGL+HEQWRSFNNEK+T +AK +LDGDLIESFLDLSR +M+EISK M+V
Sbjct: 1013 QTSLRKVIKGVGGLSHEQWRSFNNEKRTAEAKGYLDGDLIESFLDLSRGKMEEISKGMDV 1072
Query: 1021 SVEELCKRVEELTRLH 1036
VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088
|
Component of light signal transduction machinery. Involved in repression of photomorphogenesis in darkness (By similarity). Plays a role in DNA repair by forming with DDB2 the UV-damaged DNA-binding protein complex (UV-DDB). Arabidopsis thaliana (taxid: 3702) |
| >sp|Q3U1J4|DDB1_MOUSE DNA damage-binding protein 1 OS=Mus musculus GN=Ddb1 PE=1 SV=2 | Back alignment and function description |
|---|
Score = 1150 bits (2975), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPGRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
|
Required for DNA repair. Binds to DDB2 to form the UV-damaged DNA-binding protein complex (the UV-DDB complex). The UV-DDB complex may recognize UV-induced DNA damage and recruit proteins of the nucleotide excision repair pathway (the NER pathway) to initiate DNA repair. The UV-DDB complex preferentially binds to cyclobutane pyrimidine dimers (CPD), 6-4 photoproducts (6-4 PP), apurinic sites and short mismatches. Also appears to function as a component of numerous distinct DCX (DDB1-CUL4-X-box) E3 ubiquitin-protein ligase complexes which mediate the ubiquitination and subsequent proteasomal degradation of target proteins. The functional specificity of the DCX E3 ubiquitin-protein ligase complex is determined by the variable substrate recognition component recruited by DDB1. DCX(DDB2) (also known as DDB1-CUL4-ROC1, CUL4-DDB-ROC1 and CUL4-DDB-RBX1) may ubiquitinate histone H2A, histone H3 and histone H4 at sites of UV-induced DNA damage. The ubiquitination of histones may facilitate their removal from the nucleosome and promote subsequent DNA repair. DCX(DDB2) also ubiquitinates XPC, which may enhance DNA-binding by XPC and promote NER. DCX(DTL) plays a role in PCNA-dependent polyubiquitination of CDT1 and MDM2-dependent ubiquitination of TP53 in response to radiation-induced DNA damage and during DNA replication. DCX(ERCC8) (the CSA complex) plays a role in transcription-coupled repair (TCR). May also play a role in ubiquitination of CDKN1B/p27kip when associated with CUL4 and SKP2. Mus musculus (taxid: 10090) |
| >sp|P33194|DDB1_CHLAE DNA damage-binding protein 1 OS=Chlorocebus aethiops GN=DDB1 PE=1 SV=1 | Back alignment and function description |
|---|
Score = 1150 bits (2974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
|
Required for DNA repair. Binds to DDB2 to form the UV-damaged DNA-binding protein complex (the UV-DDB complex). The UV-DDB complex may recognize UV-induced DNA damage and recruit proteins of the nucleotide excision repair pathway (the NER pathway) to initiate DNA repair. The UV-DDB complex preferentially binds to cyclobutane pyrimidine dimers (CPD), 6-4 photoproducts (6-4 PP), apurinic sites and short mismatches. Also appears to function as a component of numerous distinct DCX (DDB1-CUL4-X-box) E3 ubiquitin-protein ligase complexes which mediate the ubiquitination and subsequent proteasomal degradation of target proteins. The functional specificity of the DCX E3 ubiquitin-protein ligase complex is determined by the variable substrate recognition component recruited by DDB1. DCX(DDB2) (also known as DDB1-CUL4-ROC1, CUL4-DDB-ROC1 and CUL4-DDB-RBX1) may ubiquitinate histone H2A, histone H3 and histone H4 at sites of UV-induced DNA damage. The ubiquitination of histones may facilitate their removal from the nucleosome and promote subsequent DNA repair. DCX(DDB2) also ubiquitinates XPC, which may enhance DNA-binding by XPC and promote NER. DCX(DTL) plays a role in PCNA-dependent polyubiquitination of CDT1 and MDM2-dependent ubiquitination of TP53 in response to radiation-induced DNA damage and during DNA replication. DCX(ERCC8) (the CSA complex) plays a role in transcription-coupled repair (TCR). May also play a role in ubiquitination of CDKN1B/p27kip when associated with CUL4 and SKP2. Chlorocebus aethiops (taxid: 9534) |
| >sp|A1A4K3|DDB1_BOVIN DNA damage-binding protein 1 OS=Bos taurus GN=DDB1 PE=2 SV=1 | Back alignment and function description |
|---|
Score = 1150 bits (2974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGMSPLCAIGLWTDISARIAKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
|
Required for DNA repair. Binds to DDB2 to form the UV-damaged DNA-binding protein complex (the UV-DDB complex). The UV-DDB complex may recognize UV-induced DNA damage and recruit proteins of the nucleotide excision repair pathway (the NER pathway) to initiate DNA repair. The UV-DDB complex preferentially binds to cyclobutane pyrimidine dimers (CPD), 6-4 photoproducts (6-4 PP), apurinic sites and short mismatches. Also appears to function as a component of numerous distinct DCX (DDB1-CUL4-X-box) E3 ubiquitin-protein ligase complexes which mediate the ubiquitination and subsequent proteasomal degradation of target proteins. The functional specificity of the DCX E3 ubiquitin-protein ligase complex is determined by the variable substrate recognition component recruited by DDB1. DCX(DDB2) (also known as DDB1-CUL4-ROC1, CUL4-DDB-ROC1 and CUL4-DDB-RBX1) may ubiquitinate histone H2A, histone H3 and histone H4 at sites of UV-induced DNA damage. The ubiquitination of histones may facilitate their removal from the nucleosome and promote subsequent DNA repair. DCX(DDB2) also ubiquitinates XPC, which may enhance DNA-binding by XPC and promote NER. DCX(DTL) plays a role in PCNA-dependent polyubiquitination of CDT1 and MDM2-dependent ubiquitination of TP53 in response to radiation-induced DNA damage and during DNA replication. DCX(ERCC8) (the CSA complex) plays a role in transcription-coupled repair (TCR). May also play a role in ubiquitination of CDKN1B/p27kip when associated with CUL4 and SKP2. Bos taurus (taxid: 9913) |
| >sp|Q16531|DDB1_HUMAN DNA damage-binding protein 1 OS=Homo sapiens GN=DDB1 PE=1 SV=1 | Back alignment and function description |
|---|
Score = 1149 bits (2972), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
|
Required for DNA repair. Binds to DDB2 to form the UV-damaged DNA-binding protein complex (the UV-DDB complex). The UV-DDB complex may recognize UV-induced DNA damage and recruit proteins of the nucleotide excision repair pathway (the NER pathway) to initiate DNA repair. The UV-DDB complex preferentially binds to cyclobutane pyrimidine dimers (CPD), 6-4 photoproducts (6-4 PP), apurinic sites and short mismatches. Also appears to function as a component of numerous distinct DCX (DDB1-CUL4-X-box) E3 ubiquitin-protein ligase complexes which mediate the ubiquitination and subsequent proteasomal degradation of target proteins. The functional specificity of the DCX E3 ubiquitin-protein ligase complex is determined by the variable substrate recognition component recruited by DDB1. DCX(DDB2) (also known as DDB1-CUL4-ROC1, CUL4-DDB-ROC1 and CUL4-DDB-RBX1) may ubiquitinate histone H2A, histone H3 and histone H4 at sites of UV-induced DNA damage. The ubiquitination of histones may facilitate their removal from the nucleosome and promote subsequent DNA repair. DCX(DDB2) also ubiquitinates XPC, which may enhance DNA-binding by XPC and promote NER. DCX(DTL) plays a role in PCNA-dependent polyubiquitination of CDT1 and MDM2-dependent ubiquitination of TP53 in response to radiation-induced DNA damage and during DNA replication. DCX(ERCC8) (the CSA complex) plays a role in transcription-coupled repair (TCR). May also play a role in ubiquitination of CDKN1B/p27kip when associated with CUL4 and SKP2. Homo sapiens (taxid: 9606) |
| >sp|Q5R649|DDB1_PONAB DNA damage-binding protein 1 OS=Pongo abelii GN=DDB1 PE=2 SV=1 | Back alignment and function description |
|---|
Score = 1147 bits (2968), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 577/1094 (52%), Positives = 758/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNVCILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVY + FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYPMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
|
Required for DNA repair. Binds to DDB2 to form the UV-damaged DNA-binding protein complex (the UV-DDB complex). The UV-DDB complex may recognize UV-induced DNA damage and recruit proteins of the nucleotide excision repair pathway (the NER pathway) to initiate DNA repair. The UV-DDB complex preferentially binds to cyclobutane pyrimidine dimers (CPD), 6-4 photoproducts (6-4 PP), apurinic sites and short mismatches. Also appears to function as a component of numerous distinct DCX (DDB1-CUL4-X-box) E3 ubiquitin-protein ligase complexes which mediate the ubiquitination and subsequent proteasomal degradation of target proteins. The functional specificity of the DCX E3 ubiquitin-protein ligase complex is determined by the variable substrate recognition component recruited by DDB1. DCX(DDB2) (also known as DDB1-CUL4-ROC1, CUL4-DDB-ROC1 and CUL4-DDB-RBX1) may ubiquitinate histone H2A, histone H3 and histone H4 at sites of UV-induced DNA damage. The ubiquitination of histones may facilitate their removal from the nucleosome and promote subsequent DNA repair. DCX(DDB2) also ubiquitinates XPC, which may enhance DNA-binding by XPC and promote NER. DCX(DTL) plays a role in PCNA-dependent polyubiquitination of CDT1 and MDM2-dependent ubiquitination of TP53 in response to radiation-induced DNA damage and during DNA replication. DCX(ERCC8) (the CSA complex) plays a role in transcription-coupled repair (TCR). May also play a role in ubiquitination of CDKN1B/p27kip when associated with CUL4 and SKP2. Pongo abelii (taxid: 9601) |
| >sp|Q805F9|DDB1_CHICK DNA damage-binding protein 1 OS=Gallus gallus GN=DDB1 PE=2 SV=1 | Back alignment and function description |
|---|
Score = 1147 bits (2967), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 576/1094 (52%), Positives = 761/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+ A +ELFRP GE++D LFI T +Y C+L++ + ++ ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKTAVMELFRPKGESKDLLFILTAKYNACILEYKQNGDNIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + + +AFNIRLEELQV+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRENKELKAFNIRLEELQVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDSHREMDNMLVLSFVGQTRVLMLNGE-EVEETELTGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N++QV++A G L YLEI L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPNGKNISVASCNSNQVVVAVGRA-LYYLEIRPQELRQIN 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
++E+E++CLDI P+G+ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 CTEMEHEVACLDITPLGDTNGMSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F L+++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLSLETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFHYSDGKLQSLAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + + +V
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREATV 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKIVEELTRIH 1140
|
Required for DNA repair. Binds to DDB2 to form the UV-damaged DNA-binding protein complex (the UV-DDB complex). The UV-DDB complex may recognize UV-induced DNA damage and recruit proteins of the nucleotide excision repair pathway (the NER pathway) to initiate DNA repair. Also appears to function as a component of numerous distinct DCX (DDB1-CUL4-X-box) E3 ubiquitin-protein ligase complexes which mediate the ubiquitination and subsequent proteasomal degradation of target proteins. The functional specificity of the DCX E3 ubiquitin-protein ligase complex is determined by the variable substrate recognition component recruited by DDB1. Gallus gallus (taxid: 9031) |
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 1036 | ||||||
| 225443990 | 1089 | PREDICTED: DNA damage-binding protein 1 | 1.0 | 0.951 | 0.957 | 0.0 | |
| 224061051 | 1088 | predicted protein [Populus trichocarpa] | 1.0 | 0.952 | 0.949 | 0.0 | |
| 356512636 | 1089 | PREDICTED: DNA damage-binding protein 1a | 1.0 | 0.951 | 0.938 | 0.0 | |
| 356525401 | 1089 | PREDICTED: DNA damage-binding protein 1- | 1.0 | 0.951 | 0.936 | 0.0 | |
| 312283457 | 1088 | unnamed protein product [Thellungiella h | 1.0 | 0.952 | 0.924 | 0.0 | |
| 297809743 | 1088 | UV-damaged DNA-binding protein 1A [Arabi | 1.0 | 0.952 | 0.924 | 0.0 | |
| 225443992 | 1068 | PREDICTED: DNA damage-binding protein 1 | 0.979 | 0.950 | 0.937 | 0.0 | |
| 15235577 | 1088 | DNA damage-binding protein 1a [Arabidops | 1.0 | 0.952 | 0.918 | 0.0 | |
| 350537001 | 1090 | DNA damage-binding protein 1 [Solanum ly | 1.0 | 0.950 | 0.923 | 0.0 | |
| 55976392 | 1095 | RecName: Full=DNA damage-binding protein | 1.0 | 0.946 | 0.923 | 0.0 |
| >gi|225443990|ref|XP_002280735.1| PREDICTED: DNA damage-binding protein 1 isoform 1 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Score = 2034 bits (5270), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 993/1037 (95%), Positives = 1020/1037 (98%), Gaps = 1/1037 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAE+SE+ITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAENSEVITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPW+QNNLDNGADLLIPVPPPLCGVLII
Sbjct: 173 SKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWAQNNLDNGADLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSASAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNA VY+GSSYGDSQLIK++LQPDAKGSYVEVLERYVNLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAFVYVGSSYGDSQLIKIHLQPDAKGSYVEVLERYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
DTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA+Y+QLVQVTS SVRLV
Sbjct: 413 HDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVYDQLVQVTSSSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
STSRELRNEWK+P GYSVNVATANA+QVLLATGGGHLVYLEIGDG LTEVKHAQLEY+I
Sbjct: 473 GSTSRELRNEWKAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGTLTEVKHAQLEYDI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP++SQ+AAVGMWTDISVRIFSLPDLNLITKE+LGGEIIPRSVLLC+FEG
Sbjct: 533 SCLDINPIGENPNFSQLAAVGMWTDISVRIFSLPDLNLITKEYLGGEIIPRSVLLCSFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
I YLLCALGDGHLLNFLLNM TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 593 IPYLLCALGDGHLLNFLLNMSTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG+LTIGTIDDIQKLHIRSIPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGDLTIGTIDDIQKLHIRSIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRICHQEQSRTFAICSLK NQS E+SEMHF+RLLDDQTFEFISTYPLDTFEYGCSI
Sbjct: 713 EHARRICHQEQSRTFAICSLKYNQSSTEDSEMHFIRLLDDQTFEFISTYPLDTFEYGCSI 772
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 773 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 832
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
NGKLLAAINQKIQLYKWMLRDDGTRELQSE GHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 833 NGKLLAAINQKIQLYKWMLRDDGTRELQSESGHHGHILALYVQTRGDFIVVGDLMKSISL 892
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN+FTVRKNSEGATDEERGRL
Sbjct: 893 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNIFTVRKNSEGATDEERGRL 952
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QY+FLEK
Sbjct: 953 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYVFLEK 1012
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
LQ NLRKVIKGVGGL+HEQWRSFNNEKKTVDAKNFLDGDLIE+FLDL+RTRMDEISK M
Sbjct: 1013 LQANLRKVIKGVGGLSHEQWRSFNNEKKTVDAKNFLDGDLIETFLDLNRTRMDEISKAMA 1072
Query: 1020 VSVEELCKRVEELTRLH 1036
VSVEELCKRVEELTRLH
Sbjct: 1073 VSVEELCKRVEELTRLH 1089
|
Source: Vitis vinifera Species: Vitis vinifera Genus: Vitis Family: Vitaceae Order: Vitales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|224061051|ref|XP_002300334.1| predicted protein [Populus trichocarpa] gi|222847592|gb|EEE85139.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 2023 bits (5242), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 984/1036 (94%), Positives = 1014/1036 (97%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAE+SELITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAETSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLHGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTIVVLYQDNKDARHVKTYEVALKDKDF+EGPWSQNNLDNGADLLIPVPPP CGVLII
Sbjct: 173 SKPTIVVLYQDNKDARHVKTYEVALKDKDFIEGPWSQNNLDNGADLLIPVPPPFCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSAN F+AIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSANVFRAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNA V+IGSSYGDSQL+KLNL PDAKG+YVEVL+RYVNLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNLHPDAKGTYVEVLDRYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS TDDP
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSLTDDP 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMN+EDELEETEIEGFCSQ QTLFCH A++NQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNIEDELEETEIEGFCSQVQTLFCHCAVFNQLVQVTSSSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELR EW +P GYS+NVATANA+QVLLATGGGHLVYLEIGDG LT+ KHAQLE EI
Sbjct: 473 SSTTRELRQEWNAPSGYSINVATANATQVLLATGGGHLVYLEIGDGTLTQAKHAQLECEI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP+YSQ+AAVGMWTDISVRIFSLPDLNLITKE LGGEIIPRSVLLC+FEG
Sbjct: 533 SCLDINPIGENPNYSQLAAVGMWTDISVRIFSLPDLNLITKEPLGGEIIPRSVLLCSFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
I+YLLCALGDGHLLNFLLN+ TG L DRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 593 IAYLLCALGDGHLLNFLLNLSTGGLKDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL+IGTIDDIQKLHIRSIPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELSIGTIDDIQKLHIRSIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQSRTF+ICS+KNQS AEESEMHF+RLLDDQTFEFISTYPLDTFEYGCSIL
Sbjct: 713 EHARRICHQEQSRTFSICSMKNQSNAEESEMHFIRLLDDQTFEFISTYPLDTFEYGCSIL 772
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN
Sbjct: 773 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 832
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE
Sbjct: 893 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 952
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL
Sbjct: 953 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 1012
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q+NLRKVIKGVGGL+HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR+RMDEISK M +
Sbjct: 1013 QSNLRKVIKGVGGLSHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRSRMDEISKAMEI 1072
Query: 1021 SVEELCKRVEELTRLH 1036
SVEELCKRVEELTRLH
Sbjct: 1073 SVEELCKRVEELTRLH 1088
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|356512636|ref|XP_003525024.1| PREDICTED: DNA damage-binding protein 1a-like isoform 1 [Glycine max] | Back alignment and taxonomy information |
|---|
Score = 1996 bits (5170), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 973/1037 (93%), Positives = 1010/1037 (97%), Gaps = 1/1037 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQD+LFIATERYKFCVLQWD+E++EL+TRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDYLFIATERYKFCVLQWDSETAELVTRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTIVVLYQDNKDARHVKTYEVALKDKDF+EGPWSQNNLDNGADLLIPVPPPLCGVLII
Sbjct: 173 SKPTIVVLYQDNKDARHVKTYEVALKDKDFLEGPWSQNNLDNGADLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDH GLL LLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSANAFKAIPIRPSITKAYGRVDPDGSRYLLGDHTGLLSLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIE LGETSIASTISYLDNA VYIGSSYGDSQLIKLNLQPDAKGSYVE LERYVNLGPIV
Sbjct: 293 KIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKLNLQPDAKGSYVEGLERYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLR+VRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRVVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA++NQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVHNQLVQVTSSSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELRNEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDGIL EVKHAQLEYEI
Sbjct: 473 SSTTRELRNEWHAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGILQEVKHAQLEYEI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP++S +AAVGMWTDISVRIFSLPDL+LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGENPNHSHLAAVGMWTDISVRIFSLPDLSLITKEQLGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF+LN TGEL DRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 593 ISYLLCALGDGHLLNFMLNTSTGELADRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRICHQEQSRTFAICSLK N + E+SEMHFVRLLDDQTFEFISTY LDT+EYGC I
Sbjct: 713 EHARRICHQEQSRTFAICSLKYNPASGEDSEMHFVRLLDDQTFEFISTYSLDTYEYGCFI 772
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
+SCSFSDD+NVYYCVGTAYVLPEENEPTKGRILVF VEDGKLQLIAEKETKGAVY LNAF
Sbjct: 773 ISCSFSDDNNVYYCVGTAYVLPEENEPTKGRILVFAVEDGKLQLIAEKETKGAVYCLNAF 832
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
NGKLLAAINQKIQLYKW+LRDDGT ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 833 NGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 892
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
LIYKHEEGAIEERARDYNANWMSAVEI+DDDIYLGAEN+FNLFTVRKNSEGATDEERGRL
Sbjct: 893 LIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLGAENSFNLFTVRKNSEGATDEERGRL 952
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT+NGVIGVIASLPHEQY+FLEK
Sbjct: 953 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTINGVIGVIASLPHEQYVFLEK 1012
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
LQ+NLRKVIKGVGGL+HEQWRSFNNEKKTV+A+NFLDGDLIESFLDL+R++MDEISK ++
Sbjct: 1013 LQSNLRKVIKGVGGLSHEQWRSFNNEKKTVEARNFLDGDLIESFLDLNRSKMDEISKALD 1072
Query: 1020 VSVEELCKRVEELTRLH 1036
VSVEELCKRVEELTRLH
Sbjct: 1073 VSVEELCKRVEELTRLH 1089
|
Source: Glycine max Species: Glycine max Genus: Glycine Family: Fabaceae Order: Fabales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|356525401|ref|XP_003531313.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Glycine max] | Back alignment and taxonomy information |
|---|
Score = 1994 bits (5165), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 971/1037 (93%), Positives = 1010/1037 (97%), Gaps = 1/1037 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQD+LFIATERYKFCVLQWD+E+ EL+TRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDYLFIATERYKFCVLQWDSETGELVTRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII
Sbjct: 173 SKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDH GL+ LLVI HEKEKVTGL
Sbjct: 233 GEETIVYCSANAFKAIPIRPSITKAYGRVDPDGSRYLLGDHTGLVSLLVIIHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIE LGETSIASTISYLDNA VY+GSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV
Sbjct: 293 KIEPLGETSIASTISYLDNAFVYVGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLR+VRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRVVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA++NQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVHNQLVQVTSSSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+R+LRNEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDGIL EVKHAQLEYEI
Sbjct: 473 SSTTRDLRNEWHAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGILQEVKHAQLEYEI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP++S +AAVGMWTDISVRIFSLPDL+LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGENPNHSNLAAVGMWTDISVRIFSLPDLSLITKEQLGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF+LN TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 593 ISYLLCALGDGHLLNFMLNTSTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRICHQEQSRTFAICSLK N + E+SEMHFVRLLDDQTFEFISTY LDT+EYGC I
Sbjct: 713 EHARRICHQEQSRTFAICSLKYNPASGEDSEMHFVRLLDDQTFEFISTYSLDTYEYGCFI 772
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
+SCSFSDD+NVYYCVGTAYVLPEENEPTKGRI+VF VEDGKLQLIAEKETKGAVY LNAF
Sbjct: 773 ISCSFSDDNNVYYCVGTAYVLPEENEPTKGRIIVFAVEDGKLQLIAEKETKGAVYCLNAF 832
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
NGKLLAAINQKIQLYKW+LRDDGT ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 833 NGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 892
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
LIYKHEEGAIEERARDYNANWMSAVEI+DDDIYLGAEN+FNLFTVRKNSEGATDEERGRL
Sbjct: 893 LIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLGAENSFNLFTVRKNSEGATDEERGRL 952
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT+NGVIGVIASLPHEQY+FLEK
Sbjct: 953 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTINGVIGVIASLPHEQYVFLEK 1012
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
LQ+NLRKVIKGVGGL+HEQWRSFNNEKKTV+A+NFLDGDLIESFLDL+R++MDEISK ++
Sbjct: 1013 LQSNLRKVIKGVGGLSHEQWRSFNNEKKTVEARNFLDGDLIESFLDLNRSKMDEISKAVD 1072
Query: 1020 VSVEELCKRVEELTRLH 1036
VSVEELCKRVEELTRLH
Sbjct: 1073 VSVEELCKRVEELTRLH 1089
|
Source: Glycine max Species: Glycine max Genus: Glycine Family: Fabaceae Order: Fabales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|312283457|dbj|BAJ34594.1| unnamed protein product [Thellungiella halophila] | Back alignment and taxonomy information |
|---|
Score = 1983 bits (5137), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 958/1036 (92%), Positives = 1006/1036 (97%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVP+YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG
Sbjct: 53 MLDVPMYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLFGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
AKPTI VLYQDNKDARHVKTYEV+LKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII
Sbjct: 173 AKPTIAVLYQDNKDARHVKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDHAGL+HLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSANAFKAIPIRPSITKAYGRVDVDGSRYLLGDHAGLIHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNAVV++GSSYGDSQL+KLNL PDAKGSYVEVLERYVNLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKGSYVEVLERYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGA+KDGSLRIVRNGIGINEQASVELQGIKGMWSL+SS D+
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAFKDGSLRIVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETR+LAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRVLAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELR+EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+
Sbjct: 473 SSTTRELRDEWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHAVLEYEV 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG+NP+YSQ+A+VGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGDNPNYSQLASVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF L+ TG+L DRKKVSLGTQPITLRTFSSKN THVFAASDRPTV
Sbjct: 593 ISYLLCALGDGHLLNFQLDTTTGQLKDRKKVSLGTQPITLRTFSSKNATHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQ+RTF ICSL NQ+ AEESEMHFVRLLDDQ+FEF+STYPLD FEYGCSIL
Sbjct: 713 EHARRICHQEQTRTFGICSLGNQTNAEESEMHFVRLLDDQSFEFVSTYPLDAFEYGCSIL 772
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKG+VYSLNAFN
Sbjct: 773 SCSFADDKNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGSVYSLNAFN 832
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLE
Sbjct: 893 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLLTVKKNSEGATDEERGRLE 952
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY+FLEKL
Sbjct: 953 VVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYMFLEKL 1012
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q++LRKVIKGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV
Sbjct: 1013 QSSLRKVIKGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNV 1072
Query: 1021 SVEELCKRVEELTRLH 1036
VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088
|
Source: Thellungiella halophila Species: Eutrema halophilum Genus: Eutrema Family: Brassicaceae Order: Brassicales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|297809743|ref|XP_002872755.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata] gi|297318592|gb|EFH49014.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
Score = 1982 bits (5134), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 958/1036 (92%), Positives = 1006/1036 (97%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLFGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
AKPTI VLYQDNKDARHVKTYEV+LKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII
Sbjct: 173 AKPTIAVLYQDNKDARHVKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDHAGL+HLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSANAFKAIPIRPSITKAYGRVDVDGSRYLLGDHAGLIHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNAVV++GSSYGDSQL+KLNL PDAKGSYVEVLERY+NLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKGSYVEVLERYINLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKGMWSL+SS D+
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELR+EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+
Sbjct: 473 SSTTRELRDEWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHALLEYEV 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG+NP+YSQ+A+VGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGDNPNYSQLASVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF L+ TG+L DRKKVSLGTQPITLRTFSSK+ THVFAASDRPTV
Sbjct: 593 ISYLLCALGDGHLLNFQLDTTTGQLKDRKKVSLGTQPITLRTFSSKSATHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQ+RTF ICSL NQS AEESEMHFVRLLDDQTFEF+STYPLD+FEYGCSIL
Sbjct: 713 EHARRICHQEQTRTFGICSLGNQSNAEESEMHFVRLLDDQTFEFMSTYPLDSFEYGCSIL 772
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVEDG+LQLIAEKETKGAVYSLNAFN
Sbjct: 773 SCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIVEDGRLQLIAEKETKGAVYSLNAFN 832
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
+YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLE
Sbjct: 893 LYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLVTVKKNSEGATDEERGRLE 952
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY FLEKL
Sbjct: 953 VVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYTFLEKL 1012
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q++LRKVIKGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV
Sbjct: 1013 QSSLRKVIKGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNV 1072
Query: 1021 SVEELCKRVEELTRLH 1036
VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088
|
Source: Arabidopsis lyrata subsp. lyrata Species: Arabidopsis lyrata Genus: Arabidopsis Family: Brassicaceae Order: Brassicales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|225443992|ref|XP_002280744.1| PREDICTED: DNA damage-binding protein 1 isoform 2 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Score = 1977 bits (5121), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 972/1037 (93%), Positives = 999/1037 (96%), Gaps = 22/1037 (2%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAE+SE+ITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAENSEVITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQ VIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQ---------------------VIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 151
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPW+QNNLDNGADLLIPVPPPLCGVLII
Sbjct: 152 SKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWAQNNLDNGADLLIPVPPPLCGVLII 211
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL
Sbjct: 212 GEETIVYCSASAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 271
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNA VY+GSSYGDSQLIK++LQPDAKGSYVEVLERYVNLGPIV
Sbjct: 272 KIELLGETSIASTISYLDNAFVYVGSSYGDSQLIKIHLQPDAKGSYVEVLERYVNLGPIV 331
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 332 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 391
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
DTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA+Y+QLVQVTS SVRLV
Sbjct: 392 HDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVYDQLVQVTSSSVRLV 451
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
STSRELRNEWK+P GYSVNVATANA+QVLLATGGGHLVYLEIGDG LTEVKHAQLEY+I
Sbjct: 452 GSTSRELRNEWKAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGTLTEVKHAQLEYDI 511
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP++SQ+AAVGMWTDISVRIFSLPDLNLITKE+LGGEIIPRSVLLC+FEG
Sbjct: 512 SCLDINPIGENPNFSQLAAVGMWTDISVRIFSLPDLNLITKEYLGGEIIPRSVLLCSFEG 571
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
I YLLCALGDGHLLNFLLNM TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 572 IPYLLCALGDGHLLNFLLNMSTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 631
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG+LTIGTIDDIQKLHIRSIPLG
Sbjct: 632 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGDLTIGTIDDIQKLHIRSIPLG 691
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRICHQEQSRTFAICSLK NQS E+SEMHF+RLLDDQTFEFISTYPLDTFEYGCSI
Sbjct: 692 EHARRICHQEQSRTFAICSLKYNQSSTEDSEMHFIRLLDDQTFEFISTYPLDTFEYGCSI 751
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 752 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 811
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
NGKLLAAINQKIQLYKWMLRDDGTRELQSE GHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 812 NGKLLAAINQKIQLYKWMLRDDGTRELQSESGHHGHILALYVQTRGDFIVVGDLMKSISL 871
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN+FTVRKNSEGATDEERGRL
Sbjct: 872 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNIFTVRKNSEGATDEERGRL 931
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QY+FLEK
Sbjct: 932 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYVFLEK 991
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
LQ NLRKVIKGVGGL+HEQWRSFNNEKKTVDAKNFLDGDLIE+FLDL+RTRMDEISK M
Sbjct: 992 LQANLRKVIKGVGGLSHEQWRSFNNEKKTVDAKNFLDGDLIETFLDLNRTRMDEISKAMA 1051
Query: 1020 VSVEELCKRVEELTRLH 1036
VSVEELCKRVEELTRLH
Sbjct: 1052 VSVEELCKRVEELTRLH 1068
|
Source: Vitis vinifera Species: Vitis vinifera Genus: Vitis Family: Vitaceae Order: Vitales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|15235577|ref|NP_192451.1| DNA damage-binding protein 1a [Arabidopsis thaliana] gi|55976605|sp|Q9M0V3.1|DDB1A_ARATH RecName: Full=DNA damage-binding protein 1a; AltName: Full=UV-damaged DNA-binding protein 1a; Short=DDB1a gi|7267302|emb|CAB81084.1| UV-damaged DNA binding factor-like protein [Arabidopsis thaliana] gi|25054828|gb|AAN71904.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana] gi|332657117|gb|AEE82517.1| DNA damage-binding protein 1a [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
Score = 1974 bits (5114), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 952/1036 (91%), Positives = 1005/1036 (97%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDPESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLFGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
AKPTI VLYQDNKDARHVKTYEV+LKDKDFVEGPWSQN+LDNGADLLIPVPPPLCGVLII
Sbjct: 173 AKPTIAVLYQDNKDARHVKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSITKAYGRVD DGSRYLLGDHAG++HLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSASAFKAIPIRPSITKAYGRVDVDGSRYLLGDHAGMIHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNAVV++GSSYGDSQL+KLNL PDAKGSYVEVLERY+NLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKGSYVEVLERYINLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKGMWSL+SS D+
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELR+EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+
Sbjct: 473 SSTTRELRDEWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHALLEYEV 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG+NP+YSQ+AAVGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGDNPNYSQLAAVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF ++ TG+L DRKKVSLGTQPITLRTFSSK+ THVFAASDRPTV
Sbjct: 593 ISYLLCALGDGHLLNFQMDTTTGQLKDRKKVSLGTQPITLRTFSSKSATHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQ+RTF ICSL NQS +EESEMHFVRLLDDQTFEF+STYPLD+FEYGCSIL
Sbjct: 713 EHARRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQTFEFMSTYPLDSFEYGCSIL 772
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSF++D NVYYCVGTAYVLPEENEPTKGRILVFIVEDG+LQLIAEKETKGAVYSLNAFN
Sbjct: 773 SCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIVEDGRLQLIAEKETKGAVYSLNAFN 832
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
+YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLE
Sbjct: 893 LYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLLTVKKNSEGATDEERGRLE 952
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY FLEKL
Sbjct: 953 VVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYTFLEKL 1012
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q++LRKVIKGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV
Sbjct: 1013 QSSLRKVIKGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNV 1072
Query: 1021 SVEELCKRVEELTRLH 1036
VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088
|
Source: Arabidopsis thaliana Species: Arabidopsis thaliana Genus: Arabidopsis Family: Brassicaceae Order: Brassicales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|350537001|ref|NP_001234275.1| DNA damage-binding protein 1 [Solanum lycopersicum] gi|350539125|ref|NP_001233864.1| UV damaged DNA binding protein 1 [Solanum lycopersicum] gi|55976440|sp|Q6QNU4.1|DDB1_SOLLC RecName: Full=DNA damage-binding protein 1; AltName: Full=High pigmentation protein 1; AltName: Full=UV-damaged DNA-binding protein 1 gi|38455768|gb|AAR20885.1| UV damaged DNA binding protein 1 [Solanum lycopersicum] gi|42602165|gb|AAS21683.1| UV-damaged DNA binding protein 1 [Solanum lycopersicum] | Back alignment and taxonomy information |
|---|
Score = 1967 bits (5095), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 959/1038 (92%), Positives = 1004/1038 (96%), Gaps = 2/1038 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGE QD LFIATERYKFCVLQWD E+SE+ITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGETQDLLFIATERYKFCVLQWDTEASEVITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTIVVLYQDNKDARHVKTYEV+LKDKDF+EGPW+QNNLDNGA LLIPVPPPLCGVLII
Sbjct: 173 PKPTIVVLYQDNKDARHVKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSIT+AYGRVDADGSRYLLGDH GLLHLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSASAFKAIPIRPSITRAYGRVDADGSRYLLGDHNGLLHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNA V+IGSSYGDSQL+KLNLQPD KGSYVEVLERYVNLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNLQPDTKGSYVEVLERYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS+TDDP
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSATDDP 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+DTFLVVSFISETR+LAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 YDTFLVVSFISETRVLAMNLEDELEETEIEGFNSQVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SSTSR+L+NEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDG+L EVK+A+L+Y+I
Sbjct: 473 SSTSRDLKNEWFAPVGYSVNVATANATQVLLATGGGHLVYLEIGDGVLNEVKYAKLDYDI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP+YS IAAVGMWTDISVRI+SLPDLNLITKE LGGEIIPRSVL+C+FEG
Sbjct: 533 SCLDINPIGENPNYSNIAAVGMWTDISVRIYSLPDLNLITKEQLGGEIIPRSVLMCSFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF+L+M TGELTDRKKVSLGTQPITLRTFSSK+TTHVFAASDRPTV
Sbjct: 593 ISYLLCALGDGHLLNFVLSMSTGELTDRKKVSLGTQPITLRTFSSKDTTHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFN AAFPDSLAIAKEGELTIGTID+IQKLHIRSIPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNVAAFPDSLAIAKEGELTIGTIDEIQKLHIRSIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRI HQEQ+RTFA+CS+K QS A++ EMHFVRLLDDQTFEFISTYPLD FEYGCSI
Sbjct: 713 EHARRISHQEQTRTFALCSVKYTQSNADDPEMHFVRLLDDQTFEFISTYPLDQFEYGCSI 772
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
LSCSFSDDSNVYYC+GTAYV+PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 773 LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 832
Query: 780 NGKLLAAINQKIQLYKWMLRDD-GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
NGKLLAAINQKIQLYKW R+D G+RELQ+ECGHHGHILALYVQTRGDFIVVGDLMKSIS
Sbjct: 833 NGKLLAAINQKIQLYKWASREDGGSRELQTECGHHGHILALYVQTRGDFIVVGDLMKSIS 892
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
LLI+KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER R
Sbjct: 893 LLIFKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERSR 952
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QYLFLE
Sbjct: 953 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLE 1012
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
KLQTNLRKVIKGVGGL+HEQWRSF NEKKTVDAKNFLDGDLIESFLDLSR RM+EISK M
Sbjct: 1013 KLQTNLRKVIKGVGGLSHEQWRSFYNEKKTVDAKNFLDGDLIESFLDLSRNRMEEISKAM 1072
Query: 1019 NVSVEELCKRVEELTRLH 1036
+V VEEL KRVEELTRLH
Sbjct: 1073 SVPVEELMKRVEELTRLH 1090
|
Source: Solanum lycopersicum Species: Solanum lycopersicum Genus: Solanum Family: Solanaceae Order: Solanales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|55976392|sp|Q6E7D1.1|DDB1_SOLCE RecName: Full=DNA damage-binding protein 1; AltName: Full=UV-damaged DNA-binding protein 1 gi|49484911|gb|AAT66742.1| UV-damaged DNA binding protein 1 [Solanum cheesmaniae] | Back alignment and taxonomy information |
|---|
Score = 1964 bits (5088), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 959/1038 (92%), Positives = 1004/1038 (96%), Gaps = 2/1038 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGE QD LFIATERYKFCVLQWD E+SE+ITRAMGDVSDRIG
Sbjct: 58 MLDVPIYGRIATLELFRPHGETQDLLFIATERYKFCVLQWDTEASEVITRAMGDVSDRIG 117
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 118 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 177
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTIVVLYQDNKDARHVKTYEV+LKDKDF+EGPW+QNNLDNGA LLIPVPPPLCGVLII
Sbjct: 178 PKPTIVVLYQDNKDARHVKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLII 237
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSIT+AYGRVDADGSRYLLGDH GLLHLLVITHEKEKVTGL
Sbjct: 238 GEETIVYCSASAFKAIPIRPSITRAYGRVDADGSRYLLGDHNGLLHLLVITHEKEKVTGL 297
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNA V+IGSSYGDSQL+KLNLQPD KGSYVEVLERYVNLGPIV
Sbjct: 298 KIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNLQPDTKGSYVEVLERYVNLGPIV 357
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS+TDDP
Sbjct: 358 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSATDDP 417
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+DTFLVVSFISETR+LAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 418 YDTFLVVSFISETRVLAMNLEDELEETEIEGFNSQVQTLFCHDAVYNQLVQVTSNSVRLV 477
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SSTSR+L+NEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDG+L EVK+A+L+Y+I
Sbjct: 478 SSTSRDLKNEWFAPVGYSVNVATANATQVLLATGGGHLVYLEIGDGVLNEVKYAKLDYDI 537
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP+YS IAAVGMWTDISVRI+SLPDLNLITKE LGGEIIPRSVL+C+FEG
Sbjct: 538 SCLDINPIGENPNYSNIAAVGMWTDISVRIYSLPDLNLITKEQLGGEIIPRSVLMCSFEG 597
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF+L+M TGELTDRKKVSLGTQPITLRTFSSK+TTHVFAASDRPTV
Sbjct: 598 ISYLLCALGDGHLLNFVLSMSTGELTDRKKVSLGTQPITLRTFSSKDTTHVFAASDRPTV 657
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFN AAFPDSLAIAKEGELTIGTID+IQKLHIRSIPLG
Sbjct: 658 IYSSNKKLLYSNVNLKEVSHMCPFNVAAFPDSLAIAKEGELTIGTIDEIQKLHIRSIPLG 717
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRI HQEQ+RTFA+CS+K QS A++ EMHFVRLLDDQTFEFISTYPLD FEYGCSI
Sbjct: 718 EHARRISHQEQTRTFALCSVKYTQSNADDPEMHFVRLLDDQTFEFISTYPLDQFEYGCSI 777
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
LSCSFSDDSNVYYC+GTAYV+PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 778 LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 837
Query: 780 NGKLLAAINQKIQLYKWMLRDD-GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
NGKLLAAINQKIQLYKW R+D G+RELQ+ECGHHGHILALYVQTRGDFIVVGDLMKSIS
Sbjct: 838 NGKLLAAINQKIQLYKWASREDGGSRELQTECGHHGHILALYVQTRGDFIVVGDLMKSIS 897
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
LLI+KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER R
Sbjct: 898 LLIFKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERSR 957
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QYLFLE
Sbjct: 958 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLE 1017
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
KLQTNLRKVIKGVGGL+HEQWRSF NEKKTVDAKNFLDGDLIESFLDLSR RM+EISK M
Sbjct: 1018 KLQTNLRKVIKGVGGLSHEQWRSFYNEKKTVDAKNFLDGDLIESFLDLSRNRMEEISKAM 1077
Query: 1019 NVSVEELCKRVEELTRLH 1036
+V VEEL KRVEELTRLH
Sbjct: 1078 SVPVEELMKRVEELTRLH 1095
|
Source: Solanum cheesmaniae Species: Solanum cheesmaniae Genus: Solanum Family: Solanaceae Order: Solanales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 1036 | ||||||
| TAIR|locus:2115909 | 1088 | DDB1A "damaged DNA binding pro | 1.0 | 0.952 | 0.899 | 0.0 | |
| TAIR|locus:2127368 | 1088 | DDB1B "damaged DNA binding pro | 1.0 | 0.952 | 0.872 | 0.0 | |
| UNIPROTKB|E2R9E3 | 1140 | DDB1 "Uncharacterized protein" | 0.650 | 0.591 | 0.550 | 3.5e-297 | |
| UNIPROTKB|A1A4K3 | 1140 | DDB1 "DNA damage-binding prote | 0.650 | 0.591 | 0.550 | 5.7e-297 | |
| MGI|MGI:1202384 | 1140 | Ddb1 "damage specific DNA bind | 0.650 | 0.591 | 0.550 | 5.7e-297 | |
| UNIPROTKB|F1RIE2 | 1140 | DDB1 "Uncharacterized protein" | 0.650 | 0.591 | 0.548 | 1.2e-296 | |
| UNIPROTKB|Q16531 | 1140 | DDB1 "DNA damage-binding prote | 0.650 | 0.591 | 0.548 | 1.5e-296 | |
| UNIPROTKB|P33194 | 1140 | DDB1 "DNA damage-binding prote | 0.650 | 0.591 | 0.548 | 1.5e-296 | |
| UNIPROTKB|Q5R649 | 1140 | DDB1 "DNA damage-binding prote | 0.650 | 0.591 | 0.548 | 4e-296 | |
| UNIPROTKB|F1P4I8 | 1120 | DDB1 "DNA damage-binding prote | 0.650 | 0.601 | 0.544 | 1.1e-295 |
| TAIR|locus:2115909 DDB1A "damaged DNA binding protein 1A" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 4875 (1721.1 bits), Expect = 0., P = 0.
Identities = 932/1036 (89%), Positives = 983/1036 (94%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDPESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLFGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
AKPTI VLYQDNKDARHVKTYEV+LKDKDFVEGPWSQN+LDNGADLLIPVPPPLCGVLII
Sbjct: 173 AKPTIAVLYQDNKDARHVKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSITKAYGRVD DGSRY VITHEKEKVTGL
Sbjct: 233 GEETIVYCSASAFKAIPIRPSITKAYGRVDVDGSRYLLGDHAGMIHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNAVV++GSSYGDSQL+KLNL PDAKGSYVEVLERY+NLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKGSYVEVLERYINLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKGMWSL+SS D+
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412
Query: 361 FDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMNL GF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELR+EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+
Sbjct: 473 SSTTRELRDEWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHALLEYEV 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG+NP+YSQ+AAVGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGDNPNYSQLAAVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF ++ TG+L DRKKVSLGTQPITLRTFSSK+ THVFAASDRPTV
Sbjct: 593 ISYLLCALGDGHLLNFQMDTTTGQLKDRKKVSLGTQPITLRTFSSKSATHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQ+RTF ICSL NQS +EESEMHFVRLLDDQTFEF+STYPLD+FEYGCSIL
Sbjct: 713 EHARRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQTFEFMSTYPLDSFEYGCSIL 772
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSF++D NVYYCVGTAYVLPEENEPTKGRILVFIVEDG+LQLIAEKETKGAVYSLNAFN
Sbjct: 773 SCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIVEDGRLQLIAEKETKGAVYSLNAFN 832
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
+YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLE
Sbjct: 893 LYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLLTVKKNSEGATDEERGRLE 952
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY FLEKL
Sbjct: 953 VVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYTFLEKL 1012
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q++LRKVIKGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV
Sbjct: 1013 QSSLRKVIKGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNV 1072
Query: 1021 SVEELCKRVEELTRLH 1036
VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088
|
|
| TAIR|locus:2127368 DDB1B "damaged DNA binding protein 1B" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 4744 (1675.0 bits), Expect = 0., P = 0.
Identities = 904/1036 (87%), Positives = 967/1036 (93%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LDVP+YGRIAT+ELFRPHGEAQDFLF+ATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53 ILDVPLYGRIATMELFRPHGEAQDFLFVATERYKFCVLQWDYESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCR+IGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRVIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTI VLYQDNKDARHVKTYEV+LKDK+FVEGPWSQNNLDNGADLLIPVP PLCGVLII
Sbjct: 173 TKPTIAVLYQDNKDARHVKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEKVTGL 240
GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRY VITHEKEKVTGL
Sbjct: 233 GEETIVYCSANAFKAIPIRPSITKAYGRVDLDGSRYLLGDHAGLIHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIAS+ISYLDNAVV++GSSYGDSQLIKLNLQPDAKGSYVE+LE+YVNLGPIV
Sbjct: 293 KIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKLNLQPDAKGSYVEILEKYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL+SS D+
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412
Query: 361 FDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMN+ GF S+ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNIEDELEETEIEGFLSEVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELRN+W +P G+SVNVATANASQVLLATGGGHLVYLEIGDG LTEVKH LEYE+
Sbjct: 473 SSTTRELRNKWDAPAGFSVNVATANASQVLLATGGGHLVYLEIGDGTLTEVKHVLLEYEV 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG+NP+YSQ+AAVGMWTDISVRIF LPDL LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGDNPNYSQLAAVGMWTDISVRIFVLPDLTLITKEELGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF L+ G+L DRKKVSLGT+PITLRTFSSK+ THVFAASDRP V
Sbjct: 593 ISYLLCALGDGHLLNFQLDTSCGKLRDRKKVSLGTRPITLRTFSSKSATHVFAASDRPAV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYS+NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IP+G
Sbjct: 653 IYSNNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPIG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQ+RTFAI L+N+ AEESE HFVRLLD Q+FEF+S+YPLD FE GCSIL
Sbjct: 713 EHARRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQSFEFLSSYPLDAFECGCSIL 772
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVE+G+LQLI EKETKGAVYSLNAFN
Sbjct: 773 SCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIVEEGRLQLITEKETKGAVYSLNAFN 832
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLA+INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI VGDLMKSISLL
Sbjct: 833 GKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIAVGDLMKSISLL 892
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
IYKHEEGAIEERARDYNANWM+AVEIL+DDIYLG +N FN+FTV+KN+EGATDEER R+E
Sbjct: 893 IYKHEEGAIEERARDYNANWMTAVEILNDDIYLGTDNCFNIFTVKKNNEGATDEERARME 952
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYH+GEFVNRFRHGSLVM+LPDSD+GQIPTVIFGTV+G+IGVIASLP EQY FLEKL
Sbjct: 953 VVGEYHIGEFVNRFRHGSLVMKLPDSDIGQIPTVIFGTVSGMIGVIASLPQEQYAFLEKL 1012
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
QT+LRKVIKGVGGL+HEQWRSFNNEK+T +AK +LDGDLIESFLDLSR +M+EISK M+V
Sbjct: 1013 QTSLRKVIKGVGGLSHEQWRSFNNEKRTAEAKGYLDGDLIESFLDLSRGKMEEISKGMDV 1072
Query: 1021 SVEELCKRVEELTRLH 1036
VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088
|
|
| UNIPROTKB|E2R9E3 DDB1 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] | Back alignment and assigned GO terms |
|---|
Score = 1937 (686.9 bits), Expect = 3.5e-297, Sum P(2) = 3.5e-297
Identities = 378/687 (55%), Positives = 498/687 (72%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK- 236
G+E+I Y + + + AI PI + S + RVD +GSRY ++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 237 --VT--GLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
VT L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICS 679
HIR++PL E PR+IC+QE S+ F + S
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLS 737
|
|
| UNIPROTKB|A1A4K3 DDB1 "DNA damage-binding protein 1" [Bos taurus (taxid:9913)] | Back alignment and assigned GO terms |
|---|
Score = 1935 (686.2 bits), Expect = 5.7e-297, Sum P(2) = 5.7e-297
Identities = 378/687 (55%), Positives = 498/687 (72%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK- 236
G+E+I Y + + + AI PI + S + RVD +GSRY ++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 237 --VT--GLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
VT L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGMSPLCAIGLWTDISARIAKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICS 679
HIR++PL E PR+IC+QE S+ F + S
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLS 737
|
|
| MGI|MGI:1202384 Ddb1 "damage specific DNA binding protein 1" [Mus musculus (taxid:10090)] | Back alignment and assigned GO terms |
|---|
Score = 1935 (686.2 bits), Expect = 5.7e-297, Sum P(2) = 5.7e-297
Identities = 378/687 (55%), Positives = 497/687 (72%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK- 236
G+E+I Y + + + AI PI + S + RVD +GSRY ++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 237 --VT--GLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
VT L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L +N GF QT FC + + QL+Q+
Sbjct: 413 LRSDPGRETDDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICS 679
HIR++PL E PR+IC+QE S+ F + S
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLS 737
|
|
| UNIPROTKB|F1RIE2 DDB1 "Uncharacterized protein" [Sus scrofa (taxid:9823)] | Back alignment and assigned GO terms |
|---|
Score = 1932 (685.2 bits), Expect = 1.2e-296, Sum P(2) = 1.2e-296
Identities = 377/687 (54%), Positives = 498/687 (72%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK- 236
G+E+I Y + + + AI PI + S + RVD +GSRY ++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 237 --VT--GLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
VT L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N++QV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSNQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARISKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICS 679
HIR++PL E PR+IC+QE S+ F + S
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLS 737
|
|
| UNIPROTKB|Q16531 DDB1 "DNA damage-binding protein 1" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
Score = 1931 (684.8 bits), Expect = 1.5e-296, Sum P(2) = 1.5e-296
Identities = 377/687 (54%), Positives = 497/687 (72%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK- 236
G+E+I Y + + + AI PI + S + RVD +GSRY ++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 237 --VT--GLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
VT L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICS 679
HIR++PL E PR+IC+QE S+ F + S
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLS 737
|
|
| UNIPROTKB|P33194 DDB1 "DNA damage-binding protein 1" [Chlorocebus aethiops (taxid:9534)] | Back alignment and assigned GO terms |
|---|
Score = 1931 (684.8 bits), Expect = 1.5e-296, Sum P(2) = 1.5e-296
Identities = 377/687 (54%), Positives = 497/687 (72%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK- 236
G+E+I Y + + + AI PI + S + RVD +GSRY ++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 237 --VT--GLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
VT L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICS 679
HIR++PL E PR+IC+QE S+ F + S
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLS 737
|
|
| UNIPROTKB|Q5R649 DDB1 "DNA damage-binding protein 1" [Pongo abelii (taxid:9601)] | Back alignment and assigned GO terms |
|---|
Score = 1932 (685.2 bits), Expect = 4.0e-296, Sum P(2) = 4.0e-296
Identities = 377/687 (54%), Positives = 497/687 (72%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNVCILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK- 236
G+E+I Y + + + AI PI + S + RVD +GSRY ++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 237 --VT--GLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
VT L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICS 679
HIR++PL E PR+IC+QE S+ F + S
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLS 737
|
|
| UNIPROTKB|F1P4I8 DDB1 "DNA damage-binding protein 1" [Gallus gallus (taxid:9031)] | Back alignment and assigned GO terms |
|---|
Score = 1923 (682.0 bits), Expect = 1.1e-295, Sum P(2) = 1.1e-295
Identities = 374/687 (54%), Positives = 498/687 (72%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+ A +ELFRP GE++D LFI T +Y C+L++ + ++ ++ITRA G+V DRIG
Sbjct: 34 EVGMYGKTAVMELFRPKGESKDLLFILTAKYNACILEYKQNGDNIDIITRAHGNVQDRIG 93
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + + +AFNIRLEELQV+D+KFLYGC
Sbjct: 94 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRENKELKAFNIRLEELQVIDVKFLYGC 153
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 154 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 212
Query: 181 GEETIVYCSANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK- 236
G+E+I Y + + + AI PI + S + RVD +GSRY ++ E++
Sbjct: 213 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 272
Query: 237 --VT--GLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
VT L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 273 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 332
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 333 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 392
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N GF QT FC + + QL+Q+
Sbjct: 393 LRSDSHREMDNMLVLSFVGQTRVLMLN-GEEVEETELTGFVDDQQTFFCGNVAHQQLIQI 451
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N++QV++A G L YLEI L ++
Sbjct: 452 TSASVRLVSQEPKALVSEWKEPNGKNISVASCNSNQVVVAVGRA-LYYLEIRPQELRQIN 510
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
++E+E++CLDI P+G+ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 511 CTEMEHEVACLDITPLGDTNGMSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 570
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F L+++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 571 ILMTTFESSHYLLCALGDGALFYFGLSLETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 630
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 631 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 690
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICS 679
HIR++PL E PR+IC+QE S+ F + S
Sbjct: 691 HIRTVPLYESPRKICYQEVSQCFGVLS 717
|
|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Annotation Transfer from SWISS-PROT Entries 
Original result of BLAST against SWISS-PROT
ID ![]() | Name ![]() | Annotated EC number ![]() | Identity ![]() | Query coverage ![]() | Hit coverage ![]() | RBH(Q2H) ![]() | RBH(H2Q) ![]() |
| Q9ESW0 | DDB1_RAT | No assigned EC number | 0.5219 | 0.9913 | 0.9008 | yes | no |
| Q6E7D1 | DDB1_SOLCE | No assigned EC number | 0.9238 | 1.0 | 0.9461 | N/A | no |
| Q3U1J4 | DDB1_MOUSE | No assigned EC number | 0.5292 | 0.9913 | 0.9008 | yes | no |
| Q21554 | DDB1_CAEEL | No assigned EC number | 0.3351 | 0.9816 | 0.8968 | yes | no |
| Q6P6Z0 | DDB1_XENLA | No assigned EC number | 0.5292 | 0.9913 | 0.9008 | N/A | no |
| Q5R649 | DDB1_PONAB | No assigned EC number | 0.5274 | 0.9913 | 0.9008 | yes | no |
| Q9XYZ5 | DDB1_DROME | No assigned EC number | 0.4631 | 0.9864 | 0.8964 | yes | no |
| Q805F9 | DDB1_CHICK | No assigned EC number | 0.5265 | 0.9913 | 0.9008 | yes | no |
| B0M0P5 | DDB1_DICDI | No assigned EC number | 0.4384 | 0.9884 | 0.8670 | yes | no |
| Q6QNU4 | DDB1_SOLLC | No assigned EC number | 0.9238 | 1.0 | 0.9504 | N/A | no |
| Q9M0V3 | DDB1A_ARATH | No assigned EC number | 0.9189 | 1.0 | 0.9522 | yes | no |
| P33194 | DDB1_CHLAE | No assigned EC number | 0.5283 | 0.9913 | 0.9008 | N/A | no |
| O49552 | DDB1B_ARATH | No assigned EC number | 0.8928 | 1.0 | 0.9522 | no | no |
| Q16531 | DDB1_HUMAN | No assigned EC number | 0.5283 | 0.9913 | 0.9008 | yes | no |
| A1A4K3 | DDB1_BOVIN | No assigned EC number | 0.5292 | 0.9913 | 0.9008 | yes | no |
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| Your Input: | |||||||||||
| GSVIVG00028385001 | SubName- Full=Chromosome chr10 scaffold_43, whole genome shotgun sequence; (1089 aa) | ||||||||||
(Vitis vinifera) | |||||||||||
Predicted Functional Partners: | |||||||||||
| GSVIVG00024110001 | SubName- Full=Putative uncharacterized protein (Chromosome chr6 scaffold_3, whole genome shotgu [...] (559 aa) | • | • | • | 0.804 | ||||||
| GSVIVG00014959001 | SubName- Full=Chromosome chr18 scaffold_1, whole genome shotgun sequence; (462 aa) | • | • | 0.741 | |||||||
| cul4 | SubName- Full=Chromosome chr10 scaffold_43, whole genome shotgun sequence; (802 aa) | • | • | 0.735 | |||||||
| GSVIVG00002569001 | SubName- Full=Chromosome chr2 scaffold_132, whole genome shotgun sequence; (779 aa) | • | • | • | 0.702 | ||||||
| GSVIVG00023628001 | SubName- Full=Chromosome chr7 scaffold_31, whole genome shotgun sequence; (467 aa) | • | • | 0.700 | |||||||
| GSVIVG00023341001 | SubName- Full=Chromosome chr8 scaffold_29, whole genome shotgun sequence; (433 aa) | • | • | • | 0.668 | ||||||
| GSVIVG00023710001 | SubName- Full=Chromosome chr7 scaffold_31, whole genome shotgun sequence; (758 aa) | • | • | • | 0.651 | ||||||
| GSVIVG00022868001 | SubName- Full=Chromosome chr14 scaffold_27, whole genome shotgun sequence; (379 aa) | • | • | • | 0.624 | ||||||
| GSVIVG00016529001 | SubName- Full=Chromosome chr11 scaffold_13, whole genome shotgun sequence; (372 aa) | • | • | 0.611 | |||||||
| GSVIVG00024178001 | RecName- Full=Proliferating cell nuclear antigen;; This protein is an auxiliary protein of DNA [...] (266 aa) | • | • | • | • | 0.596 | |||||
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 1036 | |||
| pfam10433 | 513 | pfam10433, MMS1_N, Mono-functional DNA-alkylating | 1e-106 | |
| pfam03178 | 318 | pfam03178, CPSF_A, CPSF A subunit region | 6e-89 | |
| COG5161 | 1319 | COG5161, SFT1, Pre-mRNA cleavage and polyadenylati | 5e-12 |
| >gnl|CDD|220751 pfam10433, MMS1_N, Mono-functional DNA-alkylating methyl methanesulfonate N-term | Back alignment and domain information |
|---|
Score = 340 bits (874), Expect = e-106
Identities = 142/528 (26%), Positives = 235/528 (44%), Gaps = 78/528 (14%)
Query: 24 DFLFIATERYKFCVLQWDAESSELITRAM---GDVSDRIGRPTDNGQIGIIDPDCRLIGL 80
D L ++ + + L +D E +L D+ + R +G+ +DP R + +
Sbjct: 1 DILVLSLDSGELLFLYYDEEPRQLFPEFYHSRRDLPSDVSRLEQSGRHIAVDPSGRAMAV 60
Query: 81 HLYDGLFKVIPFDNKGQLKEA--------------FNIRLEELQVLDIKFLYGCA----K 122
Y G F V P + +E RL + +LD+ FLY +
Sbjct: 61 SAYSGFFGVYPLQRDNKTQELQSQMAEGPLSPIEEERFRLVDGDILDMCFLYPKSEDDDH 120
Query: 123 PTIVVLYQDNKDARHVKTYEVALKD-------KDFVEGPWSQNNLDNGADLLIPVPPPLC 175
+++L D + H+K YE + + K+ ++ L + LLIP+P P
Sbjct: 121 VILLLLVFDEQRRLHLKLYEWSAGEGLNQVISKETRLPLPNEFELPS---LLIPLPKP-G 176
Query: 176 GVLIIGEETIVYCSAN-------AFKAIPI--RPSITKAYGRVDADGS------RYLLGD 220
L++G ++VY + + F + I SI A+ R D + S LL
Sbjct: 177 SFLVVGPTSLVYYTVHLIESAEYPFLRLEIGSDKSIWTAWARPDRNWSYNEKHDDILLAR 236
Query: 221 HAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDN----AVVYIGSSYGDSQLIKL 276
G ++ L I+++ E V + I +G+ S+A + LD V+++ S GD L K+
Sbjct: 237 EDGRIYYLEISNDGE-VESITILGIGKCSVAIAFAILDKGNEGDVLFVASDTGDGGLFKI 295
Query: 277 NLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQG-------QVVTCSGAYKDGSLRI 329
+ + + ++ + N PI+DF VVD ++ TCSG K GSLR
Sbjct: 296 SARDN-----PRCVQDFPNWAPILDFVVVDSGNSKNESQDPSQDRIFTCSGVGKRGSLRE 350
Query: 330 VRNGIGINEQASVE--LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLE--DELE 385
+R+GI +E L+GI+G+W L S ++ +L+VSF ETR+L + ++ +E+E
Sbjct: 351 LRHGIEAEIGLVIEEELRGIRGLWLLPSGSNG--GYYLLVSFPFETRVLQLPVDPGEEVE 408
Query: 386 ETEIE-GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATA 444
E + + G QTL + ++QVT S+RL ++ +EW + A+
Sbjct: 409 EVDEDSGLDLSVQTLAAGNTSDGVIIQVTENSIRLSDLELGKITDEW---SDEIITAASV 465
Query: 445 NASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY----EISCLDINPI 488
N S VL+A G +LV L I G+L L EI CLD PI
Sbjct: 466 NGSLVLVAVAGRNLVSLRIYLGLLVIEDDGSLTLLDLGEILCLDYEPI 513
|
MMS1 is a protein that protects against replication-dependent DNA damage in Saccharomyces cerevisiae. MMS1 belongs to the DDB1 family of cullin 4 adaptors and the two proteins are homologous. MMS1 bridges the interaction of MMS22 and Crt10 with Cul8/Rtt101. Cul8/Rtt101 is a cullin protein involved in the regulation of DNA replication subsequent to DNA damage. The N-terminal region of MMS1 and the C-terminal of MMS22 are required for the the MMS1-MMS22 interaction. The human HIV-1 virion-associated protein Vpr assembles with DDB1 through interaction with DCAF1 (chromatin assembly factor) to form an E3 ubiquitin ligase that targets cellular substrates for proteasome-mediated degradation and subsequent G2 arrest. Length = 513 |
| >gnl|CDD|217409 pfam03178, CPSF_A, CPSF A subunit region | Back alignment and domain information |
|---|
Score = 287 bits (737), Expect = 6e-89
Identities = 110/324 (33%), Positives = 169/324 (52%), Gaps = 17/324 (5%)
Query: 692 HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS--DDSNVYYCVGTAYVLPEENEPTKG 749
+RL+D T+E I T L+ E S+ S + + Y VGTA+ L E+ G
Sbjct: 2 SCIRLVDPITWEVIDTLELEENEAVLSVKSVNLEDSEGRKEYLVVGTAFDLGEDPAARSG 61
Query: 750 RILVF----IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRE 805
RI VF + KL+L+ + E KGAV +L F G+LLA QK+++Y G +
Sbjct: 62 RIYVFEIIEPETNRKLKLVHKTEVKGAVTALCEFQGRLLAGQGQKLRVYDL-----GKDK 116
Query: 806 LQSECGHHGH-ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV 864
L + + ++ G+ I+VGDLMKS++ L Y E + ARD W++A
Sbjct: 117 LLPKAFLDTPITYVVSLKVFGNRIIVGDLMKSVTFLGYDEEPYRLILFARDTQPRWVTAA 176
Query: 865 EIL-DDDIYLGAENNFNLFTVRKNSEGA-TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR 922
E L D D LGA+ NL +R + E + + RL E+HLG+ V F+ GSLV +
Sbjct: 177 EFLVDYDTILGADKFGNLHVLRYDPEAPESLDGDPRLLHRAEFHLGDIVTSFQKGSLVPK 236
Query: 923 LPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS 981
++ P +++GT++G IG ++ + E+Y L+ LQ LR + + GL+ +RS
Sbjct: 237 TGGAESTSSPQILYGTLDGSIGLLVPFISEEEYRRLQHLQQQLRDELPHLCGLDPRAFRS 296
Query: 982 FNNEKKTVDAKNFLDGDLIESFLD 1005
+ + V KN +DGDL+E FLD
Sbjct: 297 YYSRSPPV--KNVIDGDLLERFLD 318
|
This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs. The function of the aligned region is unknown but may be involved in RNA/DNA binding. Length = 318 |
| >gnl|CDD|227490 COG5161, SFT1, Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification] | Back alignment and domain information |
|---|
Score = 70.4 bits (172), Expect = 5e-12
Identities = 85/457 (18%), Positives = 170/457 (37%), Gaps = 57/457 (12%)
Query: 603 SSNKKLLYSNVNLKEVSHMCPFNSAAF--PDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
SS+K N+ L V P + + D++ + + K ++ P
Sbjct: 852 SSSKAFHRGNIPLVSVI---PLSKRGYLMVDNVLGVRASQYVFDNGYVGNKNPVKRTPKH 908
Query: 661 EHPRRICHQEQSRTFAICSLK------------------NQSCAEESEMHFVRLLDDQTF 702
+ +++ + R + S + E +V L +++
Sbjct: 909 KTLQKLVYHCAGRYMVVGSCEEAGFSPKGEDGESGIPVDTNVPHAEGYRFYVDLYSPKSW 968
Query: 703 EFISTYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IV 756
E I TY D EY I D Y VGT ++ E+ P +GR+ V I+
Sbjct: 969 EVIDTYEFDENEYVFHIKYLILDDMQGTKGKSPYILVGTTFI-EGEDRPARGRLHVLEII 1027
Query: 757 E----------DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL 806
D KL+++ +ETKG V + GK+ QK+ + K + R G +
Sbjct: 1028 SVVPSPGSPFTDCKLKVLGIEETKGTVVRVCEVRGKIALCQGQKVMVRK-IDRSSGIIPV 1086
Query: 807 QSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
+ HI ++ + ++ GD+ + +S ++ E + + ++ E
Sbjct: 1087 GF---YDLHIFTSSIKVVKNLLLAGDIYQGLSFFGFQSEPYRMHLISSSEPLRNATSTEF 1143
Query: 867 L--DDDIYL---GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVM 921
L +++Y A+ N + T N+ + RL + L + +L+
Sbjct: 1144 LVTGNELYFLCCDAKGNIHGLTYSPNNPISMSGA--RLVKRSSFTLHSAEIKM---NLLP 1198
Query: 922 RLPDSDVGQIPTVI--FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
R + G I + +G++ + + Y L +QT + +K VGGLN +
Sbjct: 1199 RNSEFGAGFKKNFIMVYSRSDGMLIHVVPISDAHYRRLLGIQTAIMARLKSVGGLNPRDY 1258
Query: 980 RSFNN-EKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
R ++ ++ ++ LD +I F + + ++
Sbjct: 1259 RLNSDIHLHSLSLRSPLDLHIINLFSYFDMSTRESVA 1295
|
Length = 1319 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 1036 | |||
| KOG1897 | 1096 | consensus Damage-specific DNA binding complex, sub | 100.0 | |
| KOG1896 | 1366 | consensus mRNA cleavage and polyadenylation factor | 100.0 | |
| KOG1898 | 1205 | consensus Splicing factor 3b, subunit 3 [RNA proce | 100.0 | |
| COG5161 | 1319 | SFT1 Pre-mRNA cleavage and polyadenylation specifi | 100.0 | |
| PF10433 | 504 | MMS1_N: Mono-functional DNA-alkylating methyl meth | 100.0 | |
| PF03178 | 321 | CPSF_A: CPSF A subunit region; InterPro: IPR004871 | 100.0 | |
| KOG0318 | 603 | consensus WD40 repeat stress protein/actin interac | 98.91 | |
| KOG2048 | 691 | consensus WD40 repeat protein [General function pr | 98.89 | |
| KOG1036 | 323 | consensus Mitotic spindle checkpoint protein BUB3, | 98.13 | |
| KOG0291 | 893 | consensus WD40-repeat-containing subunit of the 18 | 98.04 | |
| PRK11028 | 330 | 6-phosphogluconolactonase; Provisional | 98.04 | |
| KOG0319 | 775 | consensus WD40-repeat-containing subunit of the 18 | 98.03 | |
| PF10282 | 345 | Lactonase: Lactonase, 7-bladed beta-propeller; Int | 97.87 | |
| cd00200 | 289 | WD40 WD40 domain, found in a number of eukaryotic | 97.83 | |
| KOG0318 | 603 | consensus WD40 repeat stress protein/actin interac | 97.82 | |
| KOG1446 | 311 | consensus Histone H3 (Lys4) methyltransferase comp | 97.82 | |
| PF03178 | 321 | CPSF_A: CPSF A subunit region; InterPro: IPR004871 | 97.81 | |
| KOG2106 | 626 | consensus Uncharacterized conserved protein, conta | 97.8 | |
| KOG0306 | 888 | consensus WD40-repeat-containing subunit of the 18 | 97.78 | |
| cd00200 | 289 | WD40 WD40 domain, found in a number of eukaryotic | 97.6 | |
| PF10282 | 345 | Lactonase: Lactonase, 7-bladed beta-propeller; Int | 97.59 | |
| KOG1539 | 910 | consensus WD repeat protein [General function pred | 97.55 | |
| PRK11028 | 330 | 6-phosphogluconolactonase; Provisional | 97.45 | |
| KOG0647 | 347 | consensus mRNA export protein (contains WD40 repea | 97.44 | |
| KOG2055 | 514 | consensus WD40 repeat protein [General function pr | 97.4 | |
| PF02239 | 369 | Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO | 97.39 | |
| KOG0319 | 775 | consensus WD40-repeat-containing subunit of the 18 | 97.36 | |
| KOG1408 | 1080 | consensus WD40 repeat protein [Function unknown] | 97.29 | |
| KOG0291 | 893 | consensus WD40-repeat-containing subunit of the 18 | 97.25 | |
| KOG2106 | 626 | consensus Uncharacterized conserved protein, conta | 97.16 | |
| KOG1274 | 933 | consensus WD40 repeat protein [General function pr | 97.09 | |
| KOG0310 | 487 | consensus Conserved WD40 repeat-containing protein | 97.06 | |
| KOG1273 | 405 | consensus WD40 repeat protein [General function pr | 96.8 | |
| KOG2055 | 514 | consensus WD40 repeat protein [General function pr | 96.74 | |
| COG2706 | 346 | 3-carboxymuconate cyclase [Carbohydrate transport | 96.59 | |
| KOG0315 | 311 | consensus G-protein beta subunit-like protein (con | 96.58 | |
| KOG0296 | 399 | consensus Angio-associated migratory cell protein | 96.54 | |
| PLN00181 | 793 | protein SPA1-RELATED; Provisional | 96.48 | |
| TIGR02658 | 352 | TTQ_MADH_Hv methylamine dehydrogenase heavy chain. | 96.43 | |
| TIGR03866 | 300 | PQQ_ABC_repeats PQQ-dependent catabolism-associate | 96.34 | |
| KOG0647 | 347 | consensus mRNA export protein (contains WD40 repea | 96.28 | |
| KOG0285 | 460 | consensus Pleiotropic regulator 1 [RNA processing | 96.27 | |
| KOG1036 | 323 | consensus Mitotic spindle checkpoint protein BUB3, | 96.24 | |
| PF08596 | 395 | Lgl_C: Lethal giant larvae(Lgl) like, C-terminal; | 96.19 | |
| KOG1539 | 910 | consensus WD repeat protein [General function pred | 95.99 | |
| KOG1407 | 313 | consensus WD40 repeat protein [Function unknown] | 95.64 | |
| KOG2110 | 391 | consensus Uncharacterized conserved protein, conta | 95.63 | |
| KOG0263 | 707 | consensus Transcription initiation factor TFIID, s | 95.37 | |
| TIGR03866 | 300 | PQQ_ABC_repeats PQQ-dependent catabolism-associate | 95.35 | |
| KOG2111 | 346 | consensus Uncharacterized conserved protein, conta | 95.29 | |
| KOG2048 | 691 | consensus WD40 repeat protein [General function pr | 95.26 | |
| KOG0282 | 503 | consensus mRNA splicing factor [Function unknown] | 95.24 | |
| KOG0283 | 712 | consensus WD40 repeat-containing protein [Function | 95.2 | |
| PF00780 | 275 | CNH: CNH domain; InterPro: IPR001180 Based on sequ | 95.15 | |
| KOG2110 | 391 | consensus Uncharacterized conserved protein, conta | 95.03 | |
| KOG0276 | 794 | consensus Vesicle coat complex COPI, beta' subunit | 94.94 | |
| KOG0310 | 487 | consensus Conserved WD40 repeat-containing protein | 94.8 | |
| KOG4378 | 673 | consensus Nuclear protein COP1 [Signal transductio | 94.7 | |
| KOG0282 | 503 | consensus mRNA splicing factor [Function unknown] | 94.7 | |
| KOG0306 | 888 | consensus WD40-repeat-containing subunit of the 18 | 94.69 | |
| KOG0278 | 334 | consensus Serine/threonine kinase receptor-associa | 94.68 | |
| PF00780 | 275 | CNH: CNH domain; InterPro: IPR001180 Based on sequ | 94.68 | |
| PF14727 | 418 | PHTB1_N: PTHB1 N-terminus | 94.63 | |
| TIGR02658 | 352 | TTQ_MADH_Hv methylamine dehydrogenase heavy chain. | 94.62 | |
| KOG0283 | 712 | consensus WD40 repeat-containing protein [Function | 94.44 | |
| KOG0294 | 362 | consensus WD40 repeat-containing protein [Function | 94.43 | |
| KOG1273 | 405 | consensus WD40 repeat protein [General function pr | 94.35 | |
| KOG0316 | 307 | consensus Conserved WD40 repeat-containing protein | 94.31 | |
| TIGR03300 | 377 | assembly_YfgL outer membrane assembly lipoprotein | 94.17 | |
| PLN00181 | 793 | protein SPA1-RELATED; Provisional | 93.8 | |
| KOG0275 | 508 | consensus Conserved WD40 repeat-containing protein | 93.67 | |
| PF14727 | 418 | PHTB1_N: PTHB1 N-terminus | 93.47 | |
| KOG4378 | 673 | consensus Nuclear protein COP1 [Signal transductio | 93.47 | |
| KOG0299 | 479 | consensus U3 snoRNP-associated protein (contains W | 93.35 | |
| KOG2321 | 703 | consensus WD40 repeat protein [General function pr | 93.33 | |
| KOG0649 | 325 | consensus WD40 repeat protein [General function pr | 93.33 | |
| KOG0289 | 506 | consensus mRNA splicing factor [General function p | 93.21 | |
| PTZ00420 | 568 | coronin; Provisional | 93.16 | |
| KOG0278 | 334 | consensus Serine/threonine kinase receptor-associa | 93.05 | |
| KOG2445 | 361 | consensus Nuclear pore complex component (sc Seh1) | 93.03 | |
| PRK11138 | 394 | outer membrane biogenesis protein BamB; Provisiona | 92.9 | |
| KOG1645 | 463 | consensus RING-finger-containing E3 ubiquitin liga | 92.8 | |
| KOG0316 | 307 | consensus Conserved WD40 repeat-containing protein | 92.75 | |
| KOG0289 | 506 | consensus mRNA splicing factor [General function p | 92.69 | |
| KOG0646 | 476 | consensus WD40 repeat protein [General function pr | 92.29 | |
| KOG0645 | 312 | consensus WD40 repeat protein [General function pr | 92.06 | |
| KOG0277 | 311 | consensus Peroxisomal targeting signal type 2 rece | 91.62 | |
| KOG0650 | 733 | consensus WD40 repeat nucleolar protein Bop1, invo | 91.4 | |
| KOG2321 | 703 | consensus WD40 repeat protein [General function pr | 91.31 | |
| KOG0772 | 641 | consensus Uncharacterized conserved protein, conta | 91.28 | |
| PF02239 | 369 | Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO | 91.27 | |
| KOG0640 | 430 | consensus mRNA cleavage stimulating factor complex | 91.15 | |
| KOG1897 | 1096 | consensus Damage-specific DNA binding complex, sub | 91.14 | |
| KOG0646 | 476 | consensus WD40 repeat protein [General function pr | 90.72 | |
| KOG0315 | 311 | consensus G-protein beta subunit-like protein (con | 90.22 | |
| TIGR03300 | 377 | assembly_YfgL outer membrane assembly lipoprotein | 90.21 | |
| COG2706 | 346 | 3-carboxymuconate cyclase [Carbohydrate transport | 89.69 | |
| KOG4649 | 354 | consensus PQQ (pyrrolo-quinoline quinone) repeat p | 89.68 | |
| KOG0294 | 362 | consensus WD40 repeat-containing protein [Function | 89.62 | |
| PRK11138 | 394 | outer membrane biogenesis protein BamB; Provisiona | 88.82 | |
| PF04053 | 443 | Coatomer_WDAD: Coatomer WD associated region ; Int | 88.71 | |
| KOG1274 | 933 | consensus WD40 repeat protein [General function pr | 88.6 | |
| KOG0266 | 456 | consensus WD40 repeat-containing protein [General | 88.5 | |
| KOG0650 | 733 | consensus WD40 repeat nucleolar protein Bop1, invo | 88.46 | |
| KOG0276 | 794 | consensus Vesicle coat complex COPI, beta' subunit | 88.43 | |
| KOG1538 | 1081 | consensus Uncharacterized conserved protein WDR10, | 87.91 | |
| PTZ00421 | 493 | coronin; Provisional | 87.54 | |
| KOG0266 | 456 | consensus WD40 repeat-containing protein [General | 87.49 | |
| KOG1034 | 385 | consensus Transcriptional repressor EED/ESC/FIE, r | 87.33 | |
| PF13360 | 238 | PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A | 86.44 | |
| KOG2096 | 420 | consensus WD40 repeat protein [General function pr | 86.19 | |
| KOG2111 | 346 | consensus Uncharacterized conserved protein, conta | 85.96 | |
| KOG3881 | 412 | consensus Uncharacterized conserved protein [Funct | 85.39 | |
| KOG4441 | 571 | consensus Proteins containing BTB/POZ and Kelch do | 84.52 | |
| KOG0772 | 641 | consensus Uncharacterized conserved protein, conta | 84.43 | |
| KOG0288 | 459 | consensus WD40 repeat protein TipD [General functi | 84.35 | |
| KOG0305 | 484 | consensus Anaphase promoting complex, Cdc20, Cdh1, | 84.14 | |
| KOG0649 | 325 | consensus WD40 repeat protein [General function pr | 83.94 | |
| KOG1587 | 555 | consensus Cytoplasmic dynein intermediate chain [C | 83.47 | |
| KOG1517 | 1387 | consensus Guanine nucleotide binding protein MIP1 | 83.46 | |
| PTZ00421 | 493 | coronin; Provisional | 83.36 | |
| KOG0284 | 464 | consensus Polyadenylation factor I complex, subuni | 83.23 | |
| PF06433 | 342 | Me-amine-dh_H: Methylamine dehydrogenase heavy cha | 82.87 | |
| KOG0268 | 433 | consensus Sof1-like rRNA processing protein (conta | 82.33 | |
| KOG2114 | 933 | consensus Vacuolar assembly/sorting protein PEP5/V | 82.23 | |
| KOG1517 | 1387 | consensus Guanine nucleotide binding protein MIP1 | 82.16 | |
| KOG0299 | 479 | consensus U3 snoRNP-associated protein (contains W | 82.09 | |
| KOG1188 | 376 | consensus WD40 repeat protein [General function pr | 82.03 | |
| PF12894 | 47 | Apc4_WD40: Anaphase-promoting complex subunit 4 WD | 81.95 | |
| PF14761 | 215 | HPS3_N: Hermansky-Pudlak syndrome 3 | 81.87 | |
| PHA02713 | 557 | hypothetical protein; Provisional | 81.31 | |
| KOG1896 | 1366 | consensus mRNA cleavage and polyadenylation factor | 81.28 | |
| KOG4441 | 571 | consensus Proteins containing BTB/POZ and Kelch do | 81.22 | |
| KOG0272 | 459 | consensus U4/U6 small nuclear ribonucleoprotein Pr | 81.15 | |
| PF08553 | 794 | VID27: VID27 cytoplasmic protein; InterPro: IPR013 | 80.24 |
| >KOG1897 consensus Damage-specific DNA binding complex, subunit DDB1 [Replication, recombination and repair] | Back alignment and domain information |
|---|
Probab=100.00 E-value=3.8e-178 Score=1488.77 Aligned_cols=1025 Identities=63% Similarity=1.036 Sum_probs=973.5
Q ss_pred CeeEEEeEEEeEeeeeccCCCCccEEEEEeCcCcEEEEEEeCCCCcEEEEEeeeccCCCCccCCCCCeEEECCCCCEEEE
Q 001655 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGL 80 (1036)
Q Consensus 1 ~~~~~l~g~i~~i~~~r~~~~~~D~Llv~~~~~~l~il~~~~~~~~~~t~s~~~~~~~~~~~~~~~~~l~vdp~~r~~~~ 80 (1036)
++++++||+|..|+.|||+++++|+|+|+|+++++++|+||....+.+|...+...++.+|+.++|++++|||.+|.+++
T Consensus 51 i~sv~ifg~I~~i~~fRp~g~~kD~LfV~t~~~~~~iL~~d~~~~~vv~~a~~~v~dr~gr~s~~g~~~~VDp~~R~Igl 130 (1096)
T KOG1897|consen 51 ITSVPIFGTIATIALFRPPGSDKDYLFVATDSYRYFILEWDEESIQVVTRAHGDVSDRSGRPSDNGQILLVDPKGRVIGL 130 (1096)
T ss_pred eEeeccceeEEEEEeecCCCCCcceEEEEECcceEEEEEEccccceEEEEecccccccccccCCCceEEEECCCCcEEEE
Confidence 57899999999999999999999999999999999999999877889999999999988999999999999999999999
Q ss_pred EecCCeEEEEeCCCC-----CccccceecccCCceEEEEEeecCCCCCEEEEEEEecCCcceEEEEEEeccCCceeeccc
Q 001655 81 HLYDGLFKVIPFDNK-----GQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPW 155 (1036)
Q Consensus 81 ~~~~~~~~ilp~~~~-----~~~~~~~~~~l~~~~i~d~~FL~~~~~ptlavL~~~~~~~~~l~~~~~~~~~~~~~~~~~ 155 (1036)
++|+|+++|+|+.+. ..+...|++|+++.+|.||+||++...||+|+||.|. ..+|+++|+++++++++.++.|
T Consensus 131 ~~yqgl~~vIp~d~~~sht~~s~l~~fn~rfdel~v~Di~fly~~s~pt~~vly~Ds-~~~Hv~~yelnl~~ke~~~~~w 209 (1096)
T KOG1897|consen 131 HLYQGLFKVIPIDSDESHTGGSLLKAFNVRFDELNVYDIKFLYGCSDPTLAVLYKDS-DGRHVKTYELNLRDKEFVKGPW 209 (1096)
T ss_pred EeecCeEEEEEecccccccCcccccccccccCcceEEEEEEEcCCCCCceEEEEEcC-CCcEEEEEEeccchhhcccccc
Confidence 999999999999886 5566789999999999999999999999999999998 8899999999999999999999
Q ss_pred cccccCCCCCEEEecCCCCcceEEEeCceEEEEcCCceeeec---cCcceeeEEEEEecCCCEEEEEcCccCEEEEEEEe
Q 001655 156 SQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITH 232 (1036)
Q Consensus 156 ~~~~lp~~~~~lip~p~~~~G~lv~~~~~i~y~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~ll~~~~G~l~~l~l~~ 232 (1036)
+ +++..++..+||+|.|.||+||+|++.|+|+++..+.+++ .+++.+.||++++.++.+||++|++|+||++.+..
T Consensus 210 ~-~~v~~~a~~li~VP~~~gGvlV~ge~~I~Y~~~~~~~ai~p~~~~~~t~~~~~~v~~~~~~yLl~d~~G~Lf~l~l~~ 288 (1096)
T KOG1897|consen 210 S-NNVDNGASMLIPVPSPIGGVLVIGEEFIVYMSGDNFVAIAPLTAEQSTIVCYGRVDLQGSRYLLGDEDGMLFKLLLSH 288 (1096)
T ss_pred c-cccccCCceeeecCCCCceEEEEeeeEEEEeeCCceeEecccccCCceEEEcccccCCccEEEEecCCCcEEEEEeec
Confidence 9 9999999999999999999999999999999998877765 67788999999999999999999999999999998
Q ss_pred cCcEEee--EEEEEcCccccceEEEEecCCeEEEEEeecCeeEEEEeeCCCCCCCcEEEeeecccCCceeeeEEeccCCC
Q 001655 233 EKEKVTG--LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ 310 (1036)
Q Consensus 233 ~~~~v~~--l~i~~~~~~~~~s~l~~l~~~~lF~gS~~gds~l~~~~~~~~~~~~~~~~~~~l~n~~Pi~D~~~~~~~~~ 310 (1036)
.+.++.+ |+++++|++++|+||++|++|+||+||++|||+|+++..++|. ++|..++++++|+|||.||++.+.+++
T Consensus 289 ~~e~~s~~~lkve~lge~siassi~~L~ng~lFvGS~~gdSqLi~L~~e~d~-gsy~~ilet~~NLgPI~Dm~Vvd~d~q 367 (1096)
T KOG1897|consen 289 TGETVSGLDLKVEYLGETSIASSINYLDNGVLFVGSRFGDSQLIKLNTEPDV-GSYVVILETFVNLGPIVDMCVVDLDRQ 367 (1096)
T ss_pred ccccccceEEEEEecCCcchhhhhhcccCceEEEeccCCceeeEEccccCCC-CchhhhhhhcccccceeeEEEEecccc
Confidence 8888887 8999999999999999999999999999999999999998876 888999999999999999999999889
Q ss_pred CCceEEEEeeCCCCccEEEEecCCceeEEEeeeCCCcceEEEeecCCCCCcccEEEEEecCceEEEEecCCCCeeeeeec
Q 001655 311 GQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIE 390 (1036)
Q Consensus 311 ~~~~lv~~sG~g~~g~l~~lr~gi~~~~~~~~~l~~~~~iw~l~~~~~~~~~~~lv~S~~~~T~vl~~~~~~~~~e~~~~ 390 (1036)
+++++++|||++++|+||++|+|+++++.+..++||++++|+++....++++.||++||.++|++|.+++ ++||++++
T Consensus 368 ~q~qivtCsGa~kdgSLRiiRngi~I~e~A~i~l~Gikg~w~lk~~v~~~~d~ylvlsf~~eTrvl~i~~--e~ee~~~~ 445 (1096)
T KOG1897|consen 368 GQGQIVTCSGAFKDGSLRIIRNGIGIDELASIDLPGIKGMWSLKSMVDENYDNYLVLSFISETRVLNISE--EVEETEDP 445 (1096)
T ss_pred CCceEEEEeCCCCCCcEEEEecccccceeeEeecCCccceeEeeccccccCCcEEEEEeccceEEEEEcc--ceEEeccc
Confidence 9999999999999999999999999999999999999999999976667888999999999999999986 49999999
Q ss_pred CeeccccceeeeecCCCeEEEEecCcEEEEecCCccceeeeeCCCCCcEEEEEeCCCEEEEEEcCCeEEEEEEcCcEEEE
Q 001655 391 GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470 (1036)
Q Consensus 391 ~f~~~~~Tl~~~~~~~~~~vQVt~~~i~l~~~~~~~~~~~w~~~~~~~I~~as~~~~~vvv~~~~~~~~~l~~~~~~l~~ 470 (1036)
||.++++||+|+.+.++.++|||+++||+++.+ .+.++|.+|.+..|..+++|..+++||.+++.+.|++++.+.+.+
T Consensus 446 gf~~~~~Tif~S~i~g~~lvQvTs~~iRl~ss~--~~~~~W~~p~~~ti~~~~~n~sqVvvA~~~~~l~y~~i~~~~l~e 523 (1096)
T KOG1897|consen 446 GFSTDEQTIFCSTINGNQLVQVTSNSIRLVSSA--GLRSEWRPPGKITIGVVSANASQVVVAGGGLALFYLEIEDGGLRE 523 (1096)
T ss_pred cccccCceEEEEccCCceEEEEecccEEEEcch--hhhhcccCCCceEEEEEeecceEEEEecCccEEEEEEeeccceee
Confidence 999999999999998889999999999999876 388999999988888888998999999988999999998877999
Q ss_pred EEeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecC-eEEEEEEeC
Q 001655 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG-ISYLLCALG 549 (1036)
Q Consensus 471 ~~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~-~~~L~vgl~ 549 (1036)
+.+.+++.||+|+|++|+++++.++.+++||+|.+-.+.+..+|++.+++.+.++.+.+|+|+++..+++ ..||+|+++
T Consensus 524 ~~~~~~e~evaCLDisp~~d~~~~s~~~aVG~Ws~~~~~l~~~pd~~~~~~~~l~~~~iPRSIl~~~~e~d~~yLlvalg 603 (1096)
T KOG1897|consen 524 VSHKEFEYEVACLDISPLGDAPNKSRLLAVGLWSDISMILTFLPDLILITHEQLSGEIIPRSILLTTFEGDIHYLLVALG 603 (1096)
T ss_pred eeeheecceeEEEecccCCCCCCcceEEEEEeecceEEEEEECCCcceeeeeccCCCccchheeeEEeeccceEEEEEcC
Confidence 9999999999999999998877788899999997778888889999998888888889999999999986 899999999
Q ss_pred CCcEEEEEEecCCCccccceeeecCccceEEEEEEeCCceEEEEEcCCceEEEecCCcEEEeeeccccceeeecccCCCC
Q 001655 550 DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAF 629 (1036)
Q Consensus 550 ~G~l~~y~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~~~p~~i~~~~~~~~~~~~~~~~i~~~~~f~~~~~ 629 (1036)
||.++.|.++..++.++++|++++|++|+.|+.|.+.+.+++||+|+|||++|..++++.|+|++.+.+..+|||++.++
T Consensus 604 dG~l~~fv~d~~tg~lsd~Kk~~lGt~P~~Lr~f~sk~~t~vfa~sdrP~viY~~n~kLv~spls~kev~~~c~f~s~a~ 683 (1096)
T KOG1897|consen 604 DGALLYFVLDINTGQLSDRKKVTLGTQPISLRTFSSKSRTAVFALSDRPTVIYSSNGKLVYSPLSLKEVNHMCPFNSDAY 683 (1096)
T ss_pred CceEEEEEEEcccceEccccccccCCCCcEEEEEeeCCceEEEEeCCCCEEEEecCCcEEEeccchHHhhhhcccccccC
Confidence 99999999999999999999999999999999999889999999999999999999999999999999999999999999
Q ss_pred CCeEEEEeCCeEEEEEEcCCCCeEEEEEeCCCccceEEEecCCCEEEEEEeecCC----CCCceeeEEEEEEeCCCceEE
Q 001655 630 PDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQS----CAEESEMHFVRLLDDQTFEFI 705 (1036)
Q Consensus 630 ~~~l~~~~~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~----~~~~~~~~~l~l~d~~t~~~i 705 (1036)
++++++++++.|+|+++++.+++++|++|++++||||+|++.+.+|+|+|.+.+. .+++.+.++++++|++|++++
T Consensus 684 ~d~l~~~~~~~l~i~tid~iqkl~irtvpl~~~prrI~~q~~sl~~~v~s~r~e~~~~~~~ee~~~s~l~vlD~nTf~vl 763 (1096)
T KOG1897|consen 684 PDSLASANGGALTIGTIDEIQKLHIRTVPLGESPRRICYQESSLTFGVLSNRIESSAEYYGEEYEVSFLRVLDQNTFEVL 763 (1096)
T ss_pred CceEEEecCCceEEEEecchhhcceeeecCCCChhheEecccceEEEEEecccccchhhcCCcceEEEEEEecCCceeEE
Confidence 9999999999999999999999999999999999999999999999999988753 355788999999999999999
Q ss_pred EEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeC-CeEEEEEEEEecCceeEeccccCeEE
Q 001655 706 STYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVED-GKLQLIAEKETKGAVYSLNAFNGKLL 784 (1036)
Q Consensus 706 ~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~-~kl~~~~~~~~~g~v~~i~~~~g~Lv 784 (1036)
+.++|+++|.++|+++++|.++...|++|||++..|+|.+|..|||.+|++.+ ++|+++++++++|+|++++.|||+|+
T Consensus 764 ~~hef~~~E~~~Si~s~~~~~d~~t~~vVGT~~v~Pde~ep~~GRIivfe~~e~~~L~~v~e~~v~Gav~aL~~fngkll 843 (1096)
T KOG1897|consen 764 SSHEFERNETALSIISCKFTDDPNTYYVVGTGLVYPDENEPVNGRIIVFEFEELNSLELVAETVVKGAVYALVEFNGKLL 843 (1096)
T ss_pred eeccccccceeeeeeeeeecCCCceEEEEEEEeeccCCCCcccceEEEEEEecCCceeeeeeeeeccceeehhhhCCeEE
Confidence 99999999999999999999988999999999999999999999999999997 89999999999999999999999999
Q ss_pred EEECCEEEEEEeeecCCCccchhcccccccceEEEEEEEeCCEEEEEeccccEEEEEEeccCCeEEEEeccCCcceeEEE
Q 001655 785 AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV 864 (1036)
Q Consensus 785 ~~~g~~l~v~~~~~~~~~~~~L~~~~~~~~~~~~~~l~~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~ 864 (1036)
+|+|++|++|+|. .+++|+..|....+..++++++.+|+|+|||+|+|+++++|+.+++.|+++|||+.|+|++++
T Consensus 844 A~In~~vrLye~t----~~~eLr~e~~~~~~~~aL~l~v~gdeI~VgDlm~Sitll~y~~~eg~f~evArD~~p~Wmtav 919 (1096)
T KOG1897|consen 844 AGINQSVRLYEWT----TERELRIECNISNPIIALDLQVKGDEIAVGDLMRSITLLQYKGDEGNFEEVARDYNPNWMTAV 919 (1096)
T ss_pred EecCcEEEEEEcc----ccceehhhhcccCCeEEEEEEecCcEEEEeeccceEEEEEEeccCCceEEeehhhCccceeeE
Confidence 9999999999995 246787788888889999999999999999999999999999999999999999999999999
Q ss_pred EEeeCceEEEEccCCcEEEEeeCCCCCCcccccceeEEEEEEcCCccceEEeeeeeecCCCCCCCCcceEEEEccccceE
Q 001655 865 EILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944 (1036)
Q Consensus 865 ~~l~~~~~l~~D~~gnl~~l~~~~~~~~~~~~~~L~~~~~~~lg~~v~~~~~~~l~~~~~~~~~~~~~~il~~t~~GsIg 944 (1036)
+++++++++++|++||+++++++.++.++++++.|+..+.||+|+.|++|+.++++++.+++.....++++|||.+|+||
T Consensus 920 eil~~d~ylgae~~gNlf~v~~d~~~~td~eR~~l~~~~~~hlGelvn~f~hg~lv~~~~~s~~~~~~~vlfgTv~GsIG 999 (1096)
T KOG1897|consen 920 EILDDDTYLGAENSGNLFTVRKDSDATTDEERQILEEVGKFHLGELVNKFRHGSLVMQLGDSMIPLEPKVLFGTVNGSIG 999 (1096)
T ss_pred EEecCceEEeecccccEEEEEecCCCCchhhhhcccceeeEEeccceeeeeecceEeeccccccCCCCcEEEEEccceEE
Confidence 99999999999999999999999999999999999999999999999999999999885555555678899999999999
Q ss_pred EEEecChhHHHHHHHHHHHHHhcccCCCCCCcccccccccccccCCCCcEEehhHHHHHhcCCHHHHHHHHHhcCCC---
Q 001655 945 VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS--- 1021 (1036)
Q Consensus 945 ~l~~l~~~~~~~L~~lq~~l~~~~~~~~gl~~~~~R~~~~~~~~~~~~~~iDGdll~~fl~l~~~~q~~i~~~~~~~--- 1021 (1036)
++..++++.+.+|..||++|++..+++||++|..||+++.+.+..|++|||||||+|+|++|+.+++.+|++++..+
T Consensus 1000 ~i~sl~~d~~~fL~~Lq~~irk~i~s~gglsH~~yrsf~~e~~~~P~~gfIDGDLiEsfl~l~~~~~~~i~~~~~~~~~~ 1079 (1096)
T KOG1897|consen 1000 IIVSLPQDWYDFLEELQRRIRKVIKSVGGLSHMDYRSFEFEKRTSPVKGFIDGDLIESFLDLSRSKMREIVRGLEHTESL 1079 (1096)
T ss_pred EEEecCcchhHHHHHHHHHHHHhhcccCCcchhhHhhhhcccccCCCcCcccchHHHhhhccCHHHHHHHHhhccccccc
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999766
Q ss_pred --HHHHHHHHHHHhccC
Q 001655 1022 --VEELCKRVEELTRLH 1036 (1036)
Q Consensus 1022 --~~~i~~~l~~l~~~h 1036 (1036)
++||.|.+|||+|||
T Consensus 1080 ~s~~el~k~vEel~rlh 1096 (1096)
T KOG1897|consen 1080 ASVQELLKIVEELTRLH 1096 (1096)
T ss_pred CCHHHHHHHHHHHHhcC
Confidence 999999999999999
|
|
| >KOG1896 consensus mRNA cleavage and polyadenylation factor II complex, subunit CFT1 (CPSF subunit) [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=100.00 E-value=7.7e-134 Score=1148.11 Aligned_cols=1024 Identities=22% Similarity=0.342 Sum_probs=828.6
Q ss_pred CeeEEEeEEEeEeeeeccCCCCccEEEEEeCcCcEEEEEEeCCCCcEEEEEeeeccCCCCc----cCCCCCeEEECCCCC
Q 001655 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGR----PTDNGQIGIIDPDCR 76 (1036)
Q Consensus 1 ~~~~~l~g~i~~i~~~r~~~~~~D~Llv~~~~~~l~il~~~~~~~~~~t~s~~~~~~~~~~----~~~~~~~l~vdp~~r 76 (1036)
+++|++||+|++|++++..+..+|+|+++|++||+|+++||+++++|.|.|+|+||++..+ ....+|.++|||++|
T Consensus 69 v~~~~l~GnV~si~~~~~~gs~rD~LlL~f~~AKiSvlefD~~t~sl~TlSLHyfE~~~~~~~~~~~~~~p~vrvDPdsr 148 (1366)
T KOG1896|consen 69 VAEFKLFGNVTSIAKLPLKGSNRDALLLLFKDAKISVLEFDPQTNSLRTLSLHYFEGPEFRKGLVGRAKIPTVRVDPDSR 148 (1366)
T ss_pred EEEEEeecceeeEEEeecCCCCcceEEEEeccceEEEEEecCCccceeeeeeEEeccccccccccccccCceEEECCCCC
Confidence 4799999999999999999999999999999999999999999999999999999987543 345678999999999
Q ss_pred EEEEEecCCeEEEEeCCCCCcc----------------ccceeccc---CC--ceEEEEEeecCCCCCEEEEEEEecCCc
Q 001655 77 LIGLHLYDGLFKVIPFDNKGQL----------------KEAFNIRL---EE--LQVLDIKFLYGCAKPTIVVLYQDNKDA 135 (1036)
Q Consensus 77 ~~~~~~~~~~~~ilp~~~~~~~----------------~~~~~~~l---~~--~~i~d~~FL~~~~~ptlavL~~~~~~~ 135 (1036)
|+++..|+..++++|+...+.+ ..+|.+.+ ++ .||+|++|||||.+||+|+||++.++|
T Consensus 149 Ca~llvyg~~m~iLpf~~~e~~~~~~~~~~~~~~ss~~~pSyvi~~reLdeki~niiD~qFLhgY~ePTl~ILyep~~tw 228 (1366)
T KOG1896|consen 149 CALLLVYGLRMAILPFRVNEHLDDEELFPSGFSKSSFTAPSYVIALRELDEKIKNIIDFQFLHGYYEPTLAILYEPEQTW 228 (1366)
T ss_pred eEEEEEecceEEEeeccccccccccccccccccccccccceeEEEhhhhhhhhccceeEEeecCcccceEEEEecccccc
Confidence 9999999999999999664322 22454443 33 589999999999999999999999887
Q ss_pred c----------eEEEEEEeccCCceeeccccccccCCCCCEEEecCCCCcceEEEeCceEEEEcCCc-eeeeccCcceee
Q 001655 136 R----------HVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANA-FKAIPIRPSITK 204 (1036)
Q Consensus 136 ~----------~l~~~~~~~~~~~~~~~~~~~~~lp~~~~~lip~p~~~~G~lv~~~~~i~y~~~~~-~~~~~~~~~~~~ 204 (1036)
. .+...+++++.+ -++.+|+..+||+|++...++|.|+||+||++.|.++|.+++. +.++++|.....
T Consensus 229 ~grv~~r~dt~~~vaisLni~q~-~hpVI~sv~sLP~D~~~~~~vp~piGgvLv~~~n~~iy~nqsv~~~gv~LNs~a~~ 307 (1366)
T KOG1896|consen 229 AGRVILRKDTCVLVAISLNITQK-VHPVIWSVLSLPFDCYQATAVPTPIGGVLVFTVNNLIYLNQSVSPYGVALNSYASK 307 (1366)
T ss_pred cceEEEecCcEEEEEEEcCcccc-ccceEeeeccCChhhhhceeecccCccEEEEeeeeEEEEccCCCceeEEecchhhc
Confidence 3 222344444432 3457899999999999999999999999999999999999987 466776642211
Q ss_pred -----------------EEEEEecCCCEEEEEcCccCEEEEEEEec-CcEEeeEEEEEcCccccceEEEEecCCeEEEEE
Q 001655 205 -----------------AYGRVDADGSRYLLGDHAGLLHLLVITHE-KEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266 (1036)
Q Consensus 205 -----------------~~~~~~~~~~~~ll~~~~G~l~~l~l~~~-~~~v~~l~i~~~~~~~~~s~l~~l~~~~lF~gS 266 (1036)
|....-.+.+.++++..+|++|+|++..| ++.|+.+.+.........+|++...++++|+||
T Consensus 308 ~t~fpl~~qs~v~i~ld~a~~t~i~~dk~vis~~~Gd~y~Ltl~~D~~r~V~~~~f~k~~asvl~t~~v~~~n~llFlGS 387 (1366)
T KOG1896|consen 308 YTAFPLIPQSGVRIELDCANATWISNDKCVISLKNGDLYLLTLILDIGRSVQLLHFDKFKASVLATSIVGHGNNLLFLGS 387 (1366)
T ss_pred ccCCccccccceEEEEeeccceeecCCeEEEecCCCcEEEEEEEeccccchhhhhhhhhhcccceeeeeccCCccEEEEe
Confidence 11000124577999999999999999999 677887777666666678899999999999999
Q ss_pred eecCeeEEEEeeCC------------C---------------------C-----------CCC----------c-EEEee
Q 001655 267 SYGDSQLIKLNLQP------------D---------------------A-----------KGS----------Y-VEVLE 291 (1036)
Q Consensus 267 ~~gds~l~~~~~~~------------~---------------------~-----------~~~----------~-~~~~~ 291 (1036)
+.|||.|+++.... | + .+. + ++++|
T Consensus 388 rlgnSlll~~s~~~~~~~e~~~re~~d~~~~~~~~~~~d~~~d~~~~d~~~~~~~~~g~~~~~g~~a~~t~~~f~fevcD 467 (1366)
T KOG1896|consen 388 RLGNSLLLRFSELLQRASEGVRREEGDTESDGYSKKRVDDTQDVRRDDEKSAELFEAGSEENYGSGAQETVQPFSFEVCD 467 (1366)
T ss_pred cCCCEEEEEehhccccCCccccccccCCcCCcchhhcccchhhhhhhhhhccchhhccccccCCcccceeeeeeEEeehh
Confidence 99999999998521 0 0 000 1 79999
Q ss_pred ecccCCceeeeEEeccCC----------CC-CceEEEEeeCCCCccEEEEecCCceeEEEeeeCCCcceEEEeecCC---
Q 001655 292 RYVNLGPIVDFCVVDLER----------QG-QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSST--- 357 (1036)
Q Consensus 292 ~l~n~~Pi~D~~~~~~~~----------~~-~~~lv~~sG~g~~g~l~~lr~gi~~~~~~~~~l~~~~~iw~l~~~~--- 357 (1036)
+++|+|||.||++++... ++ ..++|+|+|+|++|+|+++|+.|+|++..+++++||.++|++....
T Consensus 468 sL~NIGPi~~~avG~~~~~~~~~~gl~~~~~~~elV~~sGhgkngaL~V~r~sI~P~i~t~fel~Gc~~iWtV~~~~~~~ 547 (1366)
T KOG1896|consen 468 SLPNIGPITDFAVGKRSSASEAVEGLSPHNKCLELVATSGHGKNGALSVIRRSIRPEIATEFELPGCVDIWTVFIKGRKR 547 (1366)
T ss_pred ccccccccccceeccccchhhhccCCCCCCCeEEEEEeccCCCCcceEEEeecccceeeEEEEecCeeeEEEEEEecccc
Confidence 999999999999986431 12 2689999999999999999999999999999999999999997532
Q ss_pred --CCCcccEEEEEecCceEEEEecCCCCeeeeeecCeeccccceeeeecCCC-eEEEEecCcEEEEecCCccceeeeeCC
Q 001655 358 --DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN-QLVQVTSGSVRLVSSTSRELRNEWKSP 434 (1036)
Q Consensus 358 --~~~~~~~lv~S~~~~T~vl~~~~~~~~~e~~~~~f~~~~~Tl~~~~~~~~-~~vQVt~~~i~l~~~~~~~~~~~w~~~ 434 (1036)
.+..|.|+++|..++|+||+.+ +++.|++.++|..+.+||++|+++++ ++|||||+++|+++.+. ..++.....
T Consensus 548 ~~~~~~h~~lilS~e~~t~il~tg--e~~~Ev~~s~f~~~~~Tl~~gnlg~~rriVQVtp~~~rllDg~~-r~lq~i~fd 624 (1366)
T KOG1896|consen 548 EEDNTQHLYLILSTESRTMILETG--EELLEVSGSGFTRDGPTLFAGNLGNERRIVQVTPSGLRLLDGDL-RMLQRIPFD 624 (1366)
T ss_pred ccccCcceEEEeecccchhhhhcc--chhhhcccceeEeccceEEEEecCCceEEEEEccceeEEecCcc-hheeEeccc
Confidence 2234789999999999999665 46999999999999999999999876 99999999999999875 467777776
Q ss_pred CCCcEEEEEeCCCEEEEEEcCCeEEEEEEcCcE--EEE--------E---EeccCCceeEEEE-----------------
Q 001655 435 PGYSVNVATANASQVLLATGGGHLVYLEIGDGI--LTE--------V---KHAQLEYEISCLD----------------- 484 (1036)
Q Consensus 435 ~~~~I~~as~~~~~vvv~~~~~~~~~l~~~~~~--l~~--------~---~~~~l~~~is~l~----------------- 484 (1036)
.+..++..++.+.++++...++.+.+|+++... +.. + ....+..-+.-+.
T Consensus 625 ~~~~vv~~sv~dpyv~v~~~~g~i~~~~l~~~s~rl~~~~~~s~~~~sv~~~~dlsg~f~~~s~l~~k~~~~~gr~~~~~ 704 (1366)
T KOG1896|consen 625 SGAIVVQTSVADPYVAVRSSEGRITLYDLEEKSHRLALHDPMSFKVVSVSLPADLSGMFTTLSDLSLKGNEANGRSSEAE 704 (1366)
T ss_pred cCCcEEEEeccCceEEEEEcCCceEEEEeccccchhhccCcccceeEEEechhhhccceEEEeeecccCccccccccccc
Confidence 777888888887777777778888888875421 100 0 0001111110000
Q ss_pred ----cC-CCCCCC----CcccEEEEEEecCCEEEEEECCCCCeeEEecCCC----------------c-cCce----eEE
Q 001655 485 ----IN-PIGENP----SYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG----------------E-IIPR----SVL 534 (1036)
Q Consensus 485 ----~~-~~~~~~----~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~----------------~-~~p~----si~ 534 (1036)
.. .+++.+ ....+.++-...+|.++||++|++..++...... . ..+. .+.
T Consensus 705 ~~~~~~~kv~~~egg~~~~~~~~~~~~~e~g~leiy~~pd~~lVf~v~~f~~~~~~L~~~~~~~~~~~~~s~~~~l~q~~ 784 (1366)
T KOG1896|consen 705 GLQSLPCKVDDEEGGSPEQEPYWCVFVTESGTLEIYALPDFDLVFEVDMFDTGNRVLMDSRLRGPTTNKESEDLELKQLF 784 (1366)
T ss_pred ccccCCccccCCCCCCcccCceEEEEEcCCCceEEEccCCcceEEEeeccCCCcceEEeecccCccccccccchHHHHhh
Confidence 00 000000 0111444445559999999999988776421100 0 0011 111
Q ss_pred eeee-----cCeEEEEEEeCCCcEEEEEEec--CCCc--ccccee----------------e----------ecCccceE
Q 001655 535 LCAF-----EGISYLLCALGDGHLLNFLLNM--KTGE--LTDRKK----------------V----------SLGTQPIT 579 (1036)
Q Consensus 535 ~~~~-----~~~~~L~vgl~~G~l~~y~~~~--~~~~--l~~~~~----------------~----------~lG~~pv~ 579 (1036)
+.++ ..+++|+.-+.+|.+..|+..+ ..+. +..++. . ..+..-.+
T Consensus 785 ~~~L~~e~~~~e~~L~lv~~~~eil~Ykaf~~~~~~~~~~~f~kvp~~~~~~~~~p~~~~~~~~~~~~e~~~~~~~~~~~ 864 (1366)
T KOG1896|consen 785 VNPLGSEIVFKEPHLFLVVSDNEILIYKAFPQLSQGNLKVFFKKVPHNLNIRTDKPHFLCKKREGGGAEEGASVSVIVQR 864 (1366)
T ss_pred ccccchhhhccCCceEEEEeCceEEEEeeccccCccchhhhhhhCCHhhcccccCCcccchhhccccccccccccceeee
Confidence 1222 2468999999999999999775 2221 111110 0 01112236
Q ss_pred EEEE-EeCCceEEEEEcCCceEEEe-cCCcEEEeeecc-ccceeeecccCCCCCCeEEEEe-CCeEEEEEEcCC----CC
Q 001655 580 LRTF-SSKNTTHVFAASDRPTVIYS-SNKKLLYSNVNL-KEVSHMCPFNSAAFPDSLAIAK-EGELTIGTIDDI----QK 651 (1036)
Q Consensus 580 l~~~-~~~~~~~v~~~~~~p~~i~~-~~~~~~~~~~~~-~~i~~~~~f~~~~~~~~l~~~~-~~~l~i~~l~~~----~~ 651 (1036)
+++| .+++++++|++|.+|+|+.. -++.+.++++.. .++.+++||++.+||++++|++ ++.++||.++.. ++
T Consensus 865 m~~f~~i~ghsgvfv~Gs~P~~il~t~rg~lr~h~~~gngpv~sfapfhnvn~p~gfiyvd~~~~l~i~~lp~~~~Ydn~ 944 (1366)
T KOG1896|consen 865 MTYFEDIGGHSGVFVTGSKPYLILLTFRGVLRFHPVFGNGPVGSFAPFHNVNCPRGFIYVDRQGELVICVLPEALSYDNK 944 (1366)
T ss_pred EEeeccccCeeEEEEecCCceEEEEEcccccceeeeecCCcceeeeeeeccCCCcceEEECCCceEEEEEcchhcccCCC
Confidence 7778 46789999999999966654 578888777764 6899999999999999999987 689999999885 48
Q ss_pred eEEEEEeCCCccceEEEecCCCEEEEEEeecC------CCCCc-------------eeeEEEEEEeCCCceEEEEEECCC
Q 001655 652 LHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ------SCAEE-------------SEMHFVRLLDDQTFEFISTYPLDT 712 (1036)
Q Consensus 652 ~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~------~~~~~-------------~~~~~l~l~d~~t~~~i~~~~~~~ 712 (1036)
|+.|+|||+.||+.++||++.+.|+|.++.+. .++++ .+++.++|++|.+|+.++.++|++
T Consensus 945 wPvkkIpl~~T~~~vvYh~e~~vy~v~t~~~~~~~~~~~d~~e~~~~~de~~~~p~~~~f~i~LisP~sw~vi~~iefq~ 1024 (1366)
T KOG1896|consen 945 WPVKKIPLRKTPHQVVYHYEKKVYAVITSTPVPYERLGEDGEEEVISRDENVIHPEGEQFSIQLISPESWEVIDKIEFQE 1024 (1366)
T ss_pred CcccccccccchhheeeeccceEEEEEEeccceeeecccccccccccccccccccccccceeEEecCCccccccccccCc
Confidence 99999999999999999999999999996541 11111 257999999999999999999999
Q ss_pred CCeEeEEEEEEEcC-----CCceEEEEEeeeeCCCCCCCcceEEEEEEEe-----------CCeEEEEEEEEecCceeEe
Q 001655 713 FEYGCSILSCSFSD-----DSNVYYCVGTAYVLPEENEPTKGRILVFIVE-----------DGKLQLIAEKETKGAVYSL 776 (1036)
Q Consensus 713 ~E~v~si~~~~l~~-----~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~-----------~~kl~~~~~~~~~g~v~~i 776 (1036)
+|++++++.+.+.. +.++|++|||++.. +|+.+++||+++|++. ++|+|+++.++++|+|.++
T Consensus 1025 ~E~v~~~k~v~L~~~~t~~~~k~ylavGT~~~~-gEDv~~RGr~hi~diIeVVPepgkP~t~~KlKel~~eE~KGtVsav 1103 (1366)
T KOG1896|consen 1025 NEHVLHMKYVILDDEETTKGKKPYLAVGTAFIQ-GEDVPARGRIHIFDIIEVVPEPGKPFTKNKLKELYIEEQKGTVSAV 1103 (1366)
T ss_pred cceeeEEEEEEEEecccccCCcceEEEEEeecc-cccccCcccEEEEEEEEecCCCCCCcccceeeeeehhhcccceEEE
Confidence 99999999999965 24899999999998 8999999999999999 4599999999999999999
Q ss_pred ccccCeEEEEECCEEEEEEeeecCCCccchhcccccccceEEEEEEEeCCEEEEEeccccEEEEEEeccCCeEEEEeccC
Q 001655 777 NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856 (1036)
Q Consensus 777 ~~~~g~Lv~~~g~~l~v~~~~~~~~~~~~L~~~~~~~~~~~~~~l~~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~ 856 (1036)
|.++|+|+.|.|+||++|+|+. +-.|.+.+|++.+.++.++++.+|+|++||+|+|++|++|++++.+|++++||.
T Consensus 1104 ceV~G~l~~~~GqKI~v~~l~r----~~~ligVaFiD~~~yv~s~~~vknlIl~gDV~ksisfl~fqeep~rlsL~srd~ 1179 (1366)
T KOG1896|consen 1104 CEVRGHLLSSQGQKIIVRKLDR----DSELIGVAFIDLPLYVHSMKVVKNLILAGDVMKSISFLGFQEEPYRLSLLSRDF 1179 (1366)
T ss_pred EEeccEEEEccCcEEEEEEecc----CCcceeeEEeccceeEEehhhhhhheehhhhhhceEEEEEccCceEEEEeecCC
Confidence 9999999999999999999942 235999999999999999999999999999999999999999999999999999
Q ss_pred CcceeEEEEEeeCce---EEEEccCCcEEEEeeCCCCCCcccccceeEEEEEEcCCccceEEeeeeeecCCCCCCCCcce
Q 001655 857 NANWMSAVEILDDDI---YLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT 933 (1036)
Q Consensus 857 ~~~~~~~~~~l~~~~---~l~~D~~gnl~~l~~~~~~~~~~~~~~L~~~~~~~lg~~v~~~~~~~l~~~~~~~~~~~~~~ 933 (1036)
.++.+++++||.++. ++++|.++||++|.|.|+++++..|+||..+++||+|..+++|.+-. .+..+.-..+.+.
T Consensus 1180 ~~l~v~s~EFLVdg~~L~flvsDa~rNi~vy~Y~Pe~~eS~~G~RLv~radfhvg~~vs~m~~lp--~~~~~e~~~~~~~ 1257 (1366)
T KOG1896|consen 1180 EPLNVYSTEFLVDGSNLSFLVSDADRNIHVYMYAPENIESLSGQRLVRRADFHVGAHVSTMFRLP--CHQNAEFGSNSPM 1257 (1366)
T ss_pred chhhceeeeeEEcCCeeEEEEEcCCCcEEEEEeCCCCccccCcceeeeeeeeEeccceeeeEecc--ccccchhccCCch
Confidence 999999999998863 89999999999999999999999999999999999999999997633 1111111122333
Q ss_pred E--EEEccccceEEEEecChhHHHHHHHHHHHHHhcccCCCCCCccccccccccc-ccCCCCcEEehhHHHHHhcCCHHH
Q 001655 934 V--IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK-KTVDAKNFLDGDLIESFLDLSRTR 1010 (1036)
Q Consensus 934 i--l~~t~~GsIg~l~~l~~~~~~~L~~lq~~l~~~~~~~~gl~~~~~R~~~~~~-~~~~~~~~iDGdll~~fl~l~~~~ 1010 (1036)
. +|||+||++|++.|++|+.||+|..||++|...++|+|||||++||...... ...+.+.+|||++|.+|..|+-++
T Consensus 1258 ~~~v~gtlDG~l~~~~Pl~e~~YRRL~~lQn~L~~~~~hv~GLNPr~yR~~~s~~~~~n~~r~ilDg~ll~~f~yl~~~e 1337 (1366)
T KOG1896|consen 1258 FYEVFGTLDGGLGHLVPLDEKTYRRLLMLQNALMDRLPHVGGLNPRAYRLLDSSLQLSNSLRSILDGELLNRFSYLSMSE 1337 (1366)
T ss_pred hhhhhcccCCceeEEecCCHHHHHHHHHHHHHHHHhhhhhcCCCHHHhhhccchhhhcCCCcccchHhHHHHhhccchhh
Confidence 4 4899999999999999999999999999999999999999999999988766 557889999999999999999999
Q ss_pred HHHHHHhcCCCHHHHHHHHHHHhcc
Q 001655 1011 MDEISKTMNVSVEELCKRVEELTRL 1035 (1036)
Q Consensus 1011 q~~i~~~~~~~~~~i~~~l~~l~~~ 1035 (1036)
|+++|+++|+++.+|+++|-||.|+
T Consensus 1338 r~elA~kiGt~~~eIl~DLvel~~~ 1362 (1366)
T KOG1896|consen 1338 REELAHKIGTTRKEILDDLVELDRL 1362 (1366)
T ss_pred HHHHHHhcCCCHHHHHHHHHHHHHH
Confidence 9999999999999999999999875
|
|
| >KOG1898 consensus Splicing factor 3b, subunit 3 [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=100.00 E-value=3.1e-126 Score=1072.73 Aligned_cols=1005 Identities=25% Similarity=0.421 Sum_probs=880.4
Q ss_pred CeeEEEeEEEeEeeeeccCCCCccEEEEEeCcCcEEEEEEeCCCCcEEEEEeeeccCCCCccCCCCCeEEECCCCCEEEE
Q 001655 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGL 80 (1036)
Q Consensus 1 ~~~~~l~g~i~~i~~~r~~~~~~D~Llv~~~~~~l~il~~~~~~~~~~t~s~~~~~~~~~~~~~~~~~l~vdp~~r~~~~ 80 (1036)
|+++.+||+|++++.+|.++..+|+|+|+++++++++++|+.+.+.++++..+.+.+++.|+..+|.++++||.|||+++
T Consensus 53 i~~~~vFg~Irsla~~~lt~~~kD~LaV~SDSGri~il~y~~ek~~~~~~~qetfGks~~rrivpG~y~~idp~Gra~mi 132 (1205)
T KOG1898|consen 53 ICRQEVFGTIRSLAAFRLTGGTKDYLAVGSDSGRISILEYNNEKNHFEKLHQETFGKSGCRRIVPGQYLAIDPKGRAVMI 132 (1205)
T ss_pred EEEEeehhhhhhhhccccCCCCccEEEEEcCCceEEEEEechhhhccccccccccCcccceEeccccEEEEcCCccceee
Confidence 57899999999999999999999999999999999999999988889999999999999999999999999999999887
Q ss_pred Eec--CCeEEEEeCCCCCccccceecccCC----ceEEEEEeec-CCCCCEEEEEEEecCC-------------cceEEE
Q 001655 81 HLY--DGLFKVIPFDNKGQLKEAFNIRLEE----LQVLDIKFLY-GCAKPTIVVLYQDNKD-------------ARHVKT 140 (1036)
Q Consensus 81 ~~~--~~~~~ilp~~~~~~~~~~~~~~l~~----~~i~d~~FL~-~~~~ptlavL~~~~~~-------------~~~l~~ 140 (1036)
.+- +.+.+|+..+..+.++.+ .|++. ..+.||+.+. ||.+|+||.|..|+.. .+++++
T Consensus 133 save~~kLvyvlnrD~~a~ltis--Spleahk~~sic~~l~~Vd~gf~np~fa~LE~dy~~a~~d~tgeaa~~~~~~l~f 210 (1205)
T KOG1898|consen 133 SAVEKQKLVYVLNRDGAARLTIS--SPLEAHKAHSICLDLVGVDVGFENPIFAALERDYSEADNDPTGEAATMTQKVLTF 210 (1205)
T ss_pred ehhhcCcEEEEEccchhhhceec--CchhhccCCcEEEEEEEEeccCCCceEEEEeechhhcccCchhhhhhccccceeE
Confidence 774 455555544444444332 24443 3578888885 7999999999999522 368999
Q ss_pred EEEeccCCceeeccccccccCCCCCEEEecCCC---CcceEEEeCceEEEEcCCc--eeeeccCc-----------ceee
Q 001655 141 YEVALKDKDFVEGPWSQNNLDNGADLLIPVPPP---LCGVLIIGEETIVYCSANA--FKAIPIRP-----------SITK 204 (1036)
Q Consensus 141 ~~~~~~~~~~~~~~~~~~~lp~~~~~lip~p~~---~~G~lv~~~~~i~y~~~~~--~~~~~~~~-----------~~~~ 204 (1036)
|+++++. +++.+.|+ ..+++...++++||.. +.|++|++.|++.|.+... ...+|++. .++.
T Consensus 211 Yeldlgl-nhvvrk~s-~p~~~~~n~l~~VP~G~D~ps~v~vc~~n~~~y~~~~d~p~~ri~~~rr~~~L~~~~~~vliv 288 (1205)
T KOG1898|consen 211 YELDLGL-NHVVRKAS-EPVNHFGNFLLTVPGGSDGPSGVLVCAENYLLYRNLGDHPDVRIPIERRINELSDAEDGVLIV 288 (1205)
T ss_pred EEEeccc-ceeEEEcc-cccCCCceEEEEecCCCCCCcceEEecCceeeccccccCCCEEeccccccccCCccccccEEE
Confidence 9999996 77777787 4667778899999964 2479999999999999763 33444321 2222
Q ss_pred -EEEEEecCCCEEEEEcCccCEEEEEEEecCcEEeeEEEEEcCccccceEEEEecCCeEEEEEeecCeeEEEEeeCC---
Q 001655 205 -AYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP--- 280 (1036)
Q Consensus 205 -~~~~~~~~~~~~ll~~~~G~l~~l~l~~~~~~v~~l~i~~~~~~~~~s~l~~l~~~~lF~gS~~gds~l~~~~~~~--- 280 (1036)
+|..-...+-++++++++|++|++++..|++.+..++++|+++++.+..|+.+++|+||++|++||+.||++..-+
T Consensus 289 ~s~~hk~k~~ff~llqt~~GD~fk~tl~~d~d~v~el~lkYfDtvp~a~~L~I~k~GfLf~~sE~~n~~lyq~~~LG~~~ 368 (1205)
T KOG1898|consen 289 SSAEHKTKSMFFFLLQTEYGDLFKLTLEHDGDNVVELRLKYFDTVPCALQLCILKTGFLFVASEFGNHRLYQFEKLGEED 368 (1205)
T ss_pred EeecccccCCeEEEEEecCCceEEEEEecCCCcceeeeeehhcCCccceEEEEeccceEEEhhhccCcceeehhhcCCCc
Confidence 2222223557889999999999999999999889999999999999999999999999999999999999998632
Q ss_pred CC---------C---------CCcEEEeeecccCCceeeeEEeccCCCCCceEEEEeeCCCCccEEEEecCCceeEEEee
Q 001655 281 DA---------K---------GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV 342 (1036)
Q Consensus 281 ~~---------~---------~~~~~~~~~l~n~~Pi~D~~~~~~~~~~~~~lv~~sG~g~~g~l~~lr~gi~~~~~~~~ 342 (1036)
|+ . -+++..+++++|+.|++|+.+++.++.+.+||++|||+|.+++|+++|+|+...+++..
T Consensus 369 ~~~s~~~~~~~~~~~~f~p~~l~nL~~~~~i~sl~p~~d~~I~~~~ne~~~qi~~~cg~~~~sslr~lR~gle~sel~~t 448 (1205)
T KOG1898|consen 369 DDFSNAMTSEEGKSVFFEPRILKNLSPVSSVESLSPLLDISIGDDSNEDTPQIYSACGRGPRSSLRILRNGLEVSELLVT 448 (1205)
T ss_pred cchhhhcccccCcceeccccccccccchhhhhccCccceeEeeccCcccchhhhhhhCcCccccchhhccccchHHHhhh
Confidence 11 0 01367889999999999999999888888999999999999999999999999999998
Q ss_pred eCC-CcceEEEeecCCCCCcccEEEEEecCceEEEEecCCCCeeeeeecCeeccccceeeeecCCCeEEEEecCcEEEEe
Q 001655 343 ELQ-GIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421 (1036)
Q Consensus 343 ~l~-~~~~iw~l~~~~~~~~~~~lv~S~~~~T~vl~~~~~~~~~e~~~~~f~~~~~Tl~~~~~~~~~~vQVt~~~i~l~~ 421 (1036)
++| +.+++|+++....+.+|.||++||.+.|+|++++ +.++|++++||....+|++|+.++++.+|||++++||++-
T Consensus 449 ~lp~~~ta~WTvk~~~td~ydsyivvsF~n~TlVLsIg--esveEvtdsgFls~~~Tl~~~l~Gd~slVQi~~d~iRhi~ 526 (1205)
T KOG1898|consen 449 ELPGNPTATWTVKKNITDVYDSYIVVSFVNGTLVLSIG--ESVEEVTDSGFLSTTPTLACSLMGDDSLVQIHPDGIRHIR 526 (1205)
T ss_pred ccCCCCceEEEEcCccccccceEEEEEeeccEEEEEcc--hhHHHhhhcccccCCceEEEEEecCCcEEEEchhhhhhcc
Confidence 888 6789999998777899999999999999999998 4699999999999999999999999999999999999997
Q ss_pred cCCccceeeeeCCCCCcEEEEEeCCCEEEEEEcCCeEEEEEEcC-cEEEEE-EeccCCceeEEEEcCCCCCCCCcccEEE
Q 001655 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEV-KHAQLEYEISCLDINPIGENPSYSQIAA 499 (1036)
Q Consensus 422 ~~~~~~~~~w~~~~~~~I~~as~~~~~vvv~~~~~~~~~l~~~~-~~l~~~-~~~~l~~~is~l~~~~~~~~~~~~~~~~ 499 (1036)
... ++.+|.+|++.+|+.++++..++++++++++++||+++. +++.+. ++.+++++++|+++.+.+++++.+.+++
T Consensus 527 ~~~--r~~ew~~P~~~~Iv~~avnr~qiVvalSngelvyfe~d~sgql~E~~er~tl~~~vac~ai~~~~~g~krsrfla 604 (1205)
T KOG1898|consen 527 PTK--RINEWKTPERVRIVKCAVNRRQIVVALSNGELVYFEGDVSGQLNEFTERVTLSTDVACLAIGQDPEGEKRSRFLA 604 (1205)
T ss_pred ccc--ccccccCCCceEEEEEeecceEEEEEccCCeEEEEEeccCccceeeeeeeeeceeehhhccCCCCcchhhcceee
Confidence 764 788999999999999999999999999999999999976 778775 7789999999999998887778899999
Q ss_pred EEEecCCEEEEEECCC---CCeeEEecCCCccCceeEEeeeec-------CeEEEEEEeCCCcEEEEEEecCCCccccce
Q 001655 500 VGMWTDISVRIFSLPD---LNLITKEHLGGEIIPRSVLLCAFE-------GISYLLCALGDGHLLNFLLNMKTGELTDRK 569 (1036)
Q Consensus 500 vg~w~~~~i~i~~l~~---l~~v~~~~l~~~~~p~si~~~~~~-------~~~~L~vgl~~G~l~~y~~~~~~~~l~~~~ 569 (1036)
+|+. ++++++++|.- +..+..|.++ ..|.++++.+.+ +..||++|++||.++++.+|.-.|.|.+.|
T Consensus 605 ~a~~-d~~vriisL~p~d~l~~ls~q~l~--~~~~s~~iv~~~~~~~~~~~~L~l~~GL~NGvllR~~id~v~G~l~d~r 681 (1205)
T KOG1898|consen 605 LASV-DNMVRIISLDPSDCLQPLSVQGLS--SPPESLCIVEMEATGGTDVAQLYLLIGLRNGVLLRFVIDTVTGQLLDIR 681 (1205)
T ss_pred eecc-ccceeEEEecCcceEEEccccccC--CCccceEEEEecccCCccceeEEEEecccccEEEEEEecccccceeeeh
Confidence 9999 99999999852 3444556666 578888888764 267899999999999999999999999999
Q ss_pred eeecCccceEEEEEEeCCceEEEEEcCCceEEEecCCcEEEeeeccccceeeecccCCCCCCeEEEEeCCeEEEEEEcCC
Q 001655 570 KVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649 (1036)
Q Consensus 570 ~~~lG~~pv~l~~~~~~~~~~v~~~~~~p~~i~~~~~~~~~~~~~~~~i~~~~~f~~~~~~~~l~~~~~~~l~i~~l~~~ 649 (1036)
++++|.+||+|.++...+.+.+++.+++||+.|..++++..+|++..++..+++|.+..||++++++..+.++|-.++..
T Consensus 682 tR~lG~~pvkLf~~~~~~~s~vL~lSsr~wl~y~~~~~~h~t~Isy~~l~~as~~~S~qcpeGiv~i~~n~l~i~~~~~~ 761 (1205)
T KOG1898|consen 682 TRFLGLRPVKLFPISMRGQSDVLALSSRPWLLYTYQQEFHLTPISYSTLEHASPFCSEQCPEGIVAISKNTLRIIALDKL 761 (1205)
T ss_pred eeeeccccceEEEEeecCcceeEEecCChhhhhhhcceeeeecccccchhccccccccCCCcchhhhhhhhhheeeehhh
Confidence 99999999999999888999999999999999999999999999999999999999999999999999999999999988
Q ss_pred C-CeEEEEEeCCCccceEEEecCCCEEEEEEeecC------------------CCCCc----------------------
Q 001655 650 Q-KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ------------------SCAEE---------------------- 688 (1036)
Q Consensus 650 ~-~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~------------------~~~~~---------------------- 688 (1036)
. .++.+.+|++.|||++++||+++++++++.+.. ..+++
T Consensus 762 g~~~n~~~~~l~~tprkvv~h~es~lLii~~td~~~~~~~~a~~~~~~~g~v~~s~~~~e~e~g~em~~~~~~~~~~~~v 841 (1205)
T KOG1898|consen 762 GKVLNVDGFPLAYTPRKVVIHPESGLLIIGRTDHNATLTKDARKNQMEAGGVLESGEEKEDEMGGEMEIIGREEVLPENV 841 (1205)
T ss_pred cccccccccccccCcceEEEecCCCeEEEEEecccchhhHHHhhhhhhcccccccccccchhhccchhhhcccccccccc
Confidence 5 579999999999999999999999999987641 00000
Q ss_pred --------eeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCc-eEEEEEeeeeCCCC-CCCcceEEEEEEEe-
Q 001655 689 --------SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN-VYYCVGTAYVLPEE-NEPTKGRILVFIVE- 757 (1036)
Q Consensus 689 --------~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~-~~i~VGT~~~~~~e-~~~~~Gri~v~~i~- 757 (1036)
.|.++++++|+.+.+.+..+++..||...|++.+.|++... .+.+||++....-+ ..-.+|++|.|++.
T Consensus 842 ~~~p~a~~~w~s~I~~~d~~s~~~~~~~~l~~ne~a~~v~~~~fs~~~~~~~~~v~~~~~~~l~~~~~~~g~~ytyk~~~ 921 (1205)
T KOG1898|consen 842 YGSPRAGNGWVSSIRVFDPKSGKIICLVELGQNEAAFSVCAVDFSSSEYQPFVAVGVATTEQLDSKSISSGFVYTYKFVR 921 (1205)
T ss_pred ccCcccccCccceEEEEcCCCCceEEEEeecCCcchhheeeeeeccCCCceEEEEEeeccccccccccCCCceEEEEEEe
Confidence 15679999999999999999999999999999999987444 59999998754211 12267999999988
Q ss_pred -CCeEEEEEEEEecCceeEeccccCeEEEEECCEEEEEEeeecCCCccchhccccccc-ceEEEEEEEeCCEEEEEeccc
Q 001655 758 -DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG-HILALYVQTRGDFIVVGDLMK 835 (1036)
Q Consensus 758 -~~kl~~~~~~~~~g~v~~i~~~~g~Lv~~~g~~l~v~~~~~~~~~~~~L~~~~~~~~-~~~~~~l~~~~~~I~vgD~~~ 835 (1036)
+.+|+++|+++++|+|.|||+|+|++++|+|+.+++|++ |+++|.++|.... +..++++++.+.+|+|||+.+
T Consensus 922 ~g~~lellh~T~~~~~v~Ai~~f~~~~LagvG~~l~~Ydl-----G~K~lLRk~e~k~~p~~Is~iqt~~~RI~VgD~qe 996 (1205)
T KOG1898|consen 922 NGDKLELLHKTEIPGPVGAICPFQGRVLAGVGRFLRLYDL-----GKKKLLRKCELKFIPNRISSIQTYGARIVVGDIQE 996 (1205)
T ss_pred cCceeeeeeccCCCccceEEeccCCEEEEecccEEEEeeC-----ChHHHHhhhhhccCceEEEEEeecceEEEEeeccc
Confidence 448999999999999999999999999999999999996 8899999988776 457888999999999999999
Q ss_pred cEEEEEEeccCCeEEEEeccCCcceeEEEEEeeCceEEEEccCCcEEEEeeCCCCCCcc------------------ccc
Q 001655 836 SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE------------------ERG 897 (1036)
Q Consensus 836 Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l~~~~~l~~D~~gnl~~l~~~~~~~~~~------------------~~~ 897 (1036)
|+.+++|++++++|+.+|.|+.|||+++++++|+++++++|++||+++++.+++.+... ..+
T Consensus 997 SV~~~~y~~~~n~l~~fadD~~pR~Vt~~~~lD~~tvagaDrfGNi~~vR~P~d~~e~~~edpt~~k~~~~~g~lN~~~~ 1076 (1205)
T KOG1898|consen 997 SVHFVRYRREDNQLIVFADDPVPRHVTALELLDYDTVAGADRFGNIAVVRIPPDVSEEASEDPTELKIAWEQGFLNDAPQ 1076 (1205)
T ss_pred eEEEEEEecCCCeEEEEeCCCccceeeEEEEecCCceeeccccCcEEEEECCCcchhhhccCCccccceecccccccccH
Confidence 99999999999999999999999999999999999999999999999999987654321 125
Q ss_pred ceeEEEEEEcCCccceEEeeeeeecCCCCCCCCcceEEEEccccceEEEEec-ChhHHHHHHHHHHHHHhcccCCCCCCc
Q 001655 898 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNH 976 (1036)
Q Consensus 898 ~L~~~~~~~lg~~v~~~~~~~l~~~~~~~~~~~~~~il~~t~~GsIg~l~~l-~~~~~~~L~~lq~~l~~~~~~~~gl~~ 976 (1036)
|.+...+|++|+.++++.+.++. ...++.++|+|..|+||+|+|+ +++++++++.+|..|++..++++|.+|
T Consensus 1077 K~~~i~~f~v~Dvits~q~~~~i-------~~a~e~~iy~tl~GtiG~f~p~~s~~d~~Ff~~~e~~~r~e~ppl~GrDH 1149 (1205)
T KOG1898|consen 1077 KVQLISQFFVGDVITSLQKVSSI-------PGARESLIYTTLLGTIGVFAPFLSREDVDFFQHLEMHMRKEYPPLLGRDH 1149 (1205)
T ss_pred hhhhhhhccccCeeeeceeeeec-------cCCcceeeeeeccccceEEeecccccchHHHHHHHHhccccCCcccCcch
Confidence 67888999999999999887654 3457899999999999999999 566899999999999999999999999
Q ss_pred ccccccccccccCCCCcEEehhHHHHHhcCCHHHHHHHHHhcCCCHHHHHHHHHHHh
Q 001655 977 EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033 (1036)
Q Consensus 977 ~~~R~~~~~~~~~~~~~~iDGdll~~fl~l~~~~q~~i~~~~~~~~~~i~~~l~~l~ 1033 (1036)
.+||++++ |.|.+|||||||+|+.|+..+|++||++++++++||.|+||+++
T Consensus 1150 ~~yRsyy~-----Pvk~VIDGDlceqF~~L~~~~Qe~va~el~~ti~eI~kkledir 1201 (1205)
T KOG1898|consen 1150 LEYRSYYA-----PVKKVIDGDLCEQFLRLEENQQEEVAEELDRTIEEISKKLEDIR 1201 (1205)
T ss_pred hhhhhhcc-----chhhcccHHHHHHHhhCCHHHHHHHHhcccCCHHHHHHHHHHHH
Confidence 99999999 89999999999999999999999999999999999999999987
|
|
| >COG5161 SFT1 Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=100.00 E-value=2.3e-91 Score=768.78 Aligned_cols=1006 Identities=16% Similarity=0.234 Sum_probs=754.7
Q ss_pred eEEEeEEEeEeeeeccCCCCccEEEEEeCcCcEEEEEEeCCCCcEEEEEeeeccCCCCcc----CCCCCeEEECCCCCEE
Q 001655 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP----TDNGQIGIIDPDCRLI 78 (1036)
Q Consensus 3 ~~~l~g~i~~i~~~r~~~~~~D~Llv~~~~~~l~il~~~~~~~~~~t~s~~~~~~~~~~~----~~~~~~l~vdp~~r~~ 78 (1036)
++.++|.|++|..+....+++|.|+++|..+|+++++||.+.+.|.|+|+|+++..+.-. ...-..+.-||.+-|
T Consensus 54 e~~~~e~~t~I~~~pq~~se~~~lll~t~~akis~lrf~sq~n~f~TislhyyeGKfkgksLvelak~stle~D~~ssc- 132 (1319)
T COG5161 54 EHMLLEKVTQIEKYPQISSEQDGLLLLTHRAKISLLRFDSQANEFRTISLHYYEGKFKGKSLVELAKFSTLEFDIRSSC- 132 (1319)
T ss_pred HHhhhhhhhhhhhcccccCccceEEEEeccceEEEEEehhhcccceeEEEeeeccccCCchhhhhhhhhheeeccCccc-
Confidence 567889999999987778899999999999999999999999999999999998875321 222246788888866
Q ss_pred EEEecCCeEEEEeCCCCCc-------------------------------cc-------------cceec---ccC--Cc
Q 001655 79 GLHLYDGLFKVIPFDNKGQ-------------------------------LK-------------EAFNI---RLE--EL 109 (1036)
Q Consensus 79 ~~~~~~~~~~ilp~~~~~~-------------------------------~~-------------~~~~~---~l~--~~ 109 (1036)
+++.+++....+||....+ +. .++.+ .+. ..
T Consensus 133 aLlfneDi~~flpfhvnkndddev~~d~D~~~~~~~~~h~~i~psqgtntfnkrkrt~~~~kfsaPs~Vl~~seld~~ik 212 (1319)
T COG5161 133 ALLFNEDIGNFLPFHVNKNDDDEVRIDVDLGMFQMSKRHFSIFPSQGTNTFNKRKRTLFPGKFSAPSKVLKFSELDGKIK 212 (1319)
T ss_pred hhhhhhhhhhcccccccCCccccccccccccHHHHHHHHhhcCCCCCccccchhhhhhcCCcccCceeEEEehhhhcccc
Confidence 6678888888888753211 00 01111 122 26
Q ss_pred eEEEEEeecCCCCCEEEEEEEecCCcce----------EEEEEEeccCCceeeccccccccCCCCCEEEecCCCCcceEE
Q 001655 110 QVLDIKFLYGCAKPTIVVLYQDNKDARH----------VKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLI 179 (1036)
Q Consensus 110 ~i~d~~FL~~~~~ptlavL~~~~~~~~~----------l~~~~~~~~~~~~~~~~~~~~~lp~~~~~lip~p~~~~G~lv 179 (1036)
+|+|++||.+|.+||+|+||++...|.+ ...+.+|++-.... .+-....||+|-+..+|+| .|.|+
T Consensus 213 niiD~~FL~ny~~PTvallY~Pkl~~~~~~ti~k~p~~~~v~Tldl~~~~sa-VI~~~~~lP~d~~~~v~~p---~Gall 288 (1319)
T COG5161 213 NIIDFVFLENYSIPTVALLYDPKLSLPRKYTILKNPYNAIVFTLDLGAGRSA-VIDEFLVLPRDFRVTVAGP---VGALL 288 (1319)
T ss_pred ccEEEEeeccCCCceEEEEecccccccceeEeecCceeEEEEEEecCcchhh-hhHhHhcCCceEEEEEecc---cceEE
Confidence 9999999999999999999998876631 22356666543322 2334568999999999988 49999
Q ss_pred EeCceEEEEcCCc-eeeeccCcce----------------eeEEE------EEe---cCCCEEEEEcCccCEEEEEEEec
Q 001655 180 IGEETIVYCSANA-FKAIPIRPSI----------------TKAYG------RVD---ADGSRYLLGDHAGLLHLLVITHE 233 (1036)
Q Consensus 180 ~~~~~i~y~~~~~-~~~~~~~~~~----------------~~~~~------~~~---~~~~~~ll~~~~G~l~~l~l~~~ 233 (1036)
+|.|.++|++..+ ..++|+|... +.|.. ++. .....+++++-+|+.|+|.+..|
T Consensus 289 ~g~neli~idstg~~~~I~lNs~~~k~~~~~~v~d~s~~d~n~~~~gttsIwipsSK~~~etl~l~dl~g~~yyl~~~~d 368 (1319)
T COG5161 289 FGSNELILIDSTGSSYTIPLNSMSEKYGGNKIVEDISLSDVNCFSRGTTSIWIPSSKCLIETLFLGDLNGDRYYLRISMD 368 (1319)
T ss_pred EecccEEEEecCCcEEEeechhhHHHhcCCceEeecccceeeEeecCceeeeccCcccccceEEEEecCCCEEEEEEEec
Confidence 9999999999887 4557766411 11111 111 12256799999999999999999
Q ss_pred CcEEeeEEEEEcC-------ccccceEEEEecCCeEEEEEeecCeeEEEEeeCC------------------C-------
Q 001655 234 KEKVTGLKIELLG-------ETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP------------------D------- 281 (1036)
Q Consensus 234 ~~~v~~l~i~~~~-------~~~~~s~l~~l~~~~lF~gS~~gds~l~~~~~~~------------------~------- 281 (1036)
|+.+.++++..+. ..+.++|+..+++.++|.|+..+||.+++|.... |
T Consensus 369 gk~iigfdi~~L~~e~dllk~~s~~~Cv~~~n~~l~f~g~g~~ns~vlr~~~l~~tiEtR~~eG~~~l~g~nDeEmdD~y 448 (1319)
T COG5161 369 GKRIIGFDIASLEFEGDLLKKGSAVSCVGHVNNLLFFGGVGDSNSRVLRIKSLLPTIETRASEGVGPLEGGNDEEMDDEY 448 (1319)
T ss_pred cceeeccceeeeeeeccccccCCCCeeEEEcCceEEEEEecCCceEEEEecccCCchhhhhhcCCCcccCCChhhhhhhh
Confidence 9998887666542 4689999999999999999999999999998510 0
Q ss_pred ---C--------------CCCc-EEEeeecccCCceeeeEEeccC--------CCCCceEEEEeeCCCCccEEEEecCCc
Q 001655 282 ---A--------------KGSY-VEVLERYVNLGPIVDFCVVDLE--------RQGQGQVVTCSGAYKDGSLRIVRNGIG 335 (1036)
Q Consensus 282 ---~--------------~~~~-~~~~~~l~n~~Pi~D~~~~~~~--------~~~~~~lv~~sG~g~~g~l~~lr~gi~ 335 (1036)
| .++| ++.++.+.|.|||+||++++.+ +.+...+|+.+|.+..|+|.+++..+.
T Consensus 449 ~apEn~l~~n~~~~v~~~~~p~d~el~~~l~n~gpitdfavgkv~v~kglP~pN~g~l~lV~t~G~ds~~~l~V~~ts~~ 528 (1319)
T COG5161 449 SAPENKLFGNKEQEVRRQDEPYDAELFNALSNAGPITDFAVGKVDVEKGLPIPNIGLLNLVVTKGSDSEAALAVEGTSLE 528 (1319)
T ss_pred cccccccccCcccceeeccCcchhHHhhhhccCCcccceeeeeccceecCCCCCccceeeEEeccCCCcceEEEEecccc
Confidence 0 1123 6789999999999999998754 236678999999999999999999999
Q ss_pred eeEEEeeeCCCcceEEEeecCCC---CCcccEEEEEecCceEEEEecCCCCeeeeeecCeeccccceeeeecCC-CeEEE
Q 001655 336 INEQASVELQGIKGMWSLRSSTD---DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY-NQLVQ 411 (1036)
Q Consensus 336 ~~~~~~~~l~~~~~iw~l~~~~~---~~~~~~lv~S~~~~T~vl~~~~~~~~~e~~~~~f~~~~~Tl~~~~~~~-~~~vQ 411 (1036)
|++...+.+.+++.+|+++.... ..-..|+++|...+|.+|+.++ ++.+.....|..+..|++.+.+++ +++||
T Consensus 529 P~I~~~~~fi~~e~vw~~kI~g~lr~~~~~~~~~ls~~s~S~If~~~e--~f~l~~~g~~~rd~~Tl~~~~fgee~rvVQ 606 (1319)
T COG5161 529 PCICTVSSFIPLEIVWSQKIRGYLRCSRALDFYILSRVSDSRIFRWSE--EFLLEVSGEYTRDVNTLLFVEFGEENRVVQ 606 (1319)
T ss_pred ceeeehccccchhheeehhccceehhcceeeEEEeecccccceeeccc--cceeeecceeeccccEEEeeeccCcceEEE
Confidence 99999999999999999986431 1124689999999999998864 577777789999999999999985 59999
Q ss_pred EecCcEEEEecCCccceeeeeCCCCCcEEEEEeCCCEEEEEEcCCeEEEEEEcC--cEEEEEEeccC--CceeEEEEcCC
Q 001655 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD--GILTEVKHAQL--EYEISCLDINP 487 (1036)
Q Consensus 412 Vt~~~i~l~~~~~~~~~~~w~~~~~~~I~~as~~~~~vvv~~~~~~~~~l~~~~--~~l~~~~~~~l--~~~is~l~~~~ 487 (1036)
|||+.+++++.+- +..+...+.. +.|.+.++.+..+++..+++.+..|+.+. .++..+...+. +.....+-+..
T Consensus 607 vtp~~l~~yD~~l-R~l~~~~F~~-~~V~~~Sv~Dp~ilvv~~~g~i~~f~~~ekn~rL~k~dl~~~l~d~k~~s~v~~d 684 (1319)
T COG5161 607 VTPSYLLRYDQDL-RMLGRVEFAS-RAVEARSVRDPLILVVRDSGKILTFYDREKNMRLFKIDLVTCLADAKNKSFVLSD 684 (1319)
T ss_pred ecchHhhhhcccc-eeeeeEeece-eeeEEEeccCCEEEEEEecCceEEEEehhhhchhccCChHHHHHhhhhheEeccC
Confidence 9999999998774 4555555432 24777776655555555455555556543 12222111000 00001111100
Q ss_pred C--------CC-CCCcccEEEEEEecCCEEEEEECCCCCeeEE---------------ecCCCcc--Cc--eeEEeeeec
Q 001655 488 I--------GE-NPSYSQIAAVGMWTDISVRIFSLPDLNLITK---------------EHLGGEI--IP--RSVLLCAFE 539 (1036)
Q Consensus 488 ~--------~~-~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~---------------~~l~~~~--~p--~si~~~~~~ 539 (1036)
. +- ..+....+..++..+..+....-+.+..+.. ..+..+. .| ..++++.++
T Consensus 685 sN~~g~f~ig~~~Sq~e~~l~~~~~~~~q~~~~~s~~~D~~~e~dg~dQlte~~~~~tynl~d~~f~lpsi~~~mVa~lg 764 (1319)
T COG5161 685 SNSLGIFDIGKRISQLEPCLVKGLPYAIQFSPEASPAMDLAGEEDGDDQLTEISMSLTYNLIDMLFRLPSIGNYMVAYLG 764 (1319)
T ss_pred cccccceecccchhhhchhhhhcCcccceeccccCcchhhccccccchhhhhHHHHHHHhhhhhhccChhhhhhhhHhhc
Confidence 0 00 0011112222222111111111111111111 1111111 12 122333333
Q ss_pred ---CeEEEEEEeCCCcEEEEEEecCCCc---cccce-eeec-C---------ccc---eEEEEEEeCCceEEEEEcCCce
Q 001655 540 ---GISYLLCALGDGHLLNFLLNMKTGE---LTDRK-KVSL-G---------TQP---ITLRTFSSKNTTHVFAASDRPT 599 (1036)
Q Consensus 540 ---~~~~L~vgl~~G~l~~y~~~~~~~~---l~~~~-~~~l-G---------~~p---v~l~~~~~~~~~~v~~~~~~p~ 599 (1036)
.++||+..+..|.++.|+-...... +.-.| ...+ | ... +.+...+..+++.+|++|..|+
T Consensus 765 ~D~keeyLf~~s~~~EI~~yk~~l~r~~~f~~nvTRndlAitGaPdna~~Ka~sSV~ri~m~f~~~vghs~~fvTg~~pf 844 (1319)
T COG5161 765 LDLKEEYLFDNSLSSEIVFYKTHLPRHVSFNLNVTRNDLAITGAPDNADIKAFSSVGRIDMVFIKAVGHSFMFVTGKGPF 844 (1319)
T ss_pred ccccchheehhhcCceEEEEeecccccchhhhhcchhhhhccCCCcchhhhhcccccceeEEEeeccCeEEEEEcCCccE
Confidence 4689999999999999986543211 00000 0001 1 111 2333335668999999999999
Q ss_pred EEEec-CCcEEEeeeccccceeeecccCCCCCCeEEEEe-CCeEEEEEEcCC-----CCeEEEEEeCCCccceEEEecCC
Q 001655 600 VIYSS-NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK-EGELTIGTIDDI-----QKLHIRSIPLGEHPRRICHQEQS 672 (1036)
Q Consensus 600 ~i~~~-~~~~~~~~~~~~~i~~~~~f~~~~~~~~l~~~~-~~~l~i~~l~~~-----~~~~~~~i~l~~tp~~i~y~~~~ 672 (1036)
++.+. +..-.+.+...-++.+++||+. .+.++++ ...+++|+.... +||+.++++++.|..+++||+..
T Consensus 845 l~~s~~~s~~k~f~~gNIPlvsv~p~s~----rgy~~Vd~~~~vr~~~~~~dn~y~gnK~p~k~~~~~Ktlqklvyh~~~ 920 (1319)
T COG5161 845 LCRSRYTSSSKAFHRGNIPLVSVIPLSK----RGYLMVDNVLGVRASQYVFDNGYVGNKNPVKRTPKHKTLQKLVYHCAG 920 (1319)
T ss_pred EEEEeccCCcceeecCCCceeeeeeccc----ccEEEEecccceeEEEEEeccceecccCceeeccccccccceeeeccc
Confidence 99764 2222333444457889999986 5777776 578899998765 48999999999999999999999
Q ss_pred CEEEEEEeec---CCCCCc---------------eeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcC-----CCc
Q 001655 673 RTFAICSLKN---QSCAEE---------------SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-----DSN 729 (1036)
Q Consensus 673 ~~~~v~~~~~---~~~~~~---------------~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~-----~~~ 729 (1036)
..|+|.++++ ++.+|+ ..++.+.|++|++|++||+|+|+++|.+.+++...+.. .++
T Consensus 921 ~~~~Vgsc~~~~f~~~gEdgE~~i~~D~Nvphaeg~~~~vdL~spksw~vID~yef~~ne~v~~i~~~~l~~~~~tk~k~ 1000 (1319)
T COG5161 921 RYMVVGSCEEAGFSPKGEDGESGIPVDTNVPHAEGYRFYVDLYSPKSWEVIDTYEFDENEYVFHIKYLILDDMQGTKGKS 1000 (1319)
T ss_pred eEEEEEeeeecCccccCCCCCccCccCCCCcccccceeeEEEecCcceeEeeeeecccceeeeeeeeeeeeccccccCCC
Confidence 9999998865 233332 25799999999999999999999999999999988764 348
Q ss_pred eEEEEEeeeeCCCCCCCcceEEEEEEEe-----------CCeEEEEEEEEecCceeEeccccCeEEEEECCEEEEEEeee
Q 001655 730 VYYCVGTAYVLPEENEPTKGRILVFIVE-----------DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798 (1036)
Q Consensus 730 ~~i~VGT~~~~~~e~~~~~Gri~v~~i~-----------~~kl~~~~~~~~~g~v~~i~~~~g~Lv~~~g~~l~v~~~~~ 798 (1036)
+||+|||++.. +|+.|.+||+++|++. +.|||++..++++|.|..+|+++|+++.|+|+|+.+++++.
T Consensus 1001 pyi~vgtt~~~-gED~p~rG~~hv~eII~VVP~pg~P~t~~KLK~~~~Ee~kGTV~~vcEV~G~~~~~qgqKV~Vr~i~~ 1079 (1319)
T COG5161 1001 PYILVGTTFIE-GEDRPARGRLHVLEIISVVPSPGSPFTDCKLKVLGIEETKGTVVRVCEVRGKIALCQGQKVMVRKIDR 1079 (1319)
T ss_pred ceEEEEeeecc-cCccCCcCceEEEEEEEecCCCCCCcccceeeEEehhhcccEEEEEEEEccEEEeccCcEEEEEEecc
Confidence 99999999988 8999999999999999 46999999999999999999999999999999999999853
Q ss_pred cCCCccchhcccccccceEEEEEEEeCCEEEEEeccccEEEEEEeccCCeEEEEeccCCcceeEEEEEeeCc---eEEEE
Q 001655 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD---IYLGA 875 (1036)
Q Consensus 799 ~~~~~~~L~~~~~~~~~~~~~~l~~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l~~~---~~l~~ 875 (1036)
+++ +.+.++++.+++..++++.+|++++||+|++++|+.|+.++.+|+.+++....+.+++.+||..+ .++++
T Consensus 1080 -~~~---iipV~F~Dl~~ft~s~k~~~Nlll~gD~~qg~~F~gF~~ePyRm~l~s~s~~~~n~~s~efLv~G~~lyf~~~ 1155 (1319)
T COG5161 1080 -SSG---IIPVGFYDLHIFTSSIKVVKNLLLAGDIYQGLSFFGFQSEPYRMHLISSSEPLRNATSTEFLVTGNELYFLCC 1155 (1319)
T ss_pred -cCC---cceeEEEeeeeeeehhhhhhheeehhhhhcCcEEEEecCCcEEEEEecCCchhhcchhhHhhccCCeeEEEEE
Confidence 334 88999999999999999999999999999999999999999999999999999999999999875 37899
Q ss_pred ccCCcEEEEeeCCCCCCcccccceeEEEEEEcCCccceEEeeeeeecCCC--CCCCCcceEEEEccccceEEEEecChhH
Q 001655 876 ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPD--SDVGQIPTVIFGTVNGVIGVIASLPHEQ 953 (1036)
Q Consensus 876 D~~gnl~~l~~~~~~~~~~~~~~L~~~~~~~lg~~v~~~~~~~l~~~~~~--~~~~~~~~il~~t~~GsIg~l~~l~~~~ 953 (1036)
|.+||++++.|.|+.+.+..|+||..++.||+|...++|.-. |+..+ +......+.+|+-+||++.+++||+++.
T Consensus 1156 Da~gnih~l~Y~P~np~S~sG~RLV~rssFtlhs~~~~m~ll---Prn~efG~~~~~~f~~v~~~sdG~l~~vvpisd~~ 1232 (1319)
T COG5161 1156 DAKGNIHGLTYSPNNPISMSGARLVKRSSFTLHSAEIKMNLL---PRNSEFGAGFKKNFIMVYSRSDGMLIHVVPISDAH 1232 (1319)
T ss_pred cCCCCEEEEecCCCCccccCcceeEeeccccccchhhhhhhc---cchhhhCCCCCCceeEEEEccCCcEEEEeccCHHH
Confidence 999999999999999999999999999999999999887432 22211 1233456899999999999999999999
Q ss_pred HHHHHHHHHHHHhcccCCCCCCccccccccccc-ccCCCCcEEehhHHHHHhcCCHHHHHHHHHhcCCCH-HHHHHHH
Q 001655 954 YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK-KTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSV-EELCKRV 1029 (1036)
Q Consensus 954 ~~~L~~lq~~l~~~~~~~~gl~~~~~R~~~~~~-~~~~~~~~iDGdll~~fl~l~~~~q~~i~~~~~~~~-~~i~~~l 1029 (1036)
|++|..+|+++...+.++|||||++||-....+ ...+.+..+|+.++..|-.|+-+.|++||+++|+-. .+++.+|
T Consensus 1233 YrrL~~IQ~~i~~r~~~vgGLNpr~yRL~~d~~~~~~s~r~~ld~~ii~~F~y~~~~~r~sva~kaGr~~~~e~~D~i 1310 (1319)
T COG5161 1233 YRRLLGIQTAIMARLKSVGGLNPRDYRLNSDIHLHSLSLRSPLDLHIINLFSYFDMSTRESVASKAGRIDRKEISDMI 1310 (1319)
T ss_pred HHHHHHHHHHHHHHHHhhcCCChhhhhhccCHHHhcCCcccchhhhhhhhhhhcchhhhhHHHhhcCCchHHHHHHHH
Confidence 999999999999999999999999999865433 345778999999999999999999999999999854 4555443
|
|
| >PF10433 MMS1_N: Mono-functional DNA-alkylating methyl methanesulfonate N-term; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 2B5N_C 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A | Back alignment and domain information |
|---|
Probab=100.00 E-value=2.3e-62 Score=578.12 Aligned_cols=450 Identities=40% Similarity=0.665 Sum_probs=335.7
Q ss_pred cEEEEEeCcCcEEEEEEeCCCCcEEE--E-EeeeccCCCCccCCCCCeEEECCCCCEEEEEecCCeEEEEeCCCC----C
Q 001655 24 DFLFIATERYKFCVLQWDAESSELIT--R-AMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK----G 96 (1036)
Q Consensus 24 D~Llv~~~~~~l~il~~~~~~~~~~t--~-s~~~~~~~~~~~~~~~~~l~vdp~~r~~~~~~~~~~~~ilp~~~~----~ 96 (1036)
|+|+|+|+++++++++|+++++++.+ + ....+.+++.|+..+|++++|||+|||+|+++|+|.+.|+|+++. .
T Consensus 1 D~L~v~tdsg~l~~l~~~~~~~~~~~~~v~~~~~~~~~~~r~~~~G~~l~vDP~~R~i~v~a~e~~~~v~~l~~~~~~~~ 80 (504)
T PF10433_consen 1 DSLVVTTDSGKLSILEYDPSTHGFFKEFVHQWEPLSKSGSRLSQPGQYLAVDPSGRCIAVSAYEGNFLVYPLNRSLDSDI 80 (504)
T ss_dssp -EEEEEETTTEEEEEEEEEETTEE-E-EEEEEEE---SSSEB-TT--EEEE-TTSSEEEEEEBTTEEEEEE-SS----T-
T ss_pred CEEEEEECCCCEEEEEEECCCCccceeeEEEeEecCCCCCChhcCCcEEEECCcCCEEEEEecCCeEEEEEecccccccc
Confidence 89999999999999999998887743 2 235677888899999999999999999999999999999999881 1
Q ss_pred ccccceeccc-CCceEEEEEeec---CCCCCEEEEEEEecCCcceEEEEEEeccCC-ceee-----c--cccccccCCCC
Q 001655 97 QLKEAFNIRL-EELQVLDIKFLY---GCAKPTIVVLYQDNKDARHVKTYEVALKDK-DFVE-----G--PWSQNNLDNGA 164 (1036)
Q Consensus 97 ~~~~~~~~~l-~~~~i~d~~FL~---~~~~ptlavL~~~~~~~~~l~~~~~~~~~~-~~~~-----~--~~~~~~lp~~~ 164 (1036)
.....++.|+ ++.+|+|||||+ ++++|+||+||.+...+.++.+|+|+..+. +.+. . .+...++ +
T Consensus 81 ~~~~~~~~pi~s~~~i~~~~FL~~~~~~~~p~la~L~~~~~~~~~~~~y~w~~~~~l~~~~~~~~~~~~l~~~~~~---p 157 (504)
T PF10433_consen 81 AFSPHINSPIKSEGNILDMCFLHPSVGYDNPTLAILYVDSQRRTHLVTYEWSLDDGLNHVISKSTLPIRLPNEDEL---P 157 (504)
T ss_dssp TT---EEEE--S-SEEEEEEEES---S-SS-EEEEEEEETT-EEEEEEEE--------EETTTTEEEE--EEEE-T---T
T ss_pred cccccccccccCCceEEEEEEEecccCCCCceEEEEEEEecccceeEEEeeecccccceeeeeccccccccccCCC---c
Confidence 2223345666 678999999999 899999999999988888999999877641 1111 1 1222233 4
Q ss_pred CEEEecCCCCcceEEEeCceEEEEcCCc------eeeeccC----cceeeEEEE------EecCCCEEEEEcCccCEEEE
Q 001655 165 DLLIPVPPPLCGVLIIGEETIVYCSANA------FKAIPIR----PSITKAYGR------VDADGSRYLLGDHAGLLHLL 228 (1036)
Q Consensus 165 ~~lip~p~~~~G~lv~~~~~i~y~~~~~------~~~~~~~----~~~~~~~~~------~~~~~~~~ll~~~~G~l~~l 228 (1036)
.+|||+|.|.||+||++++.++|.++.. ...++.+ ...+++|++ ...+++++||++++|+||++
T Consensus 158 ~~LIPlp~~~ggllV~~~~~i~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~lL~~e~G~l~~l 237 (504)
T PF10433_consen 158 SFLIPLPNPPGGLLVGGENIIIYKNHLIGSGDYSFLSIPSPPSSSSSLWTSWARPERNISYDKDGDRILLQDEDGDLYLL 237 (504)
T ss_dssp EEEEEE-TTT-SEEEEESSEEEEEE------TTEEEEE--H-HHHTS-EEEEEE------SSTTSSEEEEEETTSEEEEE
T ss_pred cEEEEcCCCCcEEEEECCEEEEEecccccccccccccccCCccCCCceEEEEEeccccceecCCCCEEEEEeCCCeEEEE
Confidence 9999999998999999999999997532 2344443 467888876 34567899999999999999
Q ss_pred EEEecCcEEeeEEEEEcCc-cccceEEEEecCC--eEEEEEeecCeeEEEEeeCCCCCCCcEEEeeecccCCceeeeEEe
Q 001655 229 VITHEKEKVTGLKIELLGE-TSIASTISYLDNA--VVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVV 305 (1036)
Q Consensus 229 ~l~~~~~~v~~l~i~~~~~-~~~~s~l~~l~~~--~lF~gS~~gds~l~~~~~~~~~~~~~~~~~~~l~n~~Pi~D~~~~ 305 (1036)
.+..+++ ++++.++|+ .++|+++++++++ +||+||++|||+++++... +++++++++|||||+||+++
T Consensus 238 ~l~~~~~---~i~i~~~g~~~~~~s~l~~l~~g~d~lf~gs~~gds~l~~~~~~------~l~~~~~~~N~~Pi~D~~v~ 308 (504)
T PF10433_consen 238 TLDNDGG---SISITYLGTLCSIASSLTYLKNGGDYLFVGSEFGDSQLLQISLS------NLEVLDSLPNWGPIVDFCVV 308 (504)
T ss_dssp EEEEEEE---EEEEEEEEE--S-ESEEEEESTT--EEEEEESSS-EEEEEEESE------SEEEEEEE----SEEEEEEE
T ss_pred EEEECCC---eEEEEEcCCcCChhheEEEEcCCCEEEEEEEecCCcEEEEEeCC------CcEEEEeccCcCCccceEEe
Confidence 9998875 789999999 8999999999999 9999999999999999853 38999999999999999999
Q ss_pred ccCCCCCc------eEEEEeeCCCCccEEEEecCCcee--EEEeeeCCCcceEEEeecCCCCCcccEEEEEecCceEEEE
Q 001655 306 DLERQGQG------QVVTCSGAYKDGSLRIVRNGIGIN--EQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILA 377 (1036)
Q Consensus 306 ~~~~~~~~------~lv~~sG~g~~g~l~~lr~gi~~~--~~~~~~l~~~~~iw~l~~~~~~~~~~~lv~S~~~~T~vl~ 377 (1036)
+...+++. +||+|||.|++|+|+++|+|++++ ..+..++++++++|+++....+ +.||++|++++|+||+
T Consensus 309 ~~~~~~~~~~~~~~~lv~~sG~g~~gsL~~lr~Gi~~~~~~~~~~~l~~v~~iW~l~~~~~~--~~~lv~S~~~~T~vl~ 386 (504)
T PF10433_consen 309 DSSNSGQPSNPSSDQLVACSGAGKRGSLRILRNGIGIEGLELASSELPGVTGIWTLKLSSSD--HSYLVLSFPNETRVLQ 386 (504)
T ss_dssp -TSSSSS-------EEEEEESSGGG-EEEEEEESBEEE--EEEEEEESTEEEEEEE-SSSSS--BSEEEEEESSEEEEEE
T ss_pred ccccCCCCcccccceEEEEECcCCCCcEEEEeccCCceeeeeeccCCCCceEEEEeeecCCC--ceEEEEEcCCceEEEE
Confidence 87655555 999999999999999999999999 8888999999999999875432 8999999999999999
Q ss_pred ecC---CCCeeeeeecCeeccccceeeeecCCCeEEEEecCcEEEEecCCccceeeeeCCCCCcEEEEEeCCCEEEEEEc
Q 001655 378 MNL---EDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATG 454 (1036)
Q Consensus 378 ~~~---~~~~~e~~~~~f~~~~~Tl~~~~~~~~~~vQVt~~~i~l~~~~~~~~~~~w~~~~~~~I~~as~~~~~vvv~~~ 454 (1036)
++. .++++|++..+|.++++||+||+++++.+||||+++||+++.+..+...+|.+|.+..|++|++++.+++++..
T Consensus 387 ~~~~d~~e~~~e~~~~~f~~~~~Tl~~~~~~~~~ivQVt~~~i~l~~~~~~~~~~~w~~~~~~~I~~a~~~~~~v~v~~~ 466 (504)
T PF10433_consen 387 ISEGDDGEEVEEVEEDGFDTDEPTLAAGNVGDGRIVQVTPKGIRLIDLEDGKLTQEWKPPAGSIIVAASINDPQVLVALS 466 (504)
T ss_dssp ES----SSEEEEE---TS-SSS-EEEEEEETTTEEEEEESSEEEEEESSSTSEEEEEE-TTS---SEEEESSSEEEEEE-
T ss_pred EecccCCcchhhhhhccCCCCCCCeEEEEcCCCeEEEEecCeEEEEECCCCeEEEEEeCCCCCeEEEEEECCCEEEEEEe
Confidence 962 34677774459999999999999999999999999999998766678889999999999999999999999999
Q ss_pred CCeEEEEEEcCcEEEEEEe----ccCCceeEEEEcCC
Q 001655 455 GGHLVYLEIGDGILTEVKH----AQLEYEISCLDINP 487 (1036)
Q Consensus 455 ~~~~~~l~~~~~~l~~~~~----~~l~~~is~l~~~~ 487 (1036)
++.+++|+++...+..... .+++.+|+|++++|
T Consensus 467 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~eis~l~i~p 503 (504)
T PF10433_consen 467 GGELVYFELDDNKISVSDNDETILELDNEISCLSIEP 503 (504)
T ss_dssp TTEEEEEEEETTEEEEEEE----EE-SS-EEEEE---
T ss_pred CCcEEEEEEECCceeeeeeccccccCCCceEEEEeCC
Confidence 9999999998765543322 24789999999876
|
... |
| >PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit | Back alignment and domain information |
|---|
Probab=100.00 E-value=1.1e-50 Score=451.71 Aligned_cols=310 Identities=38% Similarity=0.653 Sum_probs=257.8
Q ss_pred eEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCC---CceEEEEEeeeeCCCCCCCcceEEEEEEEeC-----CeEE
Q 001655 691 MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDD---SNVYYCVGTAYVLPEENEPTKGRILVFIVED-----GKLQ 762 (1036)
Q Consensus 691 ~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~---~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~-----~kl~ 762 (1036)
+|+|+++|+.+|+++++++|+++|+++|++.|+|..+ .++||+|||++..+++..+.+|||++|++.+ .+++
T Consensus 1 ~s~i~l~d~~~~~~~~~~~l~~~E~~~s~~~~~l~~~~~~~~~~ivVGT~~~~~~~~~~~~Gri~v~~i~~~~~~~~~l~ 80 (321)
T PF03178_consen 1 ASSIRLVDPTTFEVLDSFELEPNEHVTSLCSVKLKGDSTGKKEYIVVGTAFNYGEDPEPSSGRILVFEISESPENNFKLK 80 (321)
T ss_dssp --EEEEEETTTSSEEEEEEEETTEEEEEEEEEEETTS---SSEEEEEEEEE--TTSSS-S-EEEEEEEECSS-----EEE
T ss_pred CcEEEEEeCCCCeEEEEEECCCCceEEEEEEEEEcCccccccCEEEEEecccccccccccCcEEEEEEEEcccccceEEE
Confidence 3789999999999999999999999999999999875 4899999999998544444559999999997 3999
Q ss_pred EEEEEEecCceeEeccccCeEEEEECCEEEEEEeeecCCCccchhcccccccceEEEEEEEeCCEEEEEeccccEEEEEE
Q 001655 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842 (1036)
Q Consensus 763 ~~~~~~~~g~v~~i~~~~g~Lv~~~g~~l~v~~~~~~~~~~~~L~~~~~~~~~~~~~~l~~~~~~I~vgD~~~Sv~~~~~ 842 (1036)
++++++++|||++|+.++|+|++|+|++|++|+|+.+ .+|.+.|+++.+.+++++.+.+|+|++||+++|+++++|
T Consensus 81 ~i~~~~~~g~V~ai~~~~~~lv~~~g~~l~v~~l~~~----~~l~~~~~~~~~~~i~sl~~~~~~I~vgD~~~sv~~~~~ 156 (321)
T PF03178_consen 81 LIHSTEVKGPVTAICSFNGRLVVAVGNKLYVYDLDNS----KTLLKKAFYDSPFYITSLSVFKNYILVGDAMKSVSLLRY 156 (321)
T ss_dssp EEEEEEESS-EEEEEEETTEEEEEETTEEEEEEEETT----SSEEEEEEE-BSSSEEEEEEETTEEEEEESSSSEEEEEE
T ss_pred EEEEEeecCcceEhhhhCCEEEEeecCEEEEEEccCc----ccchhhheecceEEEEEEeccccEEEEEEcccCEEEEEE
Confidence 9999999999999999999999999999999999632 258899999999999999999999999999999999999
Q ss_pred eccCCeEEEEeccCCcceeEEEEEe-eCceEEEEccCCcEEEEeeCCCCCCccccc-ceeEEEEEEcCCccceEEeeeee
Q 001655 843 KHEEGAIEERARDYNANWMSAVEIL-DDDIYLGAENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLV 920 (1036)
Q Consensus 843 ~~~~~~l~~~a~D~~~~~~~~~~~l-~~~~~l~~D~~gnl~~l~~~~~~~~~~~~~-~L~~~~~~~lg~~v~~~~~~~l~ 920 (1036)
++++.+|.++|||+.++|+++++++ +++.++++|++||+++|+++++..++.++. +|+..++||+|+.|++++++++.
T Consensus 157 ~~~~~~l~~va~d~~~~~v~~~~~l~d~~~~i~~D~~gnl~~l~~~~~~~~~~~~~~~L~~~~~f~lg~~v~~~~~~~l~ 236 (321)
T PF03178_consen 157 DEENNKLILVARDYQPRWVTAAEFLVDEDTIIVGDKDGNLFVLRYNPEIPNSRDGDPKLERISSFHLGDIVNSFRRGSLI 236 (321)
T ss_dssp ETTTE-EEEEEEESS-BEEEEEEEE-SSSEEEEEETTSEEEEEEE-SS-SSTTTTTTBEEEEEEEE-SS-EEEEEE--SS
T ss_pred EccCCEEEEEEecCCCccEEEEEEecCCcEEEEEcCCCeEEEEEECCCCcccccccccceeEEEEECCCccceEEEEEee
Confidence 9988889999999999999999999 667899999999999999998888777788 99999999999999999999876
Q ss_pred ecCCCCCCCCcceEEEEccccceEEEEe-cChhHHHHHHHHHHHHHhcccCCCCCCcccccccccccccCCCCcEEehhH
Q 001655 921 MRLPDSDVGQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDL 999 (1036)
Q Consensus 921 ~~~~~~~~~~~~~il~~t~~GsIg~l~~-l~~~~~~~L~~lq~~l~~~~~~~~gl~~~~~R~~~~~~~~~~~~~~iDGdl 999 (1036)
++..+.+....+.++|+|.+|+||.+.| +++++|++|+.||+.|.+..+++||++|++||+++++ +...+++||||||
T Consensus 237 ~~~~~~~~~~~~~i~~~T~~G~Ig~l~p~l~~~~~~~L~~lQ~~l~~~~~~~~gl~~~~~R~~~~~-~~~~~~~~iDgdl 315 (321)
T PF03178_consen 237 PRSGSSESPNRPQILYGTVDGSIGVLIPFLSEEEYRFLQALQNNLRKHIPSLGGLNPRSFRSYKNP-RMKRSKGFIDGDL 315 (321)
T ss_dssp --SSSS-TTEEEEEEEEETTS-EEEEEE-E-HHHHHHHHHHHHHHHHHS--TTS--HHHHTSEEES-EEE--BSEEEHHH
T ss_pred ecCCCCcccccceEEEEecCCEEEEEEecCCHHHHHHHHHHHHHHHhhCCCCccCChHHhccccCc-cccCCCccCcHHH
Confidence 5411222224678999999999999999 8999999999999999999999999999999999986 3333899999999
Q ss_pred HHHHhc
Q 001655 1000 IESFLD 1005 (1036)
Q Consensus 1000 l~~fl~ 1005 (1036)
|++|++
T Consensus 316 l~~fl~ 321 (321)
T PF03178_consen 316 LEQFLE 321 (321)
T ss_dssp HHGGGG
T ss_pred HHHhhC
Confidence 999985
|
CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A .... |
| >KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton] | Back alignment and domain information |
|---|
Probab=98.91 E-value=2.5e-05 Score=85.49 Aligned_cols=215 Identities=16% Similarity=0.207 Sum_probs=138.2
Q ss_pred CCeEEEEEEcCCCCeEEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEe
Q 001655 638 EGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGC 717 (1036)
Q Consensus 638 ~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~ 717 (1036)
++.|+.-.+....-.....++++.+|+-++-.++..+.+|+|.. .|.++...+.-..-...|+ +.
T Consensus 383 Dd~l~~~~~~~~~~t~~~~~~lg~QP~~lav~~d~~~avv~~~~-----------~iv~l~~~~~~~~~~~~y~----~s 447 (603)
T KOG0318|consen 383 DDTLRVISLKDNGYTKSEVVKLGSQPKGLAVLSDGGTAVVACIS-----------DIVLLQDQTKVSSIPIGYE----SS 447 (603)
T ss_pred CCeEEEEecccCcccccceeecCCCceeEEEcCCCCEEEEEecC-----------cEEEEecCCcceeeccccc----cc
Confidence 55565555433222223347999999999999888888888874 4666653332111112222 23
Q ss_pred EEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeE-EEEEEEEecCceeEeccc-cCeEEEE--ECCEEEE
Q 001655 718 SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKL-QLIAEKETKGAVYSLNAF-NGKLLAA--INQKIQL 793 (1036)
Q Consensus 718 si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl-~~~~~~~~~g~v~~i~~~-~g~Lv~~--~g~~l~v 793 (1036)
|++. ...+.+++||. .-|.+++|.+.+..+ +.....+.++++++|+.- ++..+++ ...++.+
T Consensus 448 ~vAv----~~~~~~vaVGG----------~Dgkvhvysl~g~~l~ee~~~~~h~a~iT~vaySpd~~yla~~Da~rkvv~ 513 (603)
T KOG0318|consen 448 AVAV----SPDGSEVAVGG----------QDGKVHVYSLSGDELKEEAKLLEHRAAITDVAYSPDGAYLAAGDASRKVVL 513 (603)
T ss_pred eEEE----cCCCCEEEEec----------ccceEEEEEecCCcccceeeeecccCCceEEEECCCCcEEEEeccCCcEEE
Confidence 3332 12478899995 237799999987654 445667899999999966 4444444 3468889
Q ss_pred EEeeecCCCccchhcccccccceEEEEEEEeCCEEEEEeccccEEEEEEeccCCeEEEEeccCCcceeEEEEEeeCceEE
Q 001655 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873 (1036)
Q Consensus 794 ~~~~~~~~~~~~L~~~~~~~~~~~~~~l~~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l~~~~~l 873 (1036)
|+.+.+ ..+..+.+++...+..++-+....++..|-+-..|.++..+. |.+- ...++.++..++.+.++++.+++
T Consensus 514 yd~~s~---~~~~~~w~FHtakI~~~aWsP~n~~vATGSlDt~Viiysv~k-P~~~-i~iknAH~~gVn~v~wlde~tvv 588 (603)
T KOG0318|consen 514 YDVASR---EVKTNRWAFHTAKINCVAWSPNNKLVATGSLDTNVIIYSVKK-PAKH-IIIKNAHLGGVNSVAWLDESTVV 588 (603)
T ss_pred EEcccC---ceecceeeeeeeeEEEEEeCCCceEEEeccccceEEEEEccC-hhhh-eEeccccccCceeEEEecCceEE
Confidence 987532 224444555544443333332334666788888888776653 2222 67788889899999999999999
Q ss_pred EEccCCcEEEEee
Q 001655 874 GAENNFNLFTVRK 886 (1036)
Q Consensus 874 ~~D~~gnl~~l~~ 886 (1036)
.+-.+.|+.+...
T Consensus 589 SsG~Da~iK~W~v 601 (603)
T KOG0318|consen 589 SSGQDANIKVWNV 601 (603)
T ss_pred eccCcceeEEecc
Confidence 8888888877653
|
|
| >KOG2048 consensus WD40 repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=98.89 E-value=4.6e-05 Score=86.07 Aligned_cols=408 Identities=16% Similarity=0.149 Sum_probs=213.4
Q ss_pred CCCcEEEEEeC--CCEEEEEEcCCeEEEEEEcCcEEEEEEeccC-CceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEE
Q 001655 435 PGYSVNVATAN--ASQVLLATGGGHLVYLEIGDGILTEVKHAQL-EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 511 (1036)
Q Consensus 435 ~~~~I~~as~~--~~~vvv~~~~~~~~~l~~~~~~l~~~~~~~l-~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~ 511 (1036)
.|..|=+++++ +..+.|.+++|.+.++.+.++.++....+.- ...+-|++..+ ....++.|+- ||-|.+|
T Consensus 109 ~gg~IWsiai~p~~~~l~IgcddGvl~~~s~~p~~I~~~r~l~rq~sRvLslsw~~------~~~~i~~Gs~-Dg~Iriw 181 (691)
T KOG2048|consen 109 NGGAIWSIAINPENTILAIGCDDGVLYDFSIGPDKITYKRSLMRQKSRVLSLSWNP------TGTKIAGGSI-DGVIRIW 181 (691)
T ss_pred CCcceeEEEeCCccceEEeecCCceEEEEecCCceEEEEeecccccceEEEEEecC------CccEEEeccc-CceEEEE
Confidence 45567667766 5688888888877777777776665433322 25677888864 2456888988 9999999
Q ss_pred ECCCCCeeEE-----ecCCC--ccCceeEEeeeecCeEEEEEEeCCCcEEEEEEecCCCccccceeeecCccceEEEEE-
Q 001655 512 SLPDLNLITK-----EHLGG--EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF- 583 (1036)
Q Consensus 512 ~l~~l~~v~~-----~~l~~--~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~l~~~~~~~lG~~pv~l~~~- 583 (1036)
+......+.. ..+.. ..+.=|+++++ ...+..|-..|.+-.+... .+.|........+. | ..+
T Consensus 182 d~~~~~t~~~~~~~~d~l~k~~~~iVWSv~~Lr---d~tI~sgDS~G~V~FWd~~--~gTLiqS~~~h~ad--V--l~La 252 (691)
T KOG2048|consen 182 DVKSGQTLHIITMQLDRLSKREPTIVWSVLFLR---DSTIASGDSAGTVTFWDSI--FGTLIQSHSCHDAD--V--LALA 252 (691)
T ss_pred EcCCCceEEEeeecccccccCCceEEEEEEEee---cCcEEEecCCceEEEEccc--Ccchhhhhhhhhcc--e--eEEE
Confidence 9987555442 12221 12223444442 4457777778877765433 34333221111110 1 111
Q ss_pred EeCCceEEEEEcCCceEE-EecCCc---EE---EeeeccccceeeecccCCCCCCeEEEE--eCCeEEEEEEcCCCCe-E
Q 001655 584 SSKNTTHVFAASDRPTVI-YSSNKK---LL---YSNVNLKEVSHMCPFNSAAFPDSLAIA--KEGELTIGTIDDIQKL-H 653 (1036)
Q Consensus 584 ~~~~~~~v~~~~~~p~~i-~~~~~~---~~---~~~~~~~~i~~~~~f~~~~~~~~l~~~--~~~~l~i~~l~~~~~~-~ 653 (1036)
-.++++.+|+.|-.|-++ |...++ .. ....+...+.+++.+.+ +++. .+..|.++........ +
T Consensus 253 v~~~~d~vfsaGvd~~ii~~~~~~~~~~wv~~~~r~~h~hdvrs~av~~~------~l~sgG~d~~l~i~~s~~~~~~~h 326 (691)
T KOG2048|consen 253 VADNEDRVFSAGVDPKIIQYSLTTNKSEWVINSRRDLHAHDVRSMAVIEN------ALISGGRDFTLAICSSREFKNMDH 326 (691)
T ss_pred EcCCCCeEEEccCCCceEEEEecCCccceeeeccccCCcccceeeeeecc------eEEecceeeEEEEccccccCchhh
Confidence 123457888888555444 443222 11 11222345667776643 2332 2445566655543222 2
Q ss_pred EEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceee-EEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEE
Q 001655 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEM-HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 732 (1036)
Q Consensus 654 ~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~-~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i 732 (1036)
.++.+....+ ++.-.++.++++ .-... ..+.|+ ++.-.=-...+.-+..+..++.|.+.|.+.. ...+.|
T Consensus 327 ~~~~~~p~~~-~v~~a~~~~L~~-~w~~h---~v~lwrlGS~~~~g~~~~~~Llkl~~k~~~nIs~~aiS----Pdg~~I 397 (691)
T KOG2048|consen 327 RQKNLFPASD-RVSVAPENRLLV-LWKAH---GVDLWRLGSVILQGEYNYIHLLKLFTKEKENISCAAIS----PDGNLI 397 (691)
T ss_pred hccccccccc-eeecCccceEEE-Eeccc---cccceeccCcccccccChhhheeeecCCccceeeeccC----CCCCEE
Confidence 2222222221 122222222222 21111 000010 1110000123344556777888888776553 236899
Q ss_pred EEEeeeeCCCCCCCcceEEEEEEEe-CCeEEEEEEEEec---CceeEeccc--cCeEEEEECCEEEEEEeeecCCCccch
Q 001655 733 CVGTAYVLPEENEPTKGRILVFIVE-DGKLQLIAEKETK---GAVYSLNAF--NGKLLAAINQKIQLYKWMLRDDGTREL 806 (1036)
Q Consensus 733 ~VGT~~~~~~e~~~~~Gri~v~~i~-~~kl~~~~~~~~~---g~v~~i~~~--~g~Lv~~~g~~l~v~~~~~~~~~~~~L 806 (1036)
++||.. |+.+|++. .+++++....+.+ -++.++.-. +++++++..+...+..++......+++
T Consensus 398 a~st~~-----------~~~iy~L~~~~~vk~~~v~~~~~~~~~a~~i~ftid~~k~~~~s~~~~~le~~el~~ps~kel 466 (691)
T KOG2048|consen 398 AISTVS-----------RTKIYRLQPDPNVKVINVDDVPLALLDASAISFTIDKNKLFLVSKNIFSLEEFELETPSFKEL 466 (691)
T ss_pred EEeecc-----------ceEEEEeccCcceeEEEeccchhhhccceeeEEEecCceEEEEecccceeEEEEecCcchhhh
Confidence 999963 44556555 2233333322221 123333322 456655554444444443333344666
Q ss_pred hcccccccceEE--EEEEEeCCEEEEEeccccEEEEEEeccCCeEEEEeccCCcceeEEEEEe--eCceEEEEccCCcEE
Q 001655 807 QSECGHHGHILA--LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL--DDDIYLGAENNFNLF 882 (1036)
Q Consensus 807 ~~~~~~~~~~~~--~~l~~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l--~~~~~l~~D~~gnl~ 882 (1036)
...........+ +-...+||+|.|.+....+.++ +-+..+..++.-+.. ..+|++.+- +.+++++++.++.++
T Consensus 467 ~~~~~~~~~~~I~~l~~SsdG~yiaa~~t~g~I~v~--nl~~~~~~~l~~rln-~~vTa~~~~~~~~~~lvvats~nQv~ 543 (691)
T KOG2048|consen 467 KSIQSQAKCPSISRLVVSSDGNYIAAISTRGQIFVY--NLETLESHLLKVRLN-IDVTAAAFSPFVRNRLVVATSNNQVF 543 (691)
T ss_pred hccccccCCCcceeEEEcCCCCEEEEEeccceEEEE--EcccceeecchhccC-cceeeeeccccccCcEEEEecCCeEE
Confidence 554332111222 2345689999999977777654 556666677775555 788888776 345799999999999
Q ss_pred EEeeC
Q 001655 883 TVRKN 887 (1036)
Q Consensus 883 ~l~~~ 887 (1036)
-|..+
T Consensus 544 efdi~ 548 (691)
T KOG2048|consen 544 EFDIE 548 (691)
T ss_pred EEecc
Confidence 99884
|
|
| >KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning] | Back alignment and domain information |
|---|
Probab=98.13 E-value=0.00055 Score=70.70 Aligned_cols=257 Identities=20% Similarity=0.255 Sum_probs=149.4
Q ss_pred ceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEE
Q 001655 478 YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFL 557 (1036)
Q Consensus 478 ~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~ 557 (1036)
..||.+.+++ .+..++|++| ||++.+|+.+..++....... . -++-+.+.+....+.|.-||.+..|.
T Consensus 14 d~IS~v~f~~------~~~~LLvssW-DgslrlYdv~~~~l~~~~~~~--~---plL~c~F~d~~~~~~G~~dg~vr~~D 81 (323)
T KOG1036|consen 14 DGISSVKFSP------SSSDLLVSSW-DGSLRLYDVPANSLKLKFKHG--A---PLLDCAFADESTIVTGGLDGQVRRYD 81 (323)
T ss_pred hceeeEEEcC------cCCcEEEEec-cCcEEEEeccchhhhhheecC--C---ceeeeeccCCceEEEeccCceEEEEE
Confidence 4578888864 2578999999 999999999875432211111 1 12223455567899999999999999
Q ss_pred EecCCCccccceeeecCc--cceEEEEEEeCCceEEEEEcCCceEEEecCCcEEEeeec----------cccceeeeccc
Q 001655 558 LNMKTGELTDRKKVSLGT--QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVN----------LKEVSHMCPFN 625 (1036)
Q Consensus 558 ~~~~~~~l~~~~~~~lG~--~pv~l~~~~~~~~~~v~~~~~~p~~i~~~~~~~~~~~~~----------~~~i~~~~~f~ 625 (1036)
+...+ ..++|+ .|++..... -..+.+++|. .++++.+.... .+.+.++.
T Consensus 82 ln~~~-------~~~igth~~~i~ci~~~--~~~~~vIsgs-------WD~~ik~wD~R~~~~~~~~d~~kkVy~~~--- 142 (323)
T KOG1036|consen 82 LNTGN-------EDQIGTHDEGIRCIEYS--YEVGCVISGS-------WDKTIKFWDPRNKVVVGTFDQGKKVYCMD--- 142 (323)
T ss_pred ecCCc-------ceeeccCCCceEEEEee--ccCCeEEEcc-------cCccEEEEeccccccccccccCceEEEEe---
Confidence 87543 233443 455433221 1223334332 12223222111 11121111
Q ss_pred CCCCCCeEEE-EeCCeEEEEEEcCCC-CeEEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCC--
Q 001655 626 SAAFPDSLAI-AKEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQT-- 701 (1036)
Q Consensus 626 ~~~~~~~l~~-~~~~~l~i~~l~~~~-~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t-- 701 (1036)
-..+.|+. ..+.++.+..|.... .+..|.-++.+..|.|+..|+..-|++.+.+ .+-.++++|+..
T Consensus 143 --v~g~~LvVg~~~r~v~iyDLRn~~~~~q~reS~lkyqtR~v~~~pn~eGy~~sSie--------GRVavE~~d~s~~~ 212 (323)
T KOG1036|consen 143 --VSGNRLVVGTSDRKVLIYDLRNLDEPFQRRESSLKYQTRCVALVPNGEGYVVSSIE--------GRVAVEYFDDSEEA 212 (323)
T ss_pred --ccCCEEEEeecCceEEEEEcccccchhhhccccceeEEEEEEEecCCCceEEEeec--------ceEEEEccCCchHH
Confidence 12344555 557777778777664 3455666788888999999988888888874 344667776641
Q ss_pred ------ceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeC-CeEEEEEEEEecCcee
Q 001655 702 ------FEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVED-GKLQLIAEKETKGAVY 774 (1036)
Q Consensus 702 ------~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~-~kl~~~~~~~~~g~v~ 774 (1036)
|+.. +..-+..|.+..+..+.|.. .+..++-|. .-|.+.+|+..+ ++|+.+++.+..=+..
T Consensus 213 ~skkyaFkCH-r~~~~~~~~~yPVNai~Fhp-~~~tfaTgG----------sDG~V~~Wd~~~rKrl~q~~~~~~SI~sl 280 (323)
T KOG1036|consen 213 QSKKYAFKCH-RLSEKDTEIIYPVNAIAFHP-IHGTFATGG----------SDGIVNIWDLFNRKRLKQLAKYETSISSL 280 (323)
T ss_pred hhhceeEEee-ecccCCceEEEEeceeEecc-ccceEEecC----------CCceEEEccCcchhhhhhccCCCCceEEE
Confidence 1111 11225557788888887753 234455443 459999998884 4688888764333333
Q ss_pred EeccccCeEEEEE
Q 001655 775 SLNAFNGKLLAAI 787 (1036)
Q Consensus 775 ~i~~~~g~Lv~~~ 787 (1036)
+.+.-...|.+|.
T Consensus 281 sfs~dG~~LAia~ 293 (323)
T KOG1036|consen 281 SFSMDGSLLAIAS 293 (323)
T ss_pred EeccCCCeEEEEe
Confidence 4333333555554
|
|
| >KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=98.04 E-value=0.054 Score=62.83 Aligned_cols=445 Identities=15% Similarity=0.113 Sum_probs=224.6
Q ss_pred ecCCCEEEEEcCccCEEEEEEEecCcEEeeEEEEEcCccccceEEEEecCCeEEEEEeecCeeEEEEeeCCC-CCCCcEE
Q 001655 210 DADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPD-AKGSYVE 288 (1036)
Q Consensus 210 ~~~~~~~ll~~~~G~l~~l~l~~~~~~v~~l~i~~~~~~~~~s~l~~l~~~~lF~gS~~gds~l~~~~~~~~-~~~~~~~ 288 (1036)
.+++..++..||.|+..++.+.. ...+.++.-.....|+..-.||-+|+.+....-++++.+..-. +-.. +.
T Consensus 64 Sp~g~lllavdE~g~~~lvs~~~------r~Vlh~f~fk~~v~~i~fSPng~~fav~~gn~lqiw~~P~~~~~~~~p-Fv 136 (893)
T KOG0291|consen 64 SPDGTLLLAVDERGRALLVSLLS------RSVLHRFNFKRGVGAIKFSPNGKFFAVGCGNLLQIWHAPGEIKNEFNP-FV 136 (893)
T ss_pred CCCceEEEEEcCCCcEEEEeccc------ceeeEEEeecCccceEEECCCCcEEEEEecceeEEEecCcchhcccCc-ce
Confidence 34666888899999999998853 2233333223456788888899999887666667776654321 1112 21
Q ss_pred EeeecccCCceeeeEEeccCCCCCceEEEEeeCCCCccEEEEe---------------------cCCceeEEEeeeCCCc
Q 001655 289 VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVR---------------------NGIGINEQASVELQGI 347 (1036)
Q Consensus 289 ~~~~l~n~~Pi~D~~~~~~~~~~~~~lv~~sG~g~~g~l~~lr---------------------~gi~~~~~~~~~l~~~ 347 (1036)
.. =.+.|+--|++-++-. ...++++..+....-.|--++ -|..-..+....-.|.
T Consensus 137 l~--r~~~g~fddi~si~Ws--~DSr~l~~gsrD~s~rl~~v~~~k~~~~~~l~gHkd~VvacfF~~~~~~l~tvskdG~ 212 (893)
T KOG0291|consen 137 LH--RTYLGHFDDITSIDWS--DDSRLLVTGSRDLSARLFGVDGNKNLFTYALNGHKDYVVACFFGANSLDLYTVSKDGA 212 (893)
T ss_pred Ee--eeecCCccceeEEEec--cCCceEEeccccceEEEEEeccccccceEeccCCCcceEEEEeccCcceEEEEecCce
Confidence 11 2345566566554421 112222222211111111111 1111111111222344
Q ss_pred ceEEEeecCCCCCcccEEEEEecCceEEEEecCCCCeee---------eeecCeeccccceeeeecCCC--eEEEEecCc
Q 001655 348 KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE---------TEIEGFCSQTQTLFCHDAIYN--QLVQVTSGS 416 (1036)
Q Consensus 348 ~~iw~l~~~~~~~~~~~lv~S~~~~T~vl~~~~~~~~~e---------~~~~~f~~~~~Tl~~~~~~~~--~~vQVt~~~ 416 (1036)
--+|+....+...... =--....+.. ...+ +-|| .....|.-+..-+.|+.+..+ .++--..++
T Consensus 213 l~~W~~~~~P~~~~~~--~kd~eg~~d~-~~~~--~~Eek~~~~~~~k~~k~~ln~~~~kvtaa~fH~~t~~lvvgFssG 287 (893)
T KOG0291|consen 213 LFVWTCDLRPPELDKA--EKDEEGSDDE-EMDE--DGEEKTHKIFWYKTKKHYLNQNSSKVTAAAFHKGTNLLVVGFSSG 287 (893)
T ss_pred EEEEEecCCCcccccc--cccccccccc-cccc--cchhhhcceEEEEEEeeeecccccceeeeeccCCceEEEEEecCC
Confidence 4445443211000000 0000000000 0000 0111 011222223344555555333 333334555
Q ss_pred EE-EEecCCccceeeeeCCCCCcEEEEEeC--CCEEEEEEcC-CeEEEEEEcCcEEEEEEeccCCceeEEEEcCCCCCCC
Q 001655 417 VR-LVSSTSRELRNEWKSPPGYSVNVATAN--ASQVLLATGG-GHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENP 492 (1036)
Q Consensus 417 i~-l~~~~~~~~~~~w~~~~~~~I~~as~~--~~~vvv~~~~-~~~~~l~~~~~~l~~~~~~~l~~~is~l~~~~~~~~~ 492 (1036)
+. ++.......++.... ...+|..++.| ++++.+..+. |.++..+-..... ..+..--...++|++.+|
T Consensus 288 ~f~LyelP~f~lih~LSi-s~~~I~t~~~N~tGDWiA~g~~klgQLlVweWqsEsY-VlKQQgH~~~i~~l~YSp----- 360 (893)
T KOG0291|consen 288 EFGLYELPDFNLIHSLSI-SDQKILTVSFNSTGDWIAFGCSKLGQLLVWEWQSESY-VLKQQGHSDRITSLAYSP----- 360 (893)
T ss_pred eeEEEecCCceEEEEeec-ccceeeEEEecccCCEEEEcCCccceEEEEEeeccce-eeeccccccceeeEEECC-----
Confidence 54 555544445555433 34689999988 9999998754 3444445533221 112122235689999865
Q ss_pred CcccEEEEEEecCCEEEEEECCC-CCee-EEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEecCCCcccccee
Q 001655 493 SYSQIAAVGMWTDISVRIFSLPD-LNLI-TKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKK 570 (1036)
Q Consensus 493 ~~~~~~~vg~w~~~~i~i~~l~~-l~~v-~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~l~~~~~ 570 (1036)
.+.+++-|.. ||.+.||+... +..+ +.++-+ ....+.+.. .+..|+...-||.+-.+.+..- ++- +
T Consensus 361 -Dgq~iaTG~e-DgKVKvWn~~SgfC~vTFteHts---~Vt~v~f~~--~g~~llssSLDGtVRAwDlkRY----rNf-R 428 (893)
T KOG0291|consen 361 -DGQLIATGAE-DGKVKVWNTQSGFCFVTFTEHTS---GVTAVQFTA--RGNVLLSSSLDGTVRAWDLKRY----RNF-R 428 (893)
T ss_pred -CCcEEEeccC-CCcEEEEeccCceEEEEeccCCC---ceEEEEEEe--cCCEEEEeecCCeEEeeeeccc----cee-e
Confidence 3678888888 99999999754 3433 233332 122222232 3466888888999988876531 111 1
Q ss_pred eecCccceEEEEEEeCCceEEEEEcCCc-eEEE-ec--CCcE-EEeeeccccceeeecccCCCCCCeEEEEe-CCeEEEE
Q 001655 571 VSLGTQPITLRTFSSKNTTHVFAASDRP-TVIY-SS--NKKL-LYSNVNLKEVSHMCPFNSAAFPDSLAIAK-EGELTIG 644 (1036)
Q Consensus 571 ~~lG~~pv~l~~~~~~~~~~v~~~~~~p-~~i~-~~--~~~~-~~~~~~~~~i~~~~~f~~~~~~~~l~~~~-~~~l~i~ 644 (1036)
.+-++.|+++..+..+...-+.+.|+.- +=|+ ++ .|++ .+..=+..++.+++ |+. ..+.|+..+ ++.+++=
T Consensus 429 Tft~P~p~QfscvavD~sGelV~AG~~d~F~IfvWS~qTGqllDiLsGHEgPVs~l~-f~~--~~~~LaS~SWDkTVRiW 505 (893)
T KOG0291|consen 429 TFTSPEPIQFSCVAVDPSGELVCAGAQDSFEIFVWSVQTGQLLDILSGHEGPVSGLS-FSP--DGSLLASGSWDKTVRIW 505 (893)
T ss_pred eecCCCceeeeEEEEcCCCCEEEeeccceEEEEEEEeecCeeeehhcCCCCcceeeE-Ecc--ccCeEEeccccceEEEE
Confidence 2356788888776544223355555443 3333 22 3443 23233345554422 221 112333334 6777776
Q ss_pred EEcCCCCeEEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCce
Q 001655 645 TIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFE 703 (1036)
Q Consensus 645 ~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~ 703 (1036)
.+-.... .+..+++......+.+.|+.+-++|++.+ +.|-++|.+...
T Consensus 506 ~if~s~~-~vEtl~i~sdvl~vsfrPdG~elaVaTld----------gqItf~d~~~~~ 553 (893)
T KOG0291|consen 506 DIFSSSG-TVETLEIRSDVLAVSFRPDGKELAVATLD----------GQITFFDIKEAV 553 (893)
T ss_pred EeeccCc-eeeeEeeccceeEEEEcCCCCeEEEEEec----------ceEEEEEhhhce
Confidence 6644322 45677777888889999999999999986 356666665443
|
|
| >PRK11028 6-phosphogluconolactonase; Provisional | Back alignment and domain information |
|---|
Probab=98.04 E-value=0.019 Score=64.42 Aligned_cols=185 Identities=13% Similarity=0.122 Sum_probs=102.7
Q ss_pred CCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEE-----EEECCCCCeEeEEEEEEEcCCCceEEEE
Q 001655 660 GEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFIS-----TYPLDTFEYGCSILSCSFSDDSNVYYCV 734 (1036)
Q Consensus 660 ~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~-----~~~~~~~E~v~si~~~~l~~~~~~~i~V 734 (1036)
...|+.++++|+.+.+.|..... ..|.++|..+...+. ....+.++.+..+ .+..+ .+++.|
T Consensus 125 ~~~~~~~~~~p~g~~l~v~~~~~---------~~v~v~d~~~~g~l~~~~~~~~~~~~g~~p~~~---~~~pd-g~~lyv 191 (330)
T PRK11028 125 LEGCHSANIDPDNRTLWVPCLKE---------DRIRLFTLSDDGHLVAQEPAEVTTVEGAGPRHM---VFHPN-QQYAYC 191 (330)
T ss_pred CCcccEeEeCCCCCEEEEeeCCC---------CEEEEEEECCCCcccccCCCceecCCCCCCceE---EECCC-CCEEEE
Confidence 35688888999888777766432 468888765533221 2344555555443 34433 445555
Q ss_pred EeeeeCCCCCCCcceEEEEEEEeC--CeEEEEEEEEe-----cCcee--E--eccccCeEEEEEC--CEEEEEEeeecCC
Q 001655 735 GTAYVLPEENEPTKGRILVFIVED--GKLQLIAEKET-----KGAVY--S--LNAFNGKLLAAIN--QKIQLYKWMLRDD 801 (1036)
Q Consensus 735 GT~~~~~~e~~~~~Gri~v~~i~~--~kl~~~~~~~~-----~g~v~--~--i~~~~g~Lv~~~g--~~l~v~~~~~~~~ 801 (1036)
... ..+.|.+|++.. ++++.+.+... .++-. + +.+=+.+|+++.. +.|.+|+++.. .
T Consensus 192 ~~~---------~~~~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~~~~~I~v~~i~~~-~ 261 (330)
T PRK11028 192 VNE---------LNSSVDVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHITPDGRHLYACDRTASLISVFSVSED-G 261 (330)
T ss_pred Eec---------CCCEEEEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEECCCCCEEEEecCCCCeEEEEEEeCC-C
Confidence 542 358999999983 46766554432 22222 2 2232346777643 68889888532 1
Q ss_pred CccchhcccccccceEEEEEEEeCCEEEEEec-cccEEEEEEeccCCeEEEEeccCCcceeEEEEEe
Q 001655 802 GTRELQSECGHHGHILALYVQTRGDFIVVGDL-MKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 867 (1036)
Q Consensus 802 ~~~~L~~~~~~~~~~~~~~l~~~~~~I~vgD~-~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l 867 (1036)
+..++.........+..+.++..|.+++|+.. ...+.+++.+.+.+.+..+++=....+..++.+|
T Consensus 262 ~~~~~~~~~~~~~~p~~~~~~~dg~~l~va~~~~~~v~v~~~~~~~g~l~~~~~~~~g~~P~~~~~~ 328 (330)
T PRK11028 262 SVLSFEGHQPTETQPRGFNIDHSGKYLIAAGQKSHHISVYEIDGETGLLTELGRYAVGQGPMWVSVL 328 (330)
T ss_pred CeEEEeEEEeccccCCceEECCCCCEEEEEEccCCcEEEEEEcCCCCcEEEccccccCCCceEEEEE
Confidence 22233222111111123445556889999876 4568888877666777666543333444444443
|
|
| >KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=98.03 E-value=0.022 Score=65.86 Aligned_cols=359 Identities=16% Similarity=0.149 Sum_probs=193.2
Q ss_pred CcEEEEEeC--CCEEEEEEcCCeEEEEEEcCcEEEEEEeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECC
Q 001655 437 YSVNVATAN--ASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP 514 (1036)
Q Consensus 437 ~~I~~as~~--~~~vvv~~~~~~~~~l~~~~~~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~ 514 (1036)
..|++-++. +..++.+..+..+-.++++.+++...=+..-+..+--+++.+ .+..++.|-- ++.+.+|++.
T Consensus 63 d~ita~~l~~d~~~L~~a~rs~llrv~~L~tgk~irswKa~He~Pvi~ma~~~------~g~LlAtgga-D~~v~VWdi~ 135 (775)
T KOG0319|consen 63 DEITALALTPDEEVLVTASRSQLLRVWSLPTGKLIRSWKAIHEAPVITMAFDP------TGTLLATGGA-DGRVKVWDIK 135 (775)
T ss_pred hhhheeeecCCccEEEEeeccceEEEEEcccchHhHhHhhccCCCeEEEEEcC------CCceEEeccc-cceEEEEEee
Confidence 467776655 334444444555555566655433221122244555667754 2456777766 9999999997
Q ss_pred CCCeeEE-ecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEecCCCccc-----cceeeecCccceEEEEEEeCCc
Q 001655 515 DLNLITK-EHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELT-----DRKKVSLGTQPITLRTFSSKNT 588 (1036)
Q Consensus 515 ~l~~v~~-~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~l~-----~~~~~~lG~~pv~l~~~~~~~~ 588 (1036)
.-...+. ...+ .+..++++.+...+..|..|-.||.+..|.+......+. ......|+ +.. +.
T Consensus 136 ~~~~th~fkG~g--GvVssl~F~~~~~~~lL~sg~~D~~v~vwnl~~~~tcl~~~~~H~S~vtsL~--------~~~-d~ 204 (775)
T KOG0319|consen 136 NGYCTHSFKGHG--GVVSSLLFHPHWNRWLLASGATDGTVRVWNLNDKRTCLHTMILHKSAVTSLA--------FSE-DS 204 (775)
T ss_pred CCEEEEEecCCC--ceEEEEEeCCccchhheeecCCCceEEEEEcccCchHHHHHHhhhhheeeee--------ecc-CC
Confidence 6443322 2223 355666665544455578899999999999874321110 01111221 111 12
Q ss_pred eEEEEEcCCceEEEec---CCcEEEeeeccccceeeecccCCCCC-CeEEEEeCCeEEEEEEcCCC--CeEEEEEeCCCc
Q 001655 589 THVFAASDRPTVIYSS---NKKLLYSNVNLKEVSHMCPFNSAAFP-DSLAIAKEGELTIGTIDDIQ--KLHIRSIPLGEH 662 (1036)
Q Consensus 589 ~~v~~~~~~p~~i~~~---~~~~~~~~~~~~~i~~~~~f~~~~~~-~~l~~~~~~~l~i~~l~~~~--~~~~~~i~l~~t 662 (1036)
..++..|.--.++.++ ...+...|+. +.+.++.-.+.+.-. ..++++.++.=.+...+... -.-.++.+-++.
T Consensus 205 ~~~ls~~RDkvi~vwd~~~~~~l~~lp~y-e~~E~vv~l~~~~~~~~~~~~TaG~~g~~~~~d~es~~~~~~~~~~~~~e 283 (775)
T KOG0319|consen 205 LELLSVGRDKVIIVWDLVQYKKLKTLPLY-ESLESVVRLREELGGKGEYIITAGGSGVVQYWDSESGKCVYKQRQSDSEE 283 (775)
T ss_pred ceEEEeccCcEEEEeehhhhhhhheechh-hheeeEEEechhcCCcceEEEEecCCceEEEEecccchhhhhhccCCchh
Confidence 2344444333333332 1223444544 445555555442211 23555555555555555432 112233332444
Q ss_pred cceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCC
Q 001655 663 PRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPE 742 (1036)
Q Consensus 663 p~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~ 742 (1036)
..+.++.+..+-++.++.+. .+.|+|..+.+....+ ..-+|++..|+.| .+..++++|.|.
T Consensus 284 ~~~~~~~~~~~~~l~vtaeQ----------nl~l~d~~~l~i~k~i-vG~ndEI~Dm~~l---G~e~~~laVATN----- 344 (775)
T KOG0319|consen 284 IDHLLAIESMSQLLLVTAEQ----------NLFLYDEDELTIVKQI-VGYNDEILDMKFL---GPEESHLAVATN----- 344 (775)
T ss_pred hhcceeccccCceEEEEccc----------eEEEEEccccEEehhh-cCCchhheeeeec---CCccceEEEEeC-----
Confidence 67788888888888888764 6888888877665443 3456777888766 344689999994
Q ss_pred CCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccc-cCeEEEEEC--CEEEEEEeeecCCCccchhcccccccc---e
Q 001655 743 ENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF-NGKLLAAIN--QKIQLYKWMLRDDGTRELQSECGHHGH---I 816 (1036)
Q Consensus 743 e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~-~g~Lv~~~g--~~l~v~~~~~~~~~~~~L~~~~~~~~~---~ 816 (1036)
++++.+|+...-..+++. -+...|.++..+ .|-|+++.+ +++.+++++ ++..+....+...++ +
T Consensus 345 -----s~~lr~y~~~~~~c~ii~--GH~e~vlSL~~~~~g~llat~sKD~svilWr~~---~~~~~~~~~a~~~gH~~sv 414 (775)
T KOG0319|consen 345 -----SPELRLYTLPTSYCQIIP--GHTEAVLSLDVWSSGDLLATGSKDKSVILWRLN---NNCSKSLCVAQANGHTNSV 414 (775)
T ss_pred -----CCceEEEecCCCceEEEe--CchhheeeeeecccCcEEEEecCCceEEEEEec---CCcchhhhhhhhccccccc
Confidence 456777765544444333 456688888855 454665555 578888873 233333333323333 2
Q ss_pred EEEEEEEeC-CEEEEEeccccEEEEEEe
Q 001655 817 LALYVQTRG-DFIVVGDLMKSISLLIYK 843 (1036)
Q Consensus 817 ~~~~l~~~~-~~I~vgD~~~Sv~~~~~~ 843 (1036)
.++.+..-+ .+.+.+---..+.+..+.
T Consensus 415 gava~~~~~asffvsvS~D~tlK~W~l~ 442 (775)
T KOG0319|consen 415 GAVAGSKLGASFFVSVSQDCTLKLWDLP 442 (775)
T ss_pred ceeeecccCccEEEEecCCceEEEecCC
Confidence 344443333 244444444555555443
|
|
| >PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3 | Back alignment and domain information |
|---|
Probab=97.87 E-value=0.041 Score=62.02 Aligned_cols=274 Identities=18% Similarity=0.212 Sum_probs=152.8
Q ss_pred EEEEEecC---CEEEEEECCC--CCeeEEecCCCccCceeEEeeeecCeEEEEEEeC----CCcEEEEEEecCCCccccc
Q 001655 498 AAVGMWTD---ISVRIFSLPD--LNLITKEHLGGEIIPRSVLLCAFEGISYLLCALG----DGHLLNFLLNMKTGELTDR 568 (1036)
Q Consensus 498 ~~vg~w~~---~~i~i~~l~~--l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~----~G~l~~y~~~~~~~~l~~~ 568 (1036)
++||+|.+ +.|..|++++ -++...+.......|.-+.+.+ ++.+|++... +|.+..|.++..++.|+..
T Consensus 2 ~~vgsy~~~~~~gI~~~~~d~~~g~l~~~~~~~~~~~Ps~l~~~~--~~~~LY~~~e~~~~~g~v~~~~i~~~~g~L~~~ 79 (345)
T PF10282_consen 2 LYVGSYTNGKGGGIYVFRFDEETGTLTLVQTVAEGENPSWLAVSP--DGRRLYVVNEGSGDSGGVSSYRIDPDTGTLTLL 79 (345)
T ss_dssp EEEEECCSSSSTEEEEEEEETTTTEEEEEEEEEESSSECCEEE-T--TSSEEEEEETTSSTTTEEEEEEEETTTTEEEEE
T ss_pred EEEEcCCCCCCCcEEEEEEcCCCCCceEeeeecCCCCCceEEEEe--CCCEEEEEEccccCCCCEEEEEECCCcceeEEe
Confidence 67899954 7899999932 2221111121123455555432 4567777766 6899999999877777765
Q ss_pred eeee-cCccceEEEEEEeCCceEEEEEc--CCceEEE-ec-CCcEEEee-ec-c---------ccc--eeeecccCCCCC
Q 001655 569 KKVS-LGTQPITLRTFSSKNTTHVFAAS--DRPTVIY-SS-NKKLLYSN-VN-L---------KEV--SHMCPFNSAAFP 630 (1036)
Q Consensus 569 ~~~~-lG~~pv~l~~~~~~~~~~v~~~~--~~p~~i~-~~-~~~~~~~~-~~-~---------~~i--~~~~~f~~~~~~ 630 (1036)
.+.. .|..|+.+.. ..+...++++. +....++ .+ ++++.-.. +. . +.- -+...| .|
T Consensus 80 ~~~~~~g~~p~~i~~--~~~g~~l~vany~~g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~----~p 153 (345)
T PF10282_consen 80 NSVPSGGSSPCHIAV--DPDGRFLYVANYGGGSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVF----SP 153 (345)
T ss_dssp EEEEESSSCEEEEEE--CTTSSEEEEEETTTTEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-----T
T ss_pred eeeccCCCCcEEEEE--ecCCCEEEEEEccCCeEEEEEccCCcccceeeeecccCCCCCcccccccccceeEEE----CC
Confidence 5554 6888876643 23445666653 2222222 22 34442110 00 0 000 011111 23
Q ss_pred C-eEEEE-e--CCeEEEEEEcCCC-Ce---EEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeC---
Q 001655 631 D-SLAIA-K--EGELTIGTIDDIQ-KL---HIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD--- 699 (1036)
Q Consensus 631 ~-~l~~~-~--~~~l~i~~l~~~~-~~---~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~--- 699 (1036)
+ .++++ + .+.+.+..++... ++ ..-+++.|.-||+++++|+.+.+.|++... +.|..++.
T Consensus 154 dg~~v~v~dlG~D~v~~~~~~~~~~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e~s---------~~v~v~~~~~~ 224 (345)
T PF10282_consen 154 DGRFVYVPDLGADRVYVYDIDDDTGKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVNELS---------NTVSVFDYDPS 224 (345)
T ss_dssp TSSEEEEEETTTTEEEEEEE-TTS-TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEETTT---------TEEEEEEEETT
T ss_pred CCCEEEEEecCCCEEEEEEEeCCCceEEEeeccccccCCCCcEEEEcCCcCEEEEecCCC---------CcEEEEeeccc
Confidence 3 24444 3 6889999998765 23 334678889999999999988666665321 23333332
Q ss_pred -CCceEEEEEECC-CCCeE-eEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEe--CCeEEEEEEEEecCc-e
Q 001655 700 -QTFEFISTYPLD-TFEYG-CSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE--DGKLQLIAEKETKGA-V 773 (1036)
Q Consensus 700 -~t~~~i~~~~~~-~~E~v-~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~--~~kl~~~~~~~~~g~-v 773 (1036)
..++.+.....- .+-.. .....+.+..+ .+++.|... ....|.+|+++ .++|+.+......|. +
T Consensus 225 ~g~~~~~~~~~~~~~~~~~~~~~~~i~ispd-g~~lyvsnr---------~~~sI~vf~~d~~~g~l~~~~~~~~~G~~P 294 (345)
T PF10282_consen 225 DGSLTEIQTISTLPEGFTGENAPAEIAISPD-GRFLYVSNR---------GSNSISVFDLDPATGTLTLVQTVPTGGKFP 294 (345)
T ss_dssp TTEEEEEEEEESCETTSCSSSSEEEEEE-TT-SSEEEEEEC---------TTTEEEEEEECTTTTTEEEEEEEEESSSSE
T ss_pred CCceeEEEEeeeccccccccCCceeEEEecC-CCEEEEEec---------cCCEEEEEEEecCCCceEEEEEEeCCCCCc
Confidence 234445544422 11110 12233445443 456666542 25689999995 468999888888776 7
Q ss_pred eEecc--ccCeEEEEEC--CEEEEEEeee
Q 001655 774 YSLNA--FNGKLLAAIN--QKIQLYKWML 798 (1036)
Q Consensus 774 ~~i~~--~~g~Lv~~~g--~~l~v~~~~~ 798 (1036)
..+.. -+.+|+++.. +.|.+|+++.
T Consensus 295 r~~~~s~~g~~l~Va~~~s~~v~vf~~d~ 323 (345)
T PF10282_consen 295 RHFAFSPDGRYLYVANQDSNTVSVFDIDP 323 (345)
T ss_dssp EEEEE-TTSSEEEEEETTTTEEEEEEEET
T ss_pred cEEEEeCCCCEEEEEecCCCeEEEEEEeC
Confidence 77764 3446777654 5899999853
|
1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H. |
| >cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto | Back alignment and domain information |
|---|
Probab=97.83 E-value=0.043 Score=58.94 Aligned_cols=272 Identities=17% Similarity=0.181 Sum_probs=141.4
Q ss_pred CceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEE
Q 001655 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNF 556 (1036)
Q Consensus 477 ~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y 556 (1036)
...|+|+++.+ ...+++++.. +|.+.+|++.+.+........ .....++.+.+ ...+++++..+|.+..|
T Consensus 9 ~~~i~~~~~~~------~~~~l~~~~~-~g~i~i~~~~~~~~~~~~~~~-~~~i~~~~~~~--~~~~l~~~~~~~~i~i~ 78 (289)
T cd00200 9 TGGVTCVAFSP------DGKLLATGSG-DGTIKVWDLETGELLRTLKGH-TGPVRDVAASA--DGTYLASGSSDKTIRLW 78 (289)
T ss_pred CCCEEEEEEcC------CCCEEEEeec-CcEEEEEEeeCCCcEEEEecC-CcceeEEEECC--CCCEEEEEcCCCeEEEE
Confidence 35688998864 2568888887 999999999875543322111 11112333332 33578888899999999
Q ss_pred EEecCCCccccceeeecCccceEEEEEEeCCceEEEEEc-CCceEEEecC-CcEEE-eeeccccceeeecccCCCCCCeE
Q 001655 557 LLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAAS-DRPTVIYSSN-KKLLY-SNVNLKEVSHMCPFNSAAFPDSL 633 (1036)
Q Consensus 557 ~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~-~~p~~i~~~~-~~~~~-~~~~~~~i~~~~~f~~~~~~~~l 633 (1036)
.+.... ..........++.-..+. .....+++.+ +....++..+ ++... .......+.+++ |+.. ...+
T Consensus 79 ~~~~~~----~~~~~~~~~~~i~~~~~~-~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~-~~~~--~~~l 150 (289)
T cd00200 79 DLETGE----CVRTLTGHTSYVSSVAFS-PDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVA-FSPD--GTFV 150 (289)
T ss_pred EcCccc----ceEEEeccCCcEEEEEEc-CCCCEEEEecCCCeEEEEECCCcEEEEEeccCCCcEEEEE-EcCc--CCEE
Confidence 876421 111111111122211221 1222333333 3333333322 22211 111112233322 2211 2344
Q ss_pred EEEe-CCeEEEEEEcCCCCeEEEEEeCC-CccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECC
Q 001655 634 AIAK-EGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLD 711 (1036)
Q Consensus 634 ~~~~-~~~l~i~~l~~~~~~~~~~i~l~-~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~ 711 (1036)
+..+ ++.+.+..+..... .+.+... ..+..+.++++.+.+++++.. +.++++|..+++.+..+.-.
T Consensus 151 ~~~~~~~~i~i~d~~~~~~--~~~~~~~~~~i~~~~~~~~~~~l~~~~~~----------~~i~i~d~~~~~~~~~~~~~ 218 (289)
T cd00200 151 ASSSQDGTIKLWDLRTGKC--VATLTGHTGEVNSVAFSPDGEKLLSSSSD----------GTIKLWDLSTGKCLGTLRGH 218 (289)
T ss_pred EEEcCCCcEEEEEcccccc--ceeEecCccccceEEECCCcCEEEEecCC----------CcEEEEECCCCceecchhhc
Confidence 4445 78888887764332 2333333 367888999988777776642 47899998877766655322
Q ss_pred CCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEecccc-CeEEEEEC--
Q 001655 712 TFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN-GKLLAAIN-- 788 (1036)
Q Consensus 712 ~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~~-g~Lv~~~g-- 788 (1036)
...+.++ .+..+ ..+++.|. ..|++++|++..++...... ...+++++++-.. +.++++.+
T Consensus 219 -~~~i~~~---~~~~~-~~~~~~~~----------~~~~i~i~~~~~~~~~~~~~-~~~~~i~~~~~~~~~~~l~~~~~d 282 (289)
T cd00200 219 -ENGVNSV---AFSPD-GYLLASGS----------EDGTIRVWDLRTGECVQTLS-GHTNSVTSLAWSPDGKRLASGSAD 282 (289)
T ss_pred -CCceEEE---EEcCC-CcEEEEEc----------CCCcEEEEEcCCceeEEEcc-ccCCcEEEEEECCCCCEEEEecCC
Confidence 2234443 33332 45555443 35899999987554433222 4566788877553 23333333
Q ss_pred CEEEEE
Q 001655 789 QKIQLY 794 (1036)
Q Consensus 789 ~~l~v~ 794 (1036)
..+.+|
T Consensus 283 ~~i~iw 288 (289)
T cd00200 283 GTIRIW 288 (289)
T ss_pred CeEEec
Confidence 355554
|
|
| >KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton] | Back alignment and domain information |
|---|
Probab=97.82 E-value=0.094 Score=58.47 Aligned_cols=350 Identities=15% Similarity=0.197 Sum_probs=196.7
Q ss_pred ceeeeecC-CC-eEEEEe-cCcEEEEecCCccceeeeeCCCC---CcEEEEEeCCCEEEEEEcCCeEEEEEEcCcEEEEE
Q 001655 398 TLFCHDAI-YN-QLVQVT-SGSVRLVSSTSRELRNEWKSPPG---YSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471 (1036)
Q Consensus 398 Tl~~~~~~-~~-~~vQVt-~~~i~l~~~~~~~~~~~w~~~~~---~~I~~as~~~~~vvv~~~~~~~~~l~~~~~~l~~~ 471 (1036)
.|++-... ++ .++-+. .+.+++.+-.+...+.+|..+.. ..+-+.--++..+.|++ +|.+-||+.++.....+
T Consensus 237 sIfalsWsPDs~~~~T~SaDkt~KIWdVs~~slv~t~~~~~~v~dqqvG~lWqkd~lItVSl-~G~in~ln~~d~~~~~~ 315 (603)
T KOG0318|consen 237 SIFALSWSPDSTQFLTVSADKTIKIWDVSTNSLVSTWPMGSTVEDQQVGCLWQKDHLITVSL-SGTINYLNPSDPSVLKV 315 (603)
T ss_pred cEEEEEECCCCceEEEecCCceEEEEEeeccceEEEeecCCchhceEEEEEEeCCeEEEEEc-CcEEEEecccCCChhhe
Confidence 45555542 43 666665 46778888777778888865432 22222223344555555 77888887765432111
Q ss_pred EeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCC---ccCceeEEeeeecCeEEEEEEe
Q 001655 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG---EIIPRSVLLCAFEGISYLLCAL 548 (1036)
Q Consensus 472 ~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~---~~~p~si~~~~~~~~~~L~vgl 548 (1036)
..--...|+|++.++ ...++.-|.+ ||.|.=|++....- ..+.. ......+ ...+...+.-+|.
T Consensus 316 -i~GHnK~ITaLtv~~------d~~~i~Sgsy-DG~I~~W~~~~g~~---~~~~g~~h~nqI~~~--~~~~~~~~~t~g~ 382 (603)
T KOG0318|consen 316 -ISGHNKSITALTVSP------DGKTIYSGSY-DGHINSWDSGSGTS---DRLAGKGHTNQIKGM--AASESGELFTIGW 382 (603)
T ss_pred -ecccccceeEEEEcC------CCCEEEeecc-CceEEEEecCCccc---cccccccccceEEEE--eecCCCcEEEEec
Confidence 122346789998854 2467888888 99888888764221 11110 0111111 1112234444555
Q ss_pred CCCcEEEEEEecCCCccccceeeecCccceEEEEEEeCCceEEEEEcCCceEEEecCCcEEEeeeccc-cceeeecccCC
Q 001655 549 GDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLK-EVSHMCPFNSA 627 (1036)
Q Consensus 549 ~~G~l~~y~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~~~p~~i~~~~~~~~~~~~~~~-~i~~~~~f~~~ 627 (1036)
.| .|-+..+.. +.........+|.+|.-+-.. .++..+++++..-.+++.+.++..-.|+... ...+++|-.+
T Consensus 383 Dd-~l~~~~~~~--~~~t~~~~~~lg~QP~~lav~--~d~~~avv~~~~~iv~l~~~~~~~~~~~~y~~s~vAv~~~~~- 456 (603)
T KOG0318|consen 383 DD-TLRVISLKD--NGYTKSEVVKLGSQPKGLAVL--SDGGTAVVACISDIVLLQDQTKVSSIPIGYESSAVAVSPDGS- 456 (603)
T ss_pred CC-eEEEEeccc--CcccccceeecCCCceeEEEc--CCCCEEEEEecCcEEEEecCCcceeeccccccceEEEcCCCC-
Confidence 33 344433321 123333456899999866542 3444666777777777776666544444332 2223332211
Q ss_pred CCCCeEEEEeCCeEEEEEEcCCCC-eEEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEE
Q 001655 628 AFPDSLAIAKEGELTIGTIDDIQK-LHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFIS 706 (1036)
Q Consensus 628 ~~~~~l~~~~~~~l~i~~l~~~~~-~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~ 706 (1036)
+-.+--.++.+++.+|..... -..+.++....+.-|+|+|+...+|++-.. ..+-++|..+.+...
T Consensus 457 ---~vaVGG~Dgkvhvysl~g~~l~ee~~~~~h~a~iT~vaySpd~~yla~~Da~----------rkvv~yd~~s~~~~~ 523 (603)
T KOG0318|consen 457 ---EVAVGGQDGKVHVYSLSGDELKEEAKLLEHRAAITDVAYSPDGAYLAAGDAS----------RKVVLYDVASREVKT 523 (603)
T ss_pred ---EEEEecccceEEEEEecCCcccceeeeecccCCceEEEECCCCcEEEEeccC----------CcEEEEEcccCceec
Confidence 122334578899999987653 344777888899999999998766655332 246666766655421
Q ss_pred E-EECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCC-eEEEEEEEEecCceeEeccccCeEE
Q 001655 707 T-YPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFNGKLL 784 (1036)
Q Consensus 707 ~-~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~-kl~~~~~~~~~g~v~~i~~~~g~Lv 784 (1036)
. .-|- .-++.|+ .|.- +..+++-|. .. -.+++|++++- +- +..+..+.+.|+++.=++..=+
T Consensus 524 ~~w~FH-takI~~~-aWsP---~n~~vATGS--lD--------t~Viiysv~kP~~~-i~iknAH~~gVn~v~wlde~tv 587 (603)
T KOG0318|consen 524 NRWAFH-TAKINCV-AWSP---NNKLVATGS--LD--------TNVIIYSVKKPAKH-IIIKNAHLGGVNSVAWLDESTV 587 (603)
T ss_pred ceeeee-eeeEEEE-EeCC---CceEEEecc--cc--------ceEEEEEccChhhh-eEeccccccCceeEEEecCceE
Confidence 1 1111 1234554 3432 345555443 21 26899999853 22 4444456667999998888888
Q ss_pred EEECCEEEEEEe
Q 001655 785 AAINQKIQLYKW 796 (1036)
Q Consensus 785 ~~~g~~l~v~~~ 796 (1036)
++.|+.--+..|
T Consensus 588 vSsG~Da~iK~W 599 (603)
T KOG0318|consen 588 VSSGQDANIKVW 599 (603)
T ss_pred EeccCcceeEEe
Confidence 888876555455
|
|
| >KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones] | Back alignment and domain information |
|---|
Probab=97.82 E-value=0.063 Score=56.23 Aligned_cols=144 Identities=15% Similarity=0.229 Sum_probs=97.3
Q ss_pred CCeEEEEe--C-CeEEEEEEcCCCCeEEEEEeCCC----ccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCc
Q 001655 630 PDSLAIAK--E-GELTIGTIDDIQKLHIRSIPLGE----HPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTF 702 (1036)
Q Consensus 630 ~~~l~~~~--~-~~l~i~~l~~~~~~~~~~i~l~~----tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~ 702 (1036)
|++++++. + +.+++..+-...+=|-+++.++. .-..|-|+++.+.+++.+.. +.+.++|.-++
T Consensus 150 p~GLifA~~~~~~~IkLyD~Rs~dkgPF~tf~i~~~~~~ew~~l~FS~dGK~iLlsT~~----------s~~~~lDAf~G 219 (311)
T KOG1446|consen 150 PEGLIFALANGSELIKLYDLRSFDKGPFTTFSITDNDEAEWTDLEFSPDGKSILLSTNA----------SFIYLLDAFDG 219 (311)
T ss_pred CCCcEEEEecCCCeEEEEEecccCCCCceeEccCCCCccceeeeEEcCCCCEEEEEeCC----------CcEEEEEccCC
Confidence 56766653 3 47888888777665666666663 34568899999977776653 57899999999
Q ss_pred eEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccccC-
Q 001655 703 EFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNG- 781 (1036)
Q Consensus 703 ~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~~g- 781 (1036)
..+.+++..+++.-..+. ..|.. ..+|++.|. ..|+|.+|.+..++........-.|++.++. |+-
T Consensus 220 ~~~~tfs~~~~~~~~~~~-a~ftP-ds~Fvl~gs----------~dg~i~vw~~~tg~~v~~~~~~~~~~~~~~~-fnP~ 286 (311)
T KOG1446|consen 220 TVKSTFSGYPNAGNLPLS-ATFTP-DSKFVLSGS----------DDGTIHVWNLETGKKVAVLRGPNGGPVSCVR-FNPR 286 (311)
T ss_pred cEeeeEeeccCCCCccee-EEECC-CCcEEEEec----------CCCcEEEEEcCCCcEeeEecCCCCCCccccc-cCCc
Confidence 999999988887744432 23422 256666664 4599999999766554444434567777777 765
Q ss_pred e-EEEEECCEEEEEEe
Q 001655 782 K-LLAAINQKIQLYKW 796 (1036)
Q Consensus 782 ~-Lv~~~g~~l~v~~~ 796 (1036)
+ ..++.+.++.+|-.
T Consensus 287 ~~mf~sa~s~l~fw~p 302 (311)
T KOG1446|consen 287 YAMFVSASSNLVFWLP 302 (311)
T ss_pred eeeeeecCceEEEEec
Confidence 3 45666777776543
|
|
| >PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit | Back alignment and domain information |
|---|
Probab=97.81 E-value=0.012 Score=65.59 Aligned_cols=193 Identities=14% Similarity=0.112 Sum_probs=120.2
Q ss_pred CCEEEEEEeecCCCCCceeeEEEEEEeCCC-------ceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCC
Q 001655 672 SRTFAICSLKNQSCAEESEMHFVRLLDDQT-------FEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEEN 744 (1036)
Q Consensus 672 ~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t-------~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~ 744 (1036)
...++|++.....++.....+.+.+++-.. ++.+...++ ...+++++.+ +.++++|.+
T Consensus 42 ~~~ivVGT~~~~~~~~~~~~Gri~v~~i~~~~~~~~~l~~i~~~~~--~g~V~ai~~~------~~~lv~~~g------- 106 (321)
T PF03178_consen 42 KEYIVVGTAFNYGEDPEPSSGRILVFEISESPENNFKLKLIHSTEV--KGPVTAICSF------NGRLVVAVG------- 106 (321)
T ss_dssp SEEEEEEEEE--TTSSS-S-EEEEEEEECSS-----EEEEEEEEEE--SS-EEEEEEE------TTEEEEEET-------
T ss_pred cCEEEEEecccccccccccCcEEEEEEEEcccccceEEEEEEEEee--cCcceEhhhh------CCEEEEeec-------
Confidence 455666666543222222338888887654 344455555 3557888776 345887763
Q ss_pred CCcceEEEEEEEeCCe-EEEEEEEEecCceeEeccccCeEEEEEC-CEEEEEEeeecCCCccchhcccccccceEEEEEE
Q 001655 745 EPTKGRILVFIVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAIN-QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQ 822 (1036)
Q Consensus 745 ~~~~Gri~v~~i~~~k-l~~~~~~~~~g~v~~i~~~~g~Lv~~~g-~~l~v~~~~~~~~~~~~L~~~~~~~~~~~~~~l~ 822 (1036)
++|++|++.+.+ +...+....+..++++..++++|+++-- +.+.++.|+.. ..+|...+.-..+..++++.
T Consensus 107 ----~~l~v~~l~~~~~l~~~~~~~~~~~i~sl~~~~~~I~vgD~~~sv~~~~~~~~---~~~l~~va~d~~~~~v~~~~ 179 (321)
T PF03178_consen 107 ----NKLYVYDLDNSKTLLKKAFYDSPFYITSLSVFKNYILVGDAMKSVSLLRYDEE---NNKLILVARDYQPRWVTAAE 179 (321)
T ss_dssp ----TEEEEEEEETTSSEEEEEEE-BSSSEEEEEEETTEEEEEESSSSEEEEEEETT---TE-EEEEEEESS-BEEEEEE
T ss_pred ----CEEEEEEccCcccchhhheecceEEEEEEeccccEEEEEEcccCEEEEEEEcc---CCEEEEEEecCCCccEEEEE
Confidence 589999999887 9999999888999999999999988865 78889988642 23344333211233445544
Q ss_pred E--eCCEEEEEeccccEEEEEEecc------CC-eEEEEeccCCcceeEEE---EEee----Cc-----eEEEEccCCcE
Q 001655 823 T--RGDFIVVGDLMKSISLLIYKHE------EG-AIEERARDYNANWMSAV---EILD----DD-----IYLGAENNFNL 881 (1036)
Q Consensus 823 ~--~~~~I~vgD~~~Sv~~~~~~~~------~~-~l~~~a~D~~~~~~~~~---~~l~----~~-----~~l~~D~~gnl 881 (1036)
. .++.++++|....+.+++++++ .. +|...+.=.....+++. .+.. .+ .++.+-.+|.|
T Consensus 180 ~l~d~~~~i~~D~~gnl~~l~~~~~~~~~~~~~~~L~~~~~f~lg~~v~~~~~~~l~~~~~~~~~~~~~~i~~~T~~G~I 259 (321)
T PF03178_consen 180 FLVDEDTIIVGDKDGNLFVLRYNPEIPNSRDGDPKLERISSFHLGDIVNSFRRGSLIPRSGSSESPNRPQILYGTVDGSI 259 (321)
T ss_dssp EE-SSSEEEEEETTSEEEEEEE-SS-SSTTTTTTBEEEEEEEE-SS-EEEEEE--SS--SSSS-TTEEEEEEEEETTS-E
T ss_pred EecCCcEEEEEcCCCeEEEEEECCCCcccccccccceeEEEEECCCccceEEEEEeeecCCCCcccccceEEEEecCCEE
Confidence 4 3469999999999999999863 23 67777755444455554 2222 11 26777789999
Q ss_pred EEEee
Q 001655 882 FTVRK 886 (1036)
Q Consensus 882 ~~l~~ 886 (1036)
..+..
T Consensus 260 g~l~p 264 (321)
T PF03178_consen 260 GVLIP 264 (321)
T ss_dssp EEEEE
T ss_pred EEEEe
Confidence 96653
|
CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A .... |
| >KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown] | Back alignment and domain information |
|---|
Probab=97.80 E-value=0.099 Score=57.95 Aligned_cols=386 Identities=14% Similarity=0.207 Sum_probs=202.7
Q ss_pred cEEEEEeC--CCEEEEEEcCCeEEEEEEcCcEEEEE----EeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEE
Q 001655 438 SVNVATAN--ASQVLLATGGGHLVYLEIGDGILTEV----KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 511 (1036)
Q Consensus 438 ~I~~as~~--~~~vvv~~~~~~~~~l~~~~~~l~~~----~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~ 511 (1036)
.+..|+.. +..++++...+.+.+.....+.|.+. ++.+- +-|.|+.+.+ ..-++-| ..+|.|.||
T Consensus 202 ~v~~a~FHPtd~nliit~Gk~H~~Fw~~~~~~l~k~~~~fek~ek-k~Vl~v~F~e-------ngdviTg-DS~G~i~Iw 272 (626)
T KOG2106|consen 202 VVFLATFHPTDPNLIITCGKGHLYFWTLRGGSLVKRQGIFEKREK-KFVLCVTFLE-------NGDVITG-DSGGNILIW 272 (626)
T ss_pred eEEEEEeccCCCcEEEEeCCceEEEEEccCCceEEEeeccccccc-eEEEEEEEcC-------CCCEEee-cCCceEEEE
Confidence 45556655 55778888778888876655545442 22222 5688999864 2233444 348999999
Q ss_pred ECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEecC----CCccccceeeec--CccceEEEEEEe
Q 001655 512 SLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMK----TGELTDRKKVSL--GTQPITLRTFSS 585 (1036)
Q Consensus 512 ~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~----~~~l~~~~~~~l--G~~pv~l~~~~~ 585 (1036)
+-....+..... .. ++..+-++.++||.|+.=.-|.. +..++..+...+ ..-|++-..
T Consensus 273 ~~~~~~~~k~~~-aH------------~ggv~~L~~lr~GtllSGgKDRki~~Wd~~y~k~r~~elPe~~G~iRtv~--- 336 (626)
T KOG2106|consen 273 SKGTNRISKQVH-AH------------DGGVFSLCMLRDGTLLSGGKDRKIILWDDNYRKLRETELPEQFGPIRTVA--- 336 (626)
T ss_pred eCCCceEEeEee-ec------------CCceEEEEEecCccEeecCccceEEeccccccccccccCchhcCCeeEEe---
Confidence 975544322111 11 23456677777777765211100 000111111111 111333221
Q ss_pred CCceEEEEEcCCceEEEec-CCcEEEe-eeccccceeeecccCCCCCCeEEEE-eCCeEEEEEEcCCCCeEEEEEeCCCc
Q 001655 586 KNTTHVFAASDRPTVIYSS-NKKLLYS-NVNLKEVSHMCPFNSAAFPDSLAIA-KEGELTIGTIDDIQKLHIRSIPLGEH 662 (1036)
Q Consensus 586 ~~~~~v~~~~~~p~~i~~~-~~~~~~~-~~~~~~i~~~~~f~~~~~~~~l~~~-~~~~l~i~~l~~~~~~~~~~i~l~~t 662 (1036)
.+...+++-..|-.++..+ ++++... ....+....++.- ...+.++-+ .++.+++=+ ..+ ..-+..+++.
T Consensus 337 e~~~di~vGTtrN~iL~Gt~~~~f~~~v~gh~delwgla~h---ps~~q~~T~gqdk~v~lW~---~~k-~~wt~~~~d~ 409 (626)
T KOG2106|consen 337 EGKGDILVGTTRNFILQGTLENGFTLTVQGHGDELWGLATH---PSKNQLLTCGQDKHVRLWN---DHK-LEWTKIIEDP 409 (626)
T ss_pred cCCCcEEEeeccceEEEeeecCCceEEEEecccceeeEEcC---CChhheeeccCcceEEEcc---CCc-eeEEEEecCc
Confidence 2334466666777777654 3444322 2222333332211 111222222 244444444 223 2334456677
Q ss_pred cceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCC
Q 001655 663 PRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPE 742 (1036)
Q Consensus 663 p~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~ 742 (1036)
.+...+||.. .+++.+.. +..-++|-.+-..+.- .-+ +|.+.++. +. ....+++||+
T Consensus 410 ~~~~~fhpsg-~va~Gt~~----------G~w~V~d~e~~~lv~~-~~d-~~~ls~v~---ys-p~G~~lAvgs------ 466 (626)
T KOG2106|consen 410 AECADFHPSG-VVAVGTAT----------GRWFVLDTETQDLVTI-HTD-NEQLSVVR---YS-PDGAFLAVGS------ 466 (626)
T ss_pred eeEeeccCcc-eEEEeecc----------ceEEEEecccceeEEE-Eec-CCceEEEE---Ec-CCCCEEEEec------
Confidence 8889999887 77777753 4566777776444433 233 78776654 32 3468999998
Q ss_pred CCCCcceEEEEEEEeCC--eEEEEEEEEecCceeEeccc-cCeEEEEECCEEEEEEeeecCCCccchhc--ccccccce-
Q 001655 743 ENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAF-NGKLLAAINQKIQLYKWMLRDDGTRELQS--ECGHHGHI- 816 (1036)
Q Consensus 743 e~~~~~Gri~v~~i~~~--kl~~~~~~~~~g~v~~i~~~-~g~Lv~~~g~~l~v~~~~~~~~~~~~L~~--~~~~~~~~- 816 (1036)
..+.||+|.++.+ |++.+.+..- .|++.+.-- ++..+.+....-.+.-|... .-++... ...|.+..
T Consensus 467 ----~d~~iyiy~Vs~~g~~y~r~~k~~g-s~ithLDwS~Ds~~~~~~S~d~eiLyW~~~--~~~~~ts~kDvkW~t~~c 539 (626)
T KOG2106|consen 467 ----HDNHIYIYRVSANGRKYSRVGKCSG-SPITHLDWSSDSQFLVSNSGDYEILYWKPS--ECKQITSVKDVKWATYTC 539 (626)
T ss_pred ----CCCeEEEEEECCCCcEEEEeeeecC-ceeEEeeecCCCceEEeccCceEEEEEccc--cCcccceecceeeeeeEE
Confidence 3468999999944 5555555544 777777633 55566655544444445321 1111111 00111110
Q ss_pred ---------------EEEEEEEeCCEEEEEeccccEEEEEEeccCCeEEEEeccCCcceeEEEEEeeCce-EEEEccCCc
Q 001655 817 ---------------LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI-YLGAENNFN 880 (1036)
Q Consensus 817 ---------------~~~~l~~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l~~~~-~l~~D~~gn 880 (1036)
.+++-.-.++.+..||..--+-+++|-....+-.-.-.--..+.++++.|+-++. ++.+-++-.
T Consensus 540 ~lGF~v~g~s~~t~i~a~~rs~~~~~lA~gdd~g~v~lf~yPc~s~rA~~he~~ghs~~vt~V~Fl~~d~~li~tg~D~S 619 (626)
T KOG2106|consen 540 TLGFEVFGGSDGTDINAVARSHCEKLLASGDDFGKVHLFSYPCSSPRAPSHEYGGHSSHVTNVAFLCKDSHLISTGKDTS 619 (626)
T ss_pred EEEEEEecccCCchHHHhhhhhhhhhhhccccCceEEEEccccCCCcccceeeccccceeEEEEEeeCCceEEecCCCce
Confidence 1112223567888999999999998865433222222233567789999997765 444445655
Q ss_pred EEEEe
Q 001655 881 LFTVR 885 (1036)
Q Consensus 881 l~~l~ 885 (1036)
++..+
T Consensus 620 i~qW~ 624 (626)
T KOG2106|consen 620 IMQWR 624 (626)
T ss_pred EEEEE
Confidence 55443
|
|
| >KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=97.78 E-value=0.15 Score=59.34 Aligned_cols=225 Identities=16% Similarity=0.248 Sum_probs=131.2
Q ss_pred CCCeEEEEeCCeEEEEEEcCCCCe-----EEEEEeCCC---ccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCC
Q 001655 629 FPDSLAIAKEGELTIGTIDDIQKL-----HIRSIPLGE---HPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQ 700 (1036)
Q Consensus 629 ~~~~l~~~~~~~l~i~~l~~~~~~-----~~~~i~l~~---tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~ 700 (1036)
.++.+++.++|.+....++...+. .++.+.+++ ..|.++++.+.. ++++. ...++++.+.+
T Consensus 334 ~~~~lv~l~nNtv~~ysl~~s~~~~p~~~~~~~i~~~GHR~dVRsl~vS~d~~--~~~Sg---------a~~SikiWn~~ 402 (888)
T KOG0306|consen 334 TENTLVLLANNTVEWYSLENSGKTSPEADRTSNIEIGGHRSDVRSLCVSSDSI--LLASG---------AGESIKIWNRD 402 (888)
T ss_pred cceeEEEeecCceEEEEeccCCCCCccccccceeeeccchhheeEEEeecCce--eeeec---------CCCcEEEEEcc
Confidence 345677788999999998873322 233444432 456666665433 22222 12479999999
Q ss_pred CceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeE-EEEEEEEecCceeEeccc
Q 001655 701 TFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKL-QLIAEKETKGAVYSLNAF 779 (1036)
Q Consensus 701 t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl-~~~~~~~~~g~v~~i~~~ 779 (1036)
+.+.+-+++-. .+.|.+.+ .+..|+++|+ ..|.+.+|++....+ +.+- .+.|++-+|...
T Consensus 403 t~kciRTi~~~---y~l~~~Fv----pgd~~Iv~G~----------k~Gel~vfdlaS~~l~Eti~--AHdgaIWsi~~~ 463 (888)
T KOG0306|consen 403 TLKCIRTITCG---YILASKFV----PGDRYIVLGT----------KNGELQVFDLASASLVETIR--AHDGAIWSISLS 463 (888)
T ss_pred CcceeEEeccc---cEEEEEec----CCCceEEEec----------cCCceEEEEeehhhhhhhhh--ccccceeeeeec
Confidence 99888876654 55555443 2478999998 679999999874432 2222 567888777766
Q ss_pred ---cCeEEEEECCEEEEEEeeecCC--Cc-cchhc---ccccccc--eEEEEEEEeCCEEEEEeccccEEEEEEeccCCe
Q 001655 780 ---NGKLLAAINQKIQLYKWMLRDD--GT-RELQS---ECGHHGH--ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 848 (1036)
Q Consensus 780 ---~g~Lv~~~g~~l~v~~~~~~~~--~~-~~L~~---~~~~~~~--~~~~~l~~~~~~I~vgD~~~Sv~~~~~~~~~~~ 848 (1036)
.|.+.++..++|.+|++..-.. |. ++... ....... ..+++++..+.++.||=+..-+.+|..+.-.-.
T Consensus 464 pD~~g~vT~saDktVkfWdf~l~~~~~gt~~k~lsl~~~rtLel~ddvL~v~~Spdgk~LaVsLLdnTVkVyflDtlKFf 543 (888)
T KOG0306|consen 464 PDNKGFVTGSADKTVKFWDFKLVVSVPGTQKKVLSLKHTRTLELEDDVLCVSVSPDGKLLAVSLLDNTVKVYFLDTLKFF 543 (888)
T ss_pred CCCCceEEecCCcEEEEEeEEEEeccCcccceeeeeccceEEeccccEEEEEEcCCCcEEEEEeccCeEEEEEecceeee
Confidence 3456667778898887753211 22 12111 1112222 334445555778888887777877766543333
Q ss_pred EEEEeccCCcceeEEEEEeeCc-eEEEEccCCcEEEEee
Q 001655 849 IEERARDYNANWMSAVEILDDD-IYLGAENNFNLFTVRK 886 (1036)
Q Consensus 849 l~~~a~D~~~~~~~~~~~l~~~-~~l~~D~~gnl~~l~~ 886 (1036)
+.+-+ +...|.++..--+. .++.+-.+.|+-+.-.
T Consensus 544 lsLYG---HkLPV~smDIS~DSklivTgSADKnVKiWGL 579 (888)
T KOG0306|consen 544 LSLYG---HKLPVLSMDISPDSKLIVTGSADKNVKIWGL 579 (888)
T ss_pred eeecc---cccceeEEeccCCcCeEEeccCCCceEEecc
Confidence 33333 34556666543232 3555445555555443
|
|
| >cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto | Back alignment and domain information |
|---|
Probab=97.60 E-value=0.09 Score=56.40 Aligned_cols=222 Identities=18% Similarity=0.201 Sum_probs=121.9
Q ss_pred eEEEEe-CCeEEEEEEcCCCCeEEEEEeCCC-ccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEE
Q 001655 632 SLAIAK-EGELTIGTIDDIQKLHIRSIPLGE-HPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYP 709 (1036)
Q Consensus 632 ~l~~~~-~~~l~i~~l~~~~~~~~~~i~l~~-tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~ 709 (1036)
.++... ++.+.+..++.... .+...... ....+.+.+..+.+++++.. +.++++|..+.+.+..+.
T Consensus 23 ~l~~~~~~g~i~i~~~~~~~~--~~~~~~~~~~i~~~~~~~~~~~l~~~~~~----------~~i~i~~~~~~~~~~~~~ 90 (289)
T cd00200 23 LLATGSGDGTIKVWDLETGEL--LRTLKGHTGPVRDVAASADGTYLASGSSD----------KTIRLWDLETGECVRTLT 90 (289)
T ss_pred EEEEeecCcEEEEEEeeCCCc--EEEEecCCcceeEEEECCCCCEEEEEcCC----------CeEEEEEcCcccceEEEe
Confidence 344444 67788887765432 22333322 33478888887766666543 478888887765555544
Q ss_pred CCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEecccc-CeE-EEEE
Q 001655 710 LDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN-GKL-LAAI 787 (1036)
Q Consensus 710 ~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~~-g~L-v~~~ 787 (1036)
.. ...+.++. +..+ ..+++.|. ..|.+.+|++..++...... ...+++++++... +.+ +++.
T Consensus 91 ~~-~~~i~~~~---~~~~-~~~~~~~~----------~~~~i~~~~~~~~~~~~~~~-~~~~~i~~~~~~~~~~~l~~~~ 154 (289)
T cd00200 91 GH-TSYVSSVA---FSPD-GRILSSSS----------RDKTIKVWDVETGKCLTTLR-GHTDWVNSVAFSPDGTFVASSS 154 (289)
T ss_pred cc-CCcEEEEE---EcCC-CCEEEEec----------CCCeEEEEECCCcEEEEEec-cCCCcEEEEEEcCcCCEEEEEc
Confidence 22 22455543 2222 45665554 35789999987554433222 4567888887664 444 4444
Q ss_pred -CCEEEEEEeeecCCCccchhcccccccceEEEEEEEeCCEEEEEeccccEEEEEEeccCCeEEEEeccCCcceeEEEEE
Q 001655 788 -NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866 (1036)
Q Consensus 788 -g~~l~v~~~~~~~~~~~~L~~~~~~~~~~~~~~l~~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~ 866 (1036)
+..+++|++.. + +.+............+.....++.++++.....+.++ +....+....-+ .....+.++.+
T Consensus 155 ~~~~i~i~d~~~---~-~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~i~i~--d~~~~~~~~~~~-~~~~~i~~~~~ 227 (289)
T cd00200 155 QDGTIKLWDLRT---G-KCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLW--DLSTGKCLGTLR-GHENGVNSVAF 227 (289)
T ss_pred CCCcEEEEEccc---c-ccceeEecCccccceEEECCCcCEEEEecCCCcEEEE--ECCCCceecchh-hcCCceEEEEE
Confidence 67888887631 1 1111111111122223333344588888886667665 433333222111 22335667776
Q ss_pred eeCce-EEEEccCCcEEEEeeCC
Q 001655 867 LDDDI-YLGAENNFNLFTVRKNS 888 (1036)
Q Consensus 867 l~~~~-~l~~D~~gnl~~l~~~~ 888 (1036)
..++. +++++.+|.+.++....
T Consensus 228 ~~~~~~~~~~~~~~~i~i~~~~~ 250 (289)
T cd00200 228 SPDGYLLASGSEDGTIRVWDLRT 250 (289)
T ss_pred cCCCcEEEEEcCCCcEEEEEcCC
Confidence 65554 45566699999998653
|
|
| >PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3 | Back alignment and domain information |
|---|
Probab=97.59 E-value=0.22 Score=56.16 Aligned_cols=296 Identities=19% Similarity=0.211 Sum_probs=165.5
Q ss_pred EEEEEeCC----CcEEEEEEecCCCccccceeeecCccceEEEEEEeCCceEEEEEcCC----ceEEE---ecC-CcEEE
Q 001655 543 YLLCALGD----GHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDR----PTVIY---SSN-KKLLY 610 (1036)
Q Consensus 543 ~L~vgl~~----G~l~~y~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~~~----p~~i~---~~~-~~~~~ 610 (1036)
.++||+.+ |-+..|++|..++.|........+..|--|.. ..+...++++.+. ..+.. ..+ +.+..
T Consensus 1 ~~~vgsy~~~~~~gI~~~~~d~~~g~l~~~~~~~~~~~Ps~l~~--~~~~~~LY~~~e~~~~~g~v~~~~i~~~~g~L~~ 78 (345)
T PF10282_consen 1 TLYVGSYTNGKGGGIYVFRFDEETGTLTLVQTVAEGENPSWLAV--SPDGRRLYVVNEGSGDSGGVSSYRIDPDTGTLTL 78 (345)
T ss_dssp EEEEEECCSSSSTEEEEEEEETTTTEEEEEEEEEESSSECCEEE---TTSSEEEEEETTSSTTTEEEEEEEETTTTEEEE
T ss_pred CEEEEcCCCCCCCcEEEEEEcCCCCCceEeeeecCCCCCceEEE--EeCCCEEEEEEccccCCCCEEEEEECCCcceeEE
Confidence 47888877 78999999988888876655556666644432 2345677777653 23321 112 45532
Q ss_pred eeeccccceeeecccCCCCCCeEEEEe--CCeEEEEEEcCCCCeEEE--EE-----------eCCCccceEEEecCCCEE
Q 001655 611 SNVNLKEVSHMCPFNSAAFPDSLAIAK--EGELTIGTIDDIQKLHIR--SI-----------PLGEHPRRICHQEQSRTF 675 (1036)
Q Consensus 611 ~~~~~~~i~~~~~f~~~~~~~~l~~~~--~~~l~i~~l~~~~~~~~~--~i-----------~l~~tp~~i~y~~~~~~~ 675 (1036)
..-....-..-|.+........++.+. ++.+.+..++...++... .+ .-+..|+.+.++|+.+.+
T Consensus 79 ~~~~~~~g~~p~~i~~~~~g~~l~vany~~g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg~~v 158 (345)
T PF10282_consen 79 LNSVPSGGSSPCHIAVDPDGRFLYVANYGGGSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDGRFV 158 (345)
T ss_dssp EEEEEESSSCEEEEEECTTSSEEEEEETTTTEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTSSEE
T ss_pred eeeeccCCCCcEEEEEecCCCEEEEEEccCCeEEEEEccCCcccceeeeecccCCCCCcccccccccceeEEECCCCCEE
Confidence 211100000111111111222344443 788999999876433221 22 123568899999998877
Q ss_pred EEEEeecCCCCCceeeEEEEEEe--CCC--ceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEE
Q 001655 676 AICSLKNQSCAEESEMHFVRLLD--DQT--FEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRI 751 (1036)
Q Consensus 676 ~v~~~~~~~~~~~~~~~~l~l~d--~~t--~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri 751 (1036)
.|..... -.|.+++ ..+ .+....+.++++.-+-.+. |..+++.+++++- ..+.|
T Consensus 159 ~v~dlG~---------D~v~~~~~~~~~~~l~~~~~~~~~~G~GPRh~~---f~pdg~~~Yv~~e----------~s~~v 216 (345)
T PF10282_consen 159 YVPDLGA---------DRVYVYDIDDDTGKLTPVDSIKVPPGSGPRHLA---FSPDGKYAYVVNE----------LSNTV 216 (345)
T ss_dssp EEEETTT---------TEEEEEEE-TTS-TEEEEEEEECSTTSSEEEEE---E-TTSSEEEEEET----------TTTEE
T ss_pred EEEecCC---------CEEEEEEEeCCCceEEEeeccccccCCCCcEEE---EcCCcCEEEEecC----------CCCcE
Confidence 7665422 1455554 333 3345667788887776553 4444455555442 56799
Q ss_pred EEEEEe--CCeEEEEEEEEec-----C--ceeEeccc-cC-eEEEEEC--CEEEEEEeeecCCCccchhccccccc-ceE
Q 001655 752 LVFIVE--DGKLQLIAEKETK-----G--AVYSLNAF-NG-KLLAAIN--QKIQLYKWMLRDDGTRELQSECGHHG-HIL 817 (1036)
Q Consensus 752 ~v~~i~--~~kl~~~~~~~~~-----g--~v~~i~~~-~g-~Lv~~~g--~~l~v~~~~~~~~~~~~L~~~~~~~~-~~~ 817 (1036)
.+|++. +++++.+.....- + ....|+-- +| +|+++.. +.|.+|+++. +.|..++... ..+ -..
T Consensus 217 ~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~-~~g~l~~~~~--~~~~G~~ 293 (345)
T PF10282_consen 217 SVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNRGSNSISVFDLDP-ATGTLTLVQT--VPTGGKF 293 (345)
T ss_dssp EEEEEETTTTEEEEEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEECTTTEEEEEEECT-TTTTEEEEEE--EEESSSS
T ss_pred EEEeecccCCceeEEEEeeeccccccccCCceeEEEecCCCEEEEEeccCCEEEEEEEec-CCCceEEEEE--EeCCCCC
Confidence 999999 6677776655432 2 23444433 44 5666553 5899999843 2233333222 221 112
Q ss_pred EEEE--EEeCCEEEEEecc-ccEEEEEEeccCCeEEEEeccCCcceeEEEE
Q 001655 818 ALYV--QTRGDFIVVGDLM-KSISLLIYKHEEGAIEERARDYNANWMSAVE 865 (1036)
Q Consensus 818 ~~~l--~~~~~~I~vgD~~-~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~ 865 (1036)
|-.+ +-.|++++|+... +.|.+++.+.+.++|..+++....-..+|+.
T Consensus 294 Pr~~~~s~~g~~l~Va~~~s~~v~vf~~d~~tG~l~~~~~~~~~~~p~ci~ 344 (345)
T PF10282_consen 294 PRHFAFSPDGRYLYVANQDSNTVSVFDIDPDTGKLTPVGSSVPIPSPVCIV 344 (345)
T ss_dssp EEEEEE-TTSSEEEEEETTTTEEEEEEEETTTTEEEEEEEEEESSSEEEEE
T ss_pred ccEEEEeCCCCEEEEEecCCCeEEEEEEeCCCCcEEEecccccCCCCEEEe
Confidence 4444 4478899998744 5688888888889998887643333333443
|
1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H. |
| >KOG1539 consensus WD repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=97.55 E-value=0.33 Score=57.31 Aligned_cols=208 Identities=14% Similarity=0.194 Sum_probs=105.3
Q ss_pred ccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEE-eeeecCeEEEEEEeCCCcEEEEEEecCCCccccceeeec
Q 001655 495 SQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL-LCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL 573 (1036)
Q Consensus 495 ~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~-~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~l~~~~~~~l 573 (1036)
+.+.+..+- +++.++|+...+.+++.. ...|..|. +.. .+.+.+++.++ .+..|.-... +. .....
T Consensus 46 ~~~~vtt~v-gksfqvYd~~kl~ll~vs----~~lp~~I~alas--~~~~vy~A~g~-~i~~~~rgk~---i~--~~~~~ 112 (910)
T KOG1539|consen 46 STFYVTTCV-GKSFQVYDVNKLNLLFVS----KPLPDKITALAS--DKDYVYVASGN-KIYAYARGKH---IR--HTTLL 112 (910)
T ss_pred ceEEEEEec-CceEEEEeccceEEEEec----CCCCCceEEEEe--cCceEEEecCc-EEEEEEccce---EE--EEecc
Confidence 445555555 889999999998887643 12445552 332 46788888765 3555543210 00 01111
Q ss_pred CccceE-EEEEEeCCceEEEEEcCC-ceEEEec-C---C-c--EEEeeeccccceeeecccCCCCCCe-EEEEeCCeEEE
Q 001655 574 GTQPIT-LRTFSSKNTTHVFAASDR-PTVIYSS-N---K-K--LLYSNVNLKEVSHMCPFNSAAFPDS-LAIAKEGELTI 643 (1036)
Q Consensus 574 G~~pv~-l~~~~~~~~~~v~~~~~~-p~~i~~~-~---~-~--~~~~~~~~~~i~~~~~f~~~~~~~~-l~~~~~~~l~i 643 (1036)
+..-|. |.+| | ..++++-.+ -..+... . + . ..+.+...+.|.+++ |...+-+- ++..+++.|.|
T Consensus 113 ~~a~v~~l~~f---G-e~lia~d~~~~l~vw~~s~~~~e~~l~~~~~~~~~~~Ital~--HP~TYLNKIvvGs~~G~lql 186 (910)
T KOG1539|consen 113 HGAKVHLLLPF---G-EHLIAVDISNILFVWKTSSIQEELYLQSTFLKVEGDFITALL--HPSTYLNKIVVGSSQGRLQL 186 (910)
T ss_pred ccceEEEEeee---c-ceEEEEEccCcEEEEEeccccccccccceeeeccCCceeeEe--cchhheeeEEEeecCCcEEE
Confidence 222222 2333 1 122222222 1222111 1 1 1 123333333344433 22223344 44478999999
Q ss_pred EEEcCCCCeEE-EEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEE
Q 001655 644 GTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722 (1036)
Q Consensus 644 ~~l~~~~~~~~-~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~ 722 (1036)
-++.+..-.+. +... ...+.+.-.|.....+|++.. +.|-+++-+..+++-+++-+. -.|+ ++
T Consensus 187 ~Nvrt~K~v~~f~~~~--s~IT~ieqsPaLDVVaiG~~~----------G~ViifNlK~dkil~sFk~d~-g~Vt---sl 250 (910)
T KOG1539|consen 187 WNVRTGKVVYTFQEFF--SRITAIEQSPALDVVAIGLEN----------GTVIIFNLKFDKILMSFKQDW-GRVT---SL 250 (910)
T ss_pred EEeccCcEEEEecccc--cceeEeccCCcceEEEEeccC----------ceEEEEEcccCcEEEEEEccc-ccee---EE
Confidence 99886543221 1111 222334445566666666642 678888888888887766552 2333 44
Q ss_pred EEcCCCceEEEEEee
Q 001655 723 SFSDDSNVYYCVGTA 737 (1036)
Q Consensus 723 ~l~~~~~~~i~VGT~ 737 (1036)
.|..++.+.+++|+.
T Consensus 251 SFrtDG~p~las~~~ 265 (910)
T KOG1539|consen 251 SFRTDGNPLLASGRS 265 (910)
T ss_pred EeccCCCeeEEeccC
Confidence 666677777777764
|
|
| >PRK11028 6-phosphogluconolactonase; Provisional | Back alignment and domain information |
|---|
Probab=97.45 E-value=0.32 Score=54.55 Aligned_cols=270 Identities=16% Similarity=0.151 Sum_probs=142.1
Q ss_pred cCCeEEEEEEcC-cEEEEEEeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCC-CCeeEEecCCCccCce
Q 001655 454 GGGHLVYLEIGD-GILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD-LNLITKEHLGGEIIPR 531 (1036)
Q Consensus 454 ~~~~~~~l~~~~-~~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~-l~~v~~~~l~~~~~p~ 531 (1036)
.++.+..++++. +++..+...+....++.+++++ ...+++++.+.++.+.+|++.+ .++......+....|.
T Consensus 10 ~~~~I~~~~~~~~g~l~~~~~~~~~~~~~~l~~sp------d~~~lyv~~~~~~~i~~~~~~~~g~l~~~~~~~~~~~p~ 83 (330)
T PRK11028 10 ESQQIHVWNLNHEGALTLLQVVDVPGQVQPMVISP------DKRHLYVGVRPEFRVLSYRIADDGALTFAAESPLPGSPT 83 (330)
T ss_pred CCCCEEEEEECCCCceeeeeEEecCCCCccEEECC------CCCEEEEEECCCCcEEEEEECCCCceEEeeeecCCCCce
Confidence 356677777753 5666665555556677888765 2568888888788999999863 2221111111112455
Q ss_pred eEEeeeecCeEEEEEE-eCCCcEEEEEEecCCCccccceeeecCccceEEEEEEeCCceEEEEEcCCceEE-EecCCcEE
Q 001655 532 SVLLCAFEGISYLLCA-LGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI-YSSNKKLL 609 (1036)
Q Consensus 532 si~~~~~~~~~~L~vg-l~~G~l~~y~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~~~p~~i-~~~~~~~~ 609 (1036)
.+.+.+ +..+|+++ ..+|.+..|.++.. +.+.. ++ ..+. + ...|.-+ +..++++
T Consensus 84 ~i~~~~--~g~~l~v~~~~~~~v~v~~~~~~-g~~~~---------~~--~~~~--~-------~~~~~~~~~~p~g~~- 139 (330)
T PRK11028 84 HISTDH--QGRFLFSASYNANCVSVSPLDKD-GIPVA---------PI--QIIE--G-------LEGCHSANIDPDNRT- 139 (330)
T ss_pred EEEECC--CCCEEEEEEcCCCeEEEEEECCC-CCCCC---------ce--eecc--C-------CCcccEeEeCCCCCE-
Confidence 554442 34456654 45888888887642 21110 00 0000 0 0111111 1111221
Q ss_pred EeeeccccceeeecccCCCCCCeEEEE--eCCeEEEEEEcCCCCe-----EEEEEeCCCccceEEEecCCCEEEEEEeec
Q 001655 610 YSNVNLKEVSHMCPFNSAAFPDSLAIA--KEGELTIGTIDDIQKL-----HIRSIPLGEHPRRICHQEQSRTFAICSLKN 682 (1036)
Q Consensus 610 ~~~~~~~~i~~~~~f~~~~~~~~l~~~--~~~~l~i~~l~~~~~~-----~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~ 682 (1036)
++.+ ..+.+.+..++....+ ...+++.+..|+.++++|+.+.+.|++..
T Consensus 140 -----------------------l~v~~~~~~~v~v~d~~~~g~l~~~~~~~~~~~~g~~p~~~~~~pdg~~lyv~~~~- 195 (330)
T PRK11028 140 -----------------------LWVPCLKEDRIRLFTLSDDGHLVAQEPAEVTTVEGAGPRHMVFHPNQQYAYCVNEL- 195 (330)
T ss_pred -----------------------EEEeeCCCCEEEEEEECCCCcccccCCCceecCCCCCCceEEECCCCCEEEEEecC-
Confidence 1111 1355666666543222 12346678899999999998877776532
Q ss_pred CCCCCceeeEEEEEEeCC----CceEEEEEEC-CC---CCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEE
Q 001655 683 QSCAEESEMHFVRLLDDQ----TFEFISTYPL-DT---FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF 754 (1036)
Q Consensus 683 ~~~~~~~~~~~l~l~d~~----t~~~i~~~~~-~~---~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~ 754 (1036)
...|.+++-+ +++.+.++.. ++ ++. .. ..+.+..+ .+++.|+.. ..+.|.+|
T Consensus 196 --------~~~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~-~~-~~i~~~pd-g~~lyv~~~---------~~~~I~v~ 255 (330)
T PRK11028 196 --------NSSVDVWQLKDPHGEIECVQTLDMMPADFSDTR-WA-ADIHITPD-GRHLYACDR---------TASLISVF 255 (330)
T ss_pred --------CCEEEEEEEeCCCCCEEEEEEEecCCCcCCCCc-cc-eeEEECCC-CCEEEEecC---------CCCeEEEE
Confidence 2456665532 3344444432 21 111 11 12344433 456666531 34689999
Q ss_pred EEeC--CeEEEEEEEEecCceeEeccc-cC-eEEEEE--CCEEEEEEee
Q 001655 755 IVED--GKLQLIAEKETKGAVYSLNAF-NG-KLLAAI--NQKIQLYKWM 797 (1036)
Q Consensus 755 ~i~~--~kl~~~~~~~~~g~v~~i~~~-~g-~Lv~~~--g~~l~v~~~~ 797 (1036)
++.. .+++++........+..+.-- +| +|+++. .++|.+|+++
T Consensus 256 ~i~~~~~~~~~~~~~~~~~~p~~~~~~~dg~~l~va~~~~~~v~v~~~~ 304 (330)
T PRK11028 256 SVSEDGSVLSFEGHQPTETQPRGFNIDHSGKYLIAAGQKSHHISVYEID 304 (330)
T ss_pred EEeCCCCeEEEeEEEeccccCCceEECCCCCEEEEEEccCCcEEEEEEc
Confidence 9973 467766665554344455422 34 566554 3589999875
|
|
| >KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=97.44 E-value=0.049 Score=56.53 Aligned_cols=265 Identities=16% Similarity=0.218 Sum_probs=143.3
Q ss_pred CceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCC-CeeE--EecCCCccCceeEEeeeecCeEEEEEEeCCCcE
Q 001655 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDL-NLIT--KEHLGGEIIPRSVLLCAFEGISYLLCALGDGHL 553 (1036)
Q Consensus 477 ~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l-~~v~--~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l 553 (1036)
+..|++|+++|. ....++.|.| ||.+++|.+.+- ..+- .+.+. .++=+++- ..+...++.|.-||.+
T Consensus 27 ~DsIS~l~FSP~-----~~~~~~A~SW-D~tVR~wevq~~g~~~~ka~~~~~--~PvL~v~W--sddgskVf~g~~Dk~~ 96 (347)
T KOG0647|consen 27 EDSISALAFSPQ-----ADNLLAAGSW-DGTVRIWEVQNSGQLVPKAQQSHD--GPVLDVCW--SDDGSKVFSGGCDKQA 96 (347)
T ss_pred ccchheeEeccc-----cCceEEeccc-CCceEEEEEecCCcccchhhhccC--CCeEEEEE--ccCCceEEeeccCCce
Confidence 456899999873 2567779999 999999998762 2221 12222 12222221 2355789999999999
Q ss_pred EEEEEecCCCccccceeeecCccceEEEEEEeCCceEEEEEcCCceEEEecCCcEEEeeec---------c-ccceeeec
Q 001655 554 LNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVN---------L-KEVSHMCP 623 (1036)
Q Consensus 554 ~~y~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~~~p~~i~~~~~~~~~~~~~---------~-~~i~~~~~ 623 (1036)
-.|.+... .+ ..+..=..||+-..+-.+...-.+++|+ .+..++|.... . +.+.++.-
T Consensus 97 k~wDL~S~--Q~---~~v~~Hd~pvkt~~wv~~~~~~cl~TGS-------WDKTlKfWD~R~~~pv~t~~LPeRvYa~Dv 164 (347)
T KOG0647|consen 97 KLWDLASG--QV---SQVAAHDAPVKTCHWVPGMNYQCLVTGS-------WDKTLKFWDTRSSNPVATLQLPERVYAADV 164 (347)
T ss_pred EEEEccCC--Ce---eeeeecccceeEEEEecCCCcceeEecc-------cccceeecccCCCCeeeeeeccceeeehhc
Confidence 98887643 11 2233335666544441111122344442 13334443222 1 11111111
Q ss_pred ccCCCCCCeEEEEeCCeEEEEEEcCCC-CeEEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCC--
Q 001655 624 FNSAAFPDSLAIAKEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQ-- 700 (1036)
Q Consensus 624 f~~~~~~~~l~~~~~~~l~i~~l~~~~-~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~-- 700 (1036)
. .|=..+...+..+.+.+|.... .+-.-.-||....|.|+-.++...++++..+ .+-.++-+|+.
T Consensus 165 ~----~pm~vVata~r~i~vynL~n~~te~k~~~SpLk~Q~R~va~f~d~~~~alGsiE--------Grv~iq~id~~~~ 232 (347)
T KOG0647|consen 165 L----YPMAVVATAERHIAVYNLENPPTEFKRIESPLKWQTRCVACFQDKDGFALGSIE--------GRVAIQYIDDPNP 232 (347)
T ss_pred c----CceeEEEecCCcEEEEEcCCCcchhhhhcCcccceeeEEEEEecCCceEeeeec--------ceEEEEecCCCCc
Confidence 1 1112333458889999995543 2323344677778888888888889888774 23455666652
Q ss_pred ----CceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEe
Q 001655 701 ----TFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSL 776 (1036)
Q Consensus 701 ----t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i 776 (1036)
+|+...+-.. -+..|+++-++.|.-....+.-+| .-|++-.|+-+ -+-++......++|+++.
T Consensus 233 ~~nFtFkCHR~~~~-~~~~VYaVNsi~FhP~hgtlvTaG-----------sDGtf~FWDkd-ar~kLk~s~~~~qpItcc 299 (347)
T KOG0647|consen 233 KDNFTFKCHRSTNS-VNDDVYAVNSIAFHPVHGTLVTAG-----------SDGTFSFWDKD-ARTKLKTSETHPQPITCC 299 (347)
T ss_pred cCceeEEEeccCCC-CCCceEEecceEeecccceEEEec-----------CCceEEEecch-hhhhhhccCcCCCcccee
Confidence 1111111111 122467777776654223444444 34888888654 133445556678888876
Q ss_pred ccc-cCe-EEEEEC
Q 001655 777 NAF-NGK-LLAAIN 788 (1036)
Q Consensus 777 ~~~-~g~-Lv~~~g 788 (1036)
+-- +|. +++|.|
T Consensus 300 ~fn~~G~ifaYA~g 313 (347)
T KOG0647|consen 300 SFNRNGSIFAYALG 313 (347)
T ss_pred EecCCCCEEEEEee
Confidence 522 343 445665
|
|
| >KOG2055 consensus WD40 repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=97.40 E-value=0.073 Score=58.39 Aligned_cols=199 Identities=16% Similarity=0.118 Sum_probs=117.6
Q ss_pred ceeEEeeeecCeEEEEEEeCCCcEEEEEEecCCCccccceeeecCccceEEEEEEeCCceEEEEEcCCceEEEecC--Cc
Q 001655 530 PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSN--KK 607 (1036)
Q Consensus 530 p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~~~p~~i~~~~--~~ 607 (1036)
..|+.+.+ ..+.|+++--||.+-.|++|...+.+ .....+-..|+.=-.|..+|.+.||..|.++++...+= .+
T Consensus 216 I~sv~FHp--~~plllvaG~d~~lrifqvDGk~N~~--lqS~~l~~fPi~~a~f~p~G~~~i~~s~rrky~ysyDle~ak 291 (514)
T KOG2055|consen 216 ITSVQFHP--TAPLLLVAGLDGTLRIFQVDGKVNPK--LQSIHLEKFPIQKAEFAPNGHSVIFTSGRRKYLYSYDLETAK 291 (514)
T ss_pred ceEEEecC--CCceEEEecCCCcEEEEEecCccChh--heeeeeccCccceeeecCCCceEEEecccceEEEEeeccccc
Confidence 45665543 46777777779999999998654321 12345556666544454466678888888888776652 22
Q ss_pred E-EEeeeccccceeeecccCCCCCCeEEEEeCCeEEEEEEcCCCCeEEEEEeCCCccceEEEecCCCEEEEEEeecCCCC
Q 001655 608 L-LYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA 686 (1036)
Q Consensus 608 ~-~~~~~~~~~i~~~~~f~~~~~~~~l~~~~~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~ 686 (1036)
+ .+.++....=.++-.|-...+.+ ++.+.++.=.|--|....+-.+-++.+.+..+-++++.+++.+++++..
T Consensus 292 ~~k~~~~~g~e~~~~e~FeVShd~~-fia~~G~~G~I~lLhakT~eli~s~KieG~v~~~~fsSdsk~l~~~~~~----- 365 (514)
T KOG2055|consen 292 VTKLKPPYGVEEKSMERFEVSHDSN-FIAIAGNNGHIHLLHAKTKELITSFKIEGVVSDFTFSSDSKELLASGGT----- 365 (514)
T ss_pred cccccCCCCcccchhheeEecCCCC-eEEEcccCceEEeehhhhhhhhheeeeccEEeeEEEecCCcEEEEEcCC-----
Confidence 2 23343322212233332222333 5555554444555555555567778888888899999888766666532
Q ss_pred CceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEe
Q 001655 687 EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757 (1036)
Q Consensus 687 ~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~ 757 (1036)
+.|-++|-.+..++.++.=+-.=+.+++|. +.+..|+++|+ .+|-+-+|+..
T Consensus 366 -----GeV~v~nl~~~~~~~rf~D~G~v~gts~~~----S~ng~ylA~GS----------~~GiVNIYd~~ 417 (514)
T KOG2055|consen 366 -----GEVYVWNLRQNSCLHRFVDDGSVHGTSLCI----SLNGSYLATGS----------DSGIVNIYDGN 417 (514)
T ss_pred -----ceEEEEecCCcceEEEEeecCccceeeeee----cCCCceEEecc----------CcceEEEeccc
Confidence 356666666666665555444444455543 12356999997 45666666644
|
|
| >PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A | Back alignment and domain information |
|---|
Probab=97.39 E-value=0.36 Score=54.63 Aligned_cols=314 Identities=15% Similarity=0.179 Sum_probs=146.0
Q ss_pred cCcEEEEecCCccceeeeeCCCCCcEEEEE-eCCCEEEEEEcCCeEEEEEEcCcEEEEEEeccCCceeEEEEcCCCCCCC
Q 001655 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVAT-ANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENP 492 (1036)
Q Consensus 414 ~~~i~l~~~~~~~~~~~w~~~~~~~I~~as-~~~~~vvv~~~~~~~~~l~~~~~~l~~~~~~~l~~~is~l~~~~~~~~~ 492 (1036)
..++.+++.++.+.+.++....+..+..+. -++.++.++..++.+..+++...+ .+.+.......-++++++
T Consensus 15 ~~~v~viD~~t~~~~~~i~~~~~~h~~~~~s~Dgr~~yv~~rdg~vsviD~~~~~--~v~~i~~G~~~~~i~~s~----- 87 (369)
T PF02239_consen 15 SGSVAVIDGATNKVVARIPTGGAPHAGLKFSPDGRYLYVANRDGTVSVIDLATGK--VVATIKVGGNPRGIAVSP----- 87 (369)
T ss_dssp GTEEEEEETTT-SEEEEEE-STTEEEEEE-TT-SSEEEEEETTSEEEEEETTSSS--EEEEEE-SSEEEEEEE-------
T ss_pred CCEEEEEECCCCeEEEEEcCCCCceeEEEecCCCCEEEEEcCCCeEEEEECCccc--EEEEEecCCCcceEEEcC-----
Confidence 467888988876666666432211122222 236788888878888777765544 333445566677888754
Q ss_pred CcccEEEEEEecCCEEEEEECCCCCeeEEecCC---Cc-cCceeEEeeeecCeEEEEEEeCC-CcEEEEEEecCCCcccc
Q 001655 493 SYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG---GE-IIPRSVLLCAFEGISYLLCALGD-GHLLNFLLNMKTGELTD 567 (1036)
Q Consensus 493 ~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~---~~-~~p~si~~~~~~~~~~L~vgl~~-G~l~~y~~~~~~~~l~~ 567 (1036)
.+.+++++.|.++.+.+++..+++++...... .+ ..++-..+.....++..++.+.+ |.+... +..+..-..
T Consensus 88 -DG~~~~v~n~~~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aIv~s~~~~~fVv~lkd~~~I~vV--dy~d~~~~~ 164 (369)
T PF02239_consen 88 -DGKYVYVANYEPGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAAIVASPGRPEFVVNLKDTGEIWVV--DYSDPKNLK 164 (369)
T ss_dssp -TTTEEEEEEEETTEEEEEETTT--EEEEEE--EE-TTTS---EEEEEE-SSSSEEEEEETTTTEEEEE--ETTTSSCEE
T ss_pred -CCCEEEEEecCCCceeEeccccccceeecccccccccccCCCceeEEecCCCCEEEEEEccCCeEEEE--Eeccccccc
Confidence 36789999887999999999999887543222 11 12232333333445555555554 766543 332211001
Q ss_pred ceeeecCccceEEEEEEeCCceEEEEE-cCCceEEEec--CCcEEEeeecccccee---eecccCCCCCCeEEEEe--CC
Q 001655 568 RKKVSLGTQPITLRTFSSKNTTHVFAA-SDRPTVIYSS--NKKLLYSNVNLKEVSH---MCPFNSAAFPDSLAIAK--EG 639 (1036)
Q Consensus 568 ~~~~~lG~~pv~l~~~~~~~~~~v~~~-~~~p~~i~~~--~~~~~~~~~~~~~i~~---~~~f~~~~~~~~l~~~~--~~ 639 (1036)
.+....|..|--.- + ..++..+++. .....+...+ .+++... +.....-+ .+.|.... -+-+..+ .+
T Consensus 165 ~~~i~~g~~~~D~~-~-dpdgry~~va~~~sn~i~viD~~~~k~v~~-i~~g~~p~~~~~~~~php~--~g~vw~~~~~~ 239 (369)
T PF02239_consen 165 VTTIKVGRFPHDGG-F-DPDGRYFLVAANGSNKIAVIDTKTGKLVAL-IDTGKKPHPGPGANFPHPG--FGPVWATSGLG 239 (369)
T ss_dssp EEEEE--TTEEEEE-E--TTSSEEEEEEGGGTEEEEEETTTTEEEEE-EE-SSSBEETTEEEEEETT--TEEEEEEEBSS
T ss_pred eeeecccccccccc-c-CcccceeeecccccceeEEEeeccceEEEE-eeccccccccccccccCCC--cceEEeecccc
Confidence 12333444442211 1 1122223231 1122222211 2333211 11000000 01111111 1233322 23
Q ss_pred eEEEEEEcCC-------CCe-EEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECC
Q 001655 640 ELTIGTIDDI-------QKL-HIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLD 711 (1036)
Q Consensus 640 ~l~i~~l~~~-------~~~-~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~ 711 (1036)
...+..+... ..| .+++|++.+.|..+..||+++.+.|...-.. ....|.++|.++++.+.++...
T Consensus 240 ~~~~~~ig~~~v~v~d~~~wkvv~~I~~~G~glFi~thP~s~~vwvd~~~~~------~~~~v~viD~~tl~~~~~i~~~ 313 (369)
T PF02239_consen 240 YFAIPLIGTDPVSVHDDYAWKVVKTIPTQGGGLFIKTHPDSRYVWVDTFLNP------DADTVQVIDKKTLKVVKTITPG 313 (369)
T ss_dssp SSEEEEEE--TTT-STTTBTSEEEEEE-SSSS--EE--TT-SEEEEE-TT-S------SHT-EEEEECCGTEEEE-HHHH
T ss_pred ceecccccCCccccchhhcCeEEEEEECCCCcceeecCCCCccEEeeccCCC------CCceEEEEECcCcceeEEEecc
Confidence 3333333222 234 7899999999999999999997777632210 1458999999999887766544
Q ss_pred CCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCe
Q 001655 712 TFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK 760 (1036)
Q Consensus 712 ~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~k 760 (1036)
++-.+ ...+|..++++.++ ..- ...|+|.+|+....+
T Consensus 314 ~~~~~---~h~ef~~dG~~v~v-S~~--------~~~~~i~v~D~~Tl~ 350 (369)
T PF02239_consen 314 PGKRV---VHMEFNPDGKEVWV-SVW--------DGNGAIVVYDAKTLK 350 (369)
T ss_dssp HT--E---EEEEE-TTSSEEEE-EEE----------TTEEEEEETTTTE
T ss_pred CCCcE---eccEECCCCCEEEE-EEe--------cCCCEEEEEECCCcE
Confidence 44333 34467666655444 211 134599999765433
|
... |
| >KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=97.36 E-value=0.26 Score=57.36 Aligned_cols=352 Identities=15% Similarity=0.161 Sum_probs=180.9
Q ss_pred CCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEE
Q 001655 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLN 555 (1036)
Q Consensus 476 l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~ 555 (1036)
-+.+++|+++.+ .+.+++.+.. +.-+++|++|..+.+-.-....+ .| ++.+.--+....|-.|-.||.+.+
T Consensus 61 d~d~ita~~l~~------d~~~L~~a~r-s~llrv~~L~tgk~irswKa~He-~P-vi~ma~~~~g~LlAtggaD~~v~V 131 (775)
T KOG0319|consen 61 DEDEITALALTP------DEEVLVTASR-SQLLRVWSLPTGKLIRSWKAIHE-AP-VITMAFDPTGTLLATGGADGRVKV 131 (775)
T ss_pred chhhhheeeecC------CccEEEEeec-cceEEEEEcccchHhHhHhhccC-CC-eEEEEEcCCCceEEeccccceEEE
Confidence 346789999865 2556666666 88999999998654321001001 12 222221122333445778999998
Q ss_pred EEEecCCCccccceeeecC-ccceEEEEEEeCCceEEEEEcCCc--eEEEec--CCcEEEeeecccc---------ceee
Q 001655 556 FLLNMKTGELTDRKKVSLG-TQPITLRTFSSKNTTHVFAASDRP--TVIYSS--NKKLLYSNVNLKE---------VSHM 621 (1036)
Q Consensus 556 y~~~~~~~~l~~~~~~~lG-~~pv~l~~~~~~~~~~v~~~~~~p--~~i~~~--~~~~~~~~~~~~~---------i~~~ 621 (1036)
+.+....+. ..+-| ..+|....| +..+ |++++. ++.....+++.+. -.++
T Consensus 132 Wdi~~~~~t-----h~fkG~gGvVssl~F-----------~~~~~~~lL~sg~~D~~v~vwnl~~~~tcl~~~~~H~S~v 195 (775)
T KOG0319|consen 132 WDIKNGYCT-----HSFKGHGGVVSSLLF-----------HPHWNRWLLASGATDGTVRVWNLNDKRTCLHTMILHKSAV 195 (775)
T ss_pred EEeeCCEEE-----EEecCCCceEEEEEe-----------CCccchhheeecCCCceEEEEEcccCchHHHHHHhhhhhe
Confidence 887654311 11112 112222222 1111 222221 2333333333111 1112
Q ss_pred ecccCCCCCCeEEEE-eCCeEEEEEEcCCCCeEEEEEeCCCccceEEEecC-----CCEEEEEEeecCCCCCceeeEEEE
Q 001655 622 CPFNSAAFPDSLAIA-KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQ-----SRTFAICSLKNQSCAEESEMHFVR 695 (1036)
Q Consensus 622 ~~f~~~~~~~~l~~~-~~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~-----~~~~~v~~~~~~~~~~~~~~~~l~ 695 (1036)
+.+....-.+-++.. .+.-+.+-.+...+ .-+.+|+.+....+++.++ ...+..+. ..+.++
T Consensus 196 tsL~~~~d~~~~ls~~RDkvi~vwd~~~~~--~l~~lp~ye~~E~vv~l~~~~~~~~~~~~TaG----------~~g~~~ 263 (775)
T KOG0319|consen 196 TSLAFSEDSLELLSVGRDKVIIVWDLVQYK--KLKTLPLYESLESVVRLREELGGKGEYIITAG----------GSGVVQ 263 (775)
T ss_pred eeeeeccCCceEEEeccCcEEEEeehhhhh--hhheechhhheeeEEEechhcCCcceEEEEec----------CCceEE
Confidence 211111111122322 23334444442221 2366787777777777665 11222221 346899
Q ss_pred EEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEE-EecCcee
Q 001655 696 LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEK-ETKGAVY 774 (1036)
Q Consensus 696 l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~-~~~g~v~ 774 (1036)
.+|+++++.+....-++.|++..+..+ .....+++|-+. -.|++|+.++ ++++... -.++-|+
T Consensus 264 ~~d~es~~~~~~~~~~~~~e~~~~~~~---~~~~~~l~vtae-----------Qnl~l~d~~~--l~i~k~ivG~ndEI~ 327 (775)
T KOG0319|consen 264 YWDSESGKCVYKQRQSDSEEIDHLLAI---ESMSQLLLVTAE-----------QNLFLYDEDE--LTIVKQIVGYNDEIL 327 (775)
T ss_pred EEecccchhhhhhccCCchhhhcceec---cccCceEEEEcc-----------ceEEEEEccc--cEEehhhcCCchhhe
Confidence 999999998876655556664433222 123566666542 1456665443 4444332 3577889
Q ss_pred Eecccc---CeEEEEEC-CEEEEEEeeecCCCccchhcccccccce-EEEEEE--EeCCEEEEEeccccEEEEEEeccCC
Q 001655 775 SLNAFN---GKLLAAIN-QKIQLYKWMLRDDGTRELQSECGHHGHI-LALYVQ--TRGDFIVVGDLMKSISLLIYKHEEG 847 (1036)
Q Consensus 775 ~i~~~~---g~Lv~~~g-~~l~v~~~~~~~~~~~~L~~~~~~~~~~-~~~~l~--~~~~~I~vgD~~~Sv~~~~~~~~~~ 847 (1036)
.++-++ .+|++|.| ..+++|+... . .-++ +.++. .+++++ ..|.+++-|---+|+-+.|++....
T Consensus 328 Dm~~lG~e~~~laVATNs~~lr~y~~~~--~-~c~i-----i~GH~e~vlSL~~~~~g~llat~sKD~svilWr~~~~~~ 399 (775)
T KOG0319|consen 328 DMKFLGPEESHLAVATNSPELRLYTLPT--S-YCQI-----IPGHTEAVLSLDVWSSGDLLATGSKDKSVILWRLNNNCS 399 (775)
T ss_pred eeeecCCccceEEEEeCCCceEEEecCC--C-ceEE-----EeCchhheeeeeecccCcEEEEecCCceEEEEEecCCcc
Confidence 999886 68988888 5788886421 1 1111 12232 345665 4566899999999999999854444
Q ss_pred eEEEEec-cCCcceeEEEEE--eeCceEEEEccCCcEEEEeeCC
Q 001655 848 AIEERAR-DYNANWMSAVEI--LDDDIYLGAENNFNLFTVRKNS 888 (1036)
Q Consensus 848 ~l~~~a~-D~~~~~~~~~~~--l~~~~~l~~D~~gnl~~l~~~~ 888 (1036)
+...++. .-+...+.++.+ +....++..-.++.+-+..++.
T Consensus 400 ~~~~~a~~~gH~~svgava~~~~~asffvsvS~D~tlK~W~l~~ 443 (775)
T KOG0319|consen 400 KSLCVAQANGHTNSVGAVAGSKLGASFFVSVSQDCTLKLWDLPK 443 (775)
T ss_pred hhhhhhhhcccccccceeeecccCccEEEEecCCceEEEecCCC
Confidence 4444442 233444555544 2222344455577777766654
|
|
| >KOG1408 consensus WD40 repeat protein [Function unknown] | Back alignment and domain information |
|---|
Probab=97.29 E-value=0.025 Score=64.83 Aligned_cols=112 Identities=13% Similarity=0.275 Sum_probs=66.5
Q ss_pred cCceeEeccc--cCeE-EEEECCEEEEEEeeecCCCcc-c-hhcccccccceEEEEEEEeCCEEEEEeccccEEEEEEec
Q 001655 770 KGAVYSLNAF--NGKL-LAAINQKIQLYKWMLRDDGTR-E-LQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844 (1036)
Q Consensus 770 ~g~v~~i~~~--~g~L-v~~~g~~l~v~~~~~~~~~~~-~-L~~~~~~~~~~~~~~l~~~~~~I~vgD~~~Sv~~~~~~~ 844 (1036)
++-.|.+..- -+++ .+|...+|++|+.+ .|+. | +...-...+...-+.++-.+.||...-.-+-+.++.|-.
T Consensus 596 ktTlYDm~Vdp~~k~v~t~cQDrnirif~i~---sgKq~k~FKgs~~~eG~lIKv~lDPSgiY~atScsdktl~~~Df~s 672 (1080)
T KOG1408|consen 596 KTTLYDMAVDPTSKLVVTVCQDRNIRIFDIE---SGKQVKSFKGSRDHEGDLIKVILDPSGIYLATSCSDKTLCFVDFVS 672 (1080)
T ss_pred cceEEEeeeCCCcceEEEEecccceEEEecc---ccceeeeecccccCCCceEEEEECCCccEEEEeecCCceEEEEecc
Confidence 3444544422 2333 45677889999874 2321 1 111111122233345566688888888888888876643
Q ss_pred cCCeEEEEeccCC-cceeEEEEEeeC-ceEEEEccCCcEEEEeeCC
Q 001655 845 EEGAIEERARDYN-ANWMSAVEILDD-DIYLGAENNFNLFTVRKNS 888 (1036)
Q Consensus 845 ~~~~l~~~a~D~~-~~~~~~~~~l~~-~~~l~~D~~gnl~~l~~~~ 888 (1036)
+. .+|+-+. .--||.+.|+.+ ..+|..-.+|.||+++.+.
T Consensus 673 --gE--cvA~m~GHsE~VTG~kF~nDCkHlISvsgDgCIFvW~lp~ 714 (1080)
T KOG1408|consen 673 --GE--CVAQMTGHSEAVTGVKFLNDCKHLISVSGDGCIFVWKLPL 714 (1080)
T ss_pred --ch--hhhhhcCcchheeeeeecccchhheeecCCceEEEEECch
Confidence 22 2444332 335788999876 4578888899999999763
|
|
| >KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=97.25 E-value=0.69 Score=54.18 Aligned_cols=316 Identities=15% Similarity=0.185 Sum_probs=157.7
Q ss_pred ceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEe-CCCcEEEE
Q 001655 478 YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCAL-GDGHLLNF 556 (1036)
Q Consensus 478 ~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl-~~G~l~~y 556 (1036)
..++|.++.+ ....+++|+. +|...+|.+|++.+++.-.++. .....+.+.. ...+|-+|. +-|+|++|
T Consensus 266 ~kvtaa~fH~------~t~~lvvgFs-sG~f~LyelP~f~lih~LSis~-~~I~t~~~N~--tGDWiA~g~~klgQLlVw 335 (893)
T KOG0291|consen 266 SKVTAAAFHK------GTNLLVVGFS-SGEFGLYELPDFNLIHSLSISD-QKILTVSFNS--TGDWIAFGCSKLGQLLVW 335 (893)
T ss_pred cceeeeeccC------CceEEEEEec-CCeeEEEecCCceEEEEeeccc-ceeeEEEecc--cCCEEEEcCCccceEEEE
Confidence 4456666532 4678888887 9999999999999987655542 1222333332 246777766 45999999
Q ss_pred EEecCCCccccceeeecCccceEEEEEEeCCceEEEEEcCCceEEEecCCcEEEeeeccccceeeecccCCCC-------
Q 001655 557 LLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAF------- 629 (1036)
Q Consensus 557 ~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~~~p~~i~~~~~~~~~~~~~~~~i~~~~~f~~~~~------- 629 (1036)
++...+--++.+ |-.+ ++..+.......++++|. ++++.+....+. -.++..|.....
T Consensus 336 eWqsEsYVlKQQ-----gH~~-~i~~l~YSpDgq~iaTG~-------eDgKVKvWn~~S--gfC~vTFteHts~Vt~v~f 400 (893)
T KOG0291|consen 336 EWQSESYVLKQQ-----GHSD-RITSLAYSPDGQLIATGA-------EDGKVKVWNTQS--GFCFVTFTEHTSGVTAVQF 400 (893)
T ss_pred Eeeccceeeecc-----cccc-ceeeEEECCCCcEEEecc-------CCCcEEEEeccC--ceEEEEeccCCCceEEEEE
Confidence 998654333321 1111 222222222222344431 245555443321 113333432221
Q ss_pred ---CCeEEEEe-CCeEEEEEEcCCCCeEEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEE
Q 001655 630 ---PDSLAIAK-EGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFI 705 (1036)
Q Consensus 630 ---~~~l~~~~-~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i 705 (1036)
++.++..+ ++.++.--+.....+-.-..|-.-+...++-+|....++.+.- ..+.|.+.+.+|++.+
T Consensus 401 ~~~g~~llssSLDGtVRAwDlkRYrNfRTft~P~p~QfscvavD~sGelV~AG~~---------d~F~IfvWS~qTGqll 471 (893)
T KOG0291|consen 401 TARGNVLLSSSLDGTVRAWDLKRYRNFRTFTSPEPIQFSCVAVDPSGELVCAGAQ---------DSFEIFVWSVQTGQLL 471 (893)
T ss_pred EecCCEEEEeecCCeEEeeeecccceeeeecCCCceeeeEEEEcCCCCEEEeecc---------ceEEEEEEEeecCeee
Confidence 22233222 4444444443333332222232333344555555444333332 3478888899999888
Q ss_pred EEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccc--cCeE
Q 001655 706 STYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF--NGKL 783 (1036)
Q Consensus 706 ~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~--~g~L 783 (1036)
+. +.-+|-+.+-.+ |+.. ...++-|. .. -.+.+|++-..+ ..+-..++..-|+++..- +..|
T Consensus 472 Di--LsGHEgPVs~l~--f~~~-~~~LaS~S-WD---------kTVRiW~if~s~-~~vEtl~i~sdvl~vsfrPdG~el 535 (893)
T KOG0291|consen 472 DI--LSGHEGPVSGLS--FSPD-GSLLASGS-WD---------KTVRIWDIFSSS-GTVETLEIRSDVLAVSFRPDGKEL 535 (893)
T ss_pred eh--hcCCCCcceeeE--Eccc-cCeEEecc-cc---------ceEEEEEeeccC-ceeeeEeeccceeEEEEcCCCCeE
Confidence 73 455666554322 3222 23444332 11 134556554321 112233445555555533 3356
Q ss_pred EEE-ECCEEEEEEeeec-C----CCccchhc----------c-cccccceEEEEEEEeCCEEEEEeccccEEEEEEe
Q 001655 784 LAA-INQKIQLYKWMLR-D----DGTRELQS----------E-CGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843 (1036)
Q Consensus 784 v~~-~g~~l~v~~~~~~-~----~~~~~L~~----------~-~~~~~~~~~~~l~~~~~~I~vgD~~~Sv~~~~~~ 843 (1036)
.++ ....|.+|+.++- + +|.+.+.+ . +...-++..++.+..|+.|+.|-..+++.++...
T Consensus 536 aVaTldgqItf~d~~~~~q~~~IdgrkD~~~gR~~~D~~ta~~sa~~K~Ftti~ySaDG~~IlAgG~sn~iCiY~v~ 612 (893)
T KOG0291|consen 536 AVATLDGQITFFDIKEAVQVGSIDGRKDLSGGRKETDRITAENSAKGKTFTTICYSADGKCILAGGESNSICIYDVP 612 (893)
T ss_pred EEEEecceEEEEEhhhceeeccccchhhccccccccceeehhhcccCCceEEEEEcCCCCEEEecCCcccEEEEECc
Confidence 544 4447777665310 0 12111111 1 1111223456677789999999999999987543
|
|
| >KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown] | Back alignment and domain information |
|---|
Probab=97.16 E-value=0.62 Score=51.93 Aligned_cols=337 Identities=15% Similarity=0.175 Sum_probs=166.8
Q ss_pred EEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEecCCCccccceeeecCccc
Q 001655 498 AAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577 (1036)
Q Consensus 498 ~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~l~~~~~~~lG~~p 577 (1036)
+.+.+- .+.+.+|.+..-.+.-.+.+-...--.-++-+.++++--++-|-.+|.+.+|.... ..++ +....=..-
T Consensus 215 liit~G-k~H~~Fw~~~~~~l~k~~~~fek~ekk~Vl~v~F~engdviTgDS~G~i~Iw~~~~--~~~~--k~~~aH~gg 289 (626)
T KOG2106|consen 215 LIITCG-KGHLYFWTLRGGSLVKRQGIFEKREKKFVLCVTFLENGDVITGDSGGNILIWSKGT--NRIS--KQVHAHDGG 289 (626)
T ss_pred EEEEeC-CceEEEEEccCCceEEEeeccccccceEEEEEEEcCCCCEEeecCCceEEEEeCCC--ceEE--eEeeecCCc
Confidence 344433 67788887766554444443210000222223444455678888999998887532 1111 111100000
Q ss_pred e-EEEEEEeCCceEEEEEcCCce-EEEecC--CcEEEe--eeccccceeeecccCCCCCCeEEEEeCCeEEEEEEcCCCC
Q 001655 578 I-TLRTFSSKNTTHVFAASDRPT-VIYSSN--KKLLYS--NVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651 (1036)
Q Consensus 578 v-~l~~~~~~~~~~v~~~~~~p~-~i~~~~--~~~~~~--~~~~~~i~~~~~f~~~~~~~~l~~~~~~~l~i~~l~~~~~ 651 (1036)
| .|..+ .++-++.|.+-- ++.+++ .+++-. |-...++..++ ++-++.++-.+.|.+..++++....
T Consensus 290 v~~L~~l----r~GtllSGgKDRki~~Wd~~y~k~r~~elPe~~G~iRtv~----e~~~di~vGTtrN~iL~Gt~~~~f~ 361 (626)
T KOG2106|consen 290 VFSLCML----RDGTLLSGGKDRKIILWDDNYRKLRETELPEQFGPIRTVA----EGKGDILVGTTRNFILQGTLENGFT 361 (626)
T ss_pred eEEEEEe----cCccEeecCccceEEeccccccccccccCchhcCCeeEEe----cCCCcEEEeeccceEEEeeecCCce
Confidence 0 11111 122233333322 222221 111111 11112344433 2334566678899999999987543
Q ss_pred eEEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceE
Q 001655 652 LHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731 (1036)
Q Consensus 652 ~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~ 731 (1036)
+.+. .. ++.---++-||..+.|+-+.-. ..++|.+. -+.+.+..++. .+.|+ -|.. ...
T Consensus 362 ~~v~-gh-~delwgla~hps~~q~~T~gqd----------k~v~lW~~--~k~~wt~~~~d--~~~~~---~fhp--sg~ 420 (626)
T KOG2106|consen 362 LTVQ-GH-GDELWGLATHPSKNQLLTCGQD----------KHVRLWND--HKLEWTKIIED--PAECA---DFHP--SGV 420 (626)
T ss_pred EEEE-ec-ccceeeEEcCCChhheeeccCc----------ceEEEccC--CceeEEEEecC--ceeEe---eccC--cce
Confidence 3221 11 1222345667777766655432 35777762 33444433332 22232 2322 238
Q ss_pred EEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccc--cCeEEEE-ECCEEEEEEeeecCCCccchhc
Q 001655 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF--NGKLLAA-INQKIQLYKWMLRDDGTRELQS 808 (1036)
Q Consensus 732 i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~--~g~Lv~~-~g~~l~v~~~~~~~~~~~~L~~ 808 (1036)
+++|| ..||.+++++.+..+-.++.. ..|..++..- ..+|.++ -++.|++|..+. +..++.+
T Consensus 421 va~Gt----------~~G~w~V~d~e~~~lv~~~~d--~~~ls~v~ysp~G~~lAvgs~d~~iyiy~Vs~---~g~~y~r 485 (626)
T KOG2106|consen 421 VAVGT----------ATGRWFVLDTETQDLVTIHTD--NEQLSVVRYSPDGAFLAVGSHDNHIYIYRVSA---NGRKYSR 485 (626)
T ss_pred EEEee----------ccceEEEEecccceeEEEEec--CCceEEEEEcCCCCEEEEecCCCeEEEEEECC---CCcEEEE
Confidence 99999 569999999988666666655 5566555533 3466554 447899999853 2344444
Q ss_pred ccccccceEEEEEEEe--CCEEEEEeccccEEEEEEeccCCeEEEEeccCCcceeEEEEEee-------C----------
Q 001655 809 ECGHHGHILALYVQTR--GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD-------D---------- 869 (1036)
Q Consensus 809 ~~~~~~~~~~~~l~~~--~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l~-------~---------- 869 (1036)
...... .++++|++. ++++ ++-.-+ .-++-|.+..-+..--.||. .|.+..+.|. +
T Consensus 486 ~~k~~g-s~ithLDwS~Ds~~~-~~~S~d-~eiLyW~~~~~~~~ts~kDv--kW~t~~c~lGF~v~g~s~~t~i~a~~rs 560 (626)
T KOG2106|consen 486 VGKCSG-SPITHLDWSSDSQFL-VSNSGD-YEILYWKPSECKQITSVKDV--KWATYTCTLGFEVFGGSDGTDINAVARS 560 (626)
T ss_pred eeeecC-ceeEEeeecCCCceE-EeccCc-eEEEEEccccCcccceecce--eeeeeEEEEEEEEecccCCchHHHhhhh
Confidence 322233 566777653 3444 332222 22444554332222224443 3555433331 1
Q ss_pred ---ceEEEEccCCcEEEEeeCC
Q 001655 870 ---DIYLGAENNFNLFTVRKNS 888 (1036)
Q Consensus 870 ---~~~l~~D~~gnl~~l~~~~ 888 (1036)
+....+|.+|.+++|+|+-
T Consensus 561 ~~~~~lA~gdd~g~v~lf~yPc 582 (626)
T KOG2106|consen 561 HCEKLLASGDDFGKVHLFSYPC 582 (626)
T ss_pred hhhhhhhccccCceEEEEcccc
Confidence 1256789999999999874
|
|
| >KOG1274 consensus WD40 repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=97.09 E-value=0.24 Score=59.13 Aligned_cols=74 Identities=22% Similarity=0.248 Sum_probs=61.2
Q ss_pred CccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeC
Q 001655 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVL 740 (1036)
Q Consensus 661 ~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~ 740 (1036)
+-..+++++|....+++.+.+. +|++++.++|+....+..+..+...|...|. ....||+-||
T Consensus 189 ~i~~~~aW~Pk~g~la~~~~d~----------~Vkvy~r~~we~~f~Lr~~~~ss~~~~~~ws---PnG~YiAAs~---- 251 (933)
T KOG1274|consen 189 RICTRLAWHPKGGTLAVPPVDN----------TVKVYSRKGWELQFKLRDKLSSSKFSDLQWS---PNGKYIAAST---- 251 (933)
T ss_pred ceeeeeeecCCCCeEEeeccCC----------eEEEEccCCceeheeecccccccceEEEEEc---CCCcEEeeec----
Confidence 3446789999999999999874 8999999999999888888888877776764 2468999887
Q ss_pred CCCCCCcceEEEEEEEe
Q 001655 741 PEENEPTKGRILVFIVE 757 (1036)
Q Consensus 741 ~~e~~~~~Gri~v~~i~ 757 (1036)
..|.|.||+++
T Consensus 252 ------~~g~I~vWnv~ 262 (933)
T KOG1274|consen 252 ------LDGQILVWNVD 262 (933)
T ss_pred ------cCCcEEEEecc
Confidence 56889999987
|
|
| >KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown] | Back alignment and domain information |
|---|
Probab=97.06 E-value=0.29 Score=54.29 Aligned_cols=272 Identities=14% Similarity=0.140 Sum_probs=154.2
Q ss_pred CceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEE-ecCCCccCceeEEeeeecCeEEEEEEeCCCcEEE
Q 001655 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITK-EHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLN 555 (1036)
Q Consensus 477 ~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~-~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~ 555 (1036)
...|++++++|.. ...++|++ +-.+.||+....+..-. .... -...|+.+-. +...|.+|-..|.+-.
T Consensus 26 ~~~vssl~fsp~~-----P~d~aVt~--S~rvqly~~~~~~~~k~~srFk--~~v~s~~fR~--DG~LlaaGD~sG~V~v 94 (487)
T KOG0310|consen 26 HNSVSSLCFSPKH-----PYDFAVTS--SVRVQLYSSVTRSVRKTFSRFK--DVVYSVDFRS--DGRLLAAGDESGHVKV 94 (487)
T ss_pred cCcceeEecCCCC-----CCceEEec--ccEEEEEecchhhhhhhHHhhc--cceeEEEeec--CCeEEEccCCcCcEEE
Confidence 3567888887643 23344432 45789998766443211 0111 1223443332 3456777889999988
Q ss_pred EEEecCCCccccceeeecCccceEEEEEEeCCceEEEEEc-CCceEEEec-CCcEEEeeecc--ccceeeecccCCCCCC
Q 001655 556 FLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAAS-DRPTVIYSS-NKKLLYSNVNL--KEVSHMCPFNSAAFPD 631 (1036)
Q Consensus 556 y~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~-~~p~~i~~~-~~~~~~~~~~~--~~i~~~~~f~~~~~~~ 631 (1036)
|...+. ..| +...--..|++.+.|...+ +.+|+.| |....-+++ .+......+.. +-+.+.. +. ...+
T Consensus 95 fD~k~r-~iL---R~~~ah~apv~~~~f~~~d-~t~l~s~sDd~v~k~~d~s~a~v~~~l~~htDYVR~g~-~~--~~~~ 166 (487)
T KOG0310|consen 95 FDMKSR-VIL---RQLYAHQAPVHVTKFSPQD-NTMLVSGSDDKVVKYWDLSTAYVQAELSGHTDYVRCGD-IS--PAND 166 (487)
T ss_pred eccccH-HHH---HHHhhccCceeEEEecccC-CeEEEecCCCceEEEEEcCCcEEEEEecCCcceeEeec-cc--cCCC
Confidence 874321 011 2233456788888775433 3455544 433333443 22222112221 2222221 11 1112
Q ss_pred eEEEEe--CCeEEEEEEcCCCCeEEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEE
Q 001655 632 SLAIAK--EGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYP 709 (1036)
Q Consensus 632 ~l~~~~--~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~ 709 (1036)
.+++.- ++.+++--.-... -.+..+.=|.....++|.|....++-+.- +.+++.|-.++..+-.-.
T Consensus 167 hivvtGsYDg~vrl~DtR~~~-~~v~elnhg~pVe~vl~lpsgs~iasAgG-----------n~vkVWDl~~G~qll~~~ 234 (487)
T KOG0310|consen 167 HIVVTGSYDGKVRLWDTRSLT-SRVVELNHGCPVESVLALPSGSLIASAGG-----------NSVKVWDLTTGGQLLTSM 234 (487)
T ss_pred eEEEecCCCceEEEEEeccCC-ceeEEecCCCceeeEEEcCCCCEEEEcCC-----------CeEEEEEecCCceehhhh
Confidence 333332 4556555444332 34556666778899999998776665543 478999876444333333
Q ss_pred CCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEecccc--CeEEEEE
Q 001655 710 LDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN--GKLLAAI 787 (1036)
Q Consensus 710 ~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~~--g~Lv~~~ 787 (1036)
+..+-.++|++... ++..++-.| --|++.+|++.+ ++++|...++|||.+|.-.. ..+++|+
T Consensus 235 ~~H~KtVTcL~l~s---~~~rLlS~s-----------LD~~VKVfd~t~--~Kvv~s~~~~~pvLsiavs~dd~t~viGm 298 (487)
T KOG0310|consen 235 FNHNKTVTCLRLAS---DSTRLLSGS-----------LDRHVKVFDTTN--YKVVHSWKYPGPVLSIAVSPDDQTVVIGM 298 (487)
T ss_pred hcccceEEEEEeec---CCceEeecc-----------cccceEEEEccc--eEEEEeeecccceeeEEecCCCceEEEec
Confidence 44677888886643 334444333 347999998654 77899999999999999875 4789999
Q ss_pred CCEEEEEE
Q 001655 788 NQKIQLYK 795 (1036)
Q Consensus 788 g~~l~v~~ 795 (1036)
++-+..+.
T Consensus 299 snGlv~~r 306 (487)
T KOG0310|consen 299 SNGLVSIR 306 (487)
T ss_pred ccceeeee
Confidence 88776554
|
|
| >KOG1273 consensus WD40 repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=96.80 E-value=0.19 Score=52.63 Aligned_cols=206 Identities=16% Similarity=0.213 Sum_probs=104.9
Q ss_pred EEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEE-EECCCCCeEeEEEEEEEcCCCceE
Q 001655 653 HIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFIST-YPLDTFEYGCSILSCSFSDDSNVY 731 (1036)
Q Consensus 653 ~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~-~~~~~~E~v~si~~~~l~~~~~~~ 731 (1036)
+.++|.+....-...|||...--+|++.-. +...+.-+++..-..+.. -+-+.++...+ .+-+...+|
T Consensus 99 ~l~rirf~spv~~~q~hp~k~n~~va~~~~-------~sp~vi~~s~~~h~~Lp~d~d~dln~sas~----~~fdr~g~y 167 (405)
T KOG1273|consen 99 PLKRIRFDSPVWGAQWHPRKRNKCVATIME-------ESPVVIDFSDPKHSVLPKDDDGDLNSSASH----GVFDRRGKY 167 (405)
T ss_pred ceeEEEccCccceeeeccccCCeEEEEEec-------CCcEEEEecCCceeeccCCCcccccccccc----ccccCCCCE
Confidence 456666666667777787666666666432 112333333211111110 01111222211 122334689
Q ss_pred EEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccc----cC-eEEE-EECCEEEEEEeeecC----C
Q 001655 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF----NG-KLLA-AINQKIQLYKWMLRD----D 801 (1036)
Q Consensus 732 i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~----~g-~Lv~-~~g~~l~v~~~~~~~----~ 801 (1036)
|++|| .+|.+++|+.. .++.+++..+.. +++|.++ +| +|++ +....|+.|+..+-+ +
T Consensus 168 IitGt----------sKGkllv~~a~--t~e~vas~rits-~~~IK~I~~s~~g~~liiNtsDRvIR~ye~~di~~~~r~ 234 (405)
T KOG1273|consen 168 IITGT----------SKGKLLVYDAE--TLECVASFRITS-VQAIKQIIVSRKGRFLIINTSDRVIRTYEISDIDDEGRD 234 (405)
T ss_pred EEEec----------CcceEEEEecc--hheeeeeeeech-heeeeEEEEeccCcEEEEecCCceEEEEehhhhcccCcc
Confidence 99999 56999999864 456677666544 5555544 55 4444 444567888865211 1
Q ss_pred Cc----cch---hcccccccceEEEEEEEeCCEEEEEeccccEEEEEEeccCCeEEEEeccCCcceeEEEEEeeCceEEE
Q 001655 802 GT----REL---QSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874 (1036)
Q Consensus 802 ~~----~~L---~~~~~~~~~~~~~~l~~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l~~~~~l~ 874 (1036)
+. .|+ +.+..|. -.+.+-.|.||++|. .+.-.+|-|...-+.|+.+-.-...-....+.-=-...+++
T Consensus 235 ~e~e~~~K~qDvVNk~~Wk----~ccfs~dgeYv~a~s-~~aHaLYIWE~~~GsLVKILhG~kgE~l~DV~whp~rp~i~ 309 (405)
T KOG1273|consen 235 GEVEPEHKLQDVVNKLQWK----KCCFSGDGEYVCAGS-ARAHALYIWEKSIGSLVKILHGTKGEELLDVNWHPVRPIIA 309 (405)
T ss_pred CCcChhHHHHHHHhhhhhh----heeecCCccEEEecc-ccceeEEEEecCCcceeeeecCCchhheeecccccceeeee
Confidence 11 122 2222222 122334678998888 44456777877777776553222211111110000023666
Q ss_pred EccCCcEEEEeeC
Q 001655 875 AENNFNLFTVRKN 887 (1036)
Q Consensus 875 ~D~~gnl~~l~~~ 887 (1036)
+-..|++++....
T Consensus 310 si~sg~v~iw~~~ 322 (405)
T KOG1273|consen 310 SIASGVVYIWAVV 322 (405)
T ss_pred eccCCceEEEEee
Confidence 6688998887654
|
|
| >KOG2055 consensus WD40 repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=96.74 E-value=0.13 Score=56.55 Aligned_cols=127 Identities=13% Similarity=0.197 Sum_probs=66.7
Q ss_pred cCCccceeeeeCCCCCcEEEEEeC--CCEEEEEEcCCeEEE-EEEcCcEEEEEEec-cCCc-eeEEEEcCCCCCCCCccc
Q 001655 422 STSRELRNEWKSPPGYSVNVATAN--ASQVLLATGGGHLVY-LEIGDGILTEVKHA-QLEY-EISCLDINPIGENPSYSQ 496 (1036)
Q Consensus 422 ~~~~~~~~~w~~~~~~~I~~as~~--~~~vvv~~~~~~~~~-l~~~~~~l~~~~~~-~l~~-~is~l~~~~~~~~~~~~~ 496 (1036)
++...+++.... .+.+|..|.+. +...+++.+.....| +++...++..+.+. ..+. -+-.+.+++ .+.
T Consensus 244 Gk~N~~lqS~~l-~~fPi~~a~f~p~G~~~i~~s~rrky~ysyDle~ak~~k~~~~~g~e~~~~e~FeVSh------d~~ 316 (514)
T KOG2055|consen 244 GKVNPKLQSIHL-EKFPIQKAEFAPNGHSVIFTSGRRKYLYSYDLETAKVTKLKPPYGVEEKSMERFEVSH------DSN 316 (514)
T ss_pred CccChhheeeee-ccCccceeeecCCCceEEEecccceEEEEeeccccccccccCCCCcccchhheeEecC------CCC
Confidence 333334444433 23467777754 444555544444333 34433444444321 1111 111333333 145
Q ss_pred EEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEec
Q 001655 497 IAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNM 560 (1036)
Q Consensus 497 ~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~ 560 (1036)
++++.-- +|.|+++...+.+.+.+..+. ....++.+. .+...|++..++|.++++.+..
T Consensus 317 fia~~G~-~G~I~lLhakT~eli~s~Kie--G~v~~~~fs--Sdsk~l~~~~~~GeV~v~nl~~ 375 (514)
T KOG2055|consen 317 FIAIAGN-NGHIHLLHAKTKELITSFKIE--GVVSDFTFS--SDSKELLASGGTGEVYVWNLRQ 375 (514)
T ss_pred eEEEccc-CceEEeehhhhhhhhheeeec--cEEeeEEEe--cCCcEEEEEcCCceEEEEecCC
Confidence 6666555 788999887777766554443 233444443 2346678888899888887654
|
|
| >COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism] | Back alignment and domain information |
|---|
Probab=96.59 E-value=1.4 Score=47.46 Aligned_cols=120 Identities=18% Similarity=0.246 Sum_probs=68.8
Q ss_pred cceEEEEEEEeC--CeEEEEEE-----EEecCce--eEeccc-cCeEEEEEC---CEEEEEEeeecCCCccchhcccccc
Q 001655 747 TKGRILVFIVED--GKLQLIAE-----KETKGAV--YSLNAF-NGKLLAAIN---QKIQLYKWMLRDDGTRELQSECGHH 813 (1036)
Q Consensus 747 ~~Gri~v~~i~~--~kl~~~~~-----~~~~g~v--~~i~~~-~g~Lv~~~g---~~l~v~~~~~~~~~~~~L~~~~~~~ 813 (1036)
-.++|.+|++++ ++++.+-. .++.|.. -+|.-- +|+.+.+.+ +.|.+|.+++. .|..++.......
T Consensus 211 L~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~dGrFLYasNRg~dsI~~f~V~~~-~g~L~~~~~~~te 289 (346)
T COG2706 211 LNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISPDGRFLYASNRGHDSIAVFSVDPD-GGKLELVGITPTE 289 (346)
T ss_pred cCCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECCCCCEEEEecCCCCeEEEEEEcCC-CCEEEEEEEeccC
Confidence 356888998885 56654432 2344433 233322 677666666 46788888532 2333332211111
Q ss_pred cc-eEEEEEEEeCCEEEE-EeccccEEEEEEeccCCeEEEEeccCCcceeEEEEEe
Q 001655 814 GH-ILALYVQTRGDFIVV-GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 867 (1036)
Q Consensus 814 ~~-~~~~~l~~~~~~I~v-gD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l 867 (1036)
.. +--..+...++++++ +--.+.+.+++-+++.++|.++.+|...-.-+++.|+
T Consensus 290 g~~PR~F~i~~~g~~Liaa~q~sd~i~vf~~d~~TG~L~~~~~~~~~p~Pvcv~f~ 345 (346)
T COG2706 290 GQFPRDFNINPSGRFLIAANQKSDNITVFERDKETGRLTLLGRYAVVPEPVCVKFL 345 (346)
T ss_pred CcCCccceeCCCCCEEEEEccCCCcEEEEEEcCCCceEEecccccCCCCcEEEEEc
Confidence 11 111234455665554 5555668888888899999999998766555666553
|
|
| >KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only] | Back alignment and domain information |
|---|
Probab=96.58 E-value=0.42 Score=48.58 Aligned_cols=221 Identities=15% Similarity=0.143 Sum_probs=123.4
Q ss_pred EeCCeEEEEEEcCCCCeEEEEEeCC-CccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCC
Q 001655 636 AKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFE 714 (1036)
Q Consensus 636 ~~~~~l~i~~l~~~~~~~~~~i~l~-~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E 714 (1036)
+..-.+++..+.+.+.-++.++... .....+.|+.+.+...-.. +-+.+|+.|-.+-.. .-+|+.+-
T Consensus 58 a~~qhvRlyD~~S~np~Pv~t~e~h~kNVtaVgF~~dgrWMyTgs----------eDgt~kIWdlR~~~~--qR~~~~~s 125 (311)
T KOG0315|consen 58 AGNQHVRLYDLNSNNPNPVATFEGHTKNVTAVGFQCDGRWMYTGS----------EDGTVKIWDLRSLSC--QRNYQHNS 125 (311)
T ss_pred ccCCeeEEEEccCCCCCceeEEeccCCceEEEEEeecCeEEEecC----------CCceEEEEeccCccc--chhccCCC
Confidence 3455667777766655566655543 3444455554444333221 225788887765322 22344445
Q ss_pred eEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccc-cCeEEEEECCEEEE
Q 001655 715 YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF-NGKLLAAINQKIQL 793 (1036)
Q Consensus 715 ~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~-~g~Lv~~~g~~l~v 793 (1036)
.++++.. . .++.-+++|+ -+|+|.+|++.++...-.---+..-++.++... +|..++|++++=..
T Consensus 126 pVn~vvl---h-pnQteLis~d----------qsg~irvWDl~~~~c~~~liPe~~~~i~sl~v~~dgsml~a~nnkG~c 191 (311)
T KOG0315|consen 126 PVNTVVL---H-PNQTELISGD----------QSGNIRVWDLGENSCTHELIPEDDTSIQSLTVMPDGSMLAAANNKGNC 191 (311)
T ss_pred CcceEEe---c-CCcceEEeec----------CCCcEEEEEccCCccccccCCCCCcceeeEEEcCCCcEEEEecCCccE
Confidence 5555432 2 2455677786 579999999987643321112334677777755 67777777765444
Q ss_pred EEeeec-CCCccchhcccccccc-eEEEE--EEEeCCEEEEEeccccEEEEEEeccCCeEEEEeccCCcceeEEEEEeeC
Q 001655 794 YKWMLR-DDGTRELQSECGHHGH-ILALY--VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD 869 (1036)
Q Consensus 794 ~~~~~~-~~~~~~L~~~~~~~~~-~~~~~--l~~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l~~ 869 (1036)
|-|+.. ..+..+|.++-.++.+ .++++ ++..+.++...-+-+-+.++ +.++--..+..-+-..||+-.|.|-.+
T Consensus 192 yvW~l~~~~~~s~l~P~~k~~ah~~~il~C~lSPd~k~lat~ssdktv~iw--n~~~~~kle~~l~gh~rWvWdc~FS~d 269 (311)
T KOG0315|consen 192 YVWRLLNHQTASELEPVHKFQAHNGHILRCLLSPDVKYLATCSSDKTVKIW--NTDDFFKLELVLTGHQRWVWDCAFSAD 269 (311)
T ss_pred EEEEccCCCccccceEhhheecccceEEEEEECCCCcEEEeecCCceEEEE--ecCCceeeEEEeecCCceEEeeeeccC
Confidence 444322 1223345554434433 24443 33345577777777777664 444442333444566799999999888
Q ss_pred ceEEEEc-cCCcEEEE
Q 001655 870 DIYLGAE-NNFNLFTV 884 (1036)
Q Consensus 870 ~~~l~~D-~~gnl~~l 884 (1036)
+.+|++- .++-....
T Consensus 270 g~YlvTassd~~~rlW 285 (311)
T KOG0315|consen 270 GEYLVTASSDHTARLW 285 (311)
T ss_pred ccEEEecCCCCceeec
Confidence 8776643 34443333
|
|
| >KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown] | Back alignment and domain information |
|---|
Probab=96.54 E-value=1.5 Score=47.12 Aligned_cols=312 Identities=13% Similarity=0.143 Sum_probs=146.6
Q ss_pred cEEEEEeCCCEEEEEEcC--CeEEEEEEcCcEEEEEEeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCC
Q 001655 438 SVNVATANASQVLLATGG--GHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD 515 (1036)
Q Consensus 438 ~I~~as~~~~~vvv~~~~--~~~~~l~~~~~~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~ 515 (1036)
++-+.+.+++.=+++..+ ..-.+.++..+.+. .....-...|+|++++. .+.+++-|- .+|.+.|+...+
T Consensus 66 svFavsl~P~~~l~aTGGgDD~AflW~~~~ge~~-~eltgHKDSVt~~~Fsh------dgtlLATGd-msG~v~v~~~st 137 (399)
T KOG0296|consen 66 SVFAVSLHPNNNLVATGGGDDLAFLWDISTGEFA-GELTGHKDSVTCCSFSH------DGTLLATGD-MSGKVLVFKVST 137 (399)
T ss_pred ceEEEEeCCCCceEEecCCCceEEEEEccCCcce-eEecCCCCceEEEEEcc------CceEEEecC-CCccEEEEEccc
Confidence 455666665443444433 22333455544311 11123345789998853 255666554 489999999876
Q ss_pred CCeeEEecCCCccCceeEEeeeec-CeEEEEEEeCCCcEEEEEEecCCCccccceeeecC-ccceEEEEEEeCCceEEEE
Q 001655 516 LNLITKEHLGGEIIPRSVLLCAFE-GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG-TQPITLRTFSSKNTTHVFA 593 (1036)
Q Consensus 516 l~~v~~~~l~~~~~p~si~~~~~~-~~~~L~vgl~~G~l~~y~~~~~~~~l~~~~~~~lG-~~pv~l~~~~~~~~~~v~~ 593 (1036)
-...+...- ....|.-.... ....|+.|..||.+..|++.... . -.+.-| ..|+.--.|. ..+..++.
T Consensus 138 g~~~~~~~~----e~~dieWl~WHp~a~illAG~~DGsvWmw~ip~~~----~-~kv~~Gh~~~ct~G~f~-pdGKr~~t 207 (399)
T KOG0296|consen 138 GGEQWKLDQ----EVEDIEWLKWHPRAHILLAGSTDGSVWMWQIPSQA----L-CKVMSGHNSPCTCGEFI-PDGKRILT 207 (399)
T ss_pred CceEEEeec----ccCceEEEEecccccEEEeecCCCcEEEEECCCcc----e-eeEecCCCCCccccccc-CCCceEEE
Confidence 433322110 11233323333 24568999999999999987531 1 112223 2333322221 12233333
Q ss_pred EcCCceEEEec-CCcEEEeeeccc--------cceeeecccCCCCCCe-EEEE--eCCeEEEEEEcCCCCeEEEEEeCCC
Q 001655 594 ASDRPTVIYSS-NKKLLYSNVNLK--------EVSHMCPFNSAAFPDS-LAIA--KEGELTIGTIDDIQKLHIRSIPLGE 661 (1036)
Q Consensus 594 ~~~~p~~i~~~-~~~~~~~~~~~~--------~i~~~~~f~~~~~~~~-l~~~--~~~~l~i~~l~~~~~~~~~~i~l~~ 661 (1036)
..+.-.++.+. +..-.+..++.. .+......-.....++ +.++ ..+.+..|..++...+....=...+
T Consensus 208 gy~dgti~~Wn~ktg~p~~~~~~~e~~~~~~~~~~~~~~~~~~g~~e~~~~~~~~~sgKVv~~~n~~~~~l~~~~e~~~e 287 (399)
T KOG0296|consen 208 GYDDGTIIVWNPKTGQPLHKITQAEGLELPCISLNLAGSTLTKGNSEGVACGVNNGSGKVVNCNNGTVPELKPSQEELDE 287 (399)
T ss_pred EecCceEEEEecCCCceeEEecccccCcCCccccccccceeEeccCCccEEEEccccceEEEecCCCCccccccchhhhh
Confidence 33333333222 211111111100 0000000000011122 1122 2344444443322222111112222
Q ss_pred ccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCC
Q 001655 662 HPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLP 741 (1036)
Q Consensus 662 tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~ 741 (1036)
....+.......++|++... +.|-++|-.+++.-...+.+.. |+.+ .+. +..|++-++
T Consensus 288 sve~~~~ss~lpL~A~G~vd----------G~i~iyD~a~~~~R~~c~he~~--V~~l---~w~--~t~~l~t~c----- 345 (399)
T KOG0296|consen 288 SVESIPSSSKLPLAACGSVD----------GTIAIYDLAASTLRHICEHEDG--VTKL---KWL--NTDYLLTAC----- 345 (399)
T ss_pred hhhhcccccccchhhccccc----------ceEEEEecccchhheeccCCCc--eEEE---EEc--Ccchheeec-----
Confidence 33333334444455555443 5788899877765544444433 4433 222 367888776
Q ss_pred CCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccc-cCeEEEEEC--CEEEEEE
Q 001655 742 EENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF-NGKLLAAIN--QKIQLYK 795 (1036)
Q Consensus 742 ~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~-~g~Lv~~~g--~~l~v~~ 795 (1036)
..|.++.|+..-++++..+ .-+..+|++++-- ++++++.++ ++..+|+
T Consensus 346 -----~~g~v~~wDaRtG~l~~~y-~GH~~~Il~f~ls~~~~~vvT~s~D~~a~VF~ 396 (399)
T KOG0296|consen 346 -----ANGKVRQWDARTGQLKFTY-TGHQMGILDFALSPQKRLVVTVSDDNTALVFE 396 (399)
T ss_pred -----cCceEEeeeccccceEEEE-ecCchheeEEEEcCCCcEEEEecCCCeEEEEe
Confidence 4689999999877776554 3455677766644 455555444 5566665
|
|
| >PLN00181 protein SPA1-RELATED; Provisional | Back alignment and domain information |
|---|
Probab=96.48 E-value=0.7 Score=58.62 Aligned_cols=200 Identities=14% Similarity=0.127 Sum_probs=114.1
Q ss_pred ccceEEEecC-CCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeC
Q 001655 662 HPRRICHQEQ-SRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVL 740 (1036)
Q Consensus 662 tp~~i~y~~~-~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~ 740 (1036)
....+.++|. .+.++.++.+ +.+++.|..+++.+..+.. ...+.|+ .+......++++|+
T Consensus 577 ~V~~l~~~p~~~~~L~Sgs~D----------g~v~iWd~~~~~~~~~~~~--~~~v~~v---~~~~~~g~~latgs---- 637 (793)
T PLN00181 577 RVWSIDYSSADPTLLASGSDD----------GSVKLWSINQGVSIGTIKT--KANICCV---QFPSESGRSLAFGS---- 637 (793)
T ss_pred CEEEEEEcCCCCCEEEEEcCC----------CEEEEEECCCCcEEEEEec--CCCeEEE---EEeCCCCCEEEEEe----
Confidence 4567888864 3444444332 5799999888887776653 2344443 34334467788886
Q ss_pred CCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccccCe-EEEE-ECCEEEEEEeeecCC--Cccchhcccccccce
Q 001655 741 PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGK-LLAA-INQKIQLYKWMLRDD--GTRELQSECGHHGHI 816 (1036)
Q Consensus 741 ~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~~g~-Lv~~-~g~~l~v~~~~~~~~--~~~~L~~~~~~~~~~ 816 (1036)
..|.|++|++.+.+..+.......++|+++.-.++. |+.+ ...+|.+|++..... ....+.....+....
T Consensus 638 ------~dg~I~iwD~~~~~~~~~~~~~h~~~V~~v~f~~~~~lvs~s~D~~ikiWd~~~~~~~~~~~~l~~~~gh~~~i 711 (793)
T PLN00181 638 ------ADHKVYYYDLRNPKLPLCTMIGHSKTVSYVRFVDSSTLVSSSTDNTLKLWDLSMSISGINETPLHSFMGHTNVK 711 (793)
T ss_pred ------CCCeEEEEECCCCCccceEecCCCCCEEEEEEeCCCEEEEEECCCEEEEEeCCCCccccCCcceEEEcCCCCCe
Confidence 347999999876543222223566789888766654 4433 346788877632100 001111111122222
Q ss_pred EEEEEEEeCCEEEEEeccccEEEEEEeccCCeEEE----------EeccCCcceeEEEEEeeCc-eEEEEccCCcEEEEe
Q 001655 817 LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE----------RARDYNANWMSAVEILDDD-IYLGAENNFNLFTVR 885 (1036)
Q Consensus 817 ~~~~l~~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~----------~a~D~~~~~~~~~~~l~~~-~~l~~D~~gnl~~l~ 885 (1036)
..+.+...+++|++|..-..+.++........+.. ..-+....++.++++-.++ .+++++.+|+|.+++
T Consensus 712 ~~v~~s~~~~~lasgs~D~~v~iw~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~V~~v~ws~~~~~lva~~~dG~I~i~~ 791 (793)
T PLN00181 712 NFVGLSVSDGYIATGSETNEVFVYHKAFPMPVLSYKFKTIDPVSGLEVDDASQFISSVCWRGQSSTLVAANSTGNIKILE 791 (793)
T ss_pred eEEEEcCCCCEEEEEeCCCEEEEEECCCCCceEEEecccCCcccccccCCCCcEEEEEEEcCCCCeEEEecCCCcEEEEe
Confidence 34555556789999998888877543211111100 0012233456777665443 578888999999987
Q ss_pred e
Q 001655 886 K 886 (1036)
Q Consensus 886 ~ 886 (1036)
.
T Consensus 792 ~ 792 (793)
T PLN00181 792 M 792 (793)
T ss_pred c
Confidence 4
|
|
| >TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain | Back alignment and domain information |
|---|
Probab=96.43 E-value=2.1 Score=47.59 Aligned_cols=263 Identities=11% Similarity=0.074 Sum_probs=130.9
Q ss_pred CcEEEEecCCccceeeeeCCCCCcEEEEEeCCCEEEEEEc----------CCeEEEEEEcCcEEEEEEeccCCce-----
Q 001655 415 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATG----------GGHLVYLEIGDGILTEVKHAQLEYE----- 479 (1036)
Q Consensus 415 ~~i~l~~~~~~~~~~~w~~~~~~~I~~as~~~~~vvv~~~----------~~~~~~l~~~~~~l~~~~~~~l~~~----- 479 (1036)
+.|.++|.++.+.+..+.....-... .+-++..+.++.. ...+..++. ..+..+.+.+++..
T Consensus 27 ~~v~ViD~~~~~v~g~i~~G~~P~~~-~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~--~t~~~~~~i~~p~~p~~~~ 103 (352)
T TIGR02658 27 TQVYTIDGEAGRVLGMTDGGFLPNPV-VASDGSFFAHASTVYSRIARGKRTDYVEVIDP--QTHLPIADIELPEGPRFLV 103 (352)
T ss_pred ceEEEEECCCCEEEEEEEccCCCcee-ECCCCCEEEEEeccccccccCCCCCEEEEEEC--ccCcEEeEEccCCCchhhc
Confidence 56777777765555555421111121 4445566666665 234444443 33444444444222
Q ss_pred ---eEEEEcCCCCCCCCcccEEEEEEec-CCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEE
Q 001655 480 ---ISCLDINPIGENPSYSQIAAVGMWT-DISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLN 555 (1036)
Q Consensus 480 ---is~l~~~~~~~~~~~~~~~~vg~w~-~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~ 555 (1036)
...+++++ .+.++.|.-+. ++++.++++..-+.+..-..+. + ...+..-+....++ ..||.+..
T Consensus 104 ~~~~~~~~ls~------dgk~l~V~n~~p~~~V~VvD~~~~kvv~ei~vp~---~-~~vy~t~e~~~~~~--~~Dg~~~~ 171 (352)
T TIGR02658 104 GTYPWMTSLTP------DNKTLLFYQFSPSPAVGVVDLEGKAFVRMMDVPD---C-YHIFPTANDTFFMH--CRDGSLAK 171 (352)
T ss_pred cCccceEEECC------CCCEEEEecCCCCCEEEEEECCCCcEEEEEeCCC---C-cEEEEecCCccEEE--eecCceEE
Confidence 12444543 36788888876 7899999998877765544431 1 11222222333333 46999999
Q ss_pred EEEecCCCccccceeeec--C------ccceEEEEEEeCCceEEEEEcCCceEEEe-cCCcEEEeeeccccceeeecccC
Q 001655 556 FLLNMKTGELTDRKKVSL--G------TQPITLRTFSSKNTTHVFAASDRPTVIYS-SNKKLLYSNVNLKEVSHMCPFNS 626 (1036)
Q Consensus 556 y~~~~~~~~l~~~~~~~l--G------~~pv~l~~~~~~~~~~v~~~~~~p~~i~~-~~~~~~~~~~~~~~i~~~~~f~~ 626 (1036)
+.++.... .. .+...+ + .+| . +...++..+|++.+....+.. ......+. . ......+..
T Consensus 172 v~~d~~g~-~~-~~~~~vf~~~~~~v~~rP-~---~~~~dg~~~~vs~eG~V~~id~~~~~~~~~----~-~~~~~~~~~ 240 (352)
T TIGR02658 172 VGYGTKGN-PK-IKPTEVFHPEDEYLINHP-A---YSNKSGRLVWPTYTGKIFQIDLSSGDAKFL----P-AIEAFTEAE 240 (352)
T ss_pred EEecCCCc-eE-EeeeeeecCCccccccCC-c---eEcCCCcEEEEecCCeEEEEecCCCcceec----c-eeeeccccc
Confidence 88876533 11 112111 3 444 1 111123455555442221111 11111000 0 000000100
Q ss_pred C--C-CCCe---EEEE-eCCeE-----------------EEEEEcCCCCeEEEEEeCCCccceEEEecCCCEEEEEEeec
Q 001655 627 A--A-FPDS---LAIA-KEGEL-----------------TIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN 682 (1036)
Q Consensus 627 ~--~-~~~~---l~~~-~~~~l-----------------~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~ 682 (1036)
. . -|.+ +.+. +++.+ .+.-+|....-.+++|+++..|..|+++++.+..+..+.+.
T Consensus 241 ~~~~wrP~g~q~ia~~~dg~~lyV~~~~~~~~thk~~~~~V~ViD~~t~kvi~~i~vG~~~~~iavS~Dgkp~lyvtn~~ 320 (352)
T TIGR02658 241 KADGWRPGGWQQVAYHRARDRIYLLADQRAKWTHKTASRFLFVVDAKTGKRLRKIELGHEIDSINVSQDAKPLLYALSTG 320 (352)
T ss_pred cccccCCCcceeEEEcCCCCEEEEEecCCccccccCCCCEEEEEECCCCeEEEEEeCCCceeeEEECCCCCeEEEEeCCC
Confidence 0 0 0111 1111 12222 34445555556789999999999999999988444444321
Q ss_pred CCCCCceeeEEEEEEeCCCceEEEEE-ECC
Q 001655 683 QSCAEESEMHFVRLLDDQTFEFISTY-PLD 711 (1036)
Q Consensus 683 ~~~~~~~~~~~l~l~d~~t~~~i~~~-~~~ 711 (1036)
...|.++|..+.+.+.++ .+.
T Consensus 321 --------s~~VsViD~~t~k~i~~i~~vg 342 (352)
T TIGR02658 321 --------DKTLYIFDAETGKELSSVNQLG 342 (352)
T ss_pred --------CCcEEEEECcCCeEEeeeccCC
Confidence 246889999999888776 443
|
This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome. |
| >TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein | Back alignment and domain information |
|---|
Probab=96.34 E-value=2.1 Score=46.61 Aligned_cols=258 Identities=15% Similarity=0.156 Sum_probs=126.1
Q ss_pred ecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEE-EEEeCCCcEEEEEEecCCCccccceeeecCccceEEE
Q 001655 503 WTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYL-LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581 (1036)
Q Consensus 503 w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L-~vgl~~G~l~~y~~~~~~~~l~~~~~~~lG~~pv~l~ 581 (1036)
-.++.+.+|++.+.+.+...... ..|.++.+.. +...+ +++..+|.+..|.... +.+ .+....+..+..+
T Consensus 8 ~~d~~v~~~d~~t~~~~~~~~~~--~~~~~l~~~~--dg~~l~~~~~~~~~v~~~d~~~--~~~--~~~~~~~~~~~~~- 78 (300)
T TIGR03866 8 EKDNTISVIDTATLEVTRTFPVG--QRPRGITLSK--DGKLLYVCASDSDTIQVIDLAT--GEV--IGTLPSGPDPELF- 78 (300)
T ss_pred cCCCEEEEEECCCCceEEEEECC--CCCCceEECC--CCCEEEEEECCCCeEEEEECCC--CcE--EEeccCCCCccEE-
Confidence 33899999999876654433222 2355554432 23345 4566788888877543 221 1122223333222
Q ss_pred EEEeCCceEEEEEcC--CceEEEecC-CcE-EEeeeccccceeeecccCCCCCC--eEEEEeCCeEEEEEEcCCCCeEEE
Q 001655 582 TFSSKNTTHVFAASD--RPTVIYSSN-KKL-LYSNVNLKEVSHMCPFNSAAFPD--SLAIAKEGELTIGTIDDIQKLHIR 655 (1036)
Q Consensus 582 ~~~~~~~~~v~~~~~--~p~~i~~~~-~~~-~~~~~~~~~i~~~~~f~~~~~~~--~l~~~~~~~l~i~~l~~~~~~~~~ 655 (1036)
.+. .+.+.+++++. ....++.-. ++. ...+.. ....+ ..| .|+ .++....+.-.+..++.......+
T Consensus 79 ~~~-~~g~~l~~~~~~~~~l~~~d~~~~~~~~~~~~~-~~~~~-~~~----~~dg~~l~~~~~~~~~~~~~d~~~~~~~~ 151 (300)
T TIGR03866 79 ALH-PNGKILYIANEDDNLVTVIDIETRKVLAEIPVG-VEPEG-MAV----SPDGKIVVNTSETTNMAHFIDTKTYEIVD 151 (300)
T ss_pred EEC-CCCCEEEEEcCCCCeEEEEECCCCeEEeEeeCC-CCcce-EEE----CCCCCEEEEEecCCCeEEEEeCCCCeEEE
Confidence 222 23345555542 222222212 121 111100 01111 111 122 233333322122223333333445
Q ss_pred EEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCC-e--EeEEEEEEEcCCCceEE
Q 001655 656 SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFE-Y--GCSILSCSFSDDSNVYY 732 (1036)
Q Consensus 656 ~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E-~--v~si~~~~l~~~~~~~i 732 (1036)
.++.+..|+.+.+.++.+.+++.+.. ...+.++|..+++.+..+.+.... . ......+.+..++ .++
T Consensus 152 ~~~~~~~~~~~~~s~dg~~l~~~~~~---------~~~v~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~s~dg-~~~ 221 (300)
T TIGR03866 152 NVLVDQRPRFAEFTADGKELWVSSEI---------GGTVSVIDVATRKVIKKITFEIPGVHPEAVQPVGIKLTKDG-KTA 221 (300)
T ss_pred EEEcCCCccEEEECCCCCEEEEEcCC---------CCEEEEEEcCcceeeeeeeecccccccccCCccceEECCCC-CEE
Confidence 56677889999999988876655431 146899999988888776654211 0 0001122343333 344
Q ss_pred EEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccc-cC-eEEEEE--CCEEEEEEee
Q 001655 733 CVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF-NG-KLLAAI--NQKIQLYKWM 797 (1036)
Q Consensus 733 ~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~-~g-~Lv~~~--g~~l~v~~~~ 797 (1036)
+++.+ ..+++.+|+...+++. ......+.+.+++-- +| +|+++. .++|++|+..
T Consensus 222 ~~~~~---------~~~~i~v~d~~~~~~~--~~~~~~~~~~~~~~~~~g~~l~~~~~~~~~i~v~d~~ 279 (300)
T TIGR03866 222 FVALG---------PANRVAVVDAKTYEVL--DYLLVGQRVWQLAFTPDEKYLLTTNGVSNDVSVIDVA 279 (300)
T ss_pred EEEcC---------CCCeEEEEECCCCcEE--EEEEeCCCcceEEECCCCCEEEEEcCCCCeEEEEECC
Confidence 44432 2357889988655543 222345677777633 34 565554 4688888753
|
Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol. |
| >KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=96.28 E-value=0.62 Score=48.74 Aligned_cols=198 Identities=9% Similarity=0.009 Sum_probs=114.9
Q ss_pred CccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCC-ceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeee
Q 001655 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQT-FEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYV 739 (1036)
Q Consensus 661 ~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t-~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~ 739 (1036)
.+...++|+|..+-++++.+ |-+.||+.+-+. ...+..-.-.-.--+++++ | .+++ ..+..|.
T Consensus 28 DsIS~l~FSP~~~~~~~A~S---------WD~tVR~wevq~~g~~~~ka~~~~~~PvL~v~-W--sddg-skVf~g~--- 91 (347)
T KOG0647|consen 28 DSISALAFSPQADNLLAAGS---------WDGTVRIWEVQNSGQLVPKAQQSHDGPVLDVC-W--SDDG-SKVFSGG--- 91 (347)
T ss_pred cchheeEeccccCceEEecc---------cCCceEEEEEecCCcccchhhhccCCCeEEEE-E--ccCC-ceEEeec---
Confidence 45567889985555555544 446787776532 3333221112222345543 3 3333 4455554
Q ss_pred CCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccccCeE--EEEEC---CEEEEEEeeecCCCccchhccccccc
Q 001655 740 LPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKL--LAAIN---QKIQLYKWMLRDDGTRELQSECGHHG 814 (1036)
Q Consensus 740 ~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~~g~L--v~~~g---~~l~v~~~~~~~~~~~~L~~~~~~~~ 814 (1036)
+.|.+.+|++..+....+. .+.+||.++.-+++++ +.+.| ++|+.++ .+. -.+......
T Consensus 92 -------~Dk~~k~wDL~S~Q~~~v~--~Hd~pvkt~~wv~~~~~~cl~TGSWDKTlKfWD--~R~-----~~pv~t~~L 155 (347)
T KOG0647|consen 92 -------CDKQAKLWDLASGQVSQVA--AHDAPVKTCHWVPGMNYQCLVTGSWDKTLKFWD--TRS-----SNPVATLQL 155 (347)
T ss_pred -------cCCceEEEEccCCCeeeee--ecccceeEEEEecCCCcceeEecccccceeecc--cCC-----CCeeeeeec
Confidence 4567888998877665555 5689999999888765 55555 4565533 221 111222333
Q ss_pred ceEEEEEEEeCCEEEEEeccccEEEEEEeccCCeEEEEeccCCcceeEE-EEEeeCceEEEEccCCcEEEEeeCCCCC
Q 001655 815 HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA-VEILDDDIYLGAENNFNLFTVRKNSEGA 891 (1036)
Q Consensus 815 ~~~~~~l~~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~-~~~l~~~~~l~~D~~gnl~~l~~~~~~~ 891 (1036)
+--...+++...+.+|+.+-+++.++.....+..+.... .+-.-.+.+ ++|-+.+.++.+--+|-+.+...++..+
T Consensus 156 PeRvYa~Dv~~pm~vVata~r~i~vynL~n~~te~k~~~-SpLk~Q~R~va~f~d~~~~alGsiEGrv~iq~id~~~~ 232 (347)
T KOG0647|consen 156 PERVYAADVLYPMAVVATAERHIAVYNLENPPTEFKRIE-SPLKWQTRCVACFQDKDGFALGSIEGRVAIQYIDDPNP 232 (347)
T ss_pred cceeeehhccCceeEEEecCCcEEEEEcCCCcchhhhhc-CcccceeeEEEEEecCCceEeeeecceEEEEecCCCCc
Confidence 333455777889999999999999887754333332221 121122223 2355666777777899999888876433
|
|
| >KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=96.27 E-value=1 Score=48.08 Aligned_cols=72 Identities=22% Similarity=0.256 Sum_probs=50.0
Q ss_pred ceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCCc-cCceeEEeeeecCeEEEEEEeCCCcEEEE
Q 001655 478 YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE-IIPRSVLLCAFEGISYLLCALGDGHLLNF 556 (1036)
Q Consensus 478 ~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~-~~p~si~~~~~~~~~~L~vgl~~G~l~~y 556 (1036)
.-|-|+++.|. .++++-|.- |+.+.||++...++..+ |+.. ...+.+.+.. .-+|||.+..|+.+-+|
T Consensus 152 gWVr~vavdP~------n~wf~tgs~-DrtikIwDlatg~Lklt--ltGhi~~vr~vavS~--rHpYlFs~gedk~VKCw 220 (460)
T KOG0285|consen 152 GWVRSVAVDPG------NEWFATGSA-DRTIKIWDLATGQLKLT--LTGHIETVRGVAVSK--RHPYLFSAGEDKQVKCW 220 (460)
T ss_pred ceEEEEeeCCC------ceeEEecCC-CceeEEEEcccCeEEEe--ecchhheeeeeeecc--cCceEEEecCCCeeEEE
Confidence 45679998773 457776666 99999999987554332 2211 2345555543 35899999999999999
Q ss_pred EEec
Q 001655 557 LLNM 560 (1036)
Q Consensus 557 ~~~~ 560 (1036)
.+..
T Consensus 221 DLe~ 224 (460)
T KOG0285|consen 221 DLEY 224 (460)
T ss_pred echh
Confidence 8864
|
|
| >KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning] | Back alignment and domain information |
|---|
Probab=96.24 E-value=0.72 Score=48.42 Aligned_cols=249 Identities=11% Similarity=0.127 Sum_probs=136.4
Q ss_pred CccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeC
Q 001655 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVL 740 (1036)
Q Consensus 661 ~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~ 740 (1036)
....++.|+|.++.++|.+- -++++++|....+.. .+|+..-.++++ .|.+ ...+++|+
T Consensus 14 d~IS~v~f~~~~~~LLvssW----------DgslrlYdv~~~~l~--~~~~~~~plL~c---~F~d--~~~~~~G~---- 72 (323)
T KOG1036|consen 14 DGISSVKFSPSSSDLLVSSW----------DGSLRLYDVPANSLK--LKFKHGAPLLDC---AFAD--ESTIVTGG---- 72 (323)
T ss_pred hceeeEEEcCcCCcEEEEec----------cCcEEEEeccchhhh--hheecCCceeee---eccC--CceEEEec----
Confidence 45567888988887777764 478999987655322 234444444444 3433 45577776
Q ss_pred CCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccc--cCeEEE-EECCEEEEEEeeecCCCccchhcccccccceE
Q 001655 741 PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF--NGKLLA-AINQKIQLYKWMLRDDGTRELQSECGHHGHIL 817 (1036)
Q Consensus 741 ~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~--~g~Lv~-~~g~~l~v~~~~~~~~~~~~L~~~~~~~~~~~ 817 (1036)
..|.|..+++.-+.-..+. ...+++.+|+.. .|.++. +=+++|.+++ .+. +... ..++..--
T Consensus 73 ------~dg~vr~~Dln~~~~~~ig--th~~~i~ci~~~~~~~~vIsgsWD~~ik~wD--~R~---~~~~--~~~d~~kk 137 (323)
T KOG1036|consen 73 ------LDGQVRRYDLNTGNEDQIG--THDEGIRCIEYSYEVGCVISGSWDKTIKFWD--PRN---KVVV--GTFDQGKK 137 (323)
T ss_pred ------cCceEEEEEecCCcceeec--cCCCceEEEEeeccCCeEEEcccCccEEEEe--ccc---cccc--cccccCce
Confidence 4588888888744333333 457889999877 566654 3457888855 321 1111 11122224
Q ss_pred EEEEEEeCCEEEEEeccccEEEEEEeccCCeEEEEecc-CCcceeEEEEEeeC-ceEEEEccCCcEEEEeeCCCCCCccc
Q 001655 818 ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD-YNANWMSAVEILDD-DIYLGAENNFNLFTVRKNSEGATDEE 895 (1036)
Q Consensus 818 ~~~l~~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D-~~~~~~~~~~~l~~-~~~l~~D~~gnl~~l~~~~~~~~~~~ 895 (1036)
+-++++.+|+++||...+.+.++....-..-+ --|+ .....+.++.++.+ .-++++--+|-+.+=-.++....+..
T Consensus 138 Vy~~~v~g~~LvVg~~~r~v~iyDLRn~~~~~--q~reS~lkyqtR~v~~~pn~eGy~~sSieGRVavE~~d~s~~~~sk 215 (323)
T KOG1036|consen 138 VYCMDVSGNRLVVGTSDRKVLIYDLRNLDEPF--QRRESSLKYQTRCVALVPNGEGYVVSSIEGRVAVEYFDDSEEAQSK 215 (323)
T ss_pred EEEEeccCCEEEEeecCceEEEEEcccccchh--hhccccceeEEEEEEEecCCCceEEEeecceEEEEccCCchHHhhh
Confidence 56678899999999999998776543311111 1112 12233445555553 34777778998876544443211111
Q ss_pred ccceeE----EEEEEcCCccceEEeeeeeecCCCCCCCCcceEEEEccccceEEEEecChhHHHHHH
Q 001655 896 RGRLEV----VGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958 (1036)
Q Consensus 896 ~~~L~~----~~~~~lg~~v~~~~~~~l~~~~~~~~~~~~~~il~~t~~GsIg~l~~l~~~~~~~L~ 958 (1036)
+..+.+ .....+.=.||++.... ....+..|+.||-+-.--+.+.+.++-|.
T Consensus 216 kyaFkCHr~~~~~~~~~yPVNai~Fhp-----------~~~tfaTgGsDG~V~~Wd~~~rKrl~q~~ 271 (323)
T KOG1036|consen 216 KYAFKCHRLSEKDTEIIYPVNAIAFHP-----------IHGTFATGGSDGIVNIWDLFNRKRLKQLA 271 (323)
T ss_pred ceeEEeeecccCCceEEEEeceeEecc-----------ccceEEecCCCceEEEccCcchhhhhhcc
Confidence 111100 01112222344442211 12345566788888777777665544333
|
|
| >PF08596 Lgl_C: Lethal giant larvae(Lgl) like, C-terminal; InterPro: IPR013905 The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals | Back alignment and domain information |
|---|
Probab=96.19 E-value=0.4 Score=54.51 Aligned_cols=76 Identities=17% Similarity=0.200 Sum_probs=52.4
Q ss_pred ceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCC-------ccCceeEEee--eec----CeEEE
Q 001655 478 YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG-------EIIPRSVLLC--AFE----GISYL 544 (1036)
Q Consensus 478 ~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~-------~~~p~si~~~--~~~----~~~~L 544 (1036)
.+|++++.+. --+++||+- +|++.|+++..-.+++...+.. ...+.++.+. ..+ ..+.|
T Consensus 87 g~vtal~~S~-------iGFvaigy~-~G~l~viD~RGPavI~~~~i~~~~~~~~~~~~vt~ieF~vm~~~~D~ySSi~L 158 (395)
T PF08596_consen 87 GPVTALKNSD-------IGFVAIGYE-SGSLVVIDLRGPAVIYNENIRESFLSKSSSSYVTSIEFSVMTLGGDGYSSICL 158 (395)
T ss_dssp -SEEEEEE-B-------TSEEEEEET-TSEEEEEETTTTEEEEEEEGGG--T-SS----EEEEEEEEEE-TTSSSEEEEE
T ss_pred CcEeEEecCC-------CcEEEEEec-CCcEEEEECCCCeEEeeccccccccccccccCeeEEEEEEEecCCCcccceEE
Confidence 5667776643 359999986 9999999998878887655532 1234455443 222 35889
Q ss_pred EEEeCCCcEEEEEEecC
Q 001655 545 LCALGDGHLLNFLLNMK 561 (1036)
Q Consensus 545 ~vgl~~G~l~~y~~~~~ 561 (1036)
+||+..|.++.|++.+.
T Consensus 159 ~vGTn~G~v~~fkIlp~ 175 (395)
T PF08596_consen 159 LVGTNSGNVLTFKILPS 175 (395)
T ss_dssp EEEETTSEEEEEEEEE-
T ss_pred EEEeCCCCEEEEEEecC
Confidence 99999999999999863
|
The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A. |
| >KOG1539 consensus WD repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=95.99 E-value=5.2 Score=47.85 Aligned_cols=394 Identities=13% Similarity=0.121 Sum_probs=199.3
Q ss_pred eEEEEecCcEEEEecCCccceeeeeCCCCCcEEEEEeCC--CEEEEEEcCCeEEEEEEcCcEEEEEEeccC-CceeEEEE
Q 001655 408 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANA--SQVLLATGGGHLVYLEIGDGILTEVKHAQL-EYEISCLD 484 (1036)
Q Consensus 408 ~~vQVt~~~i~l~~~~~~~~~~~w~~~~~~~I~~as~~~--~~vvv~~~~~~~~~l~~~~~~l~~~~~~~l-~~~is~l~ 484 (1036)
.++=.+...+.+++..+++.+.+... .+..|+++.-.+ ++++|...+|+++.|.+..++... ++.. ...|+.++
T Consensus 175 IvvGs~~G~lql~Nvrt~K~v~~f~~-~~s~IT~ieqsPaLDVVaiG~~~G~ViifNlK~dkil~--sFk~d~g~VtslS 251 (910)
T KOG1539|consen 175 IVVGSSQGRLQLWNVRTGKVVYTFQE-FFSRITAIEQSPALDVVAIGLENGTVIIFNLKFDKILM--SFKQDWGRVTSLS 251 (910)
T ss_pred EEEeecCCcEEEEEeccCcEEEEecc-cccceeEeccCCcceEEEEeccCceEEEEEcccCcEEE--EEEccccceeEEE
Confidence 34445556666677666555554321 225788887654 578888889999999886543322 2333 48899999
Q ss_pred cCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEec-CCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEecCCC
Q 001655 485 INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH-LGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTG 563 (1036)
Q Consensus 485 ~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~-l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~ 563 (1036)
+.. .....++.|.- .|.+.+|+|++-+++.... .....++ ... -+.+++.|+-...|..+..|-+|..+|
T Consensus 252 Frt-----DG~p~las~~~-~G~m~~wDLe~kkl~~v~~nah~~sv~-~~~--fl~~epVl~ta~~DnSlk~~vfD~~dg 322 (910)
T KOG1539|consen 252 FRT-----DGNPLLASGRS-NGDMAFWDLEKKKLINVTRNAHYGSVT-GAT--FLPGEPVLVTAGADNSLKVWVFDSGDG 322 (910)
T ss_pred ecc-----CCCeeEEeccC-CceEEEEEcCCCeeeeeeeccccCCcc-cce--ecCCCceEeeccCCCceeEEEeeCCCC
Confidence 843 13567777766 8999999998755543211 1111121 111 123567777778899999999996655
Q ss_pred ccccceeeecC-ccceEEEEEEeCCceEEEEE--c-CCceEEEec-----CCcEEE-------eeec--------cccce
Q 001655 564 ELTDRKKVSLG-TQPITLRTFSSKNTTHVFAA--S-DRPTVIYSS-----NKKLLY-------SNVN--------LKEVS 619 (1036)
Q Consensus 564 ~l~~~~~~~lG-~~pv~l~~~~~~~~~~v~~~--~-~~p~~i~~~-----~~~~~~-------~~~~--------~~~i~ 619 (1036)
..+..|.+ -| ..|....+|. +.++.|+. + |+.+..++- +..+.. -+++ .+++.
T Consensus 323 ~pR~LR~R-~GHs~Pp~~irfy--~~~g~~ilsa~~Drt~r~fs~~~e~~~~~l~~~~~~~~~kk~~~~~~~~~k~p~i~ 399 (910)
T KOG1539|consen 323 VPRLLRSR-GGHSAPPSCIRFY--GSQGHFILSAKQDRTLRSFSVISESQSQELGQLHNKKRAKKVNVFSTEKLKLPPIV 399 (910)
T ss_pred cchheeec-cCCCCCchheeee--ccCcEEEEecccCcchhhhhhhHHHHhHhhcccccccccccccccchhhhcCCcce
Confidence 43322222 23 3443333432 23344443 3 343443321 111100 0111 11222
Q ss_pred eeecccC-CCCCCeEEEEeCCeEEEEEEcCCCCeE------EEEE-eCCCccceEEEecCCCEEEEEEeecCCCCCceee
Q 001655 620 HMCPFNS-AAFPDSLAIAKEGELTIGTIDDIQKLH------IRSI-PLGEHPRRICHQEQSRTFAICSLKNQSCAEESEM 691 (1036)
Q Consensus 620 ~~~~f~~-~~~~~~l~~~~~~~l~i~~l~~~~~~~------~~~i-~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~ 691 (1036)
.++.=+. +..=+.++.+..+.......+..++.. .++. +.+.+.+.++-++=.|..+++++ .
T Consensus 400 ~fa~~~~RE~~W~Nv~~~h~~~~~~~tW~~~n~~~G~~~L~~~~~~~~~~~~~av~vs~CGNF~~IG~S----------~ 469 (910)
T KOG1539|consen 400 EFAFENAREKEWDNVITAHKGKRSAYTWNFRNKTSGRHVLDPKRFKKDDINATAVCVSFCGNFVFIGYS----------K 469 (910)
T ss_pred eeecccchhhhhcceeEEecCcceEEEEeccCcccccEEecCccccccCcceEEEEEeccCceEEEecc----------C
Confidence 2211000 011123444444444433333222110 0000 11223334444433333333332 2
Q ss_pred EEEEEEeCCCceEEEEEE--CCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEe
Q 001655 692 HFVRLLDDQTFEFISTYP--LDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKET 769 (1036)
Q Consensus 692 ~~l~l~d~~t~~~i~~~~--~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~ 769 (1036)
+.|..++-+++-...+|- -...+.+..+.. ++.+-++|.++ ..|-+.+|+.+++. ++++...
T Consensus 470 G~Id~fNmQSGi~r~sf~~~~ah~~~V~gla~-----D~~n~~~vsa~---------~~Gilkfw~f~~k~--l~~~l~l 533 (910)
T KOG1539|consen 470 GTIDRFNMQSGIHRKSFGDSPAHKGEVTGLAV-----DGTNRLLVSAG---------ADGILKFWDFKKKV--LKKSLRL 533 (910)
T ss_pred CeEEEEEcccCeeecccccCccccCceeEEEe-----cCCCceEEEcc---------CcceEEEEecCCcc--eeeeecc
Confidence 567777777765555551 233444554432 34444555553 57999999886543 5666677
Q ss_pred cCceeEecccc--CeEEEEEC-CEEEEEEeeecCCCccchhccc-ccccceEEEEEEEeCCEEEEEeccccEEEEEEecc
Q 001655 770 KGAVYSLNAFN--GKLLAAIN-QKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845 (1036)
Q Consensus 770 ~g~v~~i~~~~--g~Lv~~~g-~~l~v~~~~~~~~~~~~L~~~~-~~~~~~~~~~l~~~~~~I~vgD~~~Sv~~~~~~~~ 845 (1036)
..++.++.... +.++++.+ -.|.+++. ..++..+.- .+.......+.+..|..|+.+++-.++-+ ||--
T Consensus 534 ~~~~~~iv~hr~s~l~a~~~ddf~I~vvD~-----~t~kvvR~f~gh~nritd~~FS~DgrWlisasmD~tIr~--wDlp 606 (910)
T KOG1539|consen 534 GSSITGIVYHRVSDLLAIALDDFSIRVVDV-----VTRKVVREFWGHGNRITDMTFSPDGRWLISASMDSTIRT--WDLP 606 (910)
T ss_pred CCCcceeeeeehhhhhhhhcCceeEEEEEc-----hhhhhhHHhhccccceeeeEeCCCCcEEEEeecCCcEEE--Eecc
Confidence 77777766553 34444444 35566663 344555431 12223344555566778888877777654 4543
Q ss_pred CCeE
Q 001655 846 EGAI 849 (1036)
Q Consensus 846 ~~~l 849 (1036)
...+
T Consensus 607 t~~l 610 (910)
T KOG1539|consen 607 TGTL 610 (910)
T ss_pred Ccce
Confidence 3333
|
|
| >KOG1407 consensus WD40 repeat protein [Function unknown] | Back alignment and domain information |
|---|
Probab=95.64 E-value=3.2 Score=42.80 Aligned_cols=221 Identities=13% Similarity=0.158 Sum_probs=112.8
Q ss_pred CcEEEEecCCccceeeeeCCCCCcE-EEEEeCCCEEEEEEcCCeEEEEEEcCcEEEEEEeccCCceeEEEEcCCCCCCCC
Q 001655 415 GSVRLVSSTSRELRNEWKSPPGYSV-NVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPS 493 (1036)
Q Consensus 415 ~~i~l~~~~~~~~~~~w~~~~~~~I-~~as~~~~~vvv~~~~~~~~~l~~~~~~l~~~~~~~l~~~is~l~~~~~~~~~~ 493 (1036)
+.||+.+...++..++.... +..| ...+-++.++++...+..+.++...+ .......++..++..++..
T Consensus 87 k~ir~wd~r~~k~~~~i~~~-~eni~i~wsp~g~~~~~~~kdD~it~id~r~--~~~~~~~~~~~e~ne~~w~------- 156 (313)
T KOG1407|consen 87 KTIRIWDIRSGKCTARIETK-GENINITWSPDGEYIAVGNKDDRITFIDART--YKIVNEEQFKFEVNEISWN------- 156 (313)
T ss_pred ceEEEEEeccCcEEEEeecc-CcceEEEEcCCCCEEEEecCcccEEEEEecc--cceeehhcccceeeeeeec-------
Confidence 45555554443444433321 2222 22223455666665666777765533 3333444555666555543
Q ss_pred cccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEecCCCccccceeeec
Q 001655 494 YSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL 573 (1036)
Q Consensus 494 ~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~l~~~~~~~l 573 (1036)
.++.+++.+...|.++|++.|.|+++..-.- .+...+++.--...-|+-+|..|-.+--+.+++ |-..|...=
T Consensus 157 ~~nd~Fflt~GlG~v~ILsypsLkpv~si~A---H~snCicI~f~p~GryfA~GsADAlvSLWD~~E----LiC~R~isR 229 (313)
T KOG1407|consen 157 NSNDLFFLTNGLGCVEILSYPSLKPVQSIKA---HPSNCICIEFDPDGRYFATGSADALVSLWDVDE----LICERCISR 229 (313)
T ss_pred CCCCEEEEecCCceEEEEecccccccccccc---CCcceEEEEECCCCceEeeccccceeeccChhH----hhhheeecc
Confidence 1445666667579999999999988643111 112233322112456788888876555444432 222222211
Q ss_pred CccceEEEEEEeCCceEEEEEcCCceEEEecCCcEEEeeeccccceeeecccCCCCCCeEEEEe-CCeEEEEEEcCCCCe
Q 001655 574 GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK-EGELTIGTIDDIQKL 652 (1036)
Q Consensus 574 G~~pv~l~~~~~~~~~~v~~~~~~p~~i~~~~~~~~~~~~~~~~i~~~~~f~~~~~~~~l~~~~-~~~l~i~~l~~~~~~ 652 (1036)
=..||+-..|.. +|++. +.++ +--+-|+.+++..+
T Consensus 230 ldwpVRTlSFS~-------------------dg~~l------------------------ASaSEDh~IDIA~vetGd~- 265 (313)
T KOG1407|consen 230 LDWPVRTLSFSH-------------------DGRML------------------------ASASEDHFIDIAEVETGDR- 265 (313)
T ss_pred ccCceEEEEecc-------------------Cccee------------------------eccCccceEEeEecccCCe-
Confidence 123444333321 22221 1111 11233444444333
Q ss_pred EEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEe
Q 001655 653 HIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLD 698 (1036)
Q Consensus 653 ~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d 698 (1036)
+..||-...--.+++||...+++.+|.+... +...+.+.++++-
T Consensus 266 -~~eI~~~~~t~tVAWHPk~~LLAyA~ddk~~-d~~reag~vKiFG 309 (313)
T KOG1407|consen 266 -VWEIPCEGPTFTVAWHPKRPLLAYACDDKDG-DSNREAGTVKIFG 309 (313)
T ss_pred -EEEeeccCCceeEEecCCCceeeEEecCCCC-ccccccceeEEec
Confidence 3566776666789999999999999986521 1223456677663
|
|
| >KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown] | Back alignment and domain information |
|---|
Probab=95.63 E-value=1.1 Score=48.26 Aligned_cols=154 Identities=17% Similarity=0.231 Sum_probs=100.4
Q ss_pred ceeeeecCCCeEEEEecCcEEEEecCCccceeeee--CCCCCcEEEEEeCCCEEEEEE----cCCeEEEEEEcCcEEEEE
Q 001655 398 TLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWK--SPPGYSVNVATANASQVLLAT----GGGHLVYLEIGDGILTEV 471 (1036)
Q Consensus 398 Tl~~~~~~~~~~vQVt~~~i~l~~~~~~~~~~~w~--~~~~~~I~~as~~~~~vvv~~----~~~~~~~l~~~~~~l~~~ 471 (1036)
.|.+-.|..+++|=+-+..|.+++.+.-+.+..+. ||......+-+.+.....++. ..|.++++...+ +..+
T Consensus 89 ~IL~VrmNr~RLvV~Lee~IyIydI~~MklLhTI~t~~~n~~gl~AlS~n~~n~ylAyp~s~t~GdV~l~d~~n--l~~v 166 (391)
T KOG2110|consen 89 SILAVRMNRKRLVVCLEESIYIYDIKDMKLLHTIETTPPNPKGLCALSPNNANCYLAYPGSTTSGDVVLFDTIN--LQPV 166 (391)
T ss_pred ceEEEEEccceEEEEEcccEEEEecccceeehhhhccCCCccceEeeccCCCCceEEecCCCCCceEEEEEccc--ceee
Confidence 46677787788888889999999987644444332 345455666666654334443 347787777654 2222
Q ss_pred Eec-cCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeec-CeEEEEEEeC
Q 001655 472 KHA-QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE-GISYLLCALG 549 (1036)
Q Consensus 472 ~~~-~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~-~~~~L~vgl~ 549 (1036)
... .-+..+.|+++++ .+..++-+..+.--|++|++|+.+.+.+..-+ ..|.+|.-..|. +..+|-+...
T Consensus 167 ~~I~aH~~~lAalafs~------~G~llATASeKGTVIRVf~v~~G~kl~eFRRG--~~~~~IySL~Fs~ds~~L~~sS~ 238 (391)
T KOG2110|consen 167 NTINAHKGPLAALAFSP------DGTLLATASEKGTVIRVFSVPEGQKLYEFRRG--TYPVSIYSLSFSPDSQFLAASSN 238 (391)
T ss_pred eEEEecCCceeEEEECC------CCCEEEEeccCceEEEEEEcCCccEeeeeeCC--ceeeEEEEEEECCCCCeEEEecC
Confidence 211 3357789999975 35677776664446899999998877654443 235444322332 5678888999
Q ss_pred CCcEEEEEEecC
Q 001655 550 DGHLLNFLLNMK 561 (1036)
Q Consensus 550 ~G~l~~y~~~~~ 561 (1036)
.+.+-.|+++..
T Consensus 239 TeTVHiFKL~~~ 250 (391)
T KOG2110|consen 239 TETVHIFKLEKV 250 (391)
T ss_pred CCeEEEEEeccc
Confidence 999999998754
|
|
| >KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription] | Back alignment and domain information |
|---|
Probab=95.37 E-value=0.92 Score=53.46 Aligned_cols=141 Identities=21% Similarity=0.258 Sum_probs=86.8
Q ss_pred CCeEEEEeCCeEEEEEEcCCCCeEEEEEeCCC--ccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEE
Q 001655 630 PDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE--HPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFIST 707 (1036)
Q Consensus 630 ~~~l~~~~~~~l~i~~l~~~~~~~~~~i~l~~--tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~ 707 (1036)
|.++++++.+.=+.+++........-+|-.|. -...+.|||..+..+-++++ .++|+.|..++..+--
T Consensus 503 P~GyYFatas~D~tArLWs~d~~~PlRifaghlsDV~cv~FHPNs~Y~aTGSsD----------~tVRlWDv~~G~~VRi 572 (707)
T KOG0263|consen 503 PRGYYFATASHDQTARLWSTDHNKPLRIFAGHLSDVDCVSFHPNSNYVATGSSD----------RTVRLWDVSTGNSVRI 572 (707)
T ss_pred CCceEEEecCCCceeeeeecccCCchhhhcccccccceEEECCcccccccCCCC----------ceEEEEEcCCCcEEEE
Confidence 44667766544445566554444444555553 56778999998877766554 3788888877766544
Q ss_pred EECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccc-cCeEEEE
Q 001655 708 YPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF-NGKLLAA 786 (1036)
Q Consensus 708 ~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~-~g~Lv~~ 786 (1036)
| .++. -.+..+.+.- ...|++-|. .-|+|.+|++.++++-.... .+.|.|+++.-- +|.++++
T Consensus 573 F---~GH~-~~V~al~~Sp-~Gr~LaSg~----------ed~~I~iWDl~~~~~v~~l~-~Ht~ti~SlsFS~dg~vLas 636 (707)
T KOG0263|consen 573 F---TGHK-GPVTALAFSP-CGRYLASGD----------EDGLIKIWDLANGSLVKQLK-GHTGTIYSLSFSRDGNVLAS 636 (707)
T ss_pred e---cCCC-CceEEEEEcC-CCceEeecc----------cCCcEEEEEcCCCcchhhhh-cccCceeEEEEecCCCEEEe
Confidence 4 2332 1222333332 356777664 45899999999876543322 228999998843 5655544
Q ss_pred --ECCEEEEEEe
Q 001655 787 --INQKIQLYKW 796 (1036)
Q Consensus 787 --~g~~l~v~~~ 796 (1036)
.+++|.+|++
T Consensus 637 gg~DnsV~lWD~ 648 (707)
T KOG0263|consen 637 GGADNSVRLWDL 648 (707)
T ss_pred cCCCCeEEEEEc
Confidence 4478888765
|
|
| >TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein | Back alignment and domain information |
|---|
Probab=95.35 E-value=5.3 Score=43.41 Aligned_cols=237 Identities=11% Similarity=0.218 Sum_probs=112.4
Q ss_pred CCEEEEEE-cCCeEEEEEEcCcEEEEEEeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecC
Q 001655 446 ASQVLLAT-GGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524 (1036)
Q Consensus 446 ~~~vvv~~-~~~~~~~l~~~~~~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l 524 (1036)
+..++++. .++.+..++...+++.. .......+.++++.+ ....++++...++.+.+|++.+.+.+.....
T Consensus 42 g~~l~~~~~~~~~v~~~d~~~~~~~~--~~~~~~~~~~~~~~~------~g~~l~~~~~~~~~l~~~d~~~~~~~~~~~~ 113 (300)
T TIGR03866 42 GKLLYVCASDSDTIQVIDLATGEVIG--TLPSGPDPELFALHP------NGKILYIANEDDNLVTVIDIETRKVLAEIPV 113 (300)
T ss_pred CCEEEEEECCCCeEEEEECCCCcEEE--eccCCCCccEEEECC------CCCEEEEEcCCCCeEEEEECCCCeEEeEeeC
Confidence 33444433 34566666655444322 122233445566643 2456666666588999999987655443222
Q ss_pred CCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEecCCCccccceeeecCccceEEEEEEeCCceEEEEEcCC-ceE-EE
Q 001655 525 GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDR-PTV-IY 602 (1036)
Q Consensus 525 ~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~~~-p~~-i~ 602 (1036)
. ..|.++.+.+ +...++++..++..+.+ ++..++.+. .....+..|..+ .+. .+...+++.++. ..+ ++
T Consensus 114 ~--~~~~~~~~~~--dg~~l~~~~~~~~~~~~-~d~~~~~~~--~~~~~~~~~~~~-~~s-~dg~~l~~~~~~~~~v~i~ 184 (300)
T TIGR03866 114 G--VEPEGMAVSP--DGKIVVNTSETTNMAHF-IDTKTYEIV--DNVLVDQRPRFA-EFT-ADGKELWVSSEIGGTVSVI 184 (300)
T ss_pred C--CCcceEEECC--CCCEEEEEecCCCeEEE-EeCCCCeEE--EEEEcCCCccEE-EEC-CCCCEEEEEcCCCCEEEEE
Confidence 2 3455665443 34566677766654322 333322211 112233344322 221 223345454432 222 22
Q ss_pred ec-CCcEEEeeec--cccc--eeeecccCCCCCC--eEEEE--eCCeEEEEEEcCCCCeEEEEEeCCCccceEEEecCCC
Q 001655 603 SS-NKKLLYSNVN--LKEV--SHMCPFNSAAFPD--SLAIA--KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSR 673 (1036)
Q Consensus 603 ~~-~~~~~~~~~~--~~~i--~~~~~f~~~~~~~--~l~~~--~~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~ 673 (1036)
.. .++.. ..+. .... ..+.+......++ .++.. .++.+.+-.+.+. + ..+.+..+..|..+.++|+.+
T Consensus 185 d~~~~~~~-~~~~~~~~~~~~~~~~~~~i~~s~dg~~~~~~~~~~~~i~v~d~~~~-~-~~~~~~~~~~~~~~~~~~~g~ 261 (300)
T TIGR03866 185 DVATRKVI-KKITFEIPGVHPEAVQPVGIKLTKDGKTAFVALGPANRVAVVDAKTY-E-VLDYLLVGQRVWQLAFTPDEK 261 (300)
T ss_pred EcCcceee-eeeeecccccccccCCccceEECCCCCEEEEEcCCCCeEEEEECCCC-c-EEEEEEeCCCcceEEECCCCC
Confidence 11 22211 1110 0000 0111111001122 22222 2455665554322 1 234456677889999999888
Q ss_pred EEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECC
Q 001655 674 TFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLD 711 (1036)
Q Consensus 674 ~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~ 711 (1036)
.+++.... ...|.++|..+++.+.++++.
T Consensus 262 ~l~~~~~~---------~~~i~v~d~~~~~~~~~~~~~ 290 (300)
T TIGR03866 262 YLLTTNGV---------SNDVSVIDVAALKVIKSIKVG 290 (300)
T ss_pred EEEEEcCC---------CCeEEEEECCCCcEEEEEEcc
Confidence 66554321 247999999999988887763
|
Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol. |
| >KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown] | Back alignment and domain information |
|---|
Probab=95.29 E-value=4.9 Score=42.70 Aligned_cols=257 Identities=14% Similarity=0.187 Sum_probs=143.6
Q ss_pred cceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCC
Q 001655 663 PRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPE 742 (1036)
Q Consensus 663 p~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~ 742 (1036)
+..+.+..+..+|+|++.. ..++++-+.++...+-++... ...+..+-+.. +-+-+||.+..
T Consensus 8 ~lsvs~NQD~ScFava~~~-----------Gfriyn~~P~ke~~~r~~~~~--G~~~veMLfR~--N~laLVGGg~~--- 69 (346)
T KOG2111|consen 8 TLSVSFNQDHSCFAVATDT-----------GFRIYNCDPFKESASRQFIDG--GFKIVEMLFRS--NYLALVGGGSR--- 69 (346)
T ss_pred eeEEEEccCCceEEEEecC-----------ceEEEecCchhhhhhhccccC--chhhhhHhhhh--ceEEEecCCCC---
Confidence 3347788888899998753 345554333433332222221 12333332321 23445674331
Q ss_pred CCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccccCeEEEEECCEEEEEEeeec----------C--C---------
Q 001655 743 ENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR----------D--D--------- 801 (1036)
Q Consensus 743 e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~~g~Lv~~~g~~l~v~~~~~~----------~--~--------- 801 (1036)
..-..-.+++|+ +.|-+.+.+.++.++|.+|+--.++|++..-++|++|.+... . .
T Consensus 70 -pky~pNkviIWD--D~k~~~i~el~f~~~I~~V~l~r~riVvvl~~~I~VytF~~n~k~l~~~et~~NPkGlC~~~~~~ 146 (346)
T KOG2111|consen 70 -PKYPPNKVIIWD--DLKERCIIELSFNSEIKAVKLRRDRIVVVLENKIYVYTFPDNPKLLHVIETRSNPKGLCSLCPTS 146 (346)
T ss_pred -CCCCCceEEEEe--cccCcEEEEEEeccceeeEEEcCCeEEEEecCeEEEEEcCCChhheeeeecccCCCceEeecCCC
Confidence 122445788887 556667888899999999998899999999999999997410 0 0
Q ss_pred ------------Cccchhcccc-----------cccceEEEEEEEeCCEEEEEeccccEEEEEEeccC-CeEEEEeccCC
Q 001655 802 ------------GTRELQSECG-----------HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE-GAIEERARDYN 857 (1036)
Q Consensus 802 ------------~~~~L~~~~~-----------~~~~~~~~~l~~~~~~I~vgD~~~Sv~~~~~~~~~-~~l~~~a~D~~ 857 (1036)
|+.++.-.+. ++..+..+.|.-.|. +++.-..+|.-+=-|+.++ ..+.++-|-..
T Consensus 147 ~k~~LafPg~k~GqvQi~dL~~~~~~~p~~I~AH~s~Iacv~Ln~~Gt-~vATaStkGTLIRIFdt~~g~~l~E~RRG~d 225 (346)
T KOG2111|consen 147 NKSLLAFPGFKTGQVQIVDLASTKPNAPSIINAHDSDIACVALNLQGT-LVATASTKGTLIRIFDTEDGTLLQELRRGVD 225 (346)
T ss_pred CceEEEcCCCccceEEEEEhhhcCcCCceEEEcccCceeEEEEcCCcc-EEEEeccCcEEEEEEEcCCCcEeeeeecCCc
Confidence 1111111000 011111223333444 4455556666555577654 56778888888
Q ss_pred cceeEEEEEeeCceE-EEEccCCcEEEEeeCCCCCCcccccce--------------eEEEEEEcCCccceEEeeeeeec
Q 001655 858 ANWMSAVEILDDDIY-LGAENNFNLFTVRKNSEGATDEERGRL--------------EVVGEYHLGEFVNRFRHGSLVMR 922 (1036)
Q Consensus 858 ~~~~~~~~~l~~~~~-l~~D~~gnl~~l~~~~~~~~~~~~~~L--------------~~~~~~~lg~~v~~~~~~~l~~~ 922 (1036)
+-.+.++.|=.+..+ .++-..|.+++|..-....+..+...+ -..+.|++.+..-.+. .
T Consensus 226 ~A~iy~iaFSp~~s~LavsSdKgTlHiF~l~~~~~~~~~~SSl~~~~~~lpky~~S~wS~~~f~l~~~~~~~~------~ 299 (346)
T KOG2111|consen 226 RADIYCIAFSPNSSWLAVSSDKGTLHIFSLRDTENTEDESSSLSFKRLVLPKYFSSEWSFAKFQLPQGTQCII------A 299 (346)
T ss_pred hheEEEEEeCCCccEEEEEcCCCeEEEEEeecCCCCccccccccccccccchhcccceeEEEEEccCCCcEEE------E
Confidence 878888877666554 456678999999876533221100011 1234455543321111 1
Q ss_pred CCCCCCCCcceEEEEccccceEEEEecCh
Q 001655 923 LPDSDVGQIPTVIFGTVNGVIGVIASLPH 951 (1036)
Q Consensus 923 ~~~~~~~~~~~il~~t~~GsIg~l~~l~~ 951 (1036)
| ......++..+.||+-+.+.--.+
T Consensus 300 f----g~~~nsvi~i~~Dgsy~k~~f~~~ 324 (346)
T KOG2111|consen 300 F----GSETNTVIAICADGSYYKFKFDPK 324 (346)
T ss_pred e----cCCCCeEEEEEeCCcEEEEEeccc
Confidence 1 112267788889999988776654
|
|
| >KOG2048 consensus WD40 repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=95.26 E-value=8.2 Score=45.10 Aligned_cols=493 Identities=14% Similarity=0.088 Sum_probs=225.2
Q ss_pred cceeeEEEEEecCCCEEEEEcCccCEEEEEEEecCcEEeeEEEEEcCccccceEEEEecCCeEEEEEeecCeeEEEEeeC
Q 001655 200 PSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQ 279 (1036)
Q Consensus 200 ~~~~~~~~~~~~~~~~~ll~~~~G~l~~l~l~~~~~~v~~l~i~~~~~~~~~s~l~~l~~~~lF~gS~~gds~l~~~~~~ 279 (1036)
+..++|-+... +.+++.++..+|.+-.=.+..+ --.. .+.+.++...-.+|+...++-||-. ..++.+..++.-
T Consensus 25 Ps~I~slA~s~-kS~~lAvsRt~g~IEiwN~~~~--w~~~-~vi~g~~drsIE~L~W~e~~RLFS~--g~sg~i~EwDl~ 98 (691)
T KOG2048|consen 25 PSEIVSLAYSH-KSNQLAVSRTDGNIEIWNLSNN--WFLE-PVIHGPEDRSIESLAWAEGGRLFSS--GLSGSITEWDLH 98 (691)
T ss_pred ccceEEEEEec-cCCceeeeccCCcEEEEccCCC--ceee-EEEecCCCCceeeEEEccCCeEEee--cCCceEEEEecc
Confidence 34455655444 3457888999998755544322 1111 1222344455567777788889955 455667777643
Q ss_pred CCCCCCcEEEeeec-ccCCceeeeEEeccCCCCCceEEEEeeCCCCccEEEEecCCceeEEEe-eeCCCc-ceEEEeecC
Q 001655 280 PDAKGSYVEVLERY-VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQAS-VELQGI-KGMWSLRSS 356 (1036)
Q Consensus 280 ~~~~~~~~~~~~~l-~n~~Pi~D~~~~~~~~~~~~~lv~~sG~g~~g~l~~lr~gi~~~~~~~-~~l~~~-~~iw~l~~~ 356 (1036)
. ++.+-.+ +|-|+|..|++-+.. +.+.+.| .+|.|..+. +++.++.. .-|+-. .++.++...
T Consensus 99 ~------lk~~~~~d~~gg~IWsiai~p~~----~~l~Igc---ddGvl~~~s--~~p~~I~~~r~l~rq~sRvLslsw~ 163 (691)
T KOG2048|consen 99 T------LKQKYNIDSNGGAIWSIAINPEN----TILAIGC---DDGVLYDFS--IGPDKITYKRSLMRQKSRVLSLSWN 163 (691)
T ss_pred c------CceeEEecCCCcceeEEEeCCcc----ceEEeec---CCceEEEEe--cCCceEEEEeecccccceEEEEEec
Confidence 2 2222223 577899999987642 3343332 344443333 33433332 112211 133333221
Q ss_pred CCCCcccEEEEEecCc-eEEEEecCCCCeeeeeecCeeccccceeeeecCCCeEEEEecCcEEEEecCCccceeeeeCCC
Q 001655 357 TDDPFDTFLVVSFISE-TRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435 (1036)
Q Consensus 357 ~~~~~~~~lv~S~~~~-T~vl~~~~~~~~~e~~~~~f~~~~~Tl~~~~~~~~~~vQVt~~~i~l~~~~~~~~~~~w~~~~ 435 (1036)
. .+..|+....+. -+++.... ..|++. +|-+--++ ..+....-|.
T Consensus 164 ~---~~~~i~~Gs~Dg~Iriwd~~~---------------~~t~~~----------~~~~~d~l---~k~~~~iVWS--- 209 (691)
T KOG2048|consen 164 P---TGTKIAGGSIDGVIRIWDVKS---------------GQTLHI----------ITMQLDRL---SKREPTIVWS--- 209 (691)
T ss_pred C---CccEEEecccCceEEEEEcCC---------------CceEEE----------eeeccccc---ccCCceEEEE---
Confidence 0 011111111111 11111100 011110 01000000 0011122231
Q ss_pred CCcEEEEEeCCCEEEEEEcCCeEEEEEEcCcEEEEEEeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCC
Q 001655 436 GYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD 515 (1036)
Q Consensus 436 ~~~I~~as~~~~~vvv~~~~~~~~~l~~~~~~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~ 515 (1036)
.....++.++-..+.|.+.+-+-..+++.+.. .-....|-||+..+ .++.++.+.- |+.+.-|++..
T Consensus 210 -----v~~Lrd~tI~sgDS~G~V~FWd~~~gTLiqS~-~~h~adVl~Lav~~------~~d~vfsaGv-d~~ii~~~~~~ 276 (691)
T KOG2048|consen 210 -----VLFLRDSTIASGDSAGTVTFWDSIFGTLIQSH-SCHDADVLALAVAD------NEDRVFSAGV-DPKIIQYSLTT 276 (691)
T ss_pred -----EEEeecCcEEEecCCceEEEEcccCcchhhhh-hhhhcceeEEEEcC------CCCeEEEccC-CCceEEEEecC
Confidence 12234556665666677766554445544432 23467788888753 3567777766 78777777654
Q ss_pred CCeeEEecCCCc---cCceeEEeeeecCeEEEEEEeCCCcEEEEEEecCCC--cc----ccceeeecCccceEEEEEEeC
Q 001655 516 LNLITKEHLGGE---IIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTG--EL----TDRKKVSLGTQPITLRTFSSK 586 (1036)
Q Consensus 516 l~~v~~~~l~~~---~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~--~l----~~~~~~~lG~~pv~l~~~~~~ 586 (1036)
-.--+......+ .-.+++++.. ..|+-|-+|+.|+......... .. ...+.+..+...-+|.-+.
T Consensus 277 ~~~~wv~~~~r~~h~hdvrs~av~~----~~l~sgG~d~~l~i~~s~~~~~~~h~~~~~~p~~~~v~~a~~~~L~~~w-- 350 (691)
T KOG2048|consen 277 NKSEWVINSRRDLHAHDVRSMAVIE----NALISGGRDFTLAICSSREFKNMDHRQKNLFPASDRVSVAPENRLLVLW-- 350 (691)
T ss_pred CccceeeeccccCCcccceeeeeec----ceEEecceeeEEEEccccccCchhhhccccccccceeecCccceEEEEe--
Confidence 221121111112 2334554442 3677888888887765443110 00 0000010110000111110
Q ss_pred CceEEEEEcCCceEEEec--CC-----cE-EEeeeccccceeeecccCCCCCCeEEEEeCCeEEEEEEcCCCCeEEEEE-
Q 001655 587 NTTHVFAASDRPTVIYSS--NK-----KL-LYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSI- 657 (1036)
Q Consensus 587 ~~~~v~~~~~~p~~i~~~--~~-----~~-~~~~~~~~~i~~~~~f~~~~~~~~l~~~~~~~l~i~~l~~~~~~~~~~i- 657 (1036)
+..+-.+|.+-+. .| ++ .+.--....|.+.+ ...+++-+++.+-..++|.+|........+++
T Consensus 351 -----~~h~v~lwrlGS~~~~g~~~~~~Llkl~~k~~~nIs~~a---iSPdg~~Ia~st~~~~~iy~L~~~~~vk~~~v~ 422 (691)
T KOG2048|consen 351 -----KAHGVDLWRLGSVILQGEYNYIHLLKLFTKEKENISCAA---ISPDGNLIAISTVSRTKIYRLQPDPNVKVINVD 422 (691)
T ss_pred -----ccccccceeccCcccccccChhhheeeecCCccceeeec---cCCCCCEEEEeeccceEEEEeccCcceeEEEec
Confidence 0112233333221 11 11 11100011222211 12234457777889999999988663333322
Q ss_pred --eCC-CccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCC--CceEEEEEECCCCCeEeEEEEEEEcCCCceEE
Q 001655 658 --PLG-EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQ--TFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 732 (1036)
Q Consensus 658 --~l~-~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~--t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i 732 (1036)
|+. .+..++.+..+.+..+++... ...+..++.. +++.+..+ .+--.+-+|+.+... ...+||
T Consensus 423 ~~~~~~~~a~~i~ftid~~k~~~~s~~---------~~~le~~el~~ps~kel~~~--~~~~~~~~I~~l~~S-sdG~yi 490 (691)
T KOG2048|consen 423 DVPLALLDASAISFTIDKNKLFLVSKN---------IFSLEEFELETPSFKELKSI--QSQAKCPSISRLVVS-SDGNYI 490 (691)
T ss_pred cchhhhccceeeEEEecCceEEEEecc---------cceeEEEEecCcchhhhhcc--ccccCCCcceeEEEc-CCCCEE
Confidence 332 245667777666666655521 2344444332 23322221 111111223333222 236788
Q ss_pred EEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccc---cCeEEE-EECCEEEEEEee
Q 001655 733 CVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF---NGKLLA-AINQKIQLYKWM 797 (1036)
Q Consensus 733 ~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~---~g~Lv~-~~g~~l~v~~~~ 797 (1036)
||-+ +.|-|++|.+...+.+.+.. ..+-+|++..-- .+.|++ -.+++++-|+++
T Consensus 491 aa~~----------t~g~I~v~nl~~~~~~~l~~-rln~~vTa~~~~~~~~~~lvvats~nQv~efdi~ 548 (691)
T KOG2048|consen 491 AAIS----------TRGQIFVYNLETLESHLLKV-RLNIDVTAAAFSPFVRNRLVVATSNNQVFEFDIE 548 (691)
T ss_pred EEEe----------ccceEEEEEcccceeecchh-ccCcceeeeeccccccCcEEEEecCCeEEEEecc
Confidence 8876 56899999888666554431 233566665532 467765 567899888874
|
|
| >KOG0282 consensus mRNA splicing factor [Function unknown] | Back alignment and domain information |
|---|
Probab=95.24 E-value=0.41 Score=53.07 Aligned_cols=251 Identities=14% Similarity=0.215 Sum_probs=132.6
Q ss_pred CCeEEEEEEcCCCCeEEEEEeCC-CccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeE
Q 001655 638 EGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYG 716 (1036)
Q Consensus 638 ~~~l~i~~l~~~~~~~~~~i~l~-~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v 716 (1036)
++.+.|-.+-...+. .|...-- ...+-+.+.+....|.-++.+ ..+++.|-+|++++.++. .++.+
T Consensus 236 D~~vklW~vy~~~~~-lrtf~gH~k~Vrd~~~s~~g~~fLS~sfD----------~~lKlwDtETG~~~~~f~--~~~~~ 302 (503)
T KOG0282|consen 236 DGLVKLWNVYDDRRC-LRTFKGHRKPVRDASFNNCGTSFLSASFD----------RFLKLWDTETGQVLSRFH--LDKVP 302 (503)
T ss_pred CceEEEEEEecCcce-ehhhhcchhhhhhhhccccCCeeeeeecc----------eeeeeeccccceEEEEEe--cCCCc
Confidence 566666666553321 2222221 233445666655555555444 489999999999887655 55677
Q ss_pred eEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEE-EecCceeEecccc-C-eEEEEE-CCEEE
Q 001655 717 CSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEK-ETKGAVYSLNAFN-G-KLLAAI-NQKIQ 792 (1036)
Q Consensus 717 ~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~-~~~g~v~~i~~~~-g-~Lv~~~-g~~l~ 792 (1036)
.|+ +|.-++.+.+++|+ ..|+|.-|++..+++ +.+. ..-|+|..|.-+. | +.+.+. .+.++
T Consensus 303 ~cv---kf~pd~~n~fl~G~----------sd~ki~~wDiRs~kv--vqeYd~hLg~i~~i~F~~~g~rFissSDdks~r 367 (503)
T KOG0282|consen 303 TCV---KFHPDNQNIFLVGG----------SDKKIRQWDIRSGKV--VQEYDRHLGAILDITFVDEGRRFISSSDDKSVR 367 (503)
T ss_pred eee---ecCCCCCcEEEEec----------CCCcEEEEeccchHH--HHHHHhhhhheeeeEEccCCceEeeeccCccEE
Confidence 665 45555678889998 458999999987663 3322 3457888888773 3 444444 46788
Q ss_pred EEEeeecCCCccchhcccccccceEEEEE---------EEeCCEEEEEeccccEEEEEEeccCCeEEEEeccCCcceeEE
Q 001655 793 LYKWMLRDDGTRELQSECGHHGHILALYV---------QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863 (1036)
Q Consensus 793 v~~~~~~~~~~~~L~~~~~~~~~~~~~~l---------~~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~ 863 (1036)
+|+|... -..++...-...+. ..+.+ .+.+|+|++--+ ...|+....+ ....-..+-..+.
T Consensus 368 iWe~~~~--v~ik~i~~~~~hsm-P~~~~~P~~~~~~aQs~dN~i~ifs~-----~~~~r~nkkK--~feGh~vaGys~~ 437 (503)
T KOG0282|consen 368 IWENRIP--VPIKNIADPEMHTM-PCLTLHPNGKWFAAQSMDNYIAIFST-----VPPFRLNKKK--RFEGHSVAGYSCQ 437 (503)
T ss_pred EEEcCCC--ccchhhcchhhccC-cceecCCCCCeehhhccCceEEEEec-----ccccccCHhh--hhcceeccCceee
Confidence 8887421 12222211111111 12222 334555544111 1112211111 0111111222234
Q ss_pred EEEeeC-ceEEEEccCCcEEEEeeCCCCCCcccccceeEEEEEEcCCccceEEeeeeeecCCCCCCCCcceEEEEccccc
Q 001655 864 VEILDD-DIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGV 942 (1036)
Q Consensus 864 ~~~l~~-~~~l~~D~~gnl~~l~~~~~~~~~~~~~~L~~~~~~~lg~~v~~~~~~~l~~~~~~~~~~~~~~il~~t~~Gs 942 (1036)
+.|=.+ ++++.+|.+|.++++..++. +|......|-+-.+ +..-+ | .....++.++.+|.
T Consensus 438 v~fSpDG~~l~SGdsdG~v~~wdwkt~--------kl~~~lkah~~~ci-----~v~wH--P----~e~Skvat~~w~G~ 498 (503)
T KOG0282|consen 438 VDFSPDGRTLCSGDSDGKVNFWDWKTT--------KLVSKLKAHDQPCI-----GVDWH--P----VEPSKVATCGWDGL 498 (503)
T ss_pred EEEcCCCCeEEeecCCccEEEeechhh--------hhhhccccCCcceE-----EEEec--C----CCcceeEecccCce
Confidence 555444 45678999999999998752 44444444422212 11111 1 12346777788888
Q ss_pred eEE
Q 001655 943 IGV 945 (1036)
Q Consensus 943 Ig~ 945 (1036)
|.+
T Consensus 499 Iki 501 (503)
T KOG0282|consen 499 IKI 501 (503)
T ss_pred eEe
Confidence 764
|
|
| >KOG0283 consensus WD40 repeat-containing protein [Function unknown] | Back alignment and domain information |
|---|
Probab=95.20 E-value=1.3 Score=52.60 Aligned_cols=111 Identities=14% Similarity=0.123 Sum_probs=75.6
Q ss_pred cEEEEEeCCCEEEEEE-cCCeEEEEEEcCcEEEEEEeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCC
Q 001655 438 SVNVATANASQVLLAT-GGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDL 516 (1036)
Q Consensus 438 ~I~~as~~~~~vvv~~-~~~~~~~l~~~~~~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l 516 (1036)
.|...+.+.++.+++. -+.++.+=++...+ -+.-+.-..-|+|+++.|+++ .|++=|+- |+.++||++++.
T Consensus 371 DILDlSWSKn~fLLSSSMDKTVRLWh~~~~~--CL~~F~HndfVTcVaFnPvDD-----ryFiSGSL-D~KvRiWsI~d~ 442 (712)
T KOG0283|consen 371 DILDLSWSKNNFLLSSSMDKTVRLWHPGRKE--CLKVFSHNDFVTCVAFNPVDD-----RYFISGSL-DGKVRLWSISDK 442 (712)
T ss_pred hheecccccCCeeEeccccccEEeecCCCcc--eeeEEecCCeeEEEEecccCC-----CcEeeccc-ccceEEeecCcC
Confidence 5777776655555544 35666554543211 122233456699999999764 57777877 999999999998
Q ss_pred CeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEec
Q 001655 517 NLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNM 560 (1036)
Q Consensus 517 ~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~ 560 (1036)
+++.-..+. -+..++++.+ +..+.+||+-+|....|....
T Consensus 443 ~Vv~W~Dl~--~lITAvcy~P--dGk~avIGt~~G~C~fY~t~~ 482 (712)
T KOG0283|consen 443 KVVDWNDLR--DLITAVCYSP--DGKGAVIGTFNGYCRFYDTEG 482 (712)
T ss_pred eeEeehhhh--hhheeEEecc--CCceEEEEEeccEEEEEEccC
Confidence 887644444 2445666554 467899999999999998653
|
|
| >PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase | Back alignment and domain information |
|---|
Probab=95.15 E-value=5.9 Score=42.87 Aligned_cols=240 Identities=15% Similarity=0.167 Sum_probs=116.3
Q ss_pred EEEeCCCEEEEEEcCCeEEEEEE-cCcEEEEEEeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCee
Q 001655 441 VATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLI 519 (1036)
Q Consensus 441 ~as~~~~~vvv~~~~~~~~~l~~-~~~~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v 519 (1036)
+++..++.++++.++| +.+++. +......+.+. ..|+-+...+ + .+. ++... ++.+.+|+++.+...
T Consensus 2 c~~~~~~~L~vGt~~G-l~~~~~~~~~~~~~i~~~---~~I~ql~vl~--~----~~~-llvLs-d~~l~~~~L~~l~~~ 69 (275)
T PF00780_consen 2 CADSWGDRLLVGTEDG-LYVYDLSDPSKPTRILKL---SSITQLSVLP--E----LNL-LLVLS-DGQLYVYDLDSLEPV 69 (275)
T ss_pred CcccCCCEEEEEECCC-EEEEEecCCccceeEeec---ceEEEEEEec--c----cCE-EEEEc-CCccEEEEchhhccc
Confidence 3455678899998777 444566 22222222111 1155554422 1 122 33334 799999999887654
Q ss_pred EEe-------------cCCCccCceeEE-eeeecCeEEEEEEeCCCcEEEEEEecCCCcc-ccceeeecCccceEEEEEE
Q 001655 520 TKE-------------HLGGEIIPRSVL-LCAFEGISYLLCALGDGHLLNFLLNMKTGEL-TDRKKVSLGTQPITLRTFS 584 (1036)
Q Consensus 520 ~~~-------------~l~~~~~p~si~-~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~l-~~~~~~~lG~~pv~l~~~~ 584 (1036)
... .++...-..... .....+...|.++.+. .+..|++....... +..+...+-..|..+.-+
T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~v~~f~~~~~~~~~~~L~va~kk-~i~i~~~~~~~~~f~~~~ke~~lp~~~~~i~~~- 147 (275)
T PF00780_consen 70 STSAPLAFPKSRSLPTKLPETKGVSFFAVNGGHEGSRRLCVAVKK-KILIYEWNDPRNSFSKLLKEISLPDPPSSIAFL- 147 (275)
T ss_pred cccccccccccccccccccccCCeeEEeeccccccceEEEEEECC-EEEEEEEECCcccccceeEEEEcCCCcEEEEEe-
Confidence 321 111000111111 0122355678888877 78889988743334 444555566666555443
Q ss_pred eCCceEEEEEcCCceEEEecC-CcEE-Eeeeccccc------eeeecccCCCCC--CeEEEEeCCeEEEEEEcCCCCeEE
Q 001655 585 SKNTTHVFAASDRPTVIYSSN-KKLL-YSNVNLKEV------SHMCPFNSAAFP--DSLAIAKEGELTIGTIDDIQKLHI 654 (1036)
Q Consensus 585 ~~~~~~v~~~~~~p~~i~~~~-~~~~-~~~~~~~~i------~~~~~f~~~~~~--~~l~~~~~~~l~i~~l~~~~~~~~ 654 (1036)
.+.+++...+-+.++.-+ +... +.+...... ....|......+ +.|++.++-++.+..-.. ....
T Consensus 148 ---~~~i~v~~~~~f~~idl~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~Ll~~~~~g~fv~~~G~--~~r~ 222 (275)
T PF00780_consen 148 ---GNKICVGTSKGFYLIDLNTGSPSELLDPSDSSSSFKSRNSSSKPLGIFQLSDNEFLLCYDNIGVFVNKNGE--PSRK 222 (275)
T ss_pred ---CCEEEEEeCCceEEEecCCCCceEEeCccCCcchhhhcccCCCceEEEEeCCceEEEEecceEEEEcCCCC--cCcc
Confidence 234444434444443322 2221 111111110 000011000111 122222222222222111 1113
Q ss_pred EEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCC
Q 001655 655 RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDT 712 (1036)
Q Consensus 655 ~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~ 712 (1036)
..+.-...|+.++|... |+++..+. .+++.+..+.+.+..++++.
T Consensus 223 ~~i~W~~~p~~~~~~~p---yli~~~~~----------~iEV~~~~~~~lvQ~i~~~~ 267 (275)
T PF00780_consen 223 STIQWSSAPQSVAYSSP---YLIAFSSN----------SIEVRSLETGELVQTIPLPN 267 (275)
T ss_pred cEEEcCCchhEEEEECC---EEEEECCC----------EEEEEECcCCcEEEEEECCC
Confidence 46777889999999764 66666542 58888888887777666543
|
These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity |
| >KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown] | Back alignment and domain information |
|---|
Probab=95.03 E-value=6.5 Score=42.65 Aligned_cols=228 Identities=17% Similarity=0.236 Sum_probs=129.9
Q ss_pred EEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEec-
Q 001655 692 HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETK- 770 (1036)
Q Consensus 692 ~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~- 770 (1036)
-.+++++-+...+|..+.|+. ++..+++. ++-++|.-. +.||+|++++ ++++|..+..
T Consensus 68 r~Lkv~~~Kk~~~ICe~~fpt-----~IL~VrmN---r~RLvV~Le-----------e~IyIydI~~--MklLhTI~t~~ 126 (391)
T KOG2110|consen 68 RKLKVVHFKKKTTICEIFFPT-----SILAVRMN---RKRLVVCLE-----------ESIYIYDIKD--MKLLHTIETTP 126 (391)
T ss_pred ceEEEEEcccCceEEEEecCC-----ceEEEEEc---cceEEEEEc-----------ccEEEEeccc--ceeehhhhccC
Confidence 478899888888888888874 45556664 456666642 3599999976 5566655432
Q ss_pred Cc---eeEeccccC--eEEEEE---CCEEEEEEeeecCCCccchhcc---cccccceEEEEEEEeCCEEEEEeccccEEE
Q 001655 771 GA---VYSLNAFNG--KLLAAI---NQKIQLYKWMLRDDGTRELQSE---CGHHGHILALYVQTRGDFIVVGDLMKSISL 839 (1036)
Q Consensus 771 g~---v~~i~~~~g--~Lv~~~---g~~l~v~~~~~~~~~~~~L~~~---~~~~~~~~~~~l~~~~~~I~vgD~~~Sv~~ 839 (1036)
-- +-|+..-++ +|.+-. ...|++|+.. .|++. ..+++...++.+...|.+|..+- -+|-.+
T Consensus 127 ~n~~gl~AlS~n~~n~ylAyp~s~t~GdV~l~d~~-------nl~~v~~I~aH~~~lAalafs~~G~llATAS-eKGTVI 198 (391)
T KOG2110|consen 127 PNPKGLCALSPNNANCYLAYPGSTTSGDVVLFDTI-------NLQPVNTINAHKGPLAALAFSPDGTLLATAS-EKGTVI 198 (391)
T ss_pred CCccceEeeccCCCCceEEecCCCCCceEEEEEcc-------cceeeeEEEecCCceeEEEECCCCCEEEEec-cCceEE
Confidence 22 334333332 565532 2467887753 23333 23445555666666777655433 344322
Q ss_pred EEEe-ccCCeEEEEeccCCcceeEEEEEeeCceEE-EEccCCcEEEEeeCCCCCC------ccc--ccceeEEEEEEcCC
Q 001655 840 LIYK-HEEGAIEERARDYNANWMSAVEILDDDIYL-GAENNFNLFTVRKNSEGAT------DEE--RGRLEVVGEYHLGE 909 (1036)
Q Consensus 840 ~~~~-~~~~~l~~~a~D~~~~~~~~~~~l~~~~~l-~~D~~gnl~~l~~~~~~~~------~~~--~~~L~~~~~~~lg~ 909 (1036)
=-|. ++..++.++-|-..+..+.+..|=.+..+| ++-..+.+++|+......+ ... ..++...+.-+++.
T Consensus 199 RVf~v~~G~kl~eFRRG~~~~~IySL~Fs~ds~~L~~sS~TeTVHiFKL~~~~~~~~~~p~~~~~~~~~~sk~~~sylps 278 (391)
T KOG2110|consen 199 RVFSVPEGQKLYEFRRGTYPVSIYSLSFSPDSQFLAASSNTETVHIFKLEKVSNNPPESPTAGTSWFGKVSKAATSYLPS 278 (391)
T ss_pred EEEEcCCccEeeeeeCCceeeEEEEEEECCCCCeEEEecCCCeEEEEEecccccCCCCCCCCCCcccchhhhhhhhhcch
Confidence 2233 456778888888888888888876555565 4556899999998542211 100 11121111115666
Q ss_pred ccceEEe--eeeee-cCCCC---------CCCCcceEEEEccccceEEEEe
Q 001655 910 FVNRFRH--GSLVM-RLPDS---------DVGQIPTVIFGTVNGVIGVIAS 948 (1036)
Q Consensus 910 ~v~~~~~--~~l~~-~~~~~---------~~~~~~~il~~t~~GsIg~l~~ 948 (1036)
.|.++.- .++.. .-+.+ .....+.+..+|.||-++.+.-
T Consensus 279 ~V~~~~~~~R~FAt~~l~~s~~~~~~~l~~~~~~~~v~vas~dG~~y~y~l 329 (391)
T KOG2110|consen 279 QVSSVLDQSRKFATAKLPESGRKNICSLSSIQKIPRVLVASYDGHLYSYRL 329 (391)
T ss_pred hhhhhhhhccceeEEEccCCCccceEEeeccCCCCEEEEEEcCCeEEEEEc
Confidence 6655411 01110 00111 1124578888999999997653
|
|
| >KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport] | Back alignment and domain information |
|---|
Probab=94.94 E-value=6.4 Score=45.52 Aligned_cols=74 Identities=12% Similarity=0.243 Sum_probs=54.7
Q ss_pred CceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEE
Q 001655 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNF 556 (1036)
Q Consensus 477 ~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y 556 (1036)
+..|-|+++.| ...+++.+++ +|.++||+-++...+-+.... +.+.++..++ ..+.++++|..|+++-+|
T Consensus 13 SdRVKsVd~HP------tePw~la~Ly-nG~V~IWnyetqtmVksfeV~-~~PvRa~kfi--aRknWiv~GsDD~~IrVf 82 (794)
T KOG0276|consen 13 SDRVKSVDFHP------TEPWILAALY-NGDVQIWNYETQTMVKSFEVS-EVPVRAAKFI--ARKNWIVTGSDDMQIRVF 82 (794)
T ss_pred CCceeeeecCC------CCceEEEeee-cCeeEEEecccceeeeeeeec-ccchhhheee--eccceEEEecCCceEEEE
Confidence 35678999876 3678899999 999999998876665543333 2344555443 246899999999999999
Q ss_pred EEec
Q 001655 557 LLNM 560 (1036)
Q Consensus 557 ~~~~ 560 (1036)
+...
T Consensus 83 nynt 86 (794)
T KOG0276|consen 83 NYNT 86 (794)
T ss_pred eccc
Confidence 8764
|
|
| >KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown] | Back alignment and domain information |
|---|
Probab=94.80 E-value=5.7 Score=44.52 Aligned_cols=191 Identities=14% Similarity=0.086 Sum_probs=115.9
Q ss_pred cceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCC-eEeEEEEEEEcCCCceEEEEEeeeeCC
Q 001655 663 PRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFE-YGCSILSCSFSDDSNVYYCVGTAYVLP 741 (1036)
Q Consensus 663 p~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E-~v~si~~~~l~~~~~~~i~VGT~~~~~ 741 (1036)
.+.+-|+|..++.++.+++. ..+++.|..+-.+ .+++..+| .+-|.... .. ..+++|-.+
T Consensus 113 v~~~~f~~~d~t~l~s~sDd---------~v~k~~d~s~a~v--~~~l~~htDYVR~g~~~---~~-~~hivvtGs---- 173 (487)
T KOG0310|consen 113 VHVTKFSPQDNTMLVSGSDD---------KVVKYWDLSTAYV--QAELSGHTDYVRCGDIS---PA-NDHIVVTGS---- 173 (487)
T ss_pred eeEEEecccCCeEEEecCCC---------ceEEEEEcCCcEE--EEEecCCcceeEeeccc---cC-CCeEEEecC----
Confidence 34556788888888887653 4667777766654 55555554 34444332 22 334444332
Q ss_pred CCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccccC--eEEEEECCEEEEEEeeecCCCccchhcccccccceEEE
Q 001655 742 EENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNG--KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 819 (1036)
Q Consensus 742 ~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~~g--~Lv~~~g~~l~v~~~~~~~~~~~~L~~~~~~~~~~~~~ 819 (1036)
.-|.|.+|+..-.. ..+.+.+..-||-+++.+.+ .|+.|-|+.+++|++. .|.+.+.....+.-.+..+
T Consensus 174 -----YDg~vrl~DtR~~~-~~v~elnhg~pVe~vl~lpsgs~iasAgGn~vkVWDl~---~G~qll~~~~~H~KtVTcL 244 (487)
T KOG0310|consen 174 -----YDGKVRLWDTRSLT-SRVVELNHGCPVESVLALPSGSLIASAGGNSVKVWDLT---TGGQLLTSMFNHNKTVTCL 244 (487)
T ss_pred -----CCceEEEEEeccCC-ceeEEecCCCceeeEEEcCCCCEEEEcCCCeEEEEEec---CCceehhhhhcccceEEEE
Confidence 45889999886433 56777888999999999954 4556777999998763 2434443333333333344
Q ss_pred EEEEeCCEEEEEeccccEEEEEEeccCCeEEEEeccCCcceeEEEEEeeCc-eEEEEccCCcEEEEe
Q 001655 820 YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD-IYLGAENNFNLFTVR 885 (1036)
Q Consensus 820 ~l~~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l~~~-~~l~~D~~gnl~~l~ 885 (1036)
.+...+.+++.|-+-+-|-++. ..+.+++- .=..|-.+.++..-+++ +++++=.+|-+.+-+
T Consensus 245 ~l~s~~~rLlS~sLD~~VKVfd--~t~~Kvv~--s~~~~~pvLsiavs~dd~t~viGmsnGlv~~rr 307 (487)
T KOG0310|consen 245 RLASDSTRLLSGSLDRHVKVFD--TTNYKVVH--SWKYPGPVLSIAVSPDDQTVVIGMSNGLVSIRR 307 (487)
T ss_pred EeecCCceEeecccccceEEEE--ccceEEEE--eeecccceeeEEecCCCceEEEecccceeeeeh
Confidence 4455567999999999888764 34444433 22334446666555554 444554566666553
|
|
| >KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms] | Back alignment and domain information |
|---|
Probab=94.70 E-value=1.3 Score=49.43 Aligned_cols=184 Identities=14% Similarity=0.127 Sum_probs=103.1
Q ss_pred eEEEEeCCeEEEEEEcCCC-CeEEEEEeCC-CccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEE
Q 001655 632 SLAIAKEGELTIGTIDDIQ-KLHIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYP 709 (1036)
Q Consensus 632 ~l~~~~~~~l~i~~l~~~~-~~~~~~i~l~-~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~ 709 (1036)
+++.++++.=.+.++.+.+ ++..|+..=. .|...+.|.- +..|+...+. .+.|-+..-+|..--..+.
T Consensus 91 S~y~~sgG~~~~Vkiwdl~~kl~hr~lkdh~stvt~v~YN~-~DeyiAsvs~---------gGdiiih~~~t~~~tt~f~ 160 (673)
T KOG4378|consen 91 SLYEISGGQSGCVKIWDLRAKLIHRFLKDHQSTVTYVDYNN-TDEYIASVSD---------GGDIIIHGTKTKQKTTTFT 160 (673)
T ss_pred ceeeeccCcCceeeehhhHHHHHhhhccCCcceeEEEEecC-CcceeEEecc---------CCcEEEEecccCcccccee
Confidence 4556666555555666554 4444444321 3445555542 2334433321 2345555555544444455
Q ss_pred CCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEec--cccCeEEEEE
Q 001655 710 LDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLN--AFNGKLLAAI 787 (1036)
Q Consensus 710 ~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~--~~~g~Lv~~~ 787 (1036)
.+.+..+ ..+.+..- +.+++ |++ ...|-+.+|++...+-..-....++.|...|| +.+..|++++
T Consensus 161 ~~sgqsv---Rll~ys~s-kr~lL-~~a--------sd~G~VtlwDv~g~sp~~~~~~~HsAP~~gicfspsne~l~vsV 227 (673)
T KOG4378|consen 161 IDSGQSV---RLLRYSPS-KRFLL-SIA--------SDKGAVTLWDVQGMSPIFHASEAHSAPCRGICFSPSNEALLVSV 227 (673)
T ss_pred cCCCCeE---EEeecccc-cceee-Eee--------ccCCeEEEEeccCCCcccchhhhccCCcCcceecCCccceEEEe
Confidence 5555554 23333322 23333 333 25699999999754443333444455555444 5588999888
Q ss_pred C--CEEEEEEeeecCCCccchhcccccccceEEEEEEEeCCEEEEEeccccEEEEEEe
Q 001655 788 N--QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843 (1036)
Q Consensus 788 g--~~l~v~~~~~~~~~~~~L~~~~~~~~~~~~~~l~~~~~~I~vgD~~~Sv~~~~~~ 843 (1036)
| .+|++|+. +.+++...-.|..+..++.+.-+|-++.+|...--+.+|-.+
T Consensus 228 G~Dkki~~yD~-----~s~~s~~~l~y~~Plstvaf~~~G~~L~aG~s~G~~i~YD~R 280 (673)
T KOG4378|consen 228 GYDKKINIYDI-----RSQASTDRLTYSHPLSTVAFSECGTYLCAGNSKGELIAYDMR 280 (673)
T ss_pred cccceEEEeec-----ccccccceeeecCCcceeeecCCceEEEeecCCceEEEEecc
Confidence 8 67777774 455665555566676677777778788888888877654333
|
|
| >KOG0282 consensus mRNA splicing factor [Function unknown] | Back alignment and domain information |
|---|
Probab=94.70 E-value=0.69 Score=51.41 Aligned_cols=220 Identities=12% Similarity=0.122 Sum_probs=131.1
Q ss_pred CeEEEEe-CCeEEEEEEcCCCCeEEEEEeCCCccceEEEecCC-CEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEE
Q 001655 631 DSLAIAK-EGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQS-RTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTY 708 (1036)
Q Consensus 631 ~~l~~~~-~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~-~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~ 708 (1036)
..|+.++ +..+++--.+ ..-...+..++..|..+-++|+. +.|+++.+. +.|+..|-.+.+++..|
T Consensus 271 ~~fLS~sfD~~lKlwDtE--TG~~~~~f~~~~~~~cvkf~pd~~n~fl~G~sd----------~ki~~wDiRs~kvvqeY 338 (503)
T KOG0282|consen 271 TSFLSASFDRFLKLWDTE--TGQVLSRFHLDKVPTCVKFHPDNQNIFLVGGSD----------KKIRQWDIRSGKVVQEY 338 (503)
T ss_pred Ceeeeeecceeeeeeccc--cceEEEEEecCCCceeeecCCCCCcEEEEecCC----------CcEEEEeccchHHHHHH
Confidence 3444444 4445443332 34567889999999999999988 888888775 47888888887766544
Q ss_pred ECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCC-eEEEEEE-EEecCceeEeccccCeEEE-
Q 001655 709 PLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAE-KETKGAVYSLNAFNGKLLA- 785 (1036)
Q Consensus 709 ~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~-kl~~~~~-~~~~g~v~~i~~~~g~Lv~- 785 (1036)
. ++.-++..++|-.+++.|+- |+ + -+.+.+|+..-. .++.+.. ..+.-|..++.+.++.+++
T Consensus 339 d----~hLg~i~~i~F~~~g~rFis--sS-----D----dks~riWe~~~~v~ik~i~~~~~hsmP~~~~~P~~~~~~aQ 403 (503)
T KOG0282|consen 339 D----RHLGAILDITFVDEGRRFIS--SS-----D----DKSVRIWENRIPVPIKNIADPEMHTMPCLTLHPNGKWFAAQ 403 (503)
T ss_pred H----hhhhheeeeEEccCCceEee--ec-----c----CccEEEEEcCCCccchhhcchhhccCcceecCCCCCeehhh
Confidence 2 44456666677665565543 21 1 237788887632 3444443 3345666666766666554
Q ss_pred EECCEEEEEEeeecCCCccchhcccccccc---eEEEEEE--EeCCEEEEEeccccEEEEEEeccCCeEEEEeccCCcce
Q 001655 786 AINQKIQLYKWMLRDDGTRELQSECGHHGH---ILALYVQ--TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW 860 (1036)
Q Consensus 786 ~~g~~l~v~~~~~~~~~~~~L~~~~~~~~~---~~~~~l~--~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~ 860 (1036)
++++.+.+|...++ .++.++-.+..+ -+++.+. -.|.+++-||.--.+.++.|+.. ++.-.-+-. ..-
T Consensus 404 s~dN~i~ifs~~~~----~r~nkkK~feGh~vaGys~~v~fSpDG~~l~SGdsdG~v~~wdwkt~--kl~~~lkah-~~~ 476 (503)
T KOG0282|consen 404 SMDNYIAIFSTVPP----FRLNKKKRFEGHSVAGYSCQVDFSPDGRTLCSGDSDGKVNFWDWKTT--KLVSKLKAH-DQP 476 (503)
T ss_pred ccCceEEEEecccc----cccCHhhhhcceeccCceeeEEEcCCCCeEEeecCCccEEEeechhh--hhhhccccC-Ccc
Confidence 67789999886432 223222122222 3455544 35789999999999998877642 221111111 122
Q ss_pred eEEEEEeeC--ceEEEEccCCcEEEE
Q 001655 861 MSAVEILDD--DIYLGAENNFNLFTV 884 (1036)
Q Consensus 861 ~~~~~~l~~--~~~l~~D~~gnl~~l 884 (1036)
|+.+..-.. ..++.++-+|-|.++
T Consensus 477 ci~v~wHP~e~Skvat~~w~G~Ikiw 502 (503)
T KOG0282|consen 477 CIGVDWHPVEPSKVATCGWDGLIKIW 502 (503)
T ss_pred eEEEEecCCCcceeEecccCceeEec
Confidence 333333322 357778888877654
|
|
| >KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=94.69 E-value=12 Score=44.30 Aligned_cols=118 Identities=17% Similarity=0.221 Sum_probs=76.5
Q ss_pred EEEEEE--EEecCceeEeccc-cCeE-EEE-ECCEEEEEEeeecCCCccchhcccccccceEEEEEEEeCCEEEEEeccc
Q 001655 761 LQLIAE--KETKGAVYSLNAF-NGKL-LAA-INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK 835 (1036)
Q Consensus 761 l~~~~~--~~~~g~v~~i~~~-~g~L-v~~-~g~~l~v~~~~~~~~~~~~L~~~~~~~~~~~~~~l~~~~~~I~vgD~~~ 835 (1036)
|++.+. .+++.-|.|+..- +|++ +++ .++++.+|-++. -+-.+..=.+..|+..+.|+...+.|+.|-+-+
T Consensus 497 lsl~~~rtLel~ddvL~v~~Spdgk~LaVsLLdnTVkVyflDt----lKFflsLYGHkLPV~smDIS~DSklivTgSADK 572 (888)
T KOG0306|consen 497 LSLKHTRTLELEDDVLCVSVSPDGKLLAVSLLDNTVKVYFLDT----LKFFLSLYGHKLPVLSMDISPDSKLIVTGSADK 572 (888)
T ss_pred eeeccceEEeccccEEEEEEcCCCcEEEEEeccCeEEEEEecc----eeeeeeecccccceeEEeccCCcCeEEeccCCC
Confidence 555444 4667888888765 4554 444 568999998741 111222233445566667777788999999999
Q ss_pred cEEEEEEeccCCeEEEEeccCCcceeEEEEEeeCce-EEEEccCCcEEEEe
Q 001655 836 SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI-YLGAENNFNLFTVR 885 (1036)
Q Consensus 836 Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l~~~~-~l~~D~~gnl~~l~ 885 (1036)
.|.++..+..+=.-..+|.| -.++++.|+.++. ++.+-++|-+--+.
T Consensus 573 nVKiWGLdFGDCHKS~fAHd---DSvm~V~F~P~~~~FFt~gKD~kvKqWD 620 (888)
T KOG0306|consen 573 NVKIWGLDFGDCHKSFFAHD---DSVMSVQFLPKTHLFFTCGKDGKVKQWD 620 (888)
T ss_pred ceEEeccccchhhhhhhccc---CceeEEEEcccceeEEEecCcceEEeec
Confidence 99988766543222333433 3578899998654 67787888876553
|
|
| >KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism] | Back alignment and domain information |
|---|
Probab=94.68 E-value=1 Score=45.77 Aligned_cols=118 Identities=17% Similarity=0.248 Sum_probs=82.8
Q ss_pred CeEEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCce
Q 001655 651 KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 730 (1036)
Q Consensus 651 ~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~ 730 (1036)
..-++++.+...++.+-++++.+.+.++. .++|.|.|+++++.+.+|+++-+= .+.+|.. +++
T Consensus 175 gt~v~sL~~~s~VtSlEvs~dG~ilTia~-----------gssV~Fwdaksf~~lKs~k~P~nV-----~SASL~P-~k~ 237 (334)
T KOG0278|consen 175 GTEVQSLEFNSPVTSLEVSQDGRILTIAY-----------GSSVKFWDAKSFGLLKSYKMPCNV-----ESASLHP-KKE 237 (334)
T ss_pred CcEEEEEecCCCCcceeeccCCCEEEEec-----------CceeEEeccccccceeeccCcccc-----ccccccC-CCc
Confidence 34678888999999999998877555443 368999999999999998887543 3334433 346
Q ss_pred EEEEEeeeeCCCCCCCcceEEEEEEEeCC-eEEEEEEEEecCceeEeccccCeEEEEECC---EEEEEEe
Q 001655 731 YYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQ---KIQLYKW 796 (1036)
Q Consensus 731 ~i~VGT~~~~~~e~~~~~Gri~v~~i~~~-kl~~~~~~~~~g~v~~i~~~~g~Lv~~~g~---~l~v~~~ 796 (1036)
+++-|. |+ +.+|.|+...+ .+.. +.+...|||.|+.--.+..++|.|+ +|+++..
T Consensus 238 ~fVaGg------ed----~~~~kfDy~TgeEi~~-~nkgh~gpVhcVrFSPdGE~yAsGSEDGTirlWQt 296 (334)
T KOG0278|consen 238 FFVAGG------ED----FKVYKFDYNTGEEIGS-YNKGHFGPVHCVRFSPDGELYASGSEDGTIRLWQT 296 (334)
T ss_pred eEEecC------cc----eEEEEEeccCCceeee-cccCCCCceEEEEECCCCceeeccCCCceEEEEEe
Confidence 666663 32 67777777633 3332 4577899999999776666777774 6777554
|
|
| >PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase | Back alignment and domain information |
|---|
Probab=94.68 E-value=7.7 Score=41.96 Aligned_cols=221 Identities=18% Similarity=0.137 Sum_probs=115.6
Q ss_pred CeEEEEEEeCCCcEEEEEEecCCCccccceeeecCccce-EEEEEEeCCceEEEEEcCCceEEEecC---CcE-----EE
Q 001655 540 GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI-TLRTFSSKNTTHVFAASDRPTVIYSSN---KKL-----LY 610 (1036)
Q Consensus 540 ~~~~L~vgl~~G~l~~y~~~~~~~~l~~~~~~~lG~~pv-~l~~~~~~~~~~v~~~~~~p~~i~~~~---~~~-----~~ 610 (1036)
.+..|++|+.+| |+.|+...... .+......+| ++..+ ...+.+++.+++-..++.-+ ... ..
T Consensus 6 ~~~~L~vGt~~G-l~~~~~~~~~~-----~~~i~~~~~I~ql~vl--~~~~~llvLsd~~l~~~~L~~l~~~~~~~~~~~ 77 (275)
T PF00780_consen 6 WGDRLLVGTEDG-LYVYDLSDPSK-----PTRILKLSSITQLSVL--PELNLLLVLSDGQLYVYDLDSLEPVSTSAPLAF 77 (275)
T ss_pred CCCEEEEEECCC-EEEEEecCCcc-----ceeEeecceEEEEEEe--cccCEEEEEcCCccEEEEchhhccccccccccc
Confidence 467899999999 88888821111 1111222222 22222 23456777777554444211 000 00
Q ss_pred -------eeec-cccceeeecccCCCCC-CeEEEEeCCeEEEEEEcCC-CC--eEEEEEeCCCccceEEEecCCCEEEEE
Q 001655 611 -------SNVN-LKEVSHMCPFNSAAFP-DSLAIAKEGELTIGTIDDI-QK--LHIRSIPLGEHPRRICHQEQSRTFAIC 678 (1036)
Q Consensus 611 -------~~~~-~~~i~~~~~f~~~~~~-~~l~~~~~~~l~i~~l~~~-~~--~~~~~i~l~~tp~~i~y~~~~~~~~v~ 678 (1036)
..+. .+.+..++.- ..... ..++++.+..+.+.+.... .+ -..+.+.+.++|+.+++. .+.++|+
T Consensus 78 ~~~~~~~~~~~~~~~v~~f~~~-~~~~~~~~L~va~kk~i~i~~~~~~~~~f~~~~ke~~lp~~~~~i~~~--~~~i~v~ 154 (275)
T PF00780_consen 78 PKSRSLPTKLPETKGVSFFAVN-GGHEGSRRLCVAVKKKILIYEWNDPRNSFSKLLKEISLPDPPSSIAFL--GNKICVG 154 (275)
T ss_pred cccccccccccccCCeeEEeec-cccccceEEEEEECCEEEEEEEECCcccccceeEEEEcCCCcEEEEEe--CCEEEEE
Confidence 0111 1234444401 11111 2466677889999999874 33 478999999999999998 4566776
Q ss_pred EeecCCCCCceeeEEEEEEeCCCceEEEEEECC--------CCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceE
Q 001655 679 SLKNQSCAEESEMHFVRLLDDQTFEFISTYPLD--------TFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGR 750 (1036)
Q Consensus 679 ~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~--------~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gr 750 (1036)
+.+ ...++|..+.....-+... ..+....+....+.+ .+++++-... |
T Consensus 155 ~~~-----------~f~~idl~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~--~e~Ll~~~~~----------g- 210 (275)
T PF00780_consen 155 TSK-----------GFYLIDLNTGSPSELLDPSDSSSSFKSRNSSSKPLGIFQLSD--NEFLLCYDNI----------G- 210 (275)
T ss_pred eCC-----------ceEEEecCCCCceEEeCccCCcchhhhcccCCCceEEEEeCC--ceEEEEecce----------E-
Confidence 643 3455555444332222111 112222333344432 3555532111 1
Q ss_pred EEEEEEeCCeEEEEEEEEecCceeEeccccCeEEEEECCEEEEEEee
Q 001655 751 ILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797 (1036)
Q Consensus 751 i~v~~i~~~kl~~~~~~~~~g~v~~i~~~~g~Lv~~~g~~l~v~~~~ 797 (1036)
+|-=.+++...-...+.++.+.+++....+|++...+.|.++++.
T Consensus 211 --~fv~~~G~~~r~~~i~W~~~p~~~~~~~pyli~~~~~~iEV~~~~ 255 (275)
T PF00780_consen 211 --VFVNKNGEPSRKSTIQWSSAPQSVAYSSPYLIAFSSNSIEVRSLE 255 (275)
T ss_pred --EEEcCCCCcCcccEEEcCCchhEEEEECCEEEEECCCEEEEEECc
Confidence 110002221112344567788888888889998888899998864
|
These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity |
| >PF14727 PHTB1_N: PTHB1 N-terminus | Back alignment and domain information |
|---|
Probab=94.63 E-value=11 Score=43.23 Aligned_cols=125 Identities=22% Similarity=0.354 Sum_probs=77.1
Q ss_pred cceeeeeCCCCCcEEEEE---eC-CCEEEEEEcCCeEEEEEEcCcEEEEEEeccCCceeEEEEcCCCC--CCCCcccEEE
Q 001655 426 ELRNEWKSPPGYSVNVAT---AN-ASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIG--ENPSYSQIAA 499 (1036)
Q Consensus 426 ~~~~~w~~~~~~~I~~as---~~-~~~vvv~~~~~~~~~l~~~~~~l~~~~~~~l~~~is~l~~~~~~--~~~~~~~~~~ 499 (1036)
+...+|...-|+.|.... .. ...-++++....+..+. ++|++.-+++ ++..++|+..-.+. ........++
T Consensus 228 ~l~~dWs~nlGE~~l~i~v~~~~~~~~~IvvLger~Lf~l~-~~G~l~~~kr--Ld~~p~~~~~Y~~~~~~~~~~~~~ll 304 (418)
T PF14727_consen 228 KLNPDWSFNLGEQALDIQVVRFSSSESDIVVLGERSLFCLK-DNGSLRFQKR--LDYNPSCFCPYRVPWYNEPSTRLNLL 304 (418)
T ss_pred cccceeEEECCceeEEEEEEEcCCCCceEEEEecceEEEEc-CCCeEEEEEe--cCCceeeEEEEEeecccCCCCceEEE
Confidence 355689877777655444 21 22223333345554443 3477766654 56777777532221 1111233588
Q ss_pred EEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeC-CCcEEEEEE
Q 001655 500 VGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALG-DGHLLNFLL 558 (1036)
Q Consensus 500 vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~-~G~l~~y~~ 558 (1036)
|++. ++.+.||+ +.+++|...++ ..|-++.+..+.+..-+++.+. +|.|-+.-+
T Consensus 305 V~t~-t~~LlVy~--d~~L~WsA~l~--~~PVal~v~~~~~~~G~IV~Ls~~G~L~v~YL 359 (418)
T PF14727_consen 305 VGTH-TGTLLVYE--DTTLVWSAQLP--HVPVALSVANFNGLKGLIVSLSDEGQLSVSYL 359 (418)
T ss_pred EEec-CCeEEEEe--CCeEEEecCCC--CCCEEEEecccCCCCceEEEEcCCCcEEEEEe
Confidence 8888 99999998 66778877775 5788888887776666677665 888876654
|
|
| >TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain | Back alignment and domain information |
|---|
Probab=94.62 E-value=9.4 Score=42.60 Aligned_cols=231 Identities=14% Similarity=0.151 Sum_probs=114.5
Q ss_pred cCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCc-EEEEEEecCCCccccceeeecCccceEEEE
Q 001655 504 TDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGH-LLNFLLNMKTGELTDRKKVSLGTQPITLRT 582 (1036)
Q Consensus 504 ~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~-l~~y~~~~~~~~l~~~~~~~lG~~pv~l~~ 582 (1036)
.+..|.+|+..+++.+....++. .|+-.. +..+..+.-+.||. |+++++++.+. +. .+-+....+ +..
T Consensus 75 ~~d~V~v~D~~t~~~~~~i~~p~--~p~~~~----~~~~~~~~ls~dgk~l~V~n~~p~~~-V~---VvD~~~~kv-v~e 143 (352)
T TIGR02658 75 RTDYVEVIDPQTHLPIADIELPE--GPRFLV----GTYPWMTSLTPDNKTLLFYQFSPSPA-VG---VVDLEGKAF-VRM 143 (352)
T ss_pred CCCEEEEEECccCcEEeEEccCC--Cchhhc----cCccceEEECCCCCEEEEecCCCCCE-EE---EEECCCCcE-EEE
Confidence 47899999999999887666652 333111 12355666777775 44455554321 11 011111111 122
Q ss_pred EEeCCceEEEEEcCCceEEEecCCcEEEeeecccc--ceeeec---------ccCC--CC-CCeEEEEe-CCeEEEEEEc
Q 001655 583 FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCP---------FNSA--AF-PDSLAIAK-EGELTIGTID 647 (1036)
Q Consensus 583 ~~~~~~~~v~~~~~~p~~i~~~~~~~~~~~~~~~~--i~~~~~---------f~~~--~~-~~~l~~~~-~~~l~i~~l~ 647 (1036)
+...+...++.+++.-.+..+.++++....+..+. ...-.+ |..+ .- .+..++++ ++.+.+..+.
T Consensus 144 i~vp~~~~vy~t~e~~~~~~~~Dg~~~~v~~d~~g~~~~~~~~vf~~~~~~v~~rP~~~~~dg~~~~vs~eG~V~~id~~ 223 (352)
T TIGR02658 144 MDVPDCYHIFPTANDTFFMHCRDGSLAKVGYGTKGNPKIKPTEVFHPEDEYLINHPAYSNKSGRLVWPTYTGKIFQIDLS 223 (352)
T ss_pred EeCCCCcEEEEecCCccEEEeecCceEEEEecCCCceEEeeeeeecCCccccccCCceEcCCCcEEEEecCCeEEEEecC
Confidence 22334555666665555555444443321111100 000001 1112 11 23455554 6777766653
Q ss_pred CCC---CeEEEEEeCCC-----ccce---EEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeE
Q 001655 648 DIQ---KLHIRSIPLGE-----HPRR---ICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYG 716 (1036)
Q Consensus 648 ~~~---~~~~~~i~l~~-----tp~~---i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v 716 (1036)
... .-....+..++ .|-. ++++++.+.+.|+...............|.++|..+++.+..+.... .+
T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~wrP~g~q~ia~~~dg~~lyV~~~~~~~~thk~~~~~V~ViD~~t~kvi~~i~vG~--~~ 301 (352)
T TIGR02658 224 SGDAKFLPAIEAFTEAEKADGWRPGGWQQVAYHRARDRIYLLADQRAKWTHKTASRFLFVVDAKTGKRLRKIELGH--EI 301 (352)
T ss_pred CCcceecceeeeccccccccccCCCcceeEEEcCCCCEEEEEecCCccccccCCCCEEEEEECCCCeEEEEEeCCC--ce
Confidence 332 11112222211 1222 99999988888866432100001123589999999999999888754 33
Q ss_pred eEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCC
Q 001655 717 CSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG 759 (1036)
Q Consensus 717 ~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~ 759 (1036)
..+ .+..+++.++.+-.. ..|.|.+++...+
T Consensus 302 ~~i---avS~Dgkp~lyvtn~---------~s~~VsViD~~t~ 332 (352)
T TIGR02658 302 DSI---NVSQDAKPLLYALST---------GDKTLYIFDAETG 332 (352)
T ss_pred eeE---EECCCCCeEEEEeCC---------CCCcEEEEECcCC
Confidence 333 455555545544331 3466888876544
|
This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome. |
| >KOG0283 consensus WD40 repeat-containing protein [Function unknown] | Back alignment and domain information |
|---|
Probab=94.44 E-value=2 Score=51.13 Aligned_cols=156 Identities=15% Similarity=0.138 Sum_probs=98.4
Q ss_pred CCccceEEEecCCCEEEEE-EeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeee
Q 001655 660 GEHPRRICHQEQSRTFAIC-SLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAY 738 (1036)
Q Consensus 660 ~~tp~~i~y~~~~~~~~v~-~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~ 738 (1036)
..-++.|+|+|--.-|.+. +.+ ..+||.+-...++++-++++ |.|++++.. . ..++.+|||
T Consensus 409 ndfVTcVaFnPvDDryFiSGSLD----------~KvRiWsI~d~~Vv~W~Dl~--~lITAvcy~---P-dGk~avIGt-- 470 (712)
T KOG0283|consen 409 NDFVTCVAFNPVDDRYFISGSLD----------GKVRLWSISDKKVVDWNDLR--DLITAVCYS---P-DGKGAVIGT-- 470 (712)
T ss_pred CCeeEEEEecccCCCcEeecccc----------cceEEeecCcCeeEeehhhh--hhheeEEec---c-CCceEEEEE--
Confidence 3455677888755544443 333 47888888778888777776 788877543 2 368999999
Q ss_pred eCCCCCCCcceEEEEEEEeCCeEEEEEE------EEe-cCceeEeccccC---eE-EEEECCEEEEEEeeecCCCccchh
Q 001655 739 VLPEENEPTKGRILVFIVEDGKLQLIAE------KET-KGAVYSLNAFNG---KL-LAAINQKIQLYKWMLRDDGTRELQ 807 (1036)
Q Consensus 739 ~~~~e~~~~~Gri~v~~i~~~kl~~~~~------~~~-~g~v~~i~~~~g---~L-v~~~g~~l~v~~~~~~~~~~~~L~ 807 (1036)
..|..++|...+.|+..-.. +.. .+.||.++.+-+ .+ |.+...+|+||+..+++ -..++.
T Consensus 471 --------~~G~C~fY~t~~lk~~~~~~I~~~~~Kk~~~~rITG~Q~~p~~~~~vLVTSnDSrIRI~d~~~~~-lv~KfK 541 (712)
T KOG0283|consen 471 --------FNGYCRFYDTEGLKLVSDFHIRLHNKKKKQGKRITGLQFFPGDPDEVLVTSNDSRIRIYDGRDKD-LVHKFK 541 (712)
T ss_pred --------eccEEEEEEccCCeEEEeeeEeeccCccccCceeeeeEecCCCCCeEEEecCCCceEEEeccchh-hhhhhc
Confidence 56899999998877653221 222 235999887732 35 45566899999963221 112332
Q ss_pred cccccccce-EEEEEEEeCCEEEEEeccccEEEEEEec
Q 001655 808 SECGHHGHI-LALYVQTRGDFIVVGDLMKSISLLIYKH 844 (1036)
Q Consensus 808 ~~~~~~~~~-~~~~l~~~~~~I~vgD~~~Sv~~~~~~~ 844 (1036)
+ +..+.. .--++...|.+|+.|---..|++.+++.
T Consensus 542 G--~~n~~SQ~~Asfs~Dgk~IVs~seDs~VYiW~~~~ 577 (712)
T KOG0283|consen 542 G--FRNTSSQISASFSSDGKHIVSASEDSWVYIWKNDS 577 (712)
T ss_pred c--cccCCcceeeeEccCCCEEEEeecCceEEEEeCCC
Confidence 2 222222 1234566889999998666676665543
|
|
| >KOG0294 consensus WD40 repeat-containing protein [Function unknown] | Back alignment and domain information |
|---|
Probab=94.43 E-value=8.1 Score=41.04 Aligned_cols=254 Identities=15% Similarity=0.156 Sum_probs=114.9
Q ss_pred ceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEE
Q 001655 478 YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFL 557 (1036)
Q Consensus 478 ~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~ 557 (1036)
..|+|++.+ ..+++-|.. |-.|+||+++.-..... -+........+.+..--+..+|+-|..||.++.++
T Consensus 44 ~sitavAVs--------~~~~aSGss-DetI~IYDm~k~~qlg~-ll~HagsitaL~F~~~~S~shLlS~sdDG~i~iw~ 113 (362)
T KOG0294|consen 44 GSITALAVS--------GPYVASGSS-DETIHIYDMRKRKQLGI-LLSHAGSITALKFYPPLSKSHLLSGSDDGHIIIWR 113 (362)
T ss_pred cceeEEEec--------ceeEeccCC-CCcEEEEeccchhhhcc-eeccccceEEEEecCCcchhheeeecCCCcEEEEE
Confidence 567888884 467666655 88999999986221100 00000111111111101234899999999999998
Q ss_pred EecCCCccccceeeecCccceEEEEEEeCCceEEEEEcCCceEEEec--CCcEEE-eeeccccceeeecccCCCCCCeEE
Q 001655 558 LNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLY-SNVNLKEVSHMCPFNSAAFPDSLA 634 (1036)
Q Consensus 558 ~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~~~p~~i~~~--~~~~~~-~~~~~~~i~~~~~f~~~~~~~~l~ 634 (1036)
..+= .-.++..--...|.-..++-.+.=++=+-+|... =.++ +||-.| .++ +.......|. .-++.|+
T Consensus 114 ~~~W----~~~~slK~H~~~Vt~lsiHPS~KLALsVg~D~~l-r~WNLV~Gr~a~v~~L--~~~at~v~w~--~~Gd~F~ 184 (362)
T KOG0294|consen 114 VGSW----ELLKSLKAHKGQVTDLSIHPSGKLALSVGGDQVL-RTWNLVRGRVAFVLNL--KNKATLVSWS--PQGDHFV 184 (362)
T ss_pred cCCe----EEeeeecccccccceeEecCCCceEEEEcCCcee-eeehhhcCccceeecc--CCcceeeEEc--CCCCEEE
Confidence 6531 1011111001112111111111112222222221 1111 222211 111 1111122222 2345688
Q ss_pred EEeCCeEEEEEEcCCCCeEEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCC
Q 001655 635 IAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFE 714 (1036)
Q Consensus 635 ~~~~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E 714 (1036)
+.-.+.+-|.+++...-+ +.+....-+..+.+. +.+.++|+... ..+.+.|.++... -++|..+|
T Consensus 185 v~~~~~i~i~q~d~A~v~--~~i~~~~r~l~~~~l-~~~~L~vG~d~----------~~i~~~D~ds~~~--~~~~~AH~ 249 (362)
T KOG0294|consen 185 VSGRNKIDIYQLDNASVF--REIENPKRILCATFL-DGSELLVGGDN----------EWISLKDTDSDTP--LTEFLAHE 249 (362)
T ss_pred EEeccEEEEEecccHhHh--hhhhccccceeeeec-CCceEEEecCC----------ceEEEeccCCCcc--ceeeecch
Confidence 888999999999875311 111111112223333 33344444322 3567777655322 23454544
Q ss_pred e-EeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEe---CCeEEEEEEEEecCceeEec
Q 001655 715 Y-GCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE---DGKLQLIAEKETKGAVYSLN 777 (1036)
Q Consensus 715 ~-v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~---~~kl~~~~~~~~~g~v~~i~ 777 (1036)
. |-.+... .++...|| +|+ .+-|.|.+|+++ +.+-+++...+.+...+|+.
T Consensus 250 ~RVK~i~~~--~~~~~~~l--vTa--------SSDG~I~vWd~~~~~k~~~~~l~e~n~~~RltCl~ 304 (362)
T KOG0294|consen 250 NRVKDIASY--TNPEHEYL--VTA--------SSDGFIKVWDIDMETKKRPTLLAELNTNVRLTCLR 304 (362)
T ss_pred hheeeeEEE--ecCCceEE--EEe--------ccCceEEEEEccccccCCcceeEEeecCCccceee
Confidence 3 4444433 22222333 344 256999999987 22345566555554444443
|
|
| >KOG1273 consensus WD40 repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=94.35 E-value=8.4 Score=40.88 Aligned_cols=185 Identities=14% Similarity=0.168 Sum_probs=102.1
Q ss_pred eEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCCcc-CceeEEeeeecCeEEEEEEeCCCcEEEEEE
Q 001655 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI-IPRSVLLCAFEGISYLLCALGDGHLLNFLL 558 (1036)
Q Consensus 480 is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~-~p~si~~~~~~~~~~L~vgl~~G~l~~y~~ 558 (1036)
..|+.+++ -+.++++|+. +|.+.||++.++.+-. -++... +..|++-.+ +...|+-+.+|-.+..+.+
T Consensus 26 a~~~~Fs~------~G~~lAvGc~-nG~vvI~D~~T~~iar--~lsaH~~pi~sl~WS~--dgr~LltsS~D~si~lwDl 94 (405)
T KOG1273|consen 26 AECCQFSR------WGDYLAVGCA-NGRVVIYDFDTFRIAR--MLSAHVRPITSLCWSR--DGRKLLTSSRDWSIKLWDL 94 (405)
T ss_pred cceEEecc------Ccceeeeecc-CCcEEEEEccccchhh--hhhccccceeEEEecC--CCCEeeeecCCceeEEEec
Confidence 34777754 3689999999 9999999998765411 111111 223443222 3456777888887777765
Q ss_pred ecCCCccccceeeecCccceEEEEEEeCCceEE--EEEcCCceEEEecCCcEEEeeeccc-cceeeec---ccCCCCCCe
Q 001655 559 NMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV--FAASDRPTVIYSSNKKLLYSNVNLK-EVSHMCP---FNSAAFPDS 632 (1036)
Q Consensus 559 ~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v--~~~~~~p~~i~~~~~~~~~~~~~~~-~i~~~~~---f~~~~~~~~ 632 (1036)
-..+ .+. .+.+ ..||-.-.+.-...+.+ ...-..|.++..++++-.+.|.+.+ .....+. |+ .-++.
T Consensus 95 ~~gs-~l~---rirf-~spv~~~q~hp~k~n~~va~~~~~sp~vi~~s~~~h~~Lp~d~d~dln~sas~~~fd--r~g~y 167 (405)
T KOG1273|consen 95 LKGS-PLK---RIRF-DSPVWGAQWHPRKRNKCVATIMEESPVVIDFSDPKHSVLPKDDDGDLNSSASHGVFD--RRGKY 167 (405)
T ss_pred cCCC-cee---EEEc-cCccceeeeccccCCeEEEEEecCCcEEEEecCCceeeccCCCcccccccccccccc--CCCCE
Confidence 4332 111 1111 23332222211122222 3345668888776666556666532 2222222 22 12333
Q ss_pred EEE-EeCCeEEEEEEcCCCCeEEEEEeCCCccceEEEecCCCEEEEEEeec
Q 001655 633 LAI-AKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN 682 (1036)
Q Consensus 633 l~~-~~~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~ 682 (1036)
++. .++|.+.+...++..=...-||..-...+.|.+....+.+++-|+++
T Consensus 168 IitGtsKGkllv~~a~t~e~vas~rits~~~IK~I~~s~~g~~liiNtsDR 218 (405)
T KOG1273|consen 168 IITGTSKGKLLVYDAETLECVASFRITSVQAIKQIIVSRKGRFLIINTSDR 218 (405)
T ss_pred EEEecCcceEEEEecchheeeeeeeechheeeeEEEEeccCcEEEEecCCc
Confidence 333 45788998888877544444555455667777777777777777654
|
|
| >KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown] | Back alignment and domain information |
|---|
Probab=94.31 E-value=3.9 Score=41.51 Aligned_cols=109 Identities=17% Similarity=0.278 Sum_probs=71.6
Q ss_pred CCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEE-CCCCCeEeEEEEEEEcCCCceEEEEEee
Q 001655 659 LGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYP-LDTFEYGCSILSCSFSDDSNVYYCVGTA 737 (1036)
Q Consensus 659 l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~-~~~~E~v~si~~~~l~~~~~~~i~VGT~ 737 (1036)
+|...+.+.++++.++..+.+.. +.++|+|..|++.+.+|. .+.-|.=+-++ |. +...+++-|.
T Consensus 182 ~g~pit~vs~s~d~nc~La~~l~----------stlrLlDk~tGklL~sYkGhkn~eykldc~---l~-qsdthV~sgS- 246 (307)
T KOG0316|consen 182 FGHPITSVSFSKDGNCSLASSLD----------STLRLLDKETGKLLKSYKGHKNMEYKLDCC---LN-QSDTHVFSGS- 246 (307)
T ss_pred cCCcceeEEecCCCCEEEEeecc----------ceeeecccchhHHHHHhcccccceeeeeee---ec-ccceeEEecc-
Confidence 45566788899999988888765 699999999999998776 33334322211 11 1233444332
Q ss_pred eeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecC-c-eeEeccc--cCeEEEEECCEEEE
Q 001655 738 YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKG-A-VYSLNAF--NGKLLAAINQKIQL 793 (1036)
Q Consensus 738 ~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g-~-v~~i~~~--~g~Lv~~~g~~l~v 793 (1036)
.-|.+|+|++.+.++ +.+.++.+ + |+++... ...++.|.|+....
T Consensus 247 ---------EDG~Vy~wdLvd~~~--~sk~~~~~~v~v~dl~~hp~~~~f~~A~~~~~~~ 295 (307)
T KOG0316|consen 247 ---------EDGKVYFWDLVDETQ--ISKLSVVSTVIVTDLSCHPTMDDFITATGHGDLF 295 (307)
T ss_pred ---------CCceEEEEEecccee--eeeeccCCceeEEeeecccCccceeEecCCceec
Confidence 349999999987543 44445444 3 5666655 56788888887765
|
|
| >TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL | Back alignment and domain information |
|---|
Probab=94.17 E-value=13 Score=42.39 Aligned_cols=135 Identities=19% Similarity=0.249 Sum_probs=76.8
Q ss_pred CCeEEEEe-cCcEEEEecCCccceeeeeCCCCCcEEEEEeCCCEEEEEEcCCeEEEEEEcCcEEEEEEeccCCce-eEEE
Q 001655 406 YNQLVQVT-SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYE-ISCL 483 (1036)
Q Consensus 406 ~~~~vQVt-~~~i~l~~~~~~~~~~~w~~~~~~~I~~as~~~~~vvv~~~~~~~~~l~~~~~~l~~~~~~~l~~~-is~l 483 (1036)
++.++-.+ ...++.++.++++. .|..+.+. ....+..+..+++...++.+..++.++++..-.. ..+... .+..
T Consensus 241 ~~~vy~~~~~g~l~a~d~~tG~~--~W~~~~~~-~~~p~~~~~~vyv~~~~G~l~~~d~~tG~~~W~~-~~~~~~~~ssp 316 (377)
T TIGR03300 241 GGQVYAVSYQGRVAALDLRSGRV--LWKRDASS-YQGPAVDDNRLYVTDADGVVVALDRRSGSELWKN-DELKYRQLTAP 316 (377)
T ss_pred CCEEEEEEcCCEEEEEECCCCcE--EEeeccCC-ccCceEeCCEEEEECCCCeEEEEECCCCcEEEcc-ccccCCccccC
Confidence 34444444 34567777665443 36543221 2223355677777777788888887665432111 112211 1221
Q ss_pred EcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEE
Q 001655 484 DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFL 557 (1036)
Q Consensus 484 ~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~ 557 (1036)
.+ ....++++.. +|.+.+++..+.+.+++..+.......+..+. ...|++++.||.|++++
T Consensus 317 ~i--------~g~~l~~~~~-~G~l~~~d~~tG~~~~~~~~~~~~~~~sp~~~----~~~l~v~~~dG~l~~~~ 377 (377)
T TIGR03300 317 AV--------VGGYLVVGDF-EGYLHWLSREDGSFVARLKTDGSGIASPPVVV----GDGLLVQTRDGDLYAFR 377 (377)
T ss_pred EE--------ECCEEEEEeC-CCEEEEEECCCCCEEEEEEcCCCccccCCEEE----CCEEEEEeCCceEEEeC
Confidence 12 1457777776 89999999888888876655432111122222 24699999999998863
|
Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ. |
| >PLN00181 protein SPA1-RELATED; Provisional | Back alignment and domain information |
|---|
Probab=93.80 E-value=26 Score=44.53 Aligned_cols=218 Identities=15% Similarity=0.177 Sum_probs=111.8
Q ss_pred CCeEEEEEEcCCC------CeEEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECC
Q 001655 638 EGELTIGTIDDIQ------KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLD 711 (1036)
Q Consensus 638 ~~~l~i~~l~~~~------~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~ 711 (1036)
++.+++-.+.... ..+...+.-......+++++..+.+++.+.. .+.|++.|..+.+.+..++-.
T Consensus 504 D~~I~iwd~~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~~~~~~las~~~---------Dg~v~lWd~~~~~~~~~~~~H 574 (793)
T PLN00181 504 NKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQVASSNF---------EGVVQVWDVARSQLVTEMKEH 574 (793)
T ss_pred CCEEEEEECCcccccccccccceEEecccCceeeEEeccCCCCEEEEEeC---------CCeEEEEECCCCeEEEEecCC
Confidence 5667766654210 1122333333445667777654444444332 258999998888776655322
Q ss_pred CCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccc--cCe-EEEEE-
Q 001655 712 TFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF--NGK-LLAAI- 787 (1036)
Q Consensus 712 ~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~--~g~-Lv~~~- 787 (1036)
...+.++ .+......+++.|. ..|.+.+|++..++. +......+.|.++.-. +|. |++|.
T Consensus 575 -~~~V~~l---~~~p~~~~~L~Sgs----------~Dg~v~iWd~~~~~~--~~~~~~~~~v~~v~~~~~~g~~latgs~ 638 (793)
T PLN00181 575 -EKRVWSI---DYSSADPTLLASGS----------DDGSVKLWSINQGVS--IGTIKTKANICCVQFPSESGRSLAFGSA 638 (793)
T ss_pred -CCCEEEE---EEcCCCCCEEEEEc----------CCCEEEEEECCCCcE--EEEEecCCCeEEEEEeCCCCCEEEEEeC
Confidence 2234444 33322346777775 347899999875432 2233345677777632 354 44433
Q ss_pred CCEEEEEEeeecCCCccchhcccccccceEEEEEEE-eCCEEEEEeccccEEEEEEecc-----CCeEEEEeccCCccee
Q 001655 788 NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQT-RGDFIVVGDLMKSISLLIYKHE-----EGAIEERARDYNANWM 861 (1036)
Q Consensus 788 g~~l~v~~~~~~~~~~~~L~~~~~~~~~~~~~~l~~-~~~~I~vgD~~~Sv~~~~~~~~-----~~~l~~~a~D~~~~~~ 861 (1036)
.+.|++|++.. +...+.....+... +.++.. .+++++.|..-..+.++..... ...+..+.. ....+
T Consensus 639 dg~I~iwD~~~---~~~~~~~~~~h~~~--V~~v~f~~~~~lvs~s~D~~ikiWd~~~~~~~~~~~~l~~~~g--h~~~i 711 (793)
T PLN00181 639 DHKVYYYDLRN---PKLPLCTMIGHSKT--VSYVRFVDSSTLVSSSTDNTLKLWDLSMSISGINETPLHSFMG--HTNVK 711 (793)
T ss_pred CCeEEEEECCC---CCccceEecCCCCC--EEEEEEeCCCEEEEEECCCEEEEEeCCCCccccCCcceEEEcC--CCCCe
Confidence 35788877531 11112111112222 233433 4678888877777777654321 112222221 12223
Q ss_pred EEEEEeeCc-eEEEEccCCcEEEEeeC
Q 001655 862 SAVEILDDD-IYLGAENNFNLFTVRKN 887 (1036)
Q Consensus 862 ~~~~~l~~~-~~l~~D~~gnl~~l~~~ 887 (1036)
..+.+-.++ .++++..+|.+.++...
T Consensus 712 ~~v~~s~~~~~lasgs~D~~v~iw~~~ 738 (793)
T PLN00181 712 NFVGLSVSDGYIATGSETNEVFVYHKA 738 (793)
T ss_pred eEEEEcCCCCEEEEEeCCCEEEEEECC
Confidence 333333233 45667778999998764
|
|
| >KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=93.67 E-value=0.53 Score=49.20 Aligned_cols=270 Identities=18% Similarity=0.246 Sum_probs=146.3
Q ss_pred EEEEeCCC--ccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCC-------CeEeEEEEEEE
Q 001655 654 IRSIPLGE--HPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTF-------EYGCSILSCSF 724 (1036)
Q Consensus 654 ~~~i~l~~--tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~-------E~v~si~~~~l 724 (1036)
.|+|..|. -|....|+|+.+.++-.+.+ +.+++.|-.+++.-.....+.. ..+.|+. |
T Consensus 205 ~r~IKFg~KSh~EcA~FSPDgqyLvsgSvD----------GFiEVWny~~GKlrKDLkYQAqd~fMMmd~aVlci~---F 271 (508)
T KOG0275|consen 205 ARSIKFGQKSHVECARFSPDGQYLVSGSVD----------GFIEVWNYTTGKLRKDLKYQAQDNFMMMDDAVLCIS---F 271 (508)
T ss_pred hhheecccccchhheeeCCCCceEeecccc----------ceeeeehhccchhhhhhhhhhhcceeecccceEEEe---e
Confidence 45666653 46667788888755544432 4666666655554432222222 3344443 3
Q ss_pred cCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEecccc--CeEE-EEECCEEEEEEeeecCC
Q 001655 725 SDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN--GKLL-AAINQKIQLYKWMLRDD 801 (1036)
Q Consensus 725 ~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~~--g~Lv-~~~g~~l~v~~~~~~~~ 801 (1036)
.. ..++++-|. -.|.|.+|++..+..--.+.+.....|+|+.--+ ..|+ ++..++++++.+. .
T Consensus 272 SR-DsEMlAsGs----------qDGkIKvWri~tG~ClRrFdrAHtkGvt~l~FSrD~SqiLS~sfD~tvRiHGlK---S 337 (508)
T KOG0275|consen 272 SR-DSEMLASGS----------QDGKIKVWRIETGQCLRRFDRAHTKGVTCLSFSRDNSQILSASFDQTVRIHGLK---S 337 (508)
T ss_pred cc-cHHHhhccC----------cCCcEEEEEEecchHHHHhhhhhccCeeEEEEccCcchhhcccccceEEEeccc---c
Confidence 33 356666554 4589999999866544444455566788877543 3555 4455788898864 1
Q ss_pred Cccchhcccccccc-eEEEE--EEEeCCEEEEEeccccEEEEEEeccC--CeEEEEeccCCcceeEEEEEeeC--ceEEE
Q 001655 802 GTRELQSECGHHGH-ILALY--VQTRGDFIVVGDLMKSISLLIYKHEE--GAIEERARDYNANWMSAVEILDD--DIYLG 874 (1036)
Q Consensus 802 ~~~~L~~~~~~~~~-~~~~~--l~~~~~~I~vgD~~~Sv~~~~~~~~~--~~l~~~a~D~~~~~~~~~~~l~~--~~~l~ 874 (1036)
| ++.+. ++++ +++.. .+..|++|+-+..--++.++.-+..+ ..+.+.+.|.. +.++..+.. ..+++
T Consensus 338 G--K~LKE--frGHsSyvn~a~ft~dG~~iisaSsDgtvkvW~~KtteC~~Tfk~~~~d~~---vnsv~~~PKnpeh~iV 410 (508)
T KOG0275|consen 338 G--KCLKE--FRGHSSYVNEATFTDDGHHIISASSDGTVKVWHGKTTECLSTFKPLGTDYP---VNSVILLPKNPEHFIV 410 (508)
T ss_pred c--hhHHH--hcCccccccceEEcCCCCeEEEecCCccEEEecCcchhhhhhccCCCCccc---ceeEEEcCCCCceEEE
Confidence 2 22221 2233 23322 34568899888877777765433221 23444454443 444555543 45899
Q ss_pred EccCCcEEEEeeCCCCCCcccccceeEEEEEEcCCccceEEeeeeeecCCCCCCCCcceEEEEccccceEEEEecChhHH
Q 001655 875 AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY 954 (1036)
Q Consensus 875 ~D~~gnl~~l~~~~~~~~~~~~~~L~~~~~~~lg~~v~~~~~~~l~~~~~~~~~~~~~~il~~t~~GsIg~l~~l~~~~~ 954 (1036)
+.+...++++....+.-.+. ..+.=.=|+.|++... ..+.=+.+.+.||-+|+|.-++-.
T Consensus 411 CNrsntv~imn~qGQvVrsf------sSGkREgGdFi~~~lS------------pkGewiYcigED~vlYCF~~~sG~-- 470 (508)
T KOG0275|consen 411 CNRSNTVYIMNMQGQVVRSF------SSGKREGGDFINAILS------------PKGEWIYCIGEDGVLYCFSVLSGK-- 470 (508)
T ss_pred EcCCCeEEEEeccceEEeee------ccCCccCCceEEEEec------------CCCcEEEEEccCcEEEEEEeecCc--
Confidence 99999999998754211100 0111123445544321 123345555777777777776532
Q ss_pred HHHHHHHHHHHhcccCCCCCCccccccc
Q 001655 955 LFLEKLQTNLRKVIKGVGGLNHEQWRSF 982 (1036)
Q Consensus 955 ~~L~~lq~~l~~~~~~~~gl~~~~~R~~ 982 (1036)
||+-|.-..+.+-|+.|--+++.
T Consensus 471 -----LE~tl~VhEkdvIGl~HHPHqNl 493 (508)
T KOG0275|consen 471 -----LERTLPVHEKDVIGLTHHPHQNL 493 (508)
T ss_pred -----eeeeeecccccccccccCcccch
Confidence 22223333344556666655553
|
|
| >PF14727 PHTB1_N: PTHB1 N-terminus | Back alignment and domain information |
|---|
Probab=93.47 E-value=17 Score=41.50 Aligned_cols=140 Identities=14% Similarity=0.172 Sum_probs=77.0
Q ss_pred CCeEEEEEEcCCCCeEEEEEeCCCccceEEEecCCCEEEEEEeecC-----------CCCCcee-eEEEEEEeCCCceEE
Q 001655 638 EGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ-----------SCAEESE-MHFVRLLDDQTFEFI 705 (1036)
Q Consensus 638 ~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~-----------~~~~~~~-~~~l~l~d~~t~~~i 705 (1036)
++.|.|..=+.. ...+.+|----|--++|.+.++.|+++.+.-. ..+.+.+ .....-.. ...+..
T Consensus 154 DG~L~~feqe~~--~f~~~lp~~llPgPl~Y~~~tDsfvt~sss~~l~~Yky~~La~~s~~~~~~~~~~~~~~-~~k~l~ 230 (418)
T PF14727_consen 154 DGSLSFFEQESF--AFSRFLPDFLLPGPLCYCPRTDSFVTASSSWTLECYKYQDLASASEASSRQSGTEQDIS-SGKKLN 230 (418)
T ss_pred CceEEEEeCCcE--EEEEEcCCCCCCcCeEEeecCCEEEEecCceeEEEecHHHhhhcccccccccccccccc-cccccc
Confidence 666666653321 12445555456777999999999988876420 0010000 00000000 111112
Q ss_pred EEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEecccc-----
Q 001655 706 STYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN----- 780 (1036)
Q Consensus 706 ~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~~----- 780 (1036)
..-.|.-+|.+..|..+++.......+|.|.. .+++++- ++ ++.+.+..+..+.|++.|.
T Consensus 231 ~dWs~nlGE~~l~i~v~~~~~~~~~IvvLger------------~Lf~l~~-~G--~l~~~krLd~~p~~~~~Y~~~~~~ 295 (418)
T PF14727_consen 231 PDWSFNLGEQALDIQVVRFSSSESDIVVLGER------------SLFCLKD-NG--SLRFQKRLDYNPSCFCPYRVPWYN 295 (418)
T ss_pred ceeEEECCceeEEEEEEEcCCCCceEEEEecc------------eEEEEcC-CC--eEEEEEecCCceeeEEEEEeeccc
Confidence 22356678999999998886544556666643 3333332 33 4455566677777777661
Q ss_pred --C----eEEEEECCEEEEEE
Q 001655 781 --G----KLLAAINQKIQLYK 795 (1036)
Q Consensus 781 --g----~Lv~~~g~~l~v~~ 795 (1036)
+ .|+++..+++.+|+
T Consensus 296 ~~~~~~~llV~t~t~~LlVy~ 316 (418)
T PF14727_consen 296 EPSTRLNLLVGTHTGTLLVYE 316 (418)
T ss_pred CCCCceEEEEEecCCeEEEEe
Confidence 1 45666778899988
|
|
| >KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms] | Back alignment and domain information |
|---|
Probab=93.47 E-value=14 Score=41.60 Aligned_cols=60 Identities=18% Similarity=0.209 Sum_probs=32.1
Q ss_pred ccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeee-cCeEEEEEEeCCCcEEEEEEe
Q 001655 495 SQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAF-EGISYLLCALGDGHLLNFLLN 559 (1036)
Q Consensus 495 ~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~-~~~~~L~vgl~~G~l~~y~~~ 559 (1036)
+.+++-|-- ++.+.||+|.. ++++.. +. +.. ..+..+.. ....|+-.+.-.|.++.-.+.
T Consensus 91 S~y~~sgG~-~~~Vkiwdl~~-kl~hr~-lk-dh~-stvt~v~YN~~DeyiAsvs~gGdiiih~~~ 151 (673)
T KOG4378|consen 91 SLYEISGGQ-SGCVKIWDLRA-KLIHRF-LK-DHQ-STVTYVDYNNTDEYIASVSDGGDIIIHGTK 151 (673)
T ss_pred ceeeeccCc-CceeeehhhHH-HHHhhh-cc-CCc-ceeEEEEecCCcceeEEeccCCcEEEEecc
Confidence 455555545 78999999873 222211 11 111 12222222 245677777778888776554
|
|
| >KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=93.35 E-value=8.3 Score=42.99 Aligned_cols=133 Identities=18% Similarity=0.188 Sum_probs=85.3
Q ss_pred eCCeEEEEEEcCCCCeEEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCC--
Q 001655 637 KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFE-- 714 (1036)
Q Consensus 637 ~~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E-- 714 (1036)
.+..+++-+++...++..+ +....+..++|..+. .|+.++. .+.|.|.+-.+++++.++....+-
T Consensus 306 rDrT~rlwKi~eesqlifr--g~~~sidcv~~In~~-HfvsGSd----------nG~IaLWs~~KKkplf~~~~AHgv~~ 372 (479)
T KOG0299|consen 306 RDRTVRLWKIPEESQLIFR--GGEGSIDCVAFINDE-HFVSGSD----------NGSIALWSLLKKKPLFTSRLAHGVIP 372 (479)
T ss_pred ccceeEEEeccccceeeee--CCCCCeeeEEEeccc-ceeeccC----------CceEEEeeecccCceeEeeccccccC
Confidence 4667777777654443222 224466666666443 2443332 367888888888888766543221
Q ss_pred ---------eEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCC--eEEEEEEEEecCceeEeccc-cC-
Q 001655 715 ---------YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAF-NG- 781 (1036)
Q Consensus 715 ---------~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~--kl~~~~~~~~~g~v~~i~~~-~g- 781 (1036)
-++++..+ .+..+++-|. ..|.+.+|.+.++ +++++++..+.|-|.+++-. +|
T Consensus 373 ~~~~~~~~~Witsla~i----~~sdL~asGS----------~~G~vrLW~i~~g~r~i~~l~~ls~~GfVNsl~f~~sgk 438 (479)
T KOG0299|consen 373 ELDPVNGNFWITSLAVI----PGSDLLASGS----------WSGCVRLWKIEDGLRAINLLYSLSLVGFVNSLAFSNSGK 438 (479)
T ss_pred CccccccccceeeeEec----ccCceEEecC----------CCCceEEEEecCCccccceeeecccccEEEEEEEccCCC
Confidence 23333332 1234444443 6789999999976 89999999999999999944 55
Q ss_pred eEEEEECCEEEEEEe
Q 001655 782 KLLAAINQKIQLYKW 796 (1036)
Q Consensus 782 ~Lv~~~g~~l~v~~~ 796 (1036)
+|++++|+.=++=+|
T Consensus 439 ~ivagiGkEhRlGRW 453 (479)
T KOG0299|consen 439 RIVAGIGKEHRLGRW 453 (479)
T ss_pred EEEEeccccccccee
Confidence 499999986665455
|
|
| >KOG2321 consensus WD40 repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=93.33 E-value=1.6 Score=49.71 Aligned_cols=138 Identities=15% Similarity=0.153 Sum_probs=77.1
Q ss_pred EEEEEEcCCeEEEEEEcCcEEEEEEeccCC-ceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecC--
Q 001655 448 QVLLATGGGHLVYLEIGDGILTEVKHAQLE-YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL-- 524 (1036)
Q Consensus 448 ~vvv~~~~~~~~~l~~~~~~l~~~~~~~l~-~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l-- 524 (1036)
..+++ ++.++.-|.++.|++. .++..+ .++.|+++++. ...+++|+. +|.++.|+-.+-+.+..-..
T Consensus 148 ly~~g-sg~evYRlNLEqGrfL--~P~~~~~~~lN~v~in~~------hgLla~Gt~-~g~VEfwDpR~ksrv~~l~~~~ 217 (703)
T KOG2321|consen 148 LYLVG-SGSEVYRLNLEQGRFL--NPFETDSGELNVVSINEE------HGLLACGTE-DGVVEFWDPRDKSRVGTLDAAS 217 (703)
T ss_pred EEEee-cCcceEEEEccccccc--cccccccccceeeeecCc------cceEEeccc-CceEEEecchhhhhheeeeccc
Confidence 44443 4666766666666542 223333 56677787653 568888988 99999998765433332111
Q ss_pred -----CC-c--cCceeEEeeeecCeEEEEEEeCCCcEEEEEEecCCCccccceeeecCccceEEEEEEeCCceEEEEEcC
Q 001655 525 -----GG-E--IIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASD 596 (1036)
Q Consensus 525 -----~~-~--~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~~ 596 (1036)
+. + ..+.++.+- ++..++=||+++|.++.|.+......+ .....+..|+....+...+....+++.+
T Consensus 218 ~v~s~pg~~~~~svTal~F~--d~gL~~aVGts~G~v~iyDLRa~~pl~---~kdh~~e~pi~~l~~~~~~~q~~v~S~D 292 (703)
T KOG2321|consen 218 SVNSHPGGDAAPSVTALKFR--DDGLHVAVGTSTGSVLIYDLRASKPLL---VKDHGYELPIKKLDWQDTDQQNKVVSMD 292 (703)
T ss_pred ccCCCccccccCcceEEEec--CCceeEEeeccCCcEEEEEcccCCcee---ecccCCccceeeecccccCCCceEEecc
Confidence 11 1 113334322 236789999999999999987543211 1224555666554442223334445444
Q ss_pred CceE
Q 001655 597 RPTV 600 (1036)
Q Consensus 597 ~p~~ 600 (1036)
...+
T Consensus 293 k~~~ 296 (703)
T KOG2321|consen 293 KRIL 296 (703)
T ss_pred hHHh
Confidence 4433
|
|
| >KOG0649 consensus WD40 repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=93.33 E-value=2 Score=43.63 Aligned_cols=145 Identities=18% Similarity=0.298 Sum_probs=92.7
Q ss_pred CceEEEEEeeeeCCCCCCCcceEEEEEEEe---------CCeEEEEEEEEecCceeEeccccCeEEEEECCEEEEEEeee
Q 001655 728 SNVYYCVGTAYVLPEENEPTKGRILVFIVE---------DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798 (1036)
Q Consensus 728 ~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~---------~~kl~~~~~~~~~g~v~~i~~~~g~Lv~~~g~~l~v~~~~~ 798 (1036)
.++|+++|.. -|.|-+++++ .+|++++.....+||+|.++..+..|+.|-...++=+.|.+
T Consensus 21 ~~~~l~agn~----------~G~iav~sl~sl~s~sa~~~gk~~iv~eqahdgpiy~~~f~d~~Lls~gdG~V~gw~W~E 90 (325)
T KOG0649|consen 21 SKQYLFAGNL----------FGDIAVLSLKSLDSGSAEPPGKLKIVPEQAHDGPIYYLAFHDDFLLSGGDGLVYGWEWNE 90 (325)
T ss_pred cceEEEEecC----------CCeEEEEEehhhhccccCCCCCcceeeccccCCCeeeeeeehhheeeccCceEEEeeehh
Confidence 3688888864 3788888888 46888999889999999999999999888778888888864
Q ss_pred cC--CCccchhc-cccccc-----c-eEEEEEEEeCC-EEEEEeccccEEEEEEeccCCeEEEEeccCCcceeEEEEEee
Q 001655 799 RD--DGTRELQS-ECGHHG-----H-ILALYVQTRGD-FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868 (1036)
Q Consensus 799 ~~--~~~~~L~~-~~~~~~-----~-~~~~~l~~~~~-~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l~ 868 (1036)
.. ++.|+|-. +..++. + +.++.++-..| .+++| -+++ ++.++-+++++...-|--.. .+ .+....
T Consensus 91 ~~es~~~K~lwe~~~P~~~~~~evPeINam~ldP~enSi~~Ag--GD~~-~y~~dlE~G~i~r~~rGHtD-Yv-H~vv~R 165 (325)
T KOG0649|consen 91 EEESLATKRLWEVKIPMQVDAVEVPEINAMWLDPSENSILFAG--GDGV-IYQVDLEDGRIQREYRGHTD-YV-HSVVGR 165 (325)
T ss_pred hhhhccchhhhhhcCccccCcccCCccceeEeccCCCcEEEec--CCeE-EEEEEecCCEEEEEEcCCcc-ee-eeeeec
Confidence 32 23333322 111211 1 24566664444 55555 4444 57888888888766543211 11 122221
Q ss_pred C--ceEEEEccCCcEEEEeeC
Q 001655 869 D--DIYLGAENNFNLFTVRKN 887 (1036)
Q Consensus 869 ~--~~~l~~D~~gnl~~l~~~ 887 (1036)
+ ..++.+-.+|.+.+....
T Consensus 166 ~~~~qilsG~EDGtvRvWd~k 186 (325)
T KOG0649|consen 166 NANGQILSGAEDGTVRVWDTK 186 (325)
T ss_pred ccCcceeecCCCccEEEEecc
Confidence 2 347777778888887643
|
|
| >KOG0289 consensus mRNA splicing factor [General function prediction only] | Back alignment and domain information |
|---|
Probab=93.21 E-value=16 Score=40.47 Aligned_cols=193 Identities=15% Similarity=0.138 Sum_probs=101.5
Q ss_pred eCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEee
Q 001655 658 PLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTA 737 (1036)
Q Consensus 658 ~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~ 737 (1036)
+..+..+-+.-||....++-++.+ ++..|-|-.+...+....-+..+.- +.+..|.. .=++.||+
T Consensus 301 ~h~~~V~~ls~h~tgeYllsAs~d----------~~w~Fsd~~~g~~lt~vs~~~s~v~--~ts~~fHp---DgLifgtg 365 (506)
T KOG0289|consen 301 PHEEPVTGLSLHPTGEYLLSASND----------GTWAFSDISSGSQLTVVSDETSDVE--YTSAAFHP---DGLIFGTG 365 (506)
T ss_pred cccccceeeeeccCCcEEEEecCC----------ceEEEEEccCCcEEEEEeeccccce--eEEeeEcC---CceEEecc
Confidence 333444555556665544433322 4555666666666554443333321 22233432 22445554
Q ss_pred eeCCCCCCCcceEEEEEEEeCCeEEEEEEEE-ecCceeEeccc-cCe-EEEEECCE-EEEEEeeecCCCccchhcccc--
Q 001655 738 YVLPEENEPTKGRILVFIVEDGKLQLIAEKE-TKGAVYSLNAF-NGK-LLAAINQK-IQLYKWMLRDDGTRELQSECG-- 811 (1036)
Q Consensus 738 ~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~-~~g~v~~i~~~-~g~-Lv~~~g~~-l~v~~~~~~~~~~~~L~~~~~-- 811 (1036)
- ..|-+.+|+++... .+..+. ..|||.+|.-- ||| |+++.... |++|++. |+...-.
T Consensus 366 t--------~d~~vkiwdlks~~--~~a~Fpght~~vk~i~FsENGY~Lat~add~~V~lwDLR-------Kl~n~kt~~ 428 (506)
T KOG0289|consen 366 T--------PDGVVKIWDLKSQT--NVAKFPGHTGPVKAISFSENGYWLATAADDGSVKLWDLR-------KLKNFKTIQ 428 (506)
T ss_pred C--------CCceEEEEEcCCcc--ccccCCCCCCceeEEEeccCceEEEEEecCCeEEEEEeh-------hhcccceee
Confidence 2 55899999988654 334333 67899999855 675 66677765 8887652 2221111
Q ss_pred ccc--ceEEEEEEEeCCEEEEEeccccEEEEEEeccCCeEEEEeccCCcc-eeEEEEEeeCceEEEE-ccCCcEEEE
Q 001655 812 HHG--HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN-WMSAVEILDDDIYLGA-ENNFNLFTV 884 (1036)
Q Consensus 812 ~~~--~~~~~~l~~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~-~~~~~~~l~~~~~l~~-D~~gnl~~l 884 (1036)
.+. ....++++-.|.++.++ -..+.++.|+....+..++..-.... ..+.+.|=+...++++ -.+.++.++
T Consensus 429 l~~~~~v~s~~fD~SGt~L~~~--g~~l~Vy~~~k~~k~W~~~~~~~~~sg~st~v~Fg~~aq~l~s~smd~~l~~~ 503 (506)
T KOG0289|consen 429 LDEKKEVNSLSFDQSGTYLGIA--GSDLQVYICKKKTKSWTEIKELADHSGLSTGVRFGEHAQYLASTSMDAILRLY 503 (506)
T ss_pred ccccccceeEEEcCCCCeEEee--cceeEEEEEecccccceeeehhhhcccccceeeecccceEEeeccchhheEEe
Confidence 111 13456666678888887 77888998886554443333221111 2345555444444443 234444433
|
|
| >PTZ00420 coronin; Provisional | Back alignment and domain information |
|---|
Probab=93.16 E-value=24 Score=42.25 Aligned_cols=214 Identities=14% Similarity=0.109 Sum_probs=104.4
Q ss_pred CccceEEEecC-CCEEEEEEeecCCCCCceeeEEEEEEeCCCce-EEE-----EEECCCCC-eEeEEEEEEEcCCCceEE
Q 001655 661 EHPRRICHQEQ-SRTFAICSLKNQSCAEESEMHFVRLLDDQTFE-FIS-----TYPLDTFE-YGCSILSCSFSDDSNVYY 732 (1036)
Q Consensus 661 ~tp~~i~y~~~-~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~-~i~-----~~~~~~~E-~v~si~~~~l~~~~~~~i 732 (1036)
..+..+.++|. .+.++.++.+ +.|++.|..+.. ... ...+..+. .+.+ +.+......++
T Consensus 75 ~~V~~lafsP~~~~lLASgS~D----------gtIrIWDi~t~~~~~~~i~~p~~~L~gH~~~V~s---Vaf~P~g~~iL 141 (568)
T PTZ00420 75 SSILDLQFNPCFSEILASGSED----------LTIRVWEIPHNDESVKEIKDPQCILKGHKKKISI---IDWNPMNYYIM 141 (568)
T ss_pred CCEEEEEEcCCCCCEEEEEeCC----------CeEEEEECCCCCccccccccceEEeecCCCcEEE---EEECCCCCeEE
Confidence 45677888886 4555544432 478888764321 110 01222222 2333 33333333334
Q ss_pred EEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccc-cCeEEEEE--CCEEEEEEeeecCCCccchhcc
Q 001655 733 CVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF-NGKLLAAI--NQKIQLYKWMLRDDGTRELQSE 809 (1036)
Q Consensus 733 ~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~-~g~Lv~~~--g~~l~v~~~~~~~~~~~~L~~~ 809 (1036)
+.|. .-|.|.+|++..++.. ......++|++++-- +|.++++. +++|++|+.. . + ..+...
T Consensus 142 aSgS----------~DgtIrIWDl~tg~~~--~~i~~~~~V~SlswspdG~lLat~s~D~~IrIwD~R--s-g-~~i~tl 205 (568)
T PTZ00420 142 CSSG----------FDSFVNIWDIENEKRA--FQINMPKKLSSLKWNIKGNLLSGTCVGKHMHIIDPR--K-Q-EIASSF 205 (568)
T ss_pred EEEe----------CCCeEEEEECCCCcEE--EEEecCCcEEEEEECCCCCEEEEEecCCEEEEEECC--C-C-cEEEEE
Confidence 4443 2378999999866532 233456788888743 56666544 5689998753 1 1 111111
Q ss_pred cccccce--EEEEE---EEeCCEEEEEecc----ccEEEEEEeccCCeEEEEeccCCcceeEEEEEeeC--c-eEEEEcc
Q 001655 810 CGHHGHI--LALYV---QTRGDFIVVGDLM----KSISLLIYKHEEGAIEERARDYNANWMSAVEILDD--D-IYLGAEN 877 (1036)
Q Consensus 810 ~~~~~~~--~~~~l---~~~~~~I~vgD~~----~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l~~--~-~~l~~D~ 877 (1036)
..+.... ..+.+ ...+++|+.+-.- +.+.++..+.....+..+.-|..+-.++. +.+. + .++++-.
T Consensus 206 ~gH~g~~~s~~v~~~~fs~d~~~IlTtG~d~~~~R~VkLWDlr~~~~pl~~~~ld~~~~~L~p--~~D~~tg~l~lsGkG 283 (568)
T PTZ00420 206 HIHDGGKNTKNIWIDGLGGDDNYILSTGFSKNNMREMKLWDLKNTTSALVTMSIDNASAPLIP--HYDESTGLIYLIGKG 283 (568)
T ss_pred ecccCCceeEEEEeeeEcCCCCEEEEEEcCCCCccEEEEEECCCCCCceEEEEecCCccceEE--eeeCCCCCEEEEEEC
Confidence 1122111 11121 1345677764433 34666544322333333333333322222 2232 2 3567778
Q ss_pred CCcEEEEeeCCCCCCcccccceeEEEEEEcCCccce
Q 001655 878 NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913 (1036)
Q Consensus 878 ~gnl~~l~~~~~~~~~~~~~~L~~~~~~~lg~~v~~ 913 (1036)
++++.+|++..+ .+....+|+.++....
T Consensus 284 D~tIr~~e~~~~--------~~~~l~~~~s~~p~~g 311 (568)
T PTZ00420 284 DGNCRYYQHSLG--------SIRKVNEYKSCSPFRS 311 (568)
T ss_pred CCeEEEEEccCC--------cEEeecccccCCCccc
Confidence 999999998642 2334445554444433
|
|
| >KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism] | Back alignment and domain information |
|---|
Probab=93.05 E-value=3.4 Score=42.19 Aligned_cols=142 Identities=12% Similarity=0.089 Sum_probs=89.8
Q ss_pred CCeEEE-EecCcEEEEecCCccceeeeeCCCCCcEEEEEeC--CCEEEEEEcCCeEEEEEEcCcEEEEEEeccCCceeEE
Q 001655 406 YNQLVQ-VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATAN--ASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISC 482 (1036)
Q Consensus 406 ~~~~vQ-Vt~~~i~l~~~~~~~~~~~w~~~~~~~I~~as~~--~~~vvv~~~~~~~~~l~~~~~~l~~~~~~~l~~~is~ 482 (1036)
+..|+- --...||+.|-.+++.++....+. .++.+.+. +.++.++. ++.+.+.+.+ .|..+++++++..|.+
T Consensus 155 D~~iLSSadd~tVRLWD~rTgt~v~sL~~~s--~VtSlEvs~dG~ilTia~-gssV~Fwdak--sf~~lKs~k~P~nV~S 229 (334)
T KOG0278|consen 155 DKCILSSADDKTVRLWDHRTGTEVQSLEFNS--PVTSLEVSQDGRILTIAY-GSSVKFWDAK--SFGLLKSYKMPCNVES 229 (334)
T ss_pred CceEEeeccCCceEEEEeccCcEEEEEecCC--CCcceeeccCCCEEEEec-CceeEEeccc--cccceeeccCcccccc
Confidence 566666 346789999988877787776543 56677654 55666665 6777776653 5667788888888887
Q ss_pred EEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEecC
Q 001655 483 LDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMK 561 (1036)
Q Consensus 483 l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~ 561 (1036)
-++.| ...+++.|-. +..++-|+..+-+-+-...-....+.+++.+.+ +...--.|..||.+..++..+.
T Consensus 230 ASL~P------~k~~fVaGge-d~~~~kfDy~TgeEi~~~nkgh~gpVhcVrFSP--dGE~yAsGSEDGTirlWQt~~~ 299 (334)
T KOG0278|consen 230 ASLHP------KKEFFVAGGE-DFKVYKFDYNTGEEIGSYNKGHFGPVHCVRFSP--DGELYASGSEDGTIRLWQTTPG 299 (334)
T ss_pred ccccC------CCceEEecCc-ceEEEEEeccCCceeeecccCCCCceEEEEECC--CCceeeccCCCceEEEEEecCC
Confidence 77765 1345555544 667777776654332221111112445555443 2333456999999999987764
|
|
| >KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport] | Back alignment and domain information |
|---|
Probab=93.03 E-value=3.3 Score=43.70 Aligned_cols=134 Identities=12% Similarity=0.191 Sum_probs=82.5
Q ss_pred CeEEEEEeCC--CccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeC------CCceEEEEEE-----CCCCCeEe
Q 001655 651 KLHIRSIPLG--EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD------QTFEFISTYP-----LDTFEYGC 717 (1036)
Q Consensus 651 ~~~~~~i~l~--~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~------~t~~~i~~~~-----~~~~E~v~ 717 (1036)
.|.-++.-.+ ....-|.|-|...-+-+++... -+.||+++. ..|.....+. ...++...
T Consensus 101 ~Wv~~ttl~DsrssV~DV~FaP~hlGLklA~~~a--------DG~lRIYEA~dp~nLs~W~Lq~Ei~~~~~pp~~~~~~~ 172 (361)
T KOG2445|consen 101 RWVRRTTLVDSRSSVTDVKFAPKHLGLKLAAASA--------DGILRIYEAPDPMNLSQWTLQHEIQNVIDPPGKNKQPC 172 (361)
T ss_pred eeEEEEEeecCCcceeEEEecchhcceEEEEecc--------CcEEEEEecCCccccccchhhhhhhhccCCcccccCcc
Confidence 4554444332 3455666766444333333221 157777753 3344333222 23566667
Q ss_pred EEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCC--eEEEEEEE-EecCceeEeccc--cC--e--EEEEEC
Q 001655 718 SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG--KLQLIAEK-ETKGAVYSLNAF--NG--K--LLAAIN 788 (1036)
Q Consensus 718 si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~--kl~~~~~~-~~~g~v~~i~~~--~g--~--Lv~~~g 788 (1036)
.+.+|.......++|+||. +|+.+..++.++|+..++ |...+.+. ....||++|+-- -| + |.+|.+
T Consensus 173 ~CvsWn~sr~~~p~iAvgs-----~e~a~~~~~~~Iye~~e~~rKw~kva~L~d~~dpI~di~wAPn~Gr~y~~lAvA~k 247 (361)
T KOG2445|consen 173 FCVSWNPSRMHEPLIAVGS-----DEDAPHLNKVKIYEYNENGRKWLKVAELPDHTDPIRDISWAPNIGRSYHLLAVATK 247 (361)
T ss_pred eEEeeccccccCceEEEEc-----ccCCccccceEEEEecCCcceeeeehhcCCCCCcceeeeeccccCCceeeEEEeec
Confidence 7778876556689999998 567789999999999843 66555543 567899998733 23 2 455666
Q ss_pred CEEEEEEee
Q 001655 789 QKIQLYKWM 797 (1036)
Q Consensus 789 ~~l~v~~~~ 797 (1036)
.-|++|.+.
T Consensus 248 Dgv~I~~v~ 256 (361)
T KOG2445|consen 248 DGVRIFKVK 256 (361)
T ss_pred CcEEEEEEe
Confidence 669999874
|
|
| >PRK11138 outer membrane biogenesis protein BamB; Provisional | Back alignment and domain information |
|---|
Probab=92.90 E-value=21 Score=40.94 Aligned_cols=127 Identities=14% Similarity=0.180 Sum_probs=73.1
Q ss_pred cCcEEEEecCCccceeeeeCCCCCcEEEEEeCCCEEEEEEcCCeEEEEEEcCcEEEEEEeccCCce-eEEEEcCCCCCCC
Q 001655 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYE-ISCLDINPIGENP 492 (1036)
Q Consensus 414 ~~~i~l~~~~~~~~~~~w~~~~~~~I~~as~~~~~vvv~~~~~~~~~l~~~~~~l~~~~~~~l~~~-is~l~~~~~~~~~ 492 (1036)
...+..++..+++ ..|..+.+.. ......++.+.+...++.++.++..+++..=... .+... .+...+
T Consensus 265 ~g~l~ald~~tG~--~~W~~~~~~~-~~~~~~~~~vy~~~~~g~l~ald~~tG~~~W~~~-~~~~~~~~sp~v------- 333 (394)
T PRK11138 265 NGNLVALDLRSGQ--IVWKREYGSV-NDFAVDGGRIYLVDQNDRVYALDTRGGVELWSQS-DLLHRLLTAPVL------- 333 (394)
T ss_pred CCeEEEEECCCCC--EEEeecCCCc-cCcEEECCEEEEEcCCCeEEEEECCCCcEEEccc-ccCCCcccCCEE-------
Confidence 3456667766543 3476543321 1223456667777677888888776654321111 11111 111111
Q ss_pred CcccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEE
Q 001655 493 SYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFL 557 (1036)
Q Consensus 493 ~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~ 557 (1036)
....+.++.- +|.+..++..+.+++++..+.......+..+. ...|++++.||.|+.++
T Consensus 334 -~~g~l~v~~~-~G~l~~ld~~tG~~~~~~~~~~~~~~s~P~~~----~~~l~v~t~~G~l~~~~ 392 (394)
T PRK11138 334 -YNGYLVVGDS-EGYLHWINREDGRFVAQQKVDSSGFLSEPVVA----DDKLLIQARDGTVYAIT 392 (394)
T ss_pred -ECCEEEEEeC-CCEEEEEECCCCCEEEEEEcCCCcceeCCEEE----CCEEEEEeCCceEEEEe
Confidence 1456777766 89999999888888887666422222222222 34799999999998865
|
|
| >KOG1645 consensus RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones] | Back alignment and domain information |
|---|
Probab=92.80 E-value=7 Score=42.85 Aligned_cols=64 Identities=11% Similarity=0.118 Sum_probs=44.9
Q ss_pred EEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEecCCCc
Q 001655 497 IAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGE 564 (1036)
Q Consensus 497 ~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~ 564 (1036)
.+.++.- ++.|.|++++.-..+...... ..+-|.+.. ..+..++++|+.+|.+++|.+...++.
T Consensus 208 Ll~~asl-~nkiki~dlet~~~vssy~a~--~~~wSC~wD-lde~h~IYaGl~nG~VlvyD~R~~~~~ 271 (463)
T KOG1645|consen 208 LLGLASL-GNKIKIMDLETSCVVSSYIAY--NQIWSCCWD-LDERHVIYAGLQNGMVLVYDMRQPEGP 271 (463)
T ss_pred eeeeecc-CceEEEEecccceeeeheecc--CCceeeeec-cCCcceeEEeccCceEEEEEccCCCch
Confidence 4556655 889999999875555444333 345566543 345889999999999999998765443
|
|
| >KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown] | Back alignment and domain information |
|---|
Probab=92.75 E-value=13 Score=37.97 Aligned_cols=154 Identities=15% Similarity=0.143 Sum_probs=86.0
Q ss_pred eeeeecCeeccccceeeeecCCCeEEEEecCcEEEEecCC--ccceeeeeCCCCCcEEEEEeCCCEEEEEEcCCeEEEEE
Q 001655 385 EETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS--RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462 (1036)
Q Consensus 385 ~e~~~~~f~~~~~Tl~~~~~~~~~~vQVt~~~i~l~~~~~--~~~~~~w~~~~~~~I~~as~~~~~vvv~~~~~~~~~l~ 462 (1036)
-+++.-.|..+..-++.|.+ ..++|+.+-.+ .+.++-... .-..|....+++.-++-..-+|.+.-+.
T Consensus 102 aqVNtV~fNeesSVv~Sgsf---------D~s~r~wDCRS~s~ePiQilde-a~D~V~Si~v~~heIvaGS~DGtvRtyd 171 (307)
T KOG0316|consen 102 AQVNTVRFNEESSVVASGSF---------DSSVRLWDCRSRSFEPIQILDE-AKDGVSSIDVAEHEIVAGSVDGTVRTYD 171 (307)
T ss_pred ceeeEEEecCcceEEEeccc---------cceeEEEEcccCCCCccchhhh-hcCceeEEEecccEEEeeccCCcEEEEE
Confidence 34444556555555666655 34556665432 112221111 1123555555555555444466776667
Q ss_pred EcCcEEEEEEeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeE--EecCCCccCceeEEeeeecC
Q 001655 463 IGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT--KEHLGGEIIPRSVLLCAFEG 540 (1036)
Q Consensus 463 ~~~~~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~--~~~l~~~~~p~si~~~~~~~ 540 (1036)
+..|++ .+--+...|+|+++++ .++..+++.- ++++++++-.+.+++. +.+...+...++ ++ .++
T Consensus 172 iR~G~l---~sDy~g~pit~vs~s~------d~nc~La~~l-~stlrLlDk~tGklL~sYkGhkn~eykldc-~l--~qs 238 (307)
T KOG0316|consen 172 IRKGTL---SSDYFGHPITSVSFSK------DGNCSLASSL-DSTLRLLDKETGKLLKSYKGHKNMEYKLDC-CL--NQS 238 (307)
T ss_pred eeccee---ehhhcCCcceeEEecC------CCCEEEEeec-cceeeecccchhHHHHHhcccccceeeeee-ee--ccc
Confidence 755543 2234677899999864 3678888877 9999998866544432 111111111111 11 134
Q ss_pred eEEEEEEeCCCcEEEEEEecC
Q 001655 541 ISYLLCALGDGHLLNFLLNMK 561 (1036)
Q Consensus 541 ~~~L~vgl~~G~l~~y~~~~~ 561 (1036)
..+++-|..||.++.|.+...
T Consensus 239 dthV~sgSEDG~Vy~wdLvd~ 259 (307)
T KOG0316|consen 239 DTHVFSGSEDGKVYFWDLVDE 259 (307)
T ss_pred ceeEEeccCCceEEEEEeccc
Confidence 678999999999999887643
|
|
| >KOG0289 consensus mRNA splicing factor [General function prediction only] | Back alignment and domain information |
|---|
Probab=92.69 E-value=19 Score=39.94 Aligned_cols=144 Identities=16% Similarity=0.169 Sum_probs=85.8
Q ss_pred CCCCeEEEEeCCe-EEEEEEcCCCCeEEEEEe-CCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEE
Q 001655 628 AFPDSLAIAKEGE-LTIGTIDDIQKLHIRSIP-LGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFI 705 (1036)
Q Consensus 628 ~~~~~l~~~~~~~-l~i~~l~~~~~~~~~~i~-l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i 705 (1036)
..++.|+.+++++ .-+..+.....+..-+-. -+-..+..++||+...|+..+.. +.|+++|-++...+
T Consensus 313 ~tgeYllsAs~d~~w~Fsd~~~g~~lt~vs~~~s~v~~ts~~fHpDgLifgtgt~d----------~~vkiwdlks~~~~ 382 (506)
T KOG0289|consen 313 PTGEYLLSASNDGTWAFSDISSGSQLTVVSDETSDVEYTSAAFHPDGLIFGTGTPD----------GVVKIWDLKSQTNV 382 (506)
T ss_pred cCCcEEEEecCCceEEEEEccCCcEEEEEeeccccceeEEeeEcCCceEEeccCCC----------ceEEEEEcCCcccc
Confidence 3456777776544 444444444333332222 12245678899998888877654 68999998776644
Q ss_pred EEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCe-EEEEEEEEecCceeEeccc-cCeE
Q 001655 706 STYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEKETKGAVYSLNAF-NGKL 783 (1036)
Q Consensus 706 ~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~k-l~~~~~~~~~g~v~~i~~~-~g~L 783 (1036)
. .|+.++.. +..+.|.. +.-|+++++ ..|.+.+|++.+.+ ++.+...+ +.+|-+++-- .|.+
T Consensus 383 a--~Fpght~~--vk~i~FsE-NGY~Lat~a----------dd~~V~lwDLRKl~n~kt~~l~~-~~~v~s~~fD~SGt~ 446 (506)
T KOG0289|consen 383 A--KFPGHTGP--VKAISFSE-NGYWLATAA----------DDGSVKLWDLRKLKNFKTIQLDE-KKEVNSLSFDQSGTY 446 (506)
T ss_pred c--cCCCCCCc--eeEEEecc-CceEEEEEe----------cCCeEEEEEehhhcccceeeccc-cccceeEEEcCCCCe
Confidence 4 56665543 45667753 455677666 23568999987533 22222222 2245555522 4555
Q ss_pred EEEECCEEEEEEee
Q 001655 784 LAAINQKIQLYKWM 797 (1036)
Q Consensus 784 v~~~g~~l~v~~~~ 797 (1036)
+++.|+.+++|.+.
T Consensus 447 L~~~g~~l~Vy~~~ 460 (506)
T KOG0289|consen 447 LGIAGSDLQVYICK 460 (506)
T ss_pred EEeecceeEEEEEe
Confidence 55559999999874
|
|
| >KOG0646 consensus WD40 repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=92.29 E-value=9 Score=42.79 Aligned_cols=204 Identities=15% Similarity=0.187 Sum_probs=111.9
Q ss_pred EEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEE
Q 001655 655 RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCV 734 (1036)
Q Consensus 655 ~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~V 734 (1036)
+++-+.+-.+.++-.|. ..|+++... ...|.+..-.|++.+..+ ......++|+. |.+ +..+|+-
T Consensus 76 q~~v~Pg~v~al~s~n~-G~~l~ag~i---------~g~lYlWelssG~LL~v~-~aHYQ~ITcL~---fs~-dgs~iiT 140 (476)
T KOG0646|consen 76 QYIVLPGPVHALASSNL-GYFLLAGTI---------SGNLYLWELSSGILLNVL-SAHYQSITCLK---FSD-DGSHIIT 140 (476)
T ss_pred hhcccccceeeeecCCC-ceEEEeecc---------cCcEEEEEeccccHHHHH-HhhccceeEEE---EeC-CCcEEEe
Confidence 55556666666665544 456666532 135666666666655433 33445566654 443 4566665
Q ss_pred EeeeeCCCCCCCcceEEEEEEEe-------CCeEEEEEEE-EecCceeEecc----ccCeEE-EEECCEEEEEEeeecCC
Q 001655 735 GTAYVLPEENEPTKGRILVFIVE-------DGKLQLIAEK-ETKGAVYSLNA----FNGKLL-AAINQKIQLYKWMLRDD 801 (1036)
Q Consensus 735 GT~~~~~~e~~~~~Gri~v~~i~-------~~kl~~~~~~-~~~g~v~~i~~----~~g~Lv-~~~g~~l~v~~~~~~~~ 801 (1036)
|. .-|++++|.+. ++..+.++.. ++.=||+.+.. .+.+|+ ++..+++++|++..
T Consensus 141 gs----------kDg~V~vW~l~~lv~a~~~~~~~p~~~f~~HtlsITDl~ig~Gg~~~rl~TaS~D~t~k~wdlS~--- 207 (476)
T KOG0646|consen 141 GS----------KDGAVLVWLLTDLVSADNDHSVKPLHIFSDHTLSITDLQIGSGGTNARLYTASEDRTIKLWDLSL--- 207 (476)
T ss_pred cC----------CCccEEEEEEEeecccccCCCccceeeeccCcceeEEEEecCCCccceEEEecCCceEEEEEecc---
Confidence 53 45889999877 2233444433 23446665542 244666 45668889988741
Q ss_pred CccchhcccccccceEEEEEEEeCCEEEEEeccccEEEEEEeccC---CeEEEEecc------------CCcceeEEEEE
Q 001655 802 GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE---GAIEERARD------------YNANWMSAVEI 866 (1036)
Q Consensus 802 ~~~~L~~~~~~~~~~~~~~l~~~~~~I~vgD~~~Sv~~~~~~~~~---~~l~~~a~D------------~~~~~~~~~~~ 866 (1036)
| .|+....++....++.++-.+..+++|..--.+++..+-.-+ .-+..-+++ .....+++...
T Consensus 208 g--~LLlti~fp~si~av~lDpae~~~yiGt~~G~I~~~~~~~~~~~~~~v~~k~~~~~~t~~~~~~Gh~~~~~ITcLai 285 (476)
T KOG0646|consen 208 G--VLLLTITFPSSIKAVALDPAERVVYIGTEEGKIFQNLLFKLSGQSAGVNQKGRHEENTQINVLVGHENESAITCLAI 285 (476)
T ss_pred c--eeeEEEecCCcceeEEEcccccEEEecCCcceEEeeehhcCCcccccccccccccccceeeeeccccCCcceeEEEE
Confidence 2 344333333334455556667789999988888766543211 100011111 11123444443
Q ss_pred eeCce-EEEEccCCcEEEEeeCC
Q 001655 867 LDDDI-YLGAENNFNLFTVRKNS 888 (1036)
Q Consensus 867 l~~~~-~l~~D~~gnl~~l~~~~ 888 (1036)
=-+++ .+.+|.+|++.+.+...
T Consensus 286 s~DgtlLlSGd~dg~VcvWdi~S 308 (476)
T KOG0646|consen 286 STDGTLLLSGDEDGKVCVWDIYS 308 (476)
T ss_pred ecCccEEEeeCCCCCEEEEecch
Confidence 33444 67899999999988653
|
|
| >KOG0645 consensus WD40 repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=92.06 E-value=17 Score=37.89 Aligned_cols=256 Identities=15% Similarity=0.181 Sum_probs=141.0
Q ss_pred CCeEEEEEEcCCCCeEEEEEeC---CCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEe--CCCceEEEEEECCC
Q 001655 638 EGELTIGTIDDIQKLHIRSIPL---GEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLD--DQTFEFISTYPLDT 712 (1036)
Q Consensus 638 ~~~l~i~~l~~~~~~~~~~i~l---~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d--~~t~~~i~~~~~~~ 712 (1036)
+..++|-.......|..|.+-- .++.|++++.|..+.++.++.+ +++.+.. ...|+.++.+|=.+
T Consensus 36 Dk~vriw~~~~~~s~~ck~vld~~hkrsVRsvAwsp~g~~La~aSFD----------~t~~Iw~k~~~efecv~~lEGHE 105 (312)
T KOG0645|consen 36 DKAVRIWSTSSGDSWTCKTVLDDGHKRSVRSVAWSPHGRYLASASFD----------ATVVIWKKEDGEFECVATLEGHE 105 (312)
T ss_pred CceEEEEecCCCCcEEEEEeccccchheeeeeeecCCCcEEEEeecc----------ceEEEeecCCCceeEEeeeeccc
Confidence 5677777777555677776653 3689999999999977777664 2344443 25688888888777
Q ss_pred CCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEe-CCeEEEEEE-----EEecCceeEeccccCeEE-E
Q 001655 713 FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE-DGKLQLIAE-----KETKGAVYSLNAFNGKLL-A 785 (1036)
Q Consensus 713 ~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~-~~kl~~~~~-----~~~~g~v~~i~~~~g~Lv-~ 785 (1036)
+|. +.+.+. ....|++.++- --.+.++++. +..++.+.. .++++.+ -.+-.+-|+ +
T Consensus 106 nEV----K~Vaws-~sG~~LATCSR----------DKSVWiWe~deddEfec~aVL~~HtqDVK~V~--WHPt~dlL~S~ 168 (312)
T KOG0645|consen 106 NEV----KCVAWS-ASGNYLATCSR----------DKSVWIWEIDEDDEFECIAVLQEHTQDVKHVI--WHPTEDLLFSC 168 (312)
T ss_pred cce----eEEEEc-CCCCEEEEeeC----------CCeEEEEEecCCCcEEEEeeeccccccccEEE--EcCCcceeEEe
Confidence 775 333343 24678886652 1267888888 334543332 2233221 112234444 3
Q ss_pred EECCEEEEEEeeecCCCccchhcccccccc---eEEEEEEEeCCEEEEEeccccEEEEEEeccCCeEEEEeccCCcceeE
Q 001655 786 AINQKIQLYKWMLRDDGTRELQSECGHHGH---ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMS 862 (1036)
Q Consensus 786 ~~g~~l~v~~~~~~~~~~~~L~~~~~~~~~---~~~~~l~~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~ 862 (1036)
+..++|++|+-. +++++++.. .++.+ ...+...-.|++++.++--.-+.+.++... + ...+.+.+.
T Consensus 169 SYDnTIk~~~~~--~dddW~c~~--tl~g~~~TVW~~~F~~~G~rl~s~sdD~tv~Iw~~~~~------~-~~~~sr~~Y 237 (312)
T KOG0645|consen 169 SYDNTIKVYRDE--DDDDWECVQ--TLDGHENTVWSLAFDNIGSRLVSCSDDGTVSIWRLYTD------L-SGMHSRALY 237 (312)
T ss_pred ccCCeEEEEeec--CCCCeeEEE--EecCccceEEEEEecCCCceEEEecCCcceEeeeeccC------c-chhcccceE
Confidence 445899998843 234555543 22333 233334445678777776666766653211 0 122333333
Q ss_pred EEEEeeCceEEEEccCCcEEEEeeCCCCCCccccccee-EEEEEEcCCccceEEeeeeeecCCCCCCCCcceEEEEcccc
Q 001655 863 AVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE-VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941 (1036)
Q Consensus 863 ~~~~l~~~~~l~~D~~gnl~~l~~~~~~~~~~~~~~L~-~~~~~~lg~~v~~~~~~~l~~~~~~~~~~~~~~il~~t~~G 941 (1036)
.+-- .++.+..+-.++-|.+|+........ ..++. .....| +..||++.-. +.....++.|+-||
T Consensus 238 ~v~W-~~~~IaS~ggD~~i~lf~~s~~~d~p--~~~l~~~~~~aH-e~dVNsV~w~----------p~~~~~L~s~~DDG 303 (312)
T KOG0645|consen 238 DVPW-DNGVIASGGGDDAIRLFKESDSPDEP--SWNLLAKKEGAH-EVDVNSVQWN----------PKVSNRLASGGDDG 303 (312)
T ss_pred eeee-cccceEeccCCCEEEEEEecCCCCCc--hHHHHHhhhccc-ccccceEEEc----------CCCCCceeecCCCc
Confidence 3322 24566666678888888876321111 11221 122222 3467765421 12345777788888
Q ss_pred ceEE
Q 001655 942 VIGV 945 (1036)
Q Consensus 942 sIg~ 945 (1036)
.+-.
T Consensus 304 ~v~~ 307 (312)
T KOG0645|consen 304 IVNF 307 (312)
T ss_pred eEEE
Confidence 7654
|
|
| >KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport] | Back alignment and domain information |
|---|
Probab=91.62 E-value=17 Score=37.51 Aligned_cols=66 Identities=17% Similarity=0.257 Sum_probs=53.3
Q ss_pred ceEEEEEeeeeCCCCCCCcceEEEEEEEe-CCeEEEEEEEEecCceeEeccc---cCeEEEEEC-CEEEEEEee
Q 001655 729 NVYYCVGTAYVLPEENEPTKGRILVFIVE-DGKLQLIAEKETKGAVYSLNAF---NGKLLAAIN-QKIQLYKWM 797 (1036)
Q Consensus 729 ~~~i~VGT~~~~~~e~~~~~Gri~v~~i~-~~kl~~~~~~~~~g~v~~i~~~---~g~Lv~~~g-~~l~v~~~~ 797 (1036)
.++++|.|+..+ .....|||+++++. .+.+.++.+.+....++.++=- .+.+++|+| ..+++|+..
T Consensus 21 ~nrLavAt~q~y---Gl~G~G~L~ile~~~~~gi~e~~s~d~~D~LfdV~Wse~~e~~~~~a~GDGSLrl~d~~ 91 (311)
T KOG0277|consen 21 ENRLAVATAQHY---GLAGNGRLFILEVTDPKGIQECQSYDTEDGLFDVAWSENHENQVIAASGDGSLRLFDLT 91 (311)
T ss_pred cchhheeehhhc---ccccCceEEEEecCCCCCeEEEEeeecccceeEeeecCCCcceEEEEecCceEEEeccC
Confidence 467899998877 35688999999996 5679999999999999988733 346778888 689998853
|
|
| >KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis] | Back alignment and domain information |
|---|
Probab=91.40 E-value=18 Score=41.86 Aligned_cols=184 Identities=15% Similarity=0.219 Sum_probs=100.4
Q ss_pred eCCeEEEEEEcCCCCeEEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCc---------eEEEE
Q 001655 637 KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTF---------EFIST 707 (1036)
Q Consensus 637 ~~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~---------~~i~~ 707 (1036)
+++.++|-.+-+. -.++++.+++..++|++.|..+.-+++..-. .++.++++.=+ +.+.+
T Consensus 420 dDGtvriWEi~Tg--Rcvr~~~~d~~I~~vaw~P~~~~~vLAvA~~---------~~~~ivnp~~G~~~e~~~t~ell~~ 488 (733)
T KOG0650|consen 420 DDGTVRIWEIATG--RCVRTVQFDSEIRSVAWNPLSDLCVLAVAVG---------ECVLIVNPIFGDRLEVGPTKELLAS 488 (733)
T ss_pred CCCcEEEEEeecc--eEEEEEeecceeEEEEecCCCCceeEEEEec---------CceEEeCccccchhhhcchhhhhhc
Confidence 3577777777653 4578999999999999998777555544321 12344444221 11111
Q ss_pred E----------------ECCCCCeEeEEEEEEEcC-------CCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEE
Q 001655 708 Y----------------PLDTFEYGCSILSCSFSD-------DSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLI 764 (1036)
Q Consensus 708 ~----------------~~~~~E~v~si~~~~l~~-------~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~ 764 (1036)
. ..++.|.-.|+.....+. ....||++-..- ...-++++.++.+++-+ .
T Consensus 489 ~~~~~~p~~~~~~W~~~~~~e~~~~v~~~I~~~k~i~~vtWHrkGDYlatV~~~-------~~~~~VliHQLSK~~sQ-~ 560 (733)
T KOG0650|consen 489 APNESEPDAAVVTWSRASLDELEKGVCIVIKHPKSIRQVTWHRKGDYLATVMPD-------SGNKSVLIHQLSKRKSQ-S 560 (733)
T ss_pred CCCccCCcccceeechhhhhhhccceEEEEecCCccceeeeecCCceEEEeccC-------CCcceEEEEeccccccc-C
Confidence 1 111222222322222211 123555543321 12336777777654433 2
Q ss_pred EEEEecCceeEec--cccCeEEEEECCEEEEEEeeecCCCccchhcccccccceEEEEEEEeCCEEEEEeccccEEEEEE
Q 001655 765 AEKETKGAVYSLN--AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842 (1036)
Q Consensus 765 ~~~~~~g~v~~i~--~~~g~Lv~~~g~~l~v~~~~~~~~~~~~L~~~~~~~~~~~~~~l~~~~~~I~vgD~~~Sv~~~~~ 842 (1036)
.-..-+|-|.++. +...+|++|.-..|++|++.-+ .-.++|...|.+. ..+++...|+-+++|..-+-+..+-.
T Consensus 561 PF~kskG~vq~v~FHPs~p~lfVaTq~~vRiYdL~kq-elvKkL~tg~kwi---S~msihp~GDnli~gs~d~k~~WfDl 636 (733)
T KOG0650|consen 561 PFRKSKGLVQRVKFHPSKPYLFVATQRSVRIYDLSKQ-ELVKKLLTGSKWI---SSMSIHPNGDNLILGSYDKKMCWFDL 636 (733)
T ss_pred chhhcCCceeEEEecCCCceEEEEeccceEEEehhHH-HHHHHHhcCCeee---eeeeecCCCCeEEEecCCCeeEEEEc
Confidence 2224567777765 3366899999999999998422 1224444444332 23455566788888887777655443
Q ss_pred e
Q 001655 843 K 843 (1036)
Q Consensus 843 ~ 843 (1036)
+
T Consensus 637 d 637 (733)
T KOG0650|consen 637 D 637 (733)
T ss_pred c
Confidence 3
|
|
| >KOG2321 consensus WD40 repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=91.31 E-value=12 Score=42.87 Aligned_cols=207 Identities=15% Similarity=0.178 Sum_probs=114.0
Q ss_pred CeEEEE-EEeCCCcEEEEEEecCCCccccceeeecCccceEEEEEEeCCceEEEEEcCCceEEEecCCcEEEe---eecc
Q 001655 540 GISYLL-CALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYS---NVNL 615 (1036)
Q Consensus 540 ~~~~L~-vgl~~G~l~~y~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~~~p~~i~~~~~~~~~~---~~~~ 615 (1036)
+..|++ .|+--=.+-+|.+... +|+. .+.+-..+|.+.++..+-...||...+|..=+....|+ .|. |-..
T Consensus 62 DGqY~lAtG~YKP~ikvydlanL--SLKF--ERhlDae~V~feiLsDD~SK~v~L~~DR~IefHak~G~-hy~~RIP~~G 136 (703)
T KOG2321|consen 62 DGQYLLATGTYKPQIKVYDLANL--SLKF--ERHLDAEVVDFEILSDDYSKSVFLQNDRTIEFHAKYGR-HYRTRIPKFG 136 (703)
T ss_pred CCcEEEEecccCCceEEEEcccc--eeee--eecccccceeEEEeccchhhheEeecCceeeehhhcCe-eeeeecCcCC
Confidence 345554 4666667778887642 2333 34567778888888544456788888876544433232 211 1111
Q ss_pred ccceeeecccCCCCCCeEEEEeCCeEEEEEEcCCCCeEEEEEeCC-CccceEEEecCCCEEEEEEeecCCCCCceeeEEE
Q 001655 616 KEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV 694 (1036)
Q Consensus 616 ~~i~~~~~f~~~~~~~~l~~~~~~~l~i~~l~~~~~~~~~~i~l~-~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l 694 (1036)
.++ .++.+.|. .++..++.. +.+++-.+.-..+.+.+. .....+--.+...++++++.. +.+
T Consensus 137 RDm----~y~~~scD-ly~~gsg~e--vYRlNLEqGrfL~P~~~~~~~lN~v~in~~hgLla~Gt~~----------g~V 199 (703)
T KOG2321|consen 137 RDM----KYHKPSCD-LYLVGSGSE--VYRLNLEQGRFLNPFETDSGELNVVSINEEHGLLACGTED----------GVV 199 (703)
T ss_pred ccc----cccCCCcc-EEEeecCcc--eEEEEccccccccccccccccceeeeecCccceEEecccC----------ceE
Confidence 111 13333332 233334444 445444333222223332 222334445555666665532 689
Q ss_pred EEEeCCCceEEEEEECCCC-------CeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEE
Q 001655 695 RLLDDQTFEFISTYPLDTF-------EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEK 767 (1036)
Q Consensus 695 ~l~d~~t~~~i~~~~~~~~-------E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~ 767 (1036)
+++||.+...+.+++...+ ...-++..+.|.++ .=.++||| +.|.+|+|++.-.+--++..+
T Consensus 200 EfwDpR~ksrv~~l~~~~~v~s~pg~~~~~svTal~F~d~-gL~~aVGt----------s~G~v~iyDLRa~~pl~~kdh 268 (703)
T KOG2321|consen 200 EFWDPRDKSRVGTLDAASSVNSHPGGDAAPSVTALKFRDD-GLHVAVGT----------STGSVLIYDLRASKPLLVKDH 268 (703)
T ss_pred EEecchhhhhheeeecccccCCCccccccCcceEEEecCC-ceeEEeec----------cCCcEEEEEcccCCceeeccc
Confidence 9999987776666554433 11223555667653 45689998 569999999986665556656
Q ss_pred EecCceeEeccc
Q 001655 768 ETKGAVYSLNAF 779 (1036)
Q Consensus 768 ~~~g~v~~i~~~ 779 (1036)
...-|+-.+.-.
T Consensus 269 ~~e~pi~~l~~~ 280 (703)
T KOG2321|consen 269 GYELPIKKLDWQ 280 (703)
T ss_pred CCccceeeeccc
Confidence 666666666543
|
|
| >KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown] | Back alignment and domain information |
|---|
Probab=91.28 E-value=18 Score=41.04 Aligned_cols=240 Identities=16% Similarity=0.201 Sum_probs=122.7
Q ss_pred cceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEE-----EE--C--CCCCeE--eEEEEEEEcCCCceE
Q 001655 663 PRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFIST-----YP--L--DTFEYG--CSILSCSFSDDSNVY 731 (1036)
Q Consensus 663 p~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~-----~~--~--~~~E~v--~si~~~~l~~~~~~~ 731 (1036)
.+.+.|++..+.++|++.. +..+++|-..++..-. |. + -.+..+ +|. .|. ..+++-
T Consensus 217 i~sl~ys~Tg~~iLvvsg~----------aqakl~DRdG~~~~e~~KGDQYI~Dm~nTKGHia~lt~g-~wh--P~~k~~ 283 (641)
T KOG0772|consen 217 INSLQYSVTGDQILVVSGS----------AQAKLLDRDGFEIVEFSKGDQYIRDMYNTKGHIAELTCG-CWH--PDNKEE 283 (641)
T ss_pred cceeeecCCCCeEEEEecC----------cceeEEccCCceeeeeeccchhhhhhhccCCceeeeecc-ccc--cCcccc
Confidence 4567799999999888765 3578888766655422 11 1 111111 111 121 111222
Q ss_pred EEEEeeeeCCCCCCCcceEEEEEEEeCCe--EEEEEEEEecC---ceeEeccc--cCeEE-EEE-CCEEEEEEeeecCCC
Q 001655 732 YCVGTAYVLPEENEPTKGRILVFIVEDGK--LQLIAEKETKG---AVYSLNAF--NGKLL-AAI-NQKIQLYKWMLRDDG 802 (1036)
Q Consensus 732 i~VGT~~~~~~e~~~~~Gri~v~~i~~~k--l~~~~~~~~~g---~v~~i~~~--~g~Lv-~~~-g~~l~v~~~~~~~~~ 802 (1036)
++ -+ ..-|.+.+|++.+.| ++++..+...| +|++++ | +|.++ +|+ ...|.+|+. +
T Consensus 284 Fl-T~---------s~DgtlRiWdv~~~k~q~qVik~k~~~g~Rv~~tsC~-~nrdg~~iAagc~DGSIQ~W~~-----~ 347 (641)
T KOG0772|consen 284 FL-TC---------SYDGTLRIWDVNNTKSQLQVIKTKPAGGKRVPVTSCA-WNRDGKLIAAGCLDGSIQIWDK-----G 347 (641)
T ss_pred eE-Ee---------cCCCcEEEEecCCchhheeEEeeccCCCcccCceeee-cCCCcchhhhcccCCceeeeec-----C
Confidence 22 11 134888999998643 55555444433 565544 5 45554 333 467877653 1
Q ss_pred ccch-----hccccccc-ceEEEEEEEeCCEEEEEeccccEEEEEEeccCCeEEEEeccCCcceeEEEEEeeCceEEE--
Q 001655 803 TREL-----QSECGHHG-HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-- 874 (1036)
Q Consensus 803 ~~~L-----~~~~~~~~-~~~~~~l~~~~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l~~~~~l~-- 874 (1036)
.+-. .+.|-... -+..++.+..||+++-=-.-.++.++..+.-...|.....=+.+...+.|+|-.++.+|+
T Consensus 348 ~~~v~p~~~vk~AH~~g~~Itsi~FS~dg~~LlSRg~D~tLKvWDLrq~kkpL~~~tgL~t~~~~tdc~FSPd~kli~TG 427 (641)
T KOG0772|consen 348 SRTVRPVMKVKDAHLPGQDITSISFSYDGNYLLSRGFDDTLKVWDLRQFKKPLNVRTGLPTPFPGTDCCFSPDDKLILTG 427 (641)
T ss_pred CcccccceEeeeccCCCCceeEEEeccccchhhhccCCCceeeeeccccccchhhhcCCCccCCCCccccCCCceEEEec
Confidence 1111 12221111 122344455677776655555666554433222232222223444566777776665443
Q ss_pred -----EccCCcEEEEeeCCCCCCcccccceeEEEEEEcCCccceEEeeeeeecCCCCCCCCcceEEEEccccceEEEEec
Q 001655 875 -----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL 949 (1036)
Q Consensus 875 -----~D~~gnl~~l~~~~~~~~~~~~~~L~~~~~~~lg~~v~~~~~~~l~~~~~~~~~~~~~~il~~t~~GsIg~l~~l 949 (1036)
.+..|+|++|.. ..|+.+..+.+. -.+..+. .|. ..-++|+.|+.+|.+.++..=
T Consensus 428 tS~~~~~~~g~L~f~d~----------~t~d~v~ki~i~--~aSvv~~----~Wh----pkLNQi~~gsgdG~~~vyYdp 487 (641)
T KOG0772|consen 428 TSAPNGMTAGTLFFFDR----------MTLDTVYKIDIS--TASVVRC----LWH----PKLNQIFAGSGDGTAHVYYDP 487 (641)
T ss_pred ccccCCCCCceEEEEec----------cceeeEEEecCC--CceEEEE----eec----chhhheeeecCCCceEEEECc
Confidence 345678888864 245555544443 1112121 121 134689999999999887665
Q ss_pred Ch
Q 001655 950 PH 951 (1036)
Q Consensus 950 ~~ 951 (1036)
++
T Consensus 488 ~~ 489 (641)
T KOG0772|consen 488 NE 489 (641)
T ss_pred cc
Confidence 44
|
|
| >PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A | Back alignment and domain information |
|---|
Probab=91.27 E-value=31 Score=39.18 Aligned_cols=200 Identities=17% Similarity=0.191 Sum_probs=106.5
Q ss_pred ccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEecCCCccccceeeecC
Q 001655 495 SQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574 (1036)
Q Consensus 495 ~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~l~~~~~~~lG 574 (1036)
.++++|....+|++.+++..+.+.+........ .+.++.+. .+..++++..+||.+..+.+ .+++ -.+++..|
T Consensus 5 ~~l~~V~~~~~~~v~viD~~t~~~~~~i~~~~~-~h~~~~~s--~Dgr~~yv~~rdg~vsviD~--~~~~--~v~~i~~G 77 (369)
T PF02239_consen 5 GNLFYVVERGSGSVAVIDGATNKVVARIPTGGA-PHAGLKFS--PDGRYLYVANRDGTVSVIDL--ATGK--VVATIKVG 77 (369)
T ss_dssp GGEEEEEEGGGTEEEEEETTT-SEEEEEE-STT-EEEEEE-T--T-SSEEEEEETTSEEEEEET--TSSS--EEEEEE-S
T ss_pred ccEEEEEecCCCEEEEEECCCCeEEEEEcCCCC-ceeEEEec--CCCCEEEEEcCCCeEEEEEC--Cccc--EEEEEecC
Confidence 567777777689999999988887765433321 22233222 23457888889997765544 3322 34567788
Q ss_pred ccceEEEEEEeCCceEEEEEcCC-ceEEEecCCcE---EEeeeccccc----eeeecccCCCCCCeEEEEe--CCeEEEE
Q 001655 575 TQPITLRTFSSKNTTHVFAASDR-PTVIYSSNKKL---LYSNVNLKEV----SHMCPFNSAAFPDSLAIAK--EGELTIG 644 (1036)
Q Consensus 575 ~~pv~l~~~~~~~~~~v~~~~~~-p~~i~~~~~~~---~~~~~~~~~i----~~~~~f~~~~~~~~l~~~~--~~~l~i~ 644 (1036)
..|..+.- ..++..+++.+.. +.+...+...+ ...|....+. ..++.+-.......+++.- .+++.+-
T Consensus 78 ~~~~~i~~--s~DG~~~~v~n~~~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aIv~s~~~~~fVv~lkd~~~I~vV 155 (369)
T PF02239_consen 78 GNPRGIAV--SPDGKYVYVANYEPGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAAIVASPGRPEFVVNLKDTGEIWVV 155 (369)
T ss_dssp SEEEEEEE----TTTEEEEEEEETTEEEEEETTT--EEEEEE--EE-TTTS---EEEEEE-SSSSEEEEEETTTTEEEEE
T ss_pred CCcceEEE--cCCCCEEEEEecCCCceeEeccccccceeecccccccccccCCCceeEEecCCCCEEEEEEccCCeEEEE
Confidence 87755432 2344566666533 33333332222 2222211000 0111111111223355443 3555555
Q ss_pred EEcCCCCeEEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCC
Q 001655 645 TIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDT 712 (1036)
Q Consensus 645 ~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~ 712 (1036)
.......+..+.++.+..|+-..+.|+.+.|.++.... ..+-++|.++.+.+...+...
T Consensus 156 dy~d~~~~~~~~i~~g~~~~D~~~dpdgry~~va~~~s---------n~i~viD~~~~k~v~~i~~g~ 214 (369)
T PF02239_consen 156 DYSDPKNLKVTTIKVGRFPHDGGFDPDGRYFLVAANGS---------NKIAVIDTKTGKLVALIDTGK 214 (369)
T ss_dssp ETTTSSCEEEEEEE--TTEEEEEE-TTSSEEEEEEGGG---------TEEEEEETTTTEEEEEEE-SS
T ss_pred EeccccccceeeecccccccccccCcccceeeeccccc---------ceeEEEeeccceEEEEeeccc
Confidence 44445567889999999999999999988887765432 267788888877665555433
|
... |
| >KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=91.15 E-value=7.7 Score=40.93 Aligned_cols=211 Identities=15% Similarity=0.213 Sum_probs=107.8
Q ss_pred cCCeEEEEEEcCcEEEE-EEeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCcee
Q 001655 454 GGGHLVYLEIGDGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532 (1036)
Q Consensus 454 ~~~~~~~l~~~~~~l~~-~~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~s 532 (1036)
.++.+.+|++..-.... .+-++-...+-|+++.| .+++++||+. -..+++|++...+--.... +.+..-.+
T Consensus 192 rD~tvKlFDfsK~saKrA~K~~qd~~~vrsiSfHP------sGefllvgTd-Hp~~rlYdv~T~Qcfvsan-Pd~qht~a 263 (430)
T KOG0640|consen 192 RDNTVKLFDFSKTSAKRAFKVFQDTEPVRSISFHP------SGEFLLVGTD-HPTLRLYDVNTYQCFVSAN-PDDQHTGA 263 (430)
T ss_pred CCCeEEEEecccHHHHHHHHHhhccceeeeEeecC------CCceEEEecC-CCceeEEeccceeEeeecC-cccccccc
Confidence 46778888765311000 00111224567888866 3789999998 8899999998755422211 22211223
Q ss_pred EEeeeecCeEEEEE-EeCCCcEEEEEEecCCCccccceeee----cCccceEEEEEEeCCceEEEEEcCCceEEEec--C
Q 001655 533 VLLCAFEGISYLLC-ALGDGHLLNFLLNMKTGELTDRKKVS----LGTQPITLRTFSSKNTTHVFAASDRPTVIYSS--N 605 (1036)
Q Consensus 533 i~~~~~~~~~~L~v-gl~~G~l~~y~~~~~~~~l~~~~~~~----lG~~pv~l~~~~~~~~~~v~~~~~~p~~i~~~--~ 605 (1036)
|.-++..+.-.|++ |..||.+-.+.=. +++-... -|...|.=-.| ..++..++-.|.-..+-.+. .
T Consensus 264 i~~V~Ys~t~~lYvTaSkDG~IklwDGV------S~rCv~t~~~AH~gsevcSa~F-tkn~kyiLsSG~DS~vkLWEi~t 336 (430)
T KOG0640|consen 264 ITQVRYSSTGSLYVTASKDGAIKLWDGV------SNRCVRTIGNAHGGSEVCSAVF-TKNGKYILSSGKDSTVKLWEIST 336 (430)
T ss_pred eeEEEecCCccEEEEeccCCcEEeeccc------cHHHHHHHHhhcCCceeeeEEE-ccCCeEEeecCCcceeeeeeecC
Confidence 32222222333444 8899988776421 1111111 12222211112 11233344444333222222 2
Q ss_pred CcEE--Eeeecc---ccceeeecccCCCCCCeEEEEeCCeEEEEEEcCCCCeEEEEEeCC--CccceEEEecCCCEEEEE
Q 001655 606 KKLL--YSNVNL---KEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG--EHPRRICHQEQSRTFAIC 678 (1036)
Q Consensus 606 ~~~~--~~~~~~---~~i~~~~~f~~~~~~~~l~~~~~~~l~i~~l~~~~~~~~~~i~l~--~tp~~i~y~~~~~~~~v~ 678 (1036)
+|.. |+.-.. +....-+.||. ..+.+++.++..-.+|.-+..+.--.+..++| ..||-|.++|....|.-+
T Consensus 337 ~R~l~~YtGAg~tgrq~~rtqAvFNh--tEdyVl~pDEas~slcsWdaRtadr~~l~slgHn~a~R~i~HSP~~p~FmTc 414 (430)
T KOG0640|consen 337 GRMLKEYTGAGTTGRQKHRTQAVFNH--TEDYVLFPDEASNSLCSWDARTADRVALLSLGHNGAVRWIVHSPVEPAFMTC 414 (430)
T ss_pred CceEEEEecCCcccchhhhhhhhhcC--ccceEEccccccCceeeccccchhhhhhcccCCCCCceEEEeCCCCCceeee
Confidence 3322 221110 11223344543 23577778888888888776654445666776 468899999988877766
Q ss_pred Eee
Q 001655 679 SLK 681 (1036)
Q Consensus 679 ~~~ 681 (1036)
+.+
T Consensus 415 sdD 417 (430)
T KOG0640|consen 415 SDD 417 (430)
T ss_pred ccc
Confidence 544
|
|
| >KOG1897 consensus Damage-specific DNA binding complex, subunit DDB1 [Replication, recombination and repair] | Back alignment and domain information |
|---|
Probab=91.14 E-value=47 Score=41.13 Aligned_cols=464 Identities=13% Similarity=0.130 Sum_probs=215.3
Q ss_pred CeEEEEEeecCeeEEEEeeCCCCCCCcEEEeee---cccCCceeeeEEeccCCCCCceEEEEeeCCCCccEEEEecCCce
Q 001655 260 AVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER---YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGI 336 (1036)
Q Consensus 260 ~~lF~gS~~gds~l~~~~~~~~~~~~~~~~~~~---l~n~~Pi~D~~~~~~~~~~~~~lv~~sG~g~~g~l~~lr~gi~~ 336 (1036)
..+|++...||+ |+|+.++. ++..++ ..-|-|.-++++.... .+..+++.+.|.+ ..+-.++..
T Consensus 452 ~Tif~S~i~g~~-lvQvTs~~------iRl~ss~~~~~~W~~p~~~ti~~~~-~n~sqVvvA~~~~-----~l~y~~i~~ 518 (1096)
T KOG1897|consen 452 QTIFCSTINGNQ-LVQVTSNS------IRLVSSAGLRSEWRPPGKITIGVVS-ANASQVVVAGGGL-----ALFYLEIED 518 (1096)
T ss_pred ceEEEEccCCce-EEEEeccc------EEEEcchhhhhcccCCCceEEEEEe-ecceEEEEecCcc-----EEEEEEeec
Confidence 467888778887 89998743 555543 3567787777776543 3445777765522 222222222
Q ss_pred eEEEeeeCCCcceEEEeecCCCCCcccEEEEEecCceEEEEecCCCCeeeeeecCeeccccceeeeecCCC-eEEEEecC
Q 001655 337 NEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN-QLVQVTSG 415 (1036)
Q Consensus 337 ~~~~~~~l~~~~~iw~l~~~~~~~~~~~lv~S~~~~T~vl~~~~~~~~~e~~~~~f~~~~~Tl~~~~~~~~-~~vQVt~~ 415 (1036)
..+... -- . -+..+-..|.+.. +.+..+ ..+-+++|.+.+- .++-..|+
T Consensus 519 ~~l~e~--------~~---------~-----~~e~evaCLDisp------~~d~~~--~s~~~aVG~Ws~~~~~l~~~pd 568 (1096)
T KOG1897|consen 519 GGLREV--------SH---------K-----EFEYEVACLDISP------LGDAPN--KSRLLAVGLWSDISMILTFLPD 568 (1096)
T ss_pred cceeee--------ee---------h-----eecceeEEEeccc------CCCCCC--cceEEEEEeecceEEEEEECCC
Confidence 111110 00 0 0111112222211 000111 1234556666443 56666677
Q ss_pred cEEEEecCCccceeeeeCCCCCcEEEEEeC--CCEEEEEEcCCeEEEEEEcC--cEEEEEEeccCCceeEEE-EcCCCCC
Q 001655 416 SVRLVSSTSRELRNEWKSPPGYSVNVATAN--ASQVLLATGGGHLVYLEIGD--GILTEVKHAQLEYEISCL-DINPIGE 490 (1036)
Q Consensus 416 ~i~l~~~~~~~~~~~w~~~~~~~I~~as~~--~~~vvv~~~~~~~~~l~~~~--~~l~~~~~~~l~~~is~l-~~~~~~~ 490 (1036)
.+.+..... -.+. .| ++|....+. ..++++++.+|.+.+|.++. +++.+.++..+...+..+ .+..
T Consensus 569 ~~~~~~~~l---~~~~-iP--RSIl~~~~e~d~~yLlvalgdG~l~~fv~d~~tg~lsd~Kk~~lGt~P~~Lr~f~s--- 639 (1096)
T KOG1897|consen 569 LILITHEQL---SGEI-IP--RSILLTTFEGDIHYLLVALGDGALLYFVLDINTGQLSDRKKVTLGTQPISLRTFSS--- 639 (1096)
T ss_pred cceeeeecc---CCCc-cc--hheeeEEeeccceEEEEEcCCceEEEEEEEcccceEccccccccCCCCcEEEEEee---
Confidence 766654321 1111 12 467777655 46899999999999988876 667777777787766555 3321
Q ss_pred CCCcccEEEEEEecCCEEEEEECCCCCeeE----------EecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEec
Q 001655 491 NPSYSQIAAVGMWTDISVRIFSLPDLNLIT----------KEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNM 560 (1036)
Q Consensus 491 ~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~----------~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~ 560 (1036)
++...++++- |.--.+|+-.+ ++++ ..++..+..|.+++.+ .+|.|....+|+
T Consensus 640 ---k~~t~vfa~s-drP~viY~~n~-kLv~spls~kev~~~c~f~s~a~~d~l~~~------------~~~~l~i~tid~ 702 (1096)
T KOG1897|consen 640 ---KSRTAVFALS-DRPTVIYSSNG-KLVYSPLSLKEVNHMCPFNSDAYPDSLASA------------NGGALTIGTIDE 702 (1096)
T ss_pred ---CCceEEEEeC-CCCEEEEecCC-cEEEeccchHHhhhhcccccccCCceEEEe------------cCCceEEEEecc
Confidence 1122222232 33344444332 1111 1122223344444333 355666666776
Q ss_pred CCCccccceeeecCccceEEEEEEeCCceEEEEEcCCceEEEecCCcEEEeeeccccceeeecccCCCCCCeEEEEeCCe
Q 001655 561 KTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640 (1036)
Q Consensus 561 ~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~~~p~~i~~~~~~~~~~~~~~~~i~~~~~f~~~~~~~~l~~~~~~~ 640 (1036)
.. ++. .+++.+|..|.++.. + .....+++++.|-= .... .+..+..-.++...+++.
T Consensus 703 iq-kl~-irtvpl~~~prrI~~-q-~~sl~~~v~s~r~e--------~~~~-----------~~~ee~~~s~l~vlD~nT 759 (1096)
T KOG1897|consen 703 IQ-KLH-IRTVPLGESPRRICY-Q-ESSLTFGVLSNRIE--------SSAE-----------YYGEEYEVSFLRVLDQNT 759 (1096)
T ss_pred hh-hcc-eeeecCCCChhheEe-c-ccceEEEEEecccc--------cchh-----------hcCCcceEEEEEEecCCc
Confidence 42 232 345788888766532 1 11222333332210 0000 011111112445556777
Q ss_pred EEEEEEcCCCCeEEEEEeCCCccceEE---EecC-CCEEEEEEeecCCCCCceeeEEEEEE---eCCCceEEEEEECCCC
Q 001655 641 LTIGTIDDIQKLHIRSIPLGEHPRRIC---HQEQ-SRTFAICSLKNQSCAEESEMHFVRLL---DDQTFEFISTYPLDTF 713 (1036)
Q Consensus 641 l~i~~l~~~~~~~~~~i~l~~tp~~i~---y~~~-~~~~~v~~~~~~~~~~~~~~~~l~l~---d~~t~~~i~~~~~~~~ 713 (1036)
+..........+ ++..+++ ++.+ ...|+|++.-..++..+-..+.+-++ +.++.+.++..+.+-
T Consensus 760 f~vl~~hef~~~--------E~~~Si~s~~~~~d~~t~~vVGT~~v~Pde~ep~~GRIivfe~~e~~~L~~v~e~~v~G- 830 (1096)
T KOG1897|consen 760 FEVLSSHEFERN--------ETALSIISCKFTDDPNTYYVVGTGLVYPDENEPVNGRIIVFEFEELNSLELVAETVVKG- 830 (1096)
T ss_pred eeEEeecccccc--------ceeeeeeeeeecCCCceEEEEEEEeeccCCCCcccceEEEEEEecCCceeeeeeeeecc-
Confidence 766666555433 2222232 3444 33456666654333333334444443 223344444333221
Q ss_pred CeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCC-eEEEEEEEEecCceeEeccccCeEEEEEC-CEE
Q 001655 714 EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAIN-QKI 791 (1036)
Q Consensus 714 E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~-kl~~~~~~~~~g~v~~i~~~~g~Lv~~~g-~~l 791 (1036)
.++++ +.| +..+++|-+ -.+-+|+...+ .|...+.....-.+..+...++.|++|-- ..+
T Consensus 831 -av~aL--~~f----ngkllA~In-----------~~vrLye~t~~~eLr~e~~~~~~~~aL~l~v~gdeI~VgDlm~Si 892 (1096)
T KOG1897|consen 831 -AVYAL--VEF----NGKLLAGIN-----------QSVRLYEWTTERELRIECNISNPIIALDLQVKGDEIAVGDLMRSI 892 (1096)
T ss_pred -ceeeh--hhh----CCeEEEecC-----------cEEEEEEccccceehhhhcccCCeEEEEEEecCcEEEEeeccceE
Confidence 22222 222 233444432 14456666544 34444433333334444555778877633 345
Q ss_pred EEEEeeecCCCccchhcccccccceEEEEEEE-eCCEEEEEeccccEEEEEEecc
Q 001655 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQT-RGDFIVVGDLMKSISLLIYKHE 845 (1036)
Q Consensus 792 ~v~~~~~~~~~~~~L~~~~~~~~~~~~~~l~~-~~~~I~vgD~~~Sv~~~~~~~~ 845 (1036)
.+..+... +|. +.-.|..-.+-+.++... .++..+.+|..--+.+++++.+
T Consensus 893 tll~y~~~-eg~--f~evArD~~p~Wmtaveil~~d~ylgae~~gNlf~v~~d~~ 944 (1096)
T KOG1897|consen 893 TLLQYKGD-EGN--FEEVARDYNPNWMTAVEILDDDTYLGAENSGNLFTVRKDSD 944 (1096)
T ss_pred EEEEEecc-CCc--eEEeehhhCccceeeEEEecCceEEeecccccEEEEEecCC
Confidence 55444322 221 222222212223333332 3556677888888888888753
|
|
| >KOG0646 consensus WD40 repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=90.72 E-value=21 Score=40.02 Aligned_cols=203 Identities=13% Similarity=0.110 Sum_probs=103.3
Q ss_pred ceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccc-cCeEEEEECCEEEEEEeeecC----CCc
Q 001655 729 NVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF-NGKLLAAINQKIQLYKWMLRD----DGT 803 (1036)
Q Consensus 729 ~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~-~g~Lv~~~g~~l~v~~~~~~~----~~~ 803 (1036)
..|++.|| -+|.||+|++..++|-.+. ..+-.+|+++..- +|..++..|+.=.++-|...+ +..
T Consensus 93 G~~l~ag~----------i~g~lYlWelssG~LL~v~-~aHYQ~ITcL~fs~dgs~iiTgskDg~V~vW~l~~lv~a~~~ 161 (476)
T KOG0646|consen 93 GYFLLAGT----------ISGNLYLWELSSGILLNVL-SAHYQSITCLKFSDDGSHIITGSKDGAVLVWLLTDLVSADND 161 (476)
T ss_pred ceEEEeec----------ccCcEEEEEeccccHHHHH-HhhccceeEEEEeCCCcEEEecCCCccEEEEEEEeecccccC
Confidence 45666666 5689999999977653222 3455688888866 567777777654444453211 111
Q ss_pred cchhcccccccce-EEEEEEEe----CCEEEEEeccccEEEEEEeccCCeEEEEeccCCcceeEEEEEeeC--ceEEEEc
Q 001655 804 RELQSECGHHGHI-LALYVQTR----GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD--DIYLGAE 876 (1036)
Q Consensus 804 ~~L~~~~~~~~~~-~~~~l~~~----~~~I~vgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l~~--~~~l~~D 876 (1036)
....+.+.+..+. .++++.+- +.+++..-.-+-+.++ +-..+.|..-. ..|+..+|+ .+|. ..+.++-
T Consensus 162 ~~~~p~~~f~~HtlsITDl~ig~Gg~~~rl~TaS~D~t~k~w--dlS~g~LLlti--~fp~si~av-~lDpae~~~yiGt 236 (476)
T KOG0646|consen 162 HSVKPLHIFSDHTLSITDLQIGSGGTNARLYTASEDRTIKLW--DLSLGVLLLTI--TFPSSIKAV-ALDPAERVVYIGT 236 (476)
T ss_pred CCccceeeeccCcceeEEEEecCCCccceEEEecCCceEEEE--EeccceeeEEE--ecCCcceeE-EEcccccEEEecC
Confidence 1122222233332 34445432 2356555555555443 32233332222 233333333 2333 2466677
Q ss_pred cCCcEEEEeeCCCCCCccc----ccce--eE--EEEEEcC-CccceEEeeeeeecCCCCCCCCcceEEEEccccceEEEE
Q 001655 877 NNFNLFTVRKNSEGATDEE----RGRL--EV--VGEYHLG-EFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIA 947 (1036)
Q Consensus 877 ~~gnl~~l~~~~~~~~~~~----~~~L--~~--~~~~~lg-~~v~~~~~~~l~~~~~~~~~~~~~~il~~t~~GsIg~l~ 947 (1036)
.+|+|++..+..-...+.+ +... .. ...=|-+ ..|+++.- ..+..-++-|..||.+-.--
T Consensus 237 ~~G~I~~~~~~~~~~~~~~v~~k~~~~~~t~~~~~~Gh~~~~~ITcLai-----------s~DgtlLlSGd~dg~VcvWd 305 (476)
T KOG0646|consen 237 EEGKIFQNLLFKLSGQSAGVNQKGRHEENTQINVLVGHENESAITCLAI-----------STDGTLLLSGDEDGKVCVWD 305 (476)
T ss_pred CcceEEeeehhcCCcccccccccccccccceeeeeccccCCcceeEEEE-----------ecCccEEEeeCCCCCEEEEe
Confidence 8999999887654432210 0000 00 1111222 34555432 13456788888999887666
Q ss_pred ecChhHHHHHH
Q 001655 948 SLPHEQYLFLE 958 (1036)
Q Consensus 948 ~l~~~~~~~L~ 958 (1036)
+-+.+-.|.|.
T Consensus 306 i~S~Q~iRtl~ 316 (476)
T KOG0646|consen 306 IYSKQCIRTLQ 316 (476)
T ss_pred cchHHHHHHHh
Confidence 55655444443
|
|
| >KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only] | Back alignment and domain information |
|---|
Probab=90.22 E-value=25 Score=36.34 Aligned_cols=96 Identities=14% Similarity=0.149 Sum_probs=61.3
Q ss_pred CCeEEEEEEcCcEEEEEEeccCC-ceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeE
Q 001655 455 GGHLVYLEIGDGILTEVKHAQLE-YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV 533 (1036)
Q Consensus 455 ~~~~~~l~~~~~~l~~~~~~~l~-~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si 533 (1036)
...+.+|++..++-..+..++.. +.|+++.+.. .+.++.-|.. ||+++||.+..++. .-.+....+..++
T Consensus 60 ~qhvRlyD~~S~np~Pv~t~e~h~kNVtaVgF~~------dgrWMyTgse-Dgt~kIWdlR~~~~--qR~~~~~spVn~v 130 (311)
T KOG0315|consen 60 NQHVRLYDLNSNNPNPVATFEGHTKNVTAVGFQC------DGRWMYTGSE-DGTVKIWDLRSLSC--QRNYQHNSPVNTV 130 (311)
T ss_pred CCeeEEEEccCCCCCceeEEeccCCceEEEEEee------cCeEEEecCC-CceEEEEeccCccc--chhccCCCCcceE
Confidence 56677777765444344444443 5677777742 2566666666 99999999876432 1112212345566
Q ss_pred EeeeecCeEEEEEEeCCCcEEEEEEecC
Q 001655 534 LLCAFEGISYLLCALGDGHLLNFLLNMK 561 (1036)
Q Consensus 534 ~~~~~~~~~~L~vgl~~G~l~~y~~~~~ 561 (1036)
++.+ .+..|++|..+|.+-++.+...
T Consensus 131 vlhp--nQteLis~dqsg~irvWDl~~~ 156 (311)
T KOG0315|consen 131 VLHP--NQTELISGDQSGNIRVWDLGEN 156 (311)
T ss_pred EecC--CcceEEeecCCCcEEEEEccCC
Confidence 5553 4678999999999999887643
|
|
| >TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL | Back alignment and domain information |
|---|
Probab=90.21 E-value=38 Score=38.52 Aligned_cols=100 Identities=18% Similarity=0.188 Sum_probs=60.0
Q ss_pred eCCCEEEEEEcCCeEEEEEEcCcEEEEEEeccCCceeE-EEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEe
Q 001655 444 ANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS-CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKE 522 (1036)
Q Consensus 444 ~~~~~vvv~~~~~~~~~l~~~~~~l~~~~~~~l~~~is-~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~ 522 (1036)
+.++.+++...++.+..++..+|+..=. .++...+. ...+ ....+++++. ++.+..++..+.+++++.
T Consensus 63 v~~~~v~v~~~~g~v~a~d~~tG~~~W~--~~~~~~~~~~p~v--------~~~~v~v~~~-~g~l~ald~~tG~~~W~~ 131 (377)
T TIGR03300 63 VAGGKVYAADADGTVVALDAETGKRLWR--VDLDERLSGGVGA--------DGGLVFVGTE-KGEVIALDAEDGKELWRA 131 (377)
T ss_pred EECCEEEEECCCCeEEEEEccCCcEeee--ecCCCCcccceEE--------cCCEEEEEcC-CCEEEEEECCCCcEeeee
Confidence 4467777777677777776655544322 22333221 1122 1457788888 899999998888888876
Q ss_pred cCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEe
Q 001655 523 HLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLN 559 (1036)
Q Consensus 523 ~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~ 559 (1036)
.+..... .+..+ ....++++..+|.|+.+...
T Consensus 132 ~~~~~~~-~~p~v----~~~~v~v~~~~g~l~a~d~~ 163 (377)
T TIGR03300 132 KLSSEVL-SPPLV----ANGLVVVRTNDGRLTALDAA 163 (377)
T ss_pred ccCceee-cCCEE----ECCEEEEECCCCeEEEEEcC
Confidence 5543211 11111 13468889999988776654
|
Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ. |
| >COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism] | Back alignment and domain information |
|---|
Probab=89.69 E-value=35 Score=37.26 Aligned_cols=232 Identities=18% Similarity=0.215 Sum_probs=125.8
Q ss_pred CCeEEEEEEcC--cEEEEEEeccCCcee-EEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCC-CCe--e-E-EecCCC
Q 001655 455 GGHLVYLEIGD--GILTEVKHAQLEYEI-SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD-LNL--I-T-KEHLGG 526 (1036)
Q Consensus 455 ~~~~~~l~~~~--~~l~~~~~~~l~~~i-s~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~-l~~--v-~-~~~l~~ 526 (1036)
.+.+..|++|+ |+|..+.+..++... +.+++++ ...+++++.+..|++.+|.+.+ ..+ + . ..+...
T Consensus 63 ~ggvaay~iD~~~G~Lt~ln~~~~~g~~p~yvsvd~------~g~~vf~AnY~~g~v~v~p~~~dG~l~~~v~~~~h~g~ 136 (346)
T COG2706 63 EGGVAAYRIDPDDGRLTFLNRQTLPGSPPCYVSVDE------DGRFVFVANYHSGSVSVYPLQADGSLQPVVQVVKHTGS 136 (346)
T ss_pred cCcEEEEEEcCCCCeEEEeeccccCCCCCeEEEECC------CCCEEEEEEccCceEEEEEcccCCccccceeeeecCCC
Confidence 35566678875 889888887776655 5667653 3678999988889999999964 222 1 1 111211
Q ss_pred ---c--cC--ceeEEeeeecCeEEEEEEeCCCcEEEEEEecCCCccccce--eeecCccceEEEEEEeCCceEEEEEcCC
Q 001655 527 ---E--II--PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRK--KVSLGTQPITLRTFSSKNTTHVFAASDR 597 (1036)
Q Consensus 527 ---~--~~--p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~l~~~~--~~~lG~~pv~l~~~~~~~~~~v~~~~~~ 597 (1036)
+ .. +++..+.+- ++..+.+-|+-..+..|.++ +|.|.... ...-|.-|..+ .|+- +...+++.++=
T Consensus 137 ~p~~rQ~~~h~H~a~~tP~-~~~l~v~DLG~Dri~~y~~~--dg~L~~~~~~~v~~G~GPRHi-~FHp-n~k~aY~v~EL 211 (346)
T COG2706 137 GPHERQESPHVHSANFTPD-GRYLVVPDLGTDRIFLYDLD--DGKLTPADPAEVKPGAGPRHI-VFHP-NGKYAYLVNEL 211 (346)
T ss_pred CCCccccCCccceeeeCCC-CCEEEEeecCCceEEEEEcc--cCccccccccccCCCCCcceE-EEcC-CCcEEEEEecc
Confidence 0 01 233333321 23445568888899999988 45554332 22345545433 2322 34455555432
Q ss_pred --ceEEEecC---CcEEEe-eec-----cccceeeecccCCCCCCeEEEEe---CCeEEEEEEcCCC-Ce-EEEEEeCCC
Q 001655 598 --PTVIYSSN---KKLLYS-NVN-----LKEVSHMCPFNSAAFPDSLAIAK---EGELTIGTIDDIQ-KL-HIRSIPLGE 661 (1036)
Q Consensus 598 --p~~i~~~~---~~~~~~-~~~-----~~~i~~~~~f~~~~~~~~l~~~~---~~~l~i~~l~~~~-~~-~~~~i~l~~ 661 (1036)
...++..+ +++.-. .+. ...-...+.++....+. +++++ .+.|.+.+++... ++ .+...+..+
T Consensus 212 ~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~dGr-FLYasNRg~dsI~~f~V~~~~g~L~~~~~~~teg 290 (346)
T COG2706 212 NSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISPDGR-FLYASNRGHDSIAVFSVDPDGGKLELVGITPTEG 290 (346)
T ss_pred CCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECCCCC-EEEEecCCCCeEEEEEEcCCCCEEEEEEEeccCC
Confidence 22222222 333211 111 11122233333333333 55554 4789999998874 33 345566655
Q ss_pred -ccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEE
Q 001655 662 -HPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFI 705 (1036)
Q Consensus 662 -tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i 705 (1036)
+||-....+..+.++++--+ ...-.+--+|++|++.-
T Consensus 291 ~~PR~F~i~~~g~~Liaa~q~-------sd~i~vf~~d~~TG~L~ 328 (346)
T COG2706 291 QFPRDFNINPSGRFLIAANQK-------SDNITVFERDKETGRLT 328 (346)
T ss_pred cCCccceeCCCCCEEEEEccC-------CCcEEEEEEcCCCceEE
Confidence 59999998887766655321 12233444577777643
|
|
| >KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism] | Back alignment and domain information |
|---|
Probab=89.68 E-value=27 Score=36.31 Aligned_cols=106 Identities=16% Similarity=0.129 Sum_probs=63.8
Q ss_pred EEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCC
Q 001655 666 ICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENE 745 (1036)
Q Consensus 666 i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~ 745 (1036)
++..+++++++++.++. ..+.-+|+++++.+..-++...-+...+ + . ..++++|+
T Consensus 16 LVV~~dskT~v~igSHs---------~~~~avd~~sG~~~We~ilg~RiE~sa~--v-v----gdfVV~GC--------- 70 (354)
T KOG4649|consen 16 LVVCNDSKTLVVIGSHS---------GIVIAVDPQSGNLIWEAILGVRIECSAI--V-V----GDFVVLGC--------- 70 (354)
T ss_pred EEEecCCceEEEEecCC---------ceEEEecCCCCcEEeehhhCceeeeeeE--E-E----CCEEEEEE---------
Confidence 34456778999888764 4567789999888877666543332222 2 1 46799998
Q ss_pred CcceEEEEEEEeCCeEEEEEEEEecCceeEeccccCeEEEEECCEEEEEEee
Q 001655 746 PTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797 (1036)
Q Consensus 746 ~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~~g~Lv~~~g~~l~v~~~~ 797 (1036)
.+|.+|++.++.+..---+...-.-.+.+.+.+.+.|+.+-...=..|.++
T Consensus 71 -y~g~lYfl~~~tGs~~w~f~~~~~vk~~a~~d~~~glIycgshd~~~yalD 121 (354)
T KOG4649|consen 71 -YSGGLYFLCVKTGSQIWNFVILETVKVRAQCDFDGGLIYCGSHDGNFYALD 121 (354)
T ss_pred -ccCcEEEEEecchhheeeeeehhhhccceEEcCCCceEEEecCCCcEEEec
Confidence 678999999886532111111112234566677776665544333344443
|
|
| >KOG0294 consensus WD40 repeat-containing protein [Function unknown] | Back alignment and domain information |
|---|
Probab=89.62 E-value=32 Score=36.78 Aligned_cols=195 Identities=12% Similarity=0.110 Sum_probs=108.4
Q ss_pred EEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecC
Q 001655 692 HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKG 771 (1036)
Q Consensus 692 ~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g 771 (1036)
+.+.+.+.++|+.+.++.=-.+. |+-+.. ...++=-+.||+- +.+-.|++.++|--.+.......
T Consensus 107 G~i~iw~~~~W~~~~slK~H~~~-Vt~lsi---HPS~KLALsVg~D-----------~~lr~WNLV~Gr~a~v~~L~~~a 171 (362)
T KOG0294|consen 107 GHIIIWRVGSWELLKSLKAHKGQ-VTDLSI---HPSGKLALSVGGD-----------QVLRTWNLVRGRVAFVLNLKNKA 171 (362)
T ss_pred CcEEEEEcCCeEEeeeecccccc-cceeEe---cCCCceEEEEcCC-----------ceeeeehhhcCccceeeccCCcc
Confidence 57888899999988876654444 544432 2223445566652 35556666554432222211111
Q ss_pred ceeEeccccCeEEEEECCEEEEEEeeecCCCccchhcccccccceEEEEEEE-eCCEEEEEeccccEEEEEEeccCCeEE
Q 001655 772 AVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQT-RGDFIVVGDLMKSISLLIYKHEEGAIE 850 (1036)
Q Consensus 772 ~v~~i~~~~g~Lv~~~g~~l~v~~~~~~~~~~~~L~~~~~~~~~~~~~~l~~-~~~~I~vgD~~~Sv~~~~~~~~~~~l~ 850 (1036)
-...-.+-+++.++..-++|-+|.++ ..++.+..... --+.++.. .+++++||=-..-+.+ ++.++ ...
T Consensus 172 t~v~w~~~Gd~F~v~~~~~i~i~q~d-----~A~v~~~i~~~--~r~l~~~~l~~~~L~vG~d~~~i~~--~D~ds-~~~ 241 (362)
T KOG0294|consen 172 TLVSWSPQGDHFVVSGRNKIDIYQLD-----NASVFREIENP--KRILCATFLDGSELLVGGDNEWISL--KDTDS-DTP 241 (362)
T ss_pred eeeEEcCCCCEEEEEeccEEEEEecc-----cHhHhhhhhcc--ccceeeeecCCceEEEecCCceEEE--eccCC-Ccc
Confidence 11111122457888889999999974 22333322211 11233333 3567777665555543 45443 111
Q ss_pred EEeccCCcceeEEEEEeeCc--e-EEEEccCCcEEEEeeCCCCCCcccccceeEEEEEEcCCccceEEe
Q 001655 851 ERARDYNANWMSAVEILDDD--I-YLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH 916 (1036)
Q Consensus 851 ~~a~D~~~~~~~~~~~l~~~--~-~l~~D~~gnl~~l~~~~~~~~~~~~~~L~~~~~~~lg~~v~~~~~ 916 (1036)
+..-+.++..|-++.+..+. . ++.+-.+|-|.+...+.+.. .+-...+++++|.++|+++-
T Consensus 242 ~~~~~AH~~RVK~i~~~~~~~~~~lvTaSSDG~I~vWd~~~~~k-----~~~~~l~e~n~~~RltCl~~ 305 (362)
T KOG0294|consen 242 LTEFLAHENRVKDIASYTNPEHEYLVTASSDGFIKVWDIDMETK-----KRPTLLAELNTNVRLTCLRV 305 (362)
T ss_pred ceeeecchhheeeeEEEecCCceEEEEeccCceEEEEEcccccc-----CCcceeEEeecCCccceeee
Confidence 11223345556677755442 3 45677799999998875432 24557899999999999864
|
|
| >PRK11138 outer membrane biogenesis protein BamB; Provisional | Back alignment and domain information |
|---|
Probab=88.82 E-value=50 Score=37.90 Aligned_cols=104 Identities=15% Similarity=0.078 Sum_probs=57.8
Q ss_pred eCCCEEEEEEcCCeEEEEEEcCcEEEEEEeccCCcee---E-----EEEcCCCCCCCCcccEEEEEEecCCEEEEEECCC
Q 001655 444 ANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI---S-----CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD 515 (1036)
Q Consensus 444 ~~~~~vvv~~~~~~~~~l~~~~~~l~~~~~~~l~~~i---s-----~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~ 515 (1036)
+.++.+.+...++.+..++.++|+..=.. ++..+. + -++-.+. .....+.++.. ++.+.-++..+
T Consensus 67 v~~~~vy~~~~~g~l~ald~~tG~~~W~~--~~~~~~~~~~~~~~~~~~~~~~----v~~~~v~v~~~-~g~l~ald~~t 139 (394)
T PRK11138 67 VAYNKVYAADRAGLVKALDADTGKEIWSV--DLSEKDGWFSKNKSALLSGGVT----VAGGKVYIGSE-KGQVYALNAED 139 (394)
T ss_pred EECCEEEEECCCCeEEEEECCCCcEeeEE--cCCCcccccccccccccccccE----EECCEEEEEcC-CCEEEEEECCC
Confidence 45667777766677777776655432211 111100 0 0000000 01456778877 88888888888
Q ss_pred CCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEe
Q 001655 516 LNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLN 559 (1036)
Q Consensus 516 l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~ 559 (1036)
-+++++..++.... .+..+. ...++++..+|.|+.+...
T Consensus 140 G~~~W~~~~~~~~~-ssP~v~----~~~v~v~~~~g~l~ald~~ 178 (394)
T PRK11138 140 GEVAWQTKVAGEAL-SRPVVS----DGLVLVHTSNGMLQALNES 178 (394)
T ss_pred CCCcccccCCCcee-cCCEEE----CCEEEEECCCCEEEEEEcc
Confidence 88888766653211 111222 3468889999987766543
|
|
| >PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment | Back alignment and domain information |
|---|
Probab=88.71 E-value=25 Score=40.76 Aligned_cols=165 Identities=15% Similarity=0.130 Sum_probs=80.4
Q ss_pred eCCceEEEEEcCCceEEEecCCcEEEeeeccccceeeecccCCCCCCeEEEEe-CCeEEEEEEcCCCCeEEEEEeCCCcc
Q 001655 585 SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK-EGELTIGTIDDIQKLHIRSIPLGEHP 663 (1036)
Q Consensus 585 ~~~~~~v~~~~~~p~~i~~~~~~~~~~~~~~~~i~~~~~f~~~~~~~~l~~~~-~~~l~i~~l~~~~~~~~~~i~l~~tp 663 (1036)
..++..|.++|+.-+.+++..+-..-. ..... .++ |.. .+.++..+ ++.+.|.+ ....-..++++++.++
T Consensus 41 npngr~v~V~g~geY~iyt~~~~r~k~--~G~g~-~~v-w~~---~n~yAv~~~~~~I~I~k--n~~~~~~k~i~~~~~~ 111 (443)
T PF04053_consen 41 NPNGRFVLVCGDGEYEIYTALAWRNKA--FGSGL-SFV-WSS---RNRYAVLESSSTIKIYK--NFKNEVVKSIKLPFSV 111 (443)
T ss_dssp -TTSSEEEEEETTEEEEEETTTTEEEE--EEE-S-EEE-E-T---SSEEEEE-TTS-EEEEE--TTEE-TT-----SS-E
T ss_pred CCCCCEEEEEcCCEEEEEEccCCcccc--cCcee-EEE-Eec---CccEEEEECCCeEEEEE--cCccccceEEcCCccc
Confidence 345678888999999999853221100 00111 111 111 23455554 56677642 2222234688888888
Q ss_pred ceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCC
Q 001655 664 RRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEE 743 (1036)
Q Consensus 664 ~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e 743 (1036)
.+|.+ ..++++.+. ..|.|+|-.+.+.+.+++.++ +.. ..|. +..+++++-|..
T Consensus 112 ~~If~---G~LL~~~~~-----------~~i~~yDw~~~~~i~~i~v~~---vk~-V~Ws---~~g~~val~t~~----- 165 (443)
T PF04053_consen 112 EKIFG---GNLLGVKSS-----------DFICFYDWETGKLIRRIDVSA---VKY-VIWS---DDGELVALVTKD----- 165 (443)
T ss_dssp EEEE----SSSEEEEET-----------TEEEEE-TTT--EEEEESS-E----EE-EEE----TTSSEEEEE-S------
T ss_pred ceEEc---CcEEEEECC-----------CCEEEEEhhHcceeeEEecCC---CcE-EEEE---CCCCEEEEEeCC-----
Confidence 88887 445555443 269999999999999988774 222 1342 346777777632
Q ss_pred CCCcceEEEEEEEeCC------------eEEEEEEEEecCceeEeccccCeEEEEECCEEE
Q 001655 744 NEPTKGRILVFIVEDG------------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792 (1036)
Q Consensus 744 ~~~~~Gri~v~~i~~~------------kl~~~~~~~~~g~v~~i~~~~g~Lv~~~g~~l~ 792 (1036)
.+++++.+.. .++.++ ++...|.+.+=.++-++....+.|.
T Consensus 166 ------~i~il~~~~~~~~~~~~~g~e~~f~~~~--E~~~~IkSg~W~~d~fiYtT~~~lk 218 (443)
T PF04053_consen 166 ------SIYILKYNLEAVAAIPEEGVEDAFELIH--EISERIKSGCWVEDCFIYTTSNHLK 218 (443)
T ss_dssp ------SEEEEEE-HHHHHHBTTTB-GGGEEEEE--EE-S--SEEEEETTEEEEE-TTEEE
T ss_pred ------eEEEEEecchhcccccccCchhceEEEE--EecceeEEEEEEcCEEEEEcCCeEE
Confidence 4566666533 566665 3455566655445555666666554
|
This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B. |
| >KOG1274 consensus WD40 repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=88.60 E-value=69 Score=39.30 Aligned_cols=198 Identities=13% Similarity=0.086 Sum_probs=103.3
Q ss_pred CCcEEEEEeCCCEEEEEEcCCeEEEEEEcCcEEE-EEEeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECC
Q 001655 436 GYSVNVATANASQVLLATGGGHLVYLEIGDGILT-EVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP 514 (1036)
Q Consensus 436 ~~~I~~as~~~~~vvv~~~~~~~~~l~~~~~~l~-~~~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~ 514 (1036)
|..|.+.++....++.+...+.+..+.+++++.. .+.+++++-...|+.- .+.++++|.. |-.|.+.++.
T Consensus 56 g~~v~~ia~~s~~f~~~s~~~tv~~y~fps~~~~~iL~Rftlp~r~~~v~g--------~g~~iaagsd-D~~vK~~~~~ 126 (933)
T KOG1274|consen 56 GELVSSIACYSNHFLTGSEQNTVLRYKFPSGEEDTILARFTLPIRDLAVSG--------SGKMIAAGSD-DTAVKLLNLD 126 (933)
T ss_pred CceeEEEeecccceEEeeccceEEEeeCCCCCccceeeeeeccceEEEEec--------CCcEEEeecC-ceeEEEEecc
Confidence 4566777777777777777788877787764433 2445555544444432 2578888877 8889999988
Q ss_pred CCCeeEE-ecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEecCCCc--cccc--eeeecCccceEEEEEEeCCce
Q 001655 515 DLNLITK-EHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGE--LTDR--KKVSLGTQPITLRTFSSKNTT 589 (1036)
Q Consensus 515 ~l~~v~~-~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~--l~~~--~~~~lG~~pv~l~~~~~~~~~ 589 (1036)
|...... ... +...-.+-+. ..+..|-+.+-||.|.+|+++...-. +... ..-..-.+++.--.++-.++.
T Consensus 127 D~s~~~~lrgh--~apVl~l~~~--p~~~fLAvss~dG~v~iw~~~~~~~~~tl~~v~k~n~~~~s~i~~~~aW~Pk~g~ 202 (933)
T KOG1274|consen 127 DSSQEKVLRGH--DAPVLQLSYD--PKGNFLAVSSCDGKVQIWDLQDGILSKTLTGVDKDNEFILSRICTRLAWHPKGGT 202 (933)
T ss_pred ccchheeeccc--CCceeeeeEc--CCCCEEEEEecCceEEEEEcccchhhhhcccCCccccccccceeeeeeecCCCCe
Confidence 7433211 111 1122222222 24567888999999999998753211 1110 011111222221222223444
Q ss_pred EEEEEcCCceEEEecCCc---EEEeeeccccceeeecccCCCCCCeEEEEe-CCeEEEEEEcC
Q 001655 590 HVFAASDRPTVIYSSNKK---LLYSNVNLKEVSHMCPFNSAAFPDSLAIAK-EGELTIGTIDD 648 (1036)
Q Consensus 590 ~v~~~~~~p~~i~~~~~~---~~~~~~~~~~i~~~~~f~~~~~~~~l~~~~-~~~l~i~~l~~ 648 (1036)
-.+.+.+.-..+|..++- +.+.+-......+.+.|. .++..++..+ ++++.+-..++
T Consensus 203 la~~~~d~~Vkvy~r~~we~~f~Lr~~~~ss~~~~~~ws--PnG~YiAAs~~~g~I~vWnv~t 263 (933)
T KOG1274|consen 203 LAVPPVDNTVKVYSRKGWELQFKLRDKLSSSKFSDLQWS--PNGKYIAASTLDGQILVWNVDT 263 (933)
T ss_pred EEeeccCCeEEEEccCCceeheeecccccccceEEEEEc--CCCcEEeeeccCCcEEEEeccc
Confidence 555566666666654332 122211111112233332 2334444433 67787777764
|
|
| >KOG0266 consensus WD40 repeat-containing protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=88.50 E-value=58 Score=38.25 Aligned_cols=180 Identities=17% Similarity=0.157 Sum_probs=98.4
Q ss_pred ceEEEEEEEeCCeEEEEEEEEecCceeEeccc-cCeEEEEEC--CEEEEEEeeecCCCccchhc-ccccccceEEEEEEE
Q 001655 748 KGRILVFIVEDGKLQLIAEKETKGAVYSLNAF-NGKLLAAIN--QKIQLYKWMLRDDGTRELQS-ECGHHGHILALYVQT 823 (1036)
Q Consensus 748 ~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~-~g~Lv~~~g--~~l~v~~~~~~~~~~~~L~~-~~~~~~~~~~~~l~~ 823 (1036)
.+.+.+|++..+.-.+..-+.+..+|++++-. +|.++++.+ .+|++++.. . + +..+ ...+...+..++...
T Consensus 224 D~tiriwd~~~~~~~~~~l~gH~~~v~~~~f~p~g~~i~Sgs~D~tvriWd~~--~-~--~~~~~l~~hs~~is~~~f~~ 298 (456)
T KOG0266|consen 224 DKTLRIWDLKDDGRNLKTLKGHSTYVTSVAFSPDGNLLVSGSDDGTVRIWDVR--T-G--ECVRKLKGHSDGISGLAFSP 298 (456)
T ss_pred CceEEEeeccCCCeEEEEecCCCCceEEEEecCCCCEEEEecCCCcEEEEecc--C-C--eEEEeeeccCCceEEEEECC
Confidence 46888999943322222233788899998855 455555444 678886653 1 2 2222 233344445556666
Q ss_pred eCCEEEEEeccccEEEEEEeccCCe---EEEEeccCCcceeEEEEEeeCce-EEEEccCCcEEEEeeCCCCCCcccccce
Q 001655 824 RGDFIVVGDLMKSISLLIYKHEEGA---IEERARDYNANWMSAVEILDDDI-YLGAENNFNLFTVRKNSEGATDEERGRL 899 (1036)
Q Consensus 824 ~~~~I~vgD~~~Sv~~~~~~~~~~~---l~~~a~D~~~~~~~~~~~l~~~~-~l~~D~~gnl~~l~~~~~~~~~~~~~~L 899 (1036)
.+++++.++. ++.. .-|+...++ +..+..+..+..++.+.|=-+.. ++.+=.++.+.++.+.... .+
T Consensus 299 d~~~l~s~s~-d~~i-~vwd~~~~~~~~~~~~~~~~~~~~~~~~~fsp~~~~ll~~~~d~~~~~w~l~~~~-------~~ 369 (456)
T KOG0266|consen 299 DGNLLVSASY-DGTI-RVWDLETGSKLCLKLLSGAENSAPVTSVQFSPNGKYLLSASLDRTLKLWDLRSGK-------SV 369 (456)
T ss_pred CCCEEEEcCC-CccE-EEEECCCCceeeeecccCCCCCCceeEEEECCCCcEEEEecCCCeEEEEEccCCc-------ce
Confidence 7889999965 5442 336776666 34555554444566776644444 4555456677776665321 11
Q ss_pred eEEEEEEcCCccceEEeeeeeecCCCCCCCCcceEEEEccccceEEEEecCh
Q 001655 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH 951 (1036)
Q Consensus 900 ~~~~~~~lg~~v~~~~~~~l~~~~~~~~~~~~~~il~~t~~GsIg~l~~l~~ 951 (1036)
..... |..+ +.++..-.. .....-++-|..+|.|...-+-+.
T Consensus 370 ~~~~~-~~~~-~~~~~~~~~--------~~~~~~i~sg~~d~~v~~~~~~s~ 411 (456)
T KOG0266|consen 370 GTYTG-HSNL-VRCIFSPTL--------STGGKLIYSGSEDGSVYVWDSSSG 411 (456)
T ss_pred eeecc-cCCc-ceeEecccc--------cCCCCeEEEEeCCceEEEEeCCcc
Confidence 11111 1111 223322111 224567788899999997766553
|
|
| >KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis] | Back alignment and domain information |
|---|
Probab=88.46 E-value=28 Score=40.42 Aligned_cols=181 Identities=14% Similarity=0.159 Sum_probs=94.7
Q ss_pred ccEEEEEecC--ceEEE--EecCCCCeeeeeecCeeccccceeeeecC--CCeEEEEecCcEEEEecCCccceeeeeCCC
Q 001655 362 DTFLVVSFIS--ETRIL--AMNLEDELEETEIEGFCSQTQTLFCHDAI--YNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435 (1036)
Q Consensus 362 ~~~lv~S~~~--~T~vl--~~~~~~~~~e~~~~~f~~~~~Tl~~~~~~--~~~~vQVt~~~i~l~~~~~~~~~~~w~~~~ 435 (1036)
.+||....++ .+.|+ ++... ...+.|.....++.+..+. ..+++=.|...||+++...+..+.... +.
T Consensus 533 GDYlatV~~~~~~~~VliHQLSK~-----~sQ~PF~kskG~vq~v~FHPs~p~lfVaTq~~vRiYdL~kqelvKkL~-tg 606 (733)
T KOG0650|consen 533 GDYLATVMPDSGNKSVLIHQLSKR-----KSQSPFRKSKGLVQRVKFHPSKPYLFVATQRSVRIYDLSKQELVKKLL-TG 606 (733)
T ss_pred CceEEEeccCCCcceEEEEecccc-----cccCchhhcCCceeEEEecCCCceEEEEeccceEEEehhHHHHHHHHh-cC
Confidence 3577666663 23343 23221 1235666666666666663 347777899999999976533333222 12
Q ss_pred CCcEEEEEeC--CCEEEEEEcCCeEEEEEEcCc-EEEEEEeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEE
Q 001655 436 GYSVNVATAN--ASQVLLATGGGHLVYLEIGDG-ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFS 512 (1036)
Q Consensus 436 ~~~I~~as~~--~~~vvv~~~~~~~~~l~~~~~-~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~ 512 (1036)
-.-|..-++. ++.+++...++.++.|+++-+ +--.. ..--+..+..+++.. .-..++.|.- ||++.||.
T Consensus 607 ~kwiS~msihp~GDnli~gs~d~k~~WfDldlsskPyk~-lr~H~~avr~Va~H~------ryPLfas~sd-Dgtv~Vfh 678 (733)
T KOG0650|consen 607 SKWISSMSIHPNGDNLILGSYDKKMCWFDLDLSSKPYKT-LRLHEKAVRSVAFHK------RYPLFASGSD-DGTVIVFH 678 (733)
T ss_pred CeeeeeeeecCCCCeEEEecCCCeeEEEEcccCcchhHH-hhhhhhhhhhhhhcc------ccceeeeecC-CCcEEEEe
Confidence 2235555554 578888777888888887531 10000 001123344555432 2345666655 78888875
Q ss_pred C---CCC----CeeEEecCCCccCceeEEeee--ec-CeEEEEEEeCCCcEEEE
Q 001655 513 L---PDL----NLITKEHLGGEIIPRSVLLCA--FE-GISYLLCALGDGHLLNF 556 (1036)
Q Consensus 513 l---~~l----~~v~~~~l~~~~~p~si~~~~--~~-~~~~L~vgl~~G~l~~y 556 (1036)
- .|| .++-...|.....-+++.++. +. ..++||..-+||.+-.|
T Consensus 679 g~VY~Dl~qnpliVPlK~L~gH~~~~~~gVLd~~wHP~qpWLfsAGAd~tirlf 732 (733)
T KOG0650|consen 679 GMVYNDLLQNPLIVPLKRLRGHEKTNDLGVLDTIWHPRQPWLFSAGADGTIRLF 732 (733)
T ss_pred eeeehhhhcCCceEeeeeccCceeecccceEeecccCCCceEEecCCCceEEee
Confidence 2 222 112122222211112332332 22 46889988888887655
|
|
| >KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport] | Back alignment and domain information |
|---|
Probab=88.43 E-value=58 Score=38.17 Aligned_cols=174 Identities=15% Similarity=0.141 Sum_probs=98.2
Q ss_pred cCcEEEEecCCccceeeeeCCCCCcEEEEEeCCC--EEEEEEcCCeEEEEEEcCcEEEEEEecc-CCceeEEEEcCCCCC
Q 001655 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANAS--QVLLATGGGHLVYLEIGDGILTEVKHAQ-LEYEISCLDINPIGE 490 (1036)
Q Consensus 414 ~~~i~l~~~~~~~~~~~w~~~~~~~I~~as~~~~--~vvv~~~~~~~~~l~~~~~~l~~~~~~~-l~~~is~l~~~~~~~ 490 (1036)
...||+++.++..++..+..-+ .=|.+-++.+. +++-+.++-.+.+..-++ +......++ -++-|-++++.|-+
T Consensus 76 D~~IrVfnynt~ekV~~FeAH~-DyIR~iavHPt~P~vLtsSDDm~iKlW~we~-~wa~~qtfeGH~HyVMqv~fnPkD- 152 (794)
T KOG0276|consen 76 DMQIRVFNYNTGEKVKTFEAHS-DYIRSIAVHPTLPYVLTSSDDMTIKLWDWEN-EWACEQTFEGHEHYVMQVAFNPKD- 152 (794)
T ss_pred CceEEEEecccceeeEEeeccc-cceeeeeecCCCCeEEecCCccEEEEeeccC-ceeeeeEEcCcceEEEEEEecCCC-
Confidence 4467889998888888776433 34777777754 455444444454444443 222211121 24567888987632
Q ss_pred CCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEecCCCcccccee
Q 001655 491 NPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKK 570 (1036)
Q Consensus 491 ~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~l~~~~~ 570 (1036)
.+..+-++- |.++.+|++..-.+.++-. +.+.-..++.+..-+++|||+-|..|-.+-++.....++- .+
T Consensus 153 ----~ntFaS~sL-DrTVKVWslgs~~~nfTl~-gHekGVN~Vdyy~~gdkpylIsgaDD~tiKvWDyQtk~CV----~T 222 (794)
T KOG0276|consen 153 ----PNTFASASL-DRTVKVWSLGSPHPNFTLE-GHEKGVNCVDYYTGGDKPYLISGADDLTIKVWDYQTKSCV----QT 222 (794)
T ss_pred ----ccceeeeec-cccEEEEEcCCCCCceeee-ccccCcceEEeccCCCcceEEecCCCceEEEeecchHHHH----HH
Confidence 233444444 8999999996544433211 1122345666666678999999998887777665432210 00
Q ss_pred eecCccceEEEEEEeCCceEEEEEcCCceEEEec-CCcEEE
Q 001655 571 VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSS-NKKLLY 610 (1036)
Q Consensus 571 ~~lG~~pv~l~~~~~~~~~~v~~~~~~p~~i~~~-~~~~~~ 610 (1036)
. -| +.++-+.++-..+-|+++..+ +|.+++
T Consensus 223 L-eG---------Ht~Nvs~v~fhp~lpiiisgsEDGTvri 253 (794)
T KOG0276|consen 223 L-EG---------HTNNVSFVFFHPELPIIISGSEDGTVRI 253 (794)
T ss_pred h-hc---------ccccceEEEecCCCcEEEEecCCccEEE
Confidence 0 01 122345666667777777655 455543
|
|
| >KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only] | Back alignment and domain information |
|---|
Probab=87.91 E-value=63 Score=38.03 Aligned_cols=40 Identities=10% Similarity=0.193 Sum_probs=28.4
Q ss_pred eEEEEeCCeEEEEEEcCCCCeEEEEEeCCCccceEEEecCCCEEEEEEee
Q 001655 632 SLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK 681 (1036)
Q Consensus 632 ~l~~~~~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~ 681 (1036)
++-+.+.+++++|++.....|.-. ++.+|.+..++++|-+
T Consensus 245 ~L~~fTR~GvrLGTvg~~D~WIWt----------V~~~PNsQ~v~~GCqD 284 (1081)
T KOG1538|consen 245 QLSLFTRDGVRLGTVGEQDSWIWT----------VQAKPNSQYVVVGCQD 284 (1081)
T ss_pred ceEEEeecCeEEeeccccceeEEE----------EEEccCCceEEEEEcc
Confidence 455566777788888776666332 4568899999999864
|
|
| >PTZ00421 coronin; Provisional | Back alignment and domain information |
|---|
Probab=87.54 E-value=68 Score=37.97 Aligned_cols=77 Identities=19% Similarity=0.110 Sum_probs=47.4
Q ss_pred CceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeE-----EecCCC-ccCceeEEeeeecCeEEEEEEeCC
Q 001655 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT-----KEHLGG-EIIPRSVLLCAFEGISYLLCALGD 550 (1036)
Q Consensus 477 ~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~-----~~~l~~-~~~p~si~~~~~~~~~~L~vgl~~ 550 (1036)
...|.++++++.. ...++.|.+ |++|.+|++++..... ...+.. .....++.+.+. +..+|+.|..|
T Consensus 75 ~~~V~~v~fsP~d-----~~~LaSgS~-DgtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~P~-~~~iLaSgs~D 147 (493)
T PTZ00421 75 EGPIIDVAFNPFD-----PQKLFTASE-DGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPS-AMNVLASAGAD 147 (493)
T ss_pred CCCEEEEEEcCCC-----CCEEEEEeC-CCEEEEEecCCCccccccCcceEEecCCCCcEEEEEeCcC-CCCEEEEEeCC
Confidence 4578899987532 457888888 9999999997532100 011111 112233333321 23578889999
Q ss_pred CcEEEEEEec
Q 001655 551 GHLLNFLLNM 560 (1036)
Q Consensus 551 G~l~~y~~~~ 560 (1036)
|.+..|.+..
T Consensus 148 gtVrIWDl~t 157 (493)
T PTZ00421 148 MVVNVWDVER 157 (493)
T ss_pred CEEEEEECCC
Confidence 9999988754
|
|
| >KOG0266 consensus WD40 repeat-containing protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=87.49 E-value=66 Score=37.76 Aligned_cols=243 Identities=19% Similarity=0.212 Sum_probs=128.1
Q ss_pred CceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCC-CccCceeEEeeeecCeEEEEEEeCCCcEEE
Q 001655 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG-GEIIPRSVLLCAFEGISYLLCALGDGHLLN 555 (1036)
Q Consensus 477 ~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~-~~~~p~si~~~~~~~~~~L~vgl~~G~l~~ 555 (1036)
...++|+++++ .+..++-+.+ ++.+.++.....+......+. ......++.+.. +..+++-+..|+.+-.
T Consensus 159 ~~sv~~~~fs~------~g~~l~~~~~-~~~i~~~~~~~~~~~~~~~l~~h~~~v~~~~fs~--d~~~l~s~s~D~tiri 229 (456)
T KOG0266|consen 159 CPSVTCVDFSP------DGRALAAASS-DGLIRIWKLEGIKSNLLRELSGHTRGVSDVAFSP--DGSYLLSGSDDKTLRI 229 (456)
T ss_pred cCceEEEEEcC------CCCeEEEccC-CCcEEEeecccccchhhccccccccceeeeEECC--CCcEEEEecCCceEEE
Confidence 35578888865 2456665556 899999988544410111111 112334444443 3448888999999999
Q ss_pred EEEecCCCccccceeeecCc-cceEEEEEEeCCceEEEEEc-CCceEEEecC-CcE-EEeeeccccceeeecccCCCCCC
Q 001655 556 FLLNMKTGELTDRKKVSLGT-QPITLRTFSSKNTTHVFAAS-DRPTVIYSSN-KKL-LYSNVNLKEVSHMCPFNSAAFPD 631 (1036)
Q Consensus 556 y~~~~~~~~l~~~~~~~lG~-~pv~l~~~~~~~~~~v~~~~-~~p~~i~~~~-~~~-~~~~~~~~~i~~~~~f~~~~~~~ 631 (1036)
+.+.. .+ +..++.. |- .+|.=..|...+ +-++..+ |....+..-+ +.. ...+.+.+.+.+ +.|+. ...
T Consensus 230 wd~~~-~~--~~~~~l~-gH~~~v~~~~f~p~g-~~i~Sgs~D~tvriWd~~~~~~~~~l~~hs~~is~-~~f~~--d~~ 301 (456)
T KOG0266|consen 230 WDLKD-DG--RNLKTLK-GHSTYVTSVAFSPDG-NLLVSGSDDGTVRIWDVRTGECVRKLKGHSDGISG-LAFSP--DGN 301 (456)
T ss_pred eeccC-CC--eEEEEec-CCCCceEEEEecCCC-CEEEEecCCCcEEEEeccCCeEEEeeeccCCceEE-EEECC--CCC
Confidence 88721 11 1112221 32 222112222222 2222222 3333333222 221 112222233332 23332 122
Q ss_pred eEEEEe-CCeEEEEEEcCCCCeEEEEEeCCCcc---ceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEE
Q 001655 632 SLAIAK-EGELTIGTIDDIQKLHIRSIPLGEHP---RRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFIST 707 (1036)
Q Consensus 632 ~l~~~~-~~~l~i~~l~~~~~~~~~~i~l~~tp---~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~ 707 (1036)
.++..+ ++.+++-.+........+.+.-.+.+ +.+.++|+.+.+++++.. ..+++.|....+.+..
T Consensus 302 ~l~s~s~d~~i~vwd~~~~~~~~~~~~~~~~~~~~~~~~~fsp~~~~ll~~~~d----------~~~~~w~l~~~~~~~~ 371 (456)
T KOG0266|consen 302 LLVSASYDGTIRVWDLETGSKLCLKLLSGAENSAPVTSVQFSPNGKYLLSASLD----------RTLKLWDLRSGKSVGT 371 (456)
T ss_pred EEEEcCCCccEEEEECCCCceeeeecccCCCCCCceeEEEECCCCcEEEEecCC----------CeEEEEEccCCcceee
Confidence 333333 66777777665543333444545556 778889988877777654 4789988877777776
Q ss_pred EECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeC
Q 001655 708 YPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVED 758 (1036)
Q Consensus 708 ~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~ 758 (1036)
+.-..++ +.|+....+ ....++++.|. .-|.+++|....
T Consensus 372 ~~~~~~~-~~~~~~~~~-~~~~~~i~sg~----------~d~~v~~~~~~s 410 (456)
T KOG0266|consen 372 YTGHSNL-VRCIFSPTL-STGGKLIYSGS----------EDGSVYVWDSSS 410 (456)
T ss_pred ecccCCc-ceeEecccc-cCCCCeEEEEe----------CCceEEEEeCCc
Confidence 6655555 355544433 33567888886 347888888764
|
|
| >KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription] | Back alignment and domain information |
|---|
Probab=87.33 E-value=18 Score=38.85 Aligned_cols=123 Identities=19% Similarity=0.254 Sum_probs=75.2
Q ss_pred EEEEEECC-CCCeEeEEEEEEEcCC-CceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccccC
Q 001655 704 FISTYPLD-TFEYGCSILSCSFSDD-SNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNG 781 (1036)
Q Consensus 704 ~i~~~~~~-~~E~v~si~~~~l~~~-~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~~g 781 (1036)
.+.+|.=+ ..|... -++|.++.. +.+|+|+|. .+|-|+++++..+++.-- -..+.+++..|.....
T Consensus 79 ~lq~y~D~d~~Esfy-tcsw~yd~~~~~p~la~~G----------~~GvIrVid~~~~~~~~~-~~ghG~sINeik~~p~ 146 (385)
T KOG1034|consen 79 LLQSYADEDHDESFY-TCSWSYDSNTGNPFLAAGG----------YLGVIRVIDVVSGQCSKN-YRGHGGSINEIKFHPD 146 (385)
T ss_pred eeeeccCCCCCcceE-EEEEEecCCCCCeeEEeec----------ceeEEEEEecchhhhccc-eeccCccchhhhcCCC
Confidence 33444333 344433 346777654 588999885 679999999886654321 1245667777776643
Q ss_pred --eEEEEEC--CEEEEEEeeecCCCccchh----cccccccceEEEEEEEeCCEEEEEeccccEEEEEEe
Q 001655 782 --KLLAAIN--QKIQLYKWMLRDDGTRELQ----SECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843 (1036)
Q Consensus 782 --~Lv~~~g--~~l~v~~~~~~~~~~~~L~----~~~~~~~~~~~~~l~~~~~~I~vgD~~~Sv~~~~~~ 843 (1036)
.|+++.. +.|++++++ ++ ..+ ..-.++..+..+..+..+++|+-+-+-+|+.+.+.+
T Consensus 147 ~~qlvls~SkD~svRlwnI~--~~---~Cv~VfGG~egHrdeVLSvD~~~~gd~i~ScGmDhslk~W~l~ 211 (385)
T KOG1034|consen 147 RPQLVLSASKDHSVRLWNIQ--TD---VCVAVFGGVEGHRDEVLSVDFSLDGDRIASCGMDHSLKLWRLN 211 (385)
T ss_pred CCcEEEEecCCceEEEEecc--CC---eEEEEecccccccCcEEEEEEcCCCCeeeccCCcceEEEEecC
Confidence 3655444 678887653 11 111 111233344555666678999999999999988766
|
|
| >PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A | Back alignment and domain information |
|---|
Probab=86.44 E-value=45 Score=34.78 Aligned_cols=172 Identities=12% Similarity=0.092 Sum_probs=93.4
Q ss_pred EEEEEEeCCCceEEEEEECCCC-CeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEec
Q 001655 692 HFVRLLDDQTFEFISTYPLDTF-EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETK 770 (1036)
Q Consensus 692 ~~l~l~d~~t~~~i~~~~~~~~-E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~ 770 (1036)
+.|.-+|+.+++.+.++.+++. ....+. .+.. ..++++++ ..|.|+.+++..+++ +.+.+.+
T Consensus 3 g~l~~~d~~tG~~~W~~~~~~~~~~~~~~---~~~~--~~~v~~~~----------~~~~l~~~d~~tG~~--~W~~~~~ 65 (238)
T PF13360_consen 3 GTLSALDPRTGKELWSYDLGPGIGGPVAT---AVPD--GGRVYVAS----------GDGNLYALDAKTGKV--LWRFDLP 65 (238)
T ss_dssp SEEEEEETTTTEEEEEEECSSSCSSEEET---EEEE--TTEEEEEE----------TTSEEEEEETTTSEE--EEEEECS
T ss_pred CEEEEEECCCCCEEEEEECCCCCCCccce---EEEe--CCEEEEEc----------CCCEEEEEECCCCCE--EEEeecc
Confidence 4678899999999999998652 222211 1111 34555554 468899998877754 5555555
Q ss_pred CceeEe-ccccCeEEEEEC-CEEEEEEeeecCCCccchh-ccccc--ccceEEEEEEEeCCEEEEEeccccEEEEEEecc
Q 001655 771 GAVYSL-NAFNGKLLAAIN-QKIQLYKWMLRDDGTRELQ-SECGH--HGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845 (1036)
Q Consensus 771 g~v~~i-~~~~g~Lv~~~g-~~l~v~~~~~~~~~~~~L~-~~~~~--~~~~~~~~l~~~~~~I~vgD~~~Sv~~~~~~~~ 845 (1036)
+++... ...++.++++.. +.++. ++. .+|+..+. ..... ..........+.++++++++....+ +.++.+
T Consensus 66 ~~~~~~~~~~~~~v~v~~~~~~l~~--~d~-~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~l--~~~d~~ 140 (238)
T PF13360_consen 66 GPISGAPVVDGGRVYVGTSDGSLYA--LDA-KTGKVLWSIYLTSSPPAGVRSSSSPAVDGDRLYVGTSSGKL--VALDPK 140 (238)
T ss_dssp SCGGSGEEEETTEEEEEETTSEEEE--EET-TTSCEEEEEEE-SSCTCSTB--SEEEEETTEEEEEETCSEE--EEEETT
T ss_pred ccccceeeecccccccccceeeeEe--ccc-CCcceeeeeccccccccccccccCceEecCEEEEEeccCcE--EEEecC
Confidence 554443 334678877764 44554 432 24544433 11110 0111223344458889888886555 456766
Q ss_pred CCeEEEEeccCCcce---------eEEEEEeeCceEEEEccCCcEEEEe
Q 001655 846 EGAIEERARDYNANW---------MSAVEILDDDIYLGAENNFNLFTVR 885 (1036)
Q Consensus 846 ~~~l~~~a~D~~~~~---------~~~~~~l~~~~~l~~D~~gnl~~l~ 885 (1036)
.+++.--..-..+.. ..+.-.+.++.+++++.+|.++.+.
T Consensus 141 tG~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~g~~~~~d 189 (238)
T PF13360_consen 141 TGKLLWKYPVGEPRGSSPISSFSDINGSPVISDGRVYVSSGDGRVVAVD 189 (238)
T ss_dssp TTEEEEEEESSTT-SS--EEEETTEEEEEECCTTEEEEECCTSSEEEEE
T ss_pred CCcEEEEeecCCCCCCcceeeecccccceEEECCEEEEEcCCCeEEEEE
Confidence 666532221112111 1233344456778888888755553
|
... |
| >KOG2096 consensus WD40 repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=86.19 E-value=36 Score=36.35 Aligned_cols=149 Identities=13% Similarity=0.267 Sum_probs=78.4
Q ss_pred eEEEE-eCCeEEEEEEcCCC----CeEEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeC--C----
Q 001655 632 SLAIA-KEGELTIGTIDDIQ----KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD--Q---- 700 (1036)
Q Consensus 632 ~l~~~-~~~~l~i~~l~~~~----~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~--~---- 700 (1036)
.|+-+ .+..+++-.+.+.. +.....++.+ +|++++|.|+-+.++|.+.+- +.|+++-. +
T Consensus 100 ~lat~~~Dr~Ir~w~~~DF~~~eHr~~R~nve~d-hpT~V~FapDc~s~vv~~~~g---------~~l~vyk~~K~~dG~ 169 (420)
T KOG2096|consen 100 KLATISGDRSIRLWDVRDFENKEHRCIRQNVEYD-HPTRVVFAPDCKSVVVSVKRG---------NKLCVYKLVKKTDGS 169 (420)
T ss_pred eeEEEeCCceEEEEecchhhhhhhhHhhccccCC-CceEEEECCCcceEEEEEccC---------CEEEEEEeeecccCC
Confidence 45544 36778888887754 2333455665 999999999999999998642 34554421 1
Q ss_pred ---CceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEec
Q 001655 701 ---TFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLN 777 (1036)
Q Consensus 701 ---t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~ 777 (1036)
++..++-++|+....+.-+ .+.-.+...+++..+ ..-.|.+|..+..-|.-+...+.+.---++.
T Consensus 170 ~~~~~v~~D~~~f~~kh~v~~i---~iGiA~~~k~imsas---------~dt~i~lw~lkGq~L~~idtnq~~n~~aavS 237 (420)
T KOG2096|consen 170 GSHHFVHIDNLEFERKHQVDII---NIGIAGNAKYIMSAS---------LDTKICLWDLKGQLLQSIDTNQSSNYDAAVS 237 (420)
T ss_pred CCcccccccccccchhcccceE---EEeecCCceEEEEec---------CCCcEEEEecCCceeeeeccccccccceeeC
Confidence 1112233445444433222 121111222222221 1125677776644455454444433322333
Q ss_pred cccCeEEEEEC--CEEEEEEeeecCCCc
Q 001655 778 AFNGKLLAAIN--QKIQLYKWMLRDDGT 803 (1036)
Q Consensus 778 ~~~g~Lv~~~g--~~l~v~~~~~~~~~~ 803 (1036)
+ +|+.+++.| ..+.+|+.-...+|.
T Consensus 238 P-~GRFia~~gFTpDVkVwE~~f~kdG~ 264 (420)
T KOG2096|consen 238 P-DGRFIAVSGFTPDVKVWEPIFTKDGT 264 (420)
T ss_pred C-CCcEEEEecCCCCceEEEEEeccCcc
Confidence 2 677777777 567776665444443
|
|
| >KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown] | Back alignment and domain information |
|---|
Probab=85.96 E-value=54 Score=35.23 Aligned_cols=121 Identities=21% Similarity=0.329 Sum_probs=71.4
Q ss_pred CCCcEEEEEeC--CCEEEEEEcCCe-EEEEEEcCc-EEEEEEeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEE
Q 001655 435 PGYSVNVATAN--ASQVLLATGGGH-LVYLEIGDG-ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRI 510 (1036)
Q Consensus 435 ~~~~I~~as~~--~~~vvv~~~~~~-~~~l~~~~~-~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i 510 (1036)
.+..|.+.+.+ +..+.-+...|+ |..|...+| .+.|..+=.-..+|-|+++++ .+.+++|+.. .|+++|
T Consensus 180 H~s~Iacv~Ln~~Gt~vATaStkGTLIRIFdt~~g~~l~E~RRG~d~A~iy~iaFSp------~~s~LavsSd-KgTlHi 252 (346)
T KOG2111|consen 180 HDSDIACVALNLQGTLVATASTKGTLIRIFDTEDGTLLQELRRGVDRADIYCIAFSP------NSSWLAVSSD-KGTLHI 252 (346)
T ss_pred ccCceeEEEEcCCccEEEEeccCcEEEEEEEcCCCcEeeeeecCCchheEEEEEeCC------CccEEEEEcC-CCeEEE
Confidence 34578888876 444444444455 445555444 455655444457889999976 3678999887 899999
Q ss_pred EECCCCCe--e--EEecCCCccCce----------------eEEeeeecCe-EEEEEEeCCCcEEEEEEecCC
Q 001655 511 FSLPDLNL--I--TKEHLGGEIIPR----------------SVLLCAFEGI-SYLLCALGDGHLLNFLLNMKT 562 (1036)
Q Consensus 511 ~~l~~l~~--v--~~~~l~~~~~p~----------------si~~~~~~~~-~~L~vgl~~G~l~~y~~~~~~ 562 (1036)
|+|.+-+. - ++.++..-..|+ ..+++.++.+ ..+++-..||....|.+++.+
T Consensus 253 F~l~~~~~~~~~~SSl~~~~~~lpky~~S~wS~~~f~l~~~~~~~~~fg~~~nsvi~i~~Dgsy~k~~f~~~~ 325 (346)
T KOG2111|consen 253 FSLRDTENTEDESSSLSFKRLVLPKYFSSEWSFAKFQLPQGTQCIIAFGSETNTVIAICADGSYYKFKFDPKN 325 (346)
T ss_pred EEeecCCCCccccccccccccccchhcccceeEEEEEccCCCcEEEEecCCCCeEEEEEeCCcEEEEEecccc
Confidence 99975221 1 110000001111 1223445544 556655568999999998764
|
|
| >KOG3881 consensus Uncharacterized conserved protein [Function unknown] | Back alignment and domain information |
|---|
Probab=85.39 E-value=7.3 Score=42.56 Aligned_cols=90 Identities=23% Similarity=0.318 Sum_probs=65.5
Q ss_pred EEEEEEeCCCc-eEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEec
Q 001655 692 HFVRLLDDQTF-EFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETK 770 (1036)
Q Consensus 692 ~~l~l~d~~t~-~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~ 770 (1036)
+.++++|+... +++.++.|. |.+.+... +. ...++|++|+ ++|.+-.|+...+++-.+.-..+.
T Consensus 226 hqvR~YDt~~qRRPV~~fd~~--E~~is~~~--l~-p~gn~Iy~gn----------~~g~l~~FD~r~~kl~g~~~kg~t 290 (412)
T KOG3881|consen 226 HQVRLYDTRHQRRPVAQFDFL--ENPISSTG--LT-PSGNFIYTGN----------TKGQLAKFDLRGGKLLGCGLKGIT 290 (412)
T ss_pred eeEEEecCcccCcceeEeccc--cCcceeee--ec-CCCcEEEEec----------ccchhheecccCceeeccccCCcc
Confidence 68999999754 466666665 55544322 21 2356788886 678889999988888777778899
Q ss_pred CceeEeccccC-eEEEEEC--CEEEEEEe
Q 001655 771 GAVYSLNAFNG-KLLAAIN--QKIQLYKW 796 (1036)
Q Consensus 771 g~v~~i~~~~g-~Lv~~~g--~~l~v~~~ 796 (1036)
|.+++|....+ .+++++| .-|+||+.
T Consensus 291 Gsirsih~hp~~~~las~GLDRyvRIhD~ 319 (412)
T KOG3881|consen 291 GSIRSIHCHPTHPVLASCGLDRYVRIHDI 319 (412)
T ss_pred CCcceEEEcCCCceEEeeccceeEEEeec
Confidence 99999998865 6777776 56778875
|
|
| >KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only] | Back alignment and domain information |
|---|
Probab=84.52 E-value=25 Score=42.42 Aligned_cols=189 Identities=14% Similarity=0.123 Sum_probs=106.3
Q ss_pred EEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCC--CceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCC
Q 001655 666 ICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQ--TFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEE 743 (1036)
Q Consensus 666 i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~--t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e 743 (1036)
++... .+.|+++.... .....+++..+|+. +|..+... ...-.-.++ +.+ +...+|||.....
T Consensus 328 ~~~~~-~~lYv~GG~~~----~~~~l~~ve~YD~~~~~W~~~a~M--~~~R~~~~v--~~l---~g~iYavGG~dg~--- 392 (571)
T KOG4441|consen 328 VAVLN-GKLYVVGGYDS----GSDRLSSVERYDPRTNQWTPVAPM--NTKRSDFGV--AVL---DGKLYAVGGFDGE--- 392 (571)
T ss_pred EEEEC-CEEEEEccccC----CCcccceEEEecCCCCceeccCCc--cCcccccee--EEE---CCEEEEEeccccc---
Confidence 44443 35677665431 12344678888884 45443322 111111122 222 2578888865411
Q ss_pred CCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccccCeEEEEEC--------CEEEEEEeeecCCCccchhcccccccc
Q 001655 744 NEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAIN--------QKIQLYKWMLRDDGTRELQSECGHHGH 815 (1036)
Q Consensus 744 ~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~~g~Lv~~~g--------~~l~v~~~~~~~~~~~~L~~~~~~~~~ 815 (1036)
..-..+-.|+...++.+.+......-.=.+++.++|+|++.-| +++..|+ +. .+.++. .+.+...
T Consensus 393 --~~l~svE~YDp~~~~W~~va~m~~~r~~~gv~~~~g~iYi~GG~~~~~~~l~sve~YD--P~-t~~W~~--~~~M~~~ 465 (571)
T KOG4441|consen 393 --KSLNSVECYDPVTNKWTPVAPMLTRRSGHGVAVLGGKLYIIGGGDGSSNCLNSVECYD--PE-TNTWTL--IAPMNTR 465 (571)
T ss_pred --cccccEEEecCCCCcccccCCCCcceeeeEEEEECCEEEEEcCcCCCccccceEEEEc--CC-CCceee--cCCcccc
Confidence 1222455566667788888876665555677788999988877 3455555 32 233443 3323222
Q ss_pred eEEEEEEEeCCEEEE-Eecccc---EEEEEEeccCCeEEEEeccCCcceeEEEEEeeCceEEEEc
Q 001655 816 ILALYVQTRGDFIVV-GDLMKS---ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 876 (1036)
Q Consensus 816 ~~~~~l~~~~~~I~v-gD~~~S---v~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l~~~~~l~~D 876 (1036)
-....+.+.+++|++ |..... -++-+|+++.++...++.-..++....+..++...++++-
T Consensus 466 R~~~g~a~~~~~iYvvGG~~~~~~~~~VE~ydp~~~~W~~v~~m~~~rs~~g~~~~~~~ly~vGG 530 (571)
T KOG4441|consen 466 RSGFGVAVLNGKIYVVGGFDGTSALSSVERYDPETNQWTMVAPMTSPRSAVGVVVLGGKLYAVGG 530 (571)
T ss_pred cccceEEEECCEEEEECCccCCCccceEEEEcCCCCceeEcccCccccccccEEEECCEEEEEec
Confidence 222336677888775 443221 1145789999999999877777777777777655555443
|
|
| >KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown] | Back alignment and domain information |
|---|
Probab=84.43 E-value=83 Score=36.07 Aligned_cols=180 Identities=12% Similarity=0.168 Sum_probs=91.6
Q ss_pred EEEEeCCCcEEEEEEecCCCccccceeeecCccceEEEEEEe-CCceEEEEEcCCceEEEecCCc------EEEeeecc-
Q 001655 544 LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKK------LLYSNVNL- 615 (1036)
Q Consensus 544 L~vgl~~G~l~~y~~~~~~~~l~~~~~~~lG~~pv~l~~~~~-~~~~~v~~~~~~p~~i~~~~~~------~~~~~~~~- 615 (1036)
.+-...||.|-.++++....++.-.+.+..|..-|....|.- .+..-+-+.|..-.+-.++.+. +.+..-+.
T Consensus 284 FlT~s~DgtlRiWdv~~~k~q~qVik~k~~~g~Rv~~tsC~~nrdg~~iAagc~DGSIQ~W~~~~~~v~p~~~vk~AH~~ 363 (641)
T KOG0772|consen 284 FLTCSYDGTLRIWDVNNTKSQLQVIKTKPAGGKRVPVTSCAWNRDGKLIAAGCLDGSIQIWDKGSRTVRPVMKVKDAHLP 363 (641)
T ss_pred eEEecCCCcEEEEecCCchhheeEEeeccCCCcccCceeeecCCCcchhhhcccCCceeeeecCCcccccceEeeeccCC
Confidence 344445999999998754333433344444543333333321 1222222222222233333222 22211111
Q ss_pred -ccceeeecccCCCCCCe-EEEEe--CCeEEEEEEcCCCC-eEEEE-EeCCCccceEEEecCCCEEEEEEeecCCCCCce
Q 001655 616 -KEVSHMCPFNSAAFPDS-LAIAK--EGELTIGTIDDIQK-LHIRS-IPLGEHPRRICHQEQSRTFAICSLKNQSCAEES 689 (1036)
Q Consensus 616 -~~i~~~~~f~~~~~~~~-l~~~~--~~~l~i~~l~~~~~-~~~~~-i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~ 689 (1036)
..|.+++ |.. ++ .++.. ++.|++=.|...++ ++++. ++....-.-.+|+|+-++++.+++-.. ..
T Consensus 364 g~~Itsi~-FS~----dg~~LlSRg~D~tLKvWDLrq~kkpL~~~tgL~t~~~~tdc~FSPd~kli~TGtS~~~----~~ 434 (641)
T KOG0772|consen 364 GQDITSIS-FSY----DGNYLLSRGFDDTLKVWDLRQFKKPLNVRTGLPTPFPGTDCCFSPDDKLILTGTSAPN----GM 434 (641)
T ss_pred CCceeEEE-ecc----ccchhhhccCCCceeeeeccccccchhhhcCCCccCCCCccccCCCceEEEecccccC----CC
Confidence 1333322 221 22 22222 57788877776654 33332 222233455789999888777776431 11
Q ss_pred eeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEee
Q 001655 690 EMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTA 737 (1036)
Q Consensus 690 ~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~ 737 (1036)
....|.|+|..+++.+..+.+.....+- +.|.- +-+-|.+|++
T Consensus 435 ~~g~L~f~d~~t~d~v~ki~i~~aSvv~--~~Whp---kLNQi~~gsg 477 (641)
T KOG0772|consen 435 TAGTLFFFDRMTLDTVYKIDISTASVVR--CLWHP---KLNQIFAGSG 477 (641)
T ss_pred CCceEEEEeccceeeEEEecCCCceEEE--Eeecc---hhhheeeecC
Confidence 2347999999999999999888433322 34532 2345777875
|
|
| >KOG0288 consensus WD40 repeat protein TipD [General function prediction only] | Back alignment and domain information |
|---|
Probab=84.35 E-value=9.1 Score=42.12 Aligned_cols=104 Identities=23% Similarity=0.388 Sum_probs=68.0
Q ss_pred EEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEE
Q 001655 653 HIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 732 (1036)
Q Consensus 653 ~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i 732 (1036)
..+.+|+++-...+-...+..-+ ..|++. -.+.++|..+.++...|.-+..--..-..-+.|. .+..|+
T Consensus 334 ~~~sv~~gg~vtSl~ls~~g~~l-LsssRD---------dtl~viDlRt~eI~~~~sA~g~k~asDwtrvvfS-pd~~Yv 402 (459)
T KOG0288|consen 334 KTRSVPLGGRVTSLDLSMDGLEL-LSSSRD---------DTLKVIDLRTKEIRQTFSAEGFKCASDWTRVVFS-PDGSYV 402 (459)
T ss_pred eeeEeecCcceeeEeeccCCeEE-eeecCC---------CceeeeecccccEEEEeeccccccccccceeEEC-CCCcee
Confidence 45778888766666666554433 333332 3688899888888876653322222222223444 357899
Q ss_pred EEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecC-ceeEec
Q 001655 733 CVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKG-AVYSLN 777 (1036)
Q Consensus 733 ~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g-~v~~i~ 777 (1036)
+-|. +.|++|+|++..+|++.+-+..... +++++.
T Consensus 403 aAGS----------~dgsv~iW~v~tgKlE~~l~~s~s~~aI~s~~ 438 (459)
T KOG0288|consen 403 AAGS----------ADGSVYIWSVFTGKLEKVLSLSTSNAAITSLS 438 (459)
T ss_pred eecc----------CCCcEEEEEccCceEEEEeccCCCCcceEEEE
Confidence 8886 5699999999999999888776655 666665
|
|
| >KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones] | Back alignment and domain information |
|---|
Probab=84.14 E-value=91 Score=36.31 Aligned_cols=253 Identities=17% Similarity=0.190 Sum_probs=127.5
Q ss_pred CCEEEEEEcCCeEEEE-EEcCcEEEEEEeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecC
Q 001655 446 ASQVLLATGGGHLVYL-EIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524 (1036)
Q Consensus 446 ~~~vvv~~~~~~~~~l-~~~~~~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l 524 (1036)
.+.+.|++ +.-+|+ ....+.+.++.... +..|+++..++ .+.+++||+- +|.++||+..+.+.+.....
T Consensus 188 ~n~laVal--g~~vylW~~~s~~v~~l~~~~-~~~vtSv~ws~------~G~~LavG~~-~g~v~iwD~~~~k~~~~~~~ 257 (484)
T KOG0305|consen 188 ANVLAVAL--GQSVYLWSASSGSVTELCSFG-EELVTSVKWSP------DGSHLAVGTS-DGTVQIWDVKEQKKTRTLRG 257 (484)
T ss_pred CCeEEEEe--cceEEEEecCCCceEEeEecC-CCceEEEEECC------CCCEEEEeec-CCeEEEEehhhccccccccC
Confidence 45667776 334454 33335555554443 77899998754 3789999999 99999999876544322111
Q ss_pred CCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEecCCCccc---cceeeecCcc-ceEEEEEEeCC-ceEEEEEcCCce
Q 001655 525 GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELT---DRKKVSLGTQ-PITLRTFSSKN-TTHVFAASDRPT 599 (1036)
Q Consensus 525 ~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~l~---~~~~~~lG~~-pv~l~~~~~~~-~~~v~~~~~~p~ 599 (1036)
. ..++-.+ .... ...+..|.++|.++.+.+........ ..+....|-+ ...-..+..++ .+.+++.-.
T Consensus 258 ~--h~~rvg~-laW~-~~~lssGsr~~~I~~~dvR~~~~~~~~~~~H~qeVCgLkws~d~~~lASGgnDN~~~Iwd~--- 330 (484)
T KOG0305|consen 258 S--HASRVGS-LAWN-SSVLSSGSRDGKILNHDVRISQHVVSTLQGHRQEVCGLKWSPDGNQLASGGNDNVVFIWDG--- 330 (484)
T ss_pred C--cCceeEE-Eecc-CceEEEecCCCcEEEEEEecchhhhhhhhcccceeeeeEECCCCCeeccCCCccceEeccC---
Confidence 1 1222111 1222 45688999999999998764321111 0000001100 00001111111 122322211
Q ss_pred EEEecCCcEEEeeecc-ccceeeecccCCCCCCeEEEEe----CCeEEEEEEcCCCCeEEEEEeCCCccceEEEecCCCE
Q 001655 600 VIYSSNKKLLYSNVNL-KEVSHMCPFNSAAFPDSLAIAK----EGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRT 674 (1036)
Q Consensus 600 ~i~~~~~~~~~~~~~~-~~i~~~~~f~~~~~~~~l~~~~----~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~ 674 (1036)
....+.+.+..... --..+.||+.. +++.+- +..++|-... ..-.++.+..+.....++|++..+-
T Consensus 331 --~~~~p~~~~~~H~aAVKA~awcP~q~-----~lLAsGGGs~D~~i~fwn~~--~g~~i~~vdtgsQVcsL~Wsk~~kE 401 (484)
T KOG0305|consen 331 --LSPEPKFTFTEHTAAVKALAWCPWQS-----GLLATGGGSADRCIKFWNTN--TGARIDSVDTGSQVCSLIWSKKYKE 401 (484)
T ss_pred --CCccccEEEeccceeeeEeeeCCCcc-----CceEEcCCCcccEEEEEEcC--CCcEecccccCCceeeEEEcCCCCE
Confidence 01123344433221 01234566653 333322 3344444433 3456788888999999999999886
Q ss_pred EEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEee
Q 001655 675 FAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTA 737 (1036)
Q Consensus 675 ~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~ 737 (1036)
++..---. ...|.+.+-.+++.+... .....+++.+.. +...+++++|.+
T Consensus 402 i~sthG~s--------~n~i~lw~~ps~~~~~~l-~gH~~RVl~la~----SPdg~~i~t~a~ 451 (484)
T KOG0305|consen 402 LLSTHGYS--------ENQITLWKYPSMKLVAEL-LGHTSRVLYLAL----SPDGETIVTGAA 451 (484)
T ss_pred EEEecCCC--------CCcEEEEeccccceeeee-cCCcceeEEEEE----CCCCCEEEEecc
Confidence 66543211 125666665555443321 122334444432 123577888875
|
|
| >KOG0649 consensus WD40 repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=83.94 E-value=56 Score=33.70 Aligned_cols=205 Identities=15% Similarity=0.173 Sum_probs=109.2
Q ss_pred EECCCCCEEEEEecCCeEEEEeCCCC--Ccccc-----ceecccCCceEEEEEeecCCCCCEEEEEEEecCCcceEEEEE
Q 001655 70 IIDPDCRLIGLHLYDGLFKVIPFDNK--GQLKE-----AFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYE 142 (1036)
Q Consensus 70 ~vdp~~r~~~~~~~~~~~~ilp~~~~--~~~~~-----~~~~~l~~~~i~d~~FL~~~~~ptlavL~~~~~~~~~l~~~~ 142 (1036)
++.|.+|+.+..-..|.++|+.+++- +.... .+.-+--+..+.+|+|-.. +++-. ....++.++
T Consensus 17 a~sp~~~~l~agn~~G~iav~sl~sl~s~sa~~~gk~~iv~eqahdgpiy~~~f~d~-----~Lls~----gdG~V~gw~ 87 (325)
T KOG0649|consen 17 AISPSKQYLFAGNLFGDIAVLSLKSLDSGSAEPPGKLKIVPEQAHDGPIYYLAFHDD-----FLLSG----GDGLVYGWE 87 (325)
T ss_pred hhCCcceEEEEecCCCeEEEEEehhhhccccCCCCCcceeeccccCCCeeeeeeehh-----heeec----cCceEEEee
Confidence 57888898877777899999988652 11111 1111223356777777421 12211 224566666
Q ss_pred EeccCCce-eeccccccccCCCCCEEEecCCCCcceEEEeCceEEEEcCCceeeeccCcceeeEEEEEecCCCEEEEEcC
Q 001655 143 VALKDKDF-VEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDH 221 (1036)
Q Consensus 143 ~~~~~~~~-~~~~~~~~~lp~~~~~lip~p~~~~G~lv~~~~~i~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~ 221 (1036)
|+--.+.. ..+.|.. ..|.... .+|+|.. ....++...+.++++--
T Consensus 88 W~E~~es~~~K~lwe~-~~P~~~~-~~evPeI-------------------------------Nam~ldP~enSi~~AgG 134 (325)
T KOG0649|consen 88 WNEEEESLATKRLWEV-KIPMQVD-AVEVPEI-------------------------------NAMWLDPSENSILFAGG 134 (325)
T ss_pred ehhhhhhccchhhhhh-cCccccC-cccCCcc-------------------------------ceeEeccCCCcEEEecC
Confidence 64433212 2234542 1222211 3344421 11224545566777778
Q ss_pred ccCEEEEEEEecCcEEeeEEEEEcCccccceEEEE-ecCCeEEEEEeecCeeEEEEeeCCCCCCCcEEEeeeccc-----
Q 001655 222 AGLLHLLVITHEKEKVTGLKIELLGETSIASTISY-LDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVN----- 295 (1036)
Q Consensus 222 ~G~l~~l~l~~~~~~v~~l~i~~~~~~~~~s~l~~-l~~~~lF~gS~~gds~l~~~~~~~~~~~~~~~~~~~l~n----- 295 (1036)
||.+|.+.++. | .+.-++-|.+----|++- -.++-+|-|++-|--.++...... ..++++.+.|
T Consensus 135 D~~~y~~dlE~-G----~i~r~~rGHtDYvH~vv~R~~~~qilsG~EDGtvRvWd~kt~k-----~v~~ie~yk~~~~lR 204 (325)
T KOG0649|consen 135 DGVIYQVDLED-G----RIQREYRGHTDYVHSVVGRNANGQILSGAEDGTVRVWDTKTQK-----HVSMIEPYKNPNLLR 204 (325)
T ss_pred CeEEEEEEecC-C----EEEEEEcCCcceeeeeeecccCcceeecCCCccEEEEeccccc-----eeEEeccccChhhcC
Confidence 88888888853 2 244445555443334333 446778999998888887765422 2444444433
Q ss_pred --CCceeeeEEeccCCCCCceEEEEeeCCCCccEEEEecC
Q 001655 296 --LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNG 333 (1036)
Q Consensus 296 --~~Pi~D~~~~~~~~~~~~~lv~~sG~g~~g~l~~lr~g 333 (1036)
||--. +++.- ++=++.||.|+.-+|-.||..
T Consensus 205 p~~g~wi-gala~------~edWlvCGgGp~lslwhLrss 237 (325)
T KOG0649|consen 205 PDWGKWI-GALAV------NEDWLVCGGGPKLSLWHLRSS 237 (325)
T ss_pred cccCcee-EEEec------cCceEEecCCCceeEEeccCC
Confidence 33311 22211 122667899998877777664
|
|
| >KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton] | Back alignment and domain information |
|---|
Probab=83.47 E-value=1.1e+02 Score=36.63 Aligned_cols=80 Identities=19% Similarity=0.252 Sum_probs=47.6
Q ss_pred cCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCe--eEE-ecCCC-ccCce-eEEeeeecCeEEEEEEeC
Q 001655 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL--ITK-EHLGG-EIIPR-SVLLCAFEGISYLLCALG 549 (1036)
Q Consensus 475 ~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~--v~~-~~l~~-~~~p~-si~~~~~~~~~~L~vgl~ 549 (1036)
.-..+|.|+.+.+.. ...++.|++ +|.+.+|.+....- .+. ..... ...|. .+.=........++.+..
T Consensus 240 ~~~s~v~~~~f~p~~-----p~ll~gG~y-~GqV~lWD~~~~~~~~~s~ls~~~~sh~~~v~~vvW~~~~~~~~f~s~ss 313 (555)
T KOG1587|consen 240 ESPSEVTCLKFCPFD-----PNLLAGGCY-NGQVVLWDLRKGSDTPPSGLSALEVSHSEPVTAVVWLQNEHNTEFFSLSS 313 (555)
T ss_pred ecCCceeEEEeccCC-----cceEEeecc-CceEEEEEccCCCCCCCcccccccccCCcCeEEEEEeccCCCCceEEEec
Confidence 345678899997743 578888999 99999999875332 211 11110 01111 111111112244777778
Q ss_pred CCcEEEEEEec
Q 001655 550 DGHLLNFLLNM 560 (1036)
Q Consensus 550 ~G~l~~y~~~~ 560 (1036)
||.+..+..+.
T Consensus 314 DG~i~~W~~~~ 324 (555)
T KOG1587|consen 314 DGSICSWDTDM 324 (555)
T ss_pred CCcEeeeeccc
Confidence 99999997764
|
|
| >KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning] | Back alignment and domain information |
|---|
Probab=83.46 E-value=1.3e+02 Score=37.76 Aligned_cols=142 Identities=18% Similarity=0.201 Sum_probs=80.9
Q ss_pred eEEEEeCCeEEEEEEcCC-CCeEEEEEeCCC--ccceE-EEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCc---eE
Q 001655 632 SLAIAKEGELTIGTIDDI-QKLHIRSIPLGE--HPRRI-CHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTF---EF 704 (1036)
Q Consensus 632 ~l~~~~~~~l~i~~l~~~-~~~~~~~i~l~~--tp~~i-~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~---~~ 704 (1036)
+.++++++ .++-++.+. ....++.||.+. .++.+ ++..+.+.+++++.+ ++|+++|...- ..
T Consensus 1178 G~Ll~tGd-~r~IRIWDa~~E~~~~diP~~s~t~vTaLS~~~~~gn~i~AGfaD----------GsvRvyD~R~a~~ds~ 1246 (1387)
T KOG1517|consen 1178 GHLLVTGD-VRSIRIWDAHKEQVVADIPYGSSTLVTALSADLVHGNIIAAGFAD----------GSVRVYDRRMAPPDSL 1246 (1387)
T ss_pred CeEEecCC-eeEEEEEecccceeEeecccCCCccceeecccccCCceEEEeecC----------CceEEeecccCCcccc
Confidence 55566655 555555443 356677777763 23333 245566788888765 58899886321 11
Q ss_pred EEEE-ECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeC-C---eEEEEEEEEecCceeEeccc
Q 001655 705 ISTY-PLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVED-G---KLQLIAEKETKGAVYSLNAF 779 (1036)
Q Consensus 705 i~~~-~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~-~---kl~~~~~~~~~g~v~~i~~~ 779 (1036)
+..+ ++...|. +..+++...+-.-++-| ...|+|+++++.- . -+.++...+..+..+++...
T Consensus 1247 v~~~R~h~~~~~---Iv~~slq~~G~~elvSg----------s~~G~I~~~DlR~~~~e~~~~iv~~~~yGs~lTal~VH 1313 (1387)
T KOG1517|consen 1247 VCVYREHNDVEP---IVHLSLQRQGLGELVSG----------SQDGDIQLLDLRMSSKETFLTIVAHWEYGSALTALTVH 1313 (1387)
T ss_pred ceeecccCCccc---ceeEEeecCCCcceeee----------ccCCeEEEEecccCcccccceeeeccccCccceeeeec
Confidence 2222 2333333 22333433222233333 3569999999984 2 24556666667789999877
Q ss_pred cC-eEEEE-ECCEEEEEEee
Q 001655 780 NG-KLLAA-INQKIQLYKWM 797 (1036)
Q Consensus 780 ~g-~Lv~~-~g~~l~v~~~~ 797 (1036)
.. .++++ .++.+.+|+..
T Consensus 1314 ~hapiiAsGs~q~ikIy~~~ 1333 (1387)
T KOG1517|consen 1314 EHAPIIASGSAQLIKIYSLS 1333 (1387)
T ss_pred cCCCeeeecCcceEEEEecC
Confidence 54 45544 44889999864
|
|
| >PTZ00421 coronin; Provisional | Back alignment and domain information |
|---|
Probab=83.36 E-value=1e+02 Score=36.41 Aligned_cols=113 Identities=14% Similarity=0.067 Sum_probs=68.1
Q ss_pred cEEEEEeC---CCEEEEEEcCCeEEEEEEcCcEEE-----EEEe-ccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEE
Q 001655 438 SVNVATAN---ASQVLLATGGGHLVYLEIGDGILT-----EVKH-AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISV 508 (1036)
Q Consensus 438 ~I~~as~~---~~~vvv~~~~~~~~~l~~~~~~l~-----~~~~-~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i 508 (1036)
.|...+.+ +..++.+..++.+...++..+.+. .+.. ..-...|.++++.+. ...+++.|.+ |+.|
T Consensus 77 ~V~~v~fsP~d~~~LaSgS~DgtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~P~-----~~~iLaSgs~-DgtV 150 (493)
T PTZ00421 77 PIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPS-----AMNVLASAGA-DMVV 150 (493)
T ss_pred CEEEEEEcCCCCCEEEEEeCCCEEEEEecCCCccccccCcceEEecCCCCcEEEEEeCcC-----CCCEEEEEeC-CCEE
Confidence 56666654 345655666777777777543211 1111 122456889988652 1357888888 9999
Q ss_pred EEEECCCCCeeEEecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEe
Q 001655 509 RIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLN 559 (1036)
Q Consensus 509 ~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~ 559 (1036)
.||++...+.+....-. .....++.+.. +...|..|..||.+..|.+.
T Consensus 151 rIWDl~tg~~~~~l~~h-~~~V~sla~sp--dG~lLatgs~Dg~IrIwD~r 198 (493)
T PTZ00421 151 NVWDVERGKAVEVIKCH-SDQITSLEWNL--DGSLLCTTSKDKKLNIIDPR 198 (493)
T ss_pred EEEECCCCeEEEEEcCC-CCceEEEEEEC--CCCEEEEecCCCEEEEEECC
Confidence 99999875543321111 12234454432 34578889999999887754
|
|
| >KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=83.23 E-value=23 Score=39.02 Aligned_cols=253 Identities=16% Similarity=0.190 Sum_probs=130.2
Q ss_pred CeEEEEEEeCCCcEEEEEEecCCCccccceeeecCccceEEEEEEeCCceEEEEEcCCceEEEecCCc-EEEeeeccccc
Q 001655 540 GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK-LLYSNVNLKEV 618 (1036)
Q Consensus 540 ~~~~L~vgl~~G~l~~y~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~~~p~~i~~~~~~-~~~~~~~~~~i 618 (1036)
+.-.|++|+..|.+.-++... +.+.-..+.=..||+ ++-...+.-|+|..+++. ++|...+...+
T Consensus 107 eGRRLltgs~SGEFtLWNg~~----fnFEtilQaHDs~Vr----------~m~ws~~g~wmiSgD~gG~iKyWqpnmnnV 172 (464)
T KOG0284|consen 107 EGRRLLTGSQSGEFTLWNGTS----FNFETILQAHDSPVR----------TMKWSHNGTWMISGDKGGMIKYWQPNMNNV 172 (464)
T ss_pred CCceeEeecccccEEEecCce----eeHHHHhhhhcccce----------eEEEccCCCEEEEcCCCceEEecccchhhh
Confidence 346799999999887765321 111001112233432 233334556777777654 55665554333
Q ss_pred eeeecccCCC-----C--CC--eEEEEeCCeEEEEEEcCCCCeEEEEEeC-CCccceEEEecCCCEEEEEEeecCCCCCc
Q 001655 619 SHMCPFNSAA-----F--PD--SLAIAKEGELTIGTIDDIQKLHIRSIPL-GEHPRRICHQEQSRTFAICSLKNQSCAEE 688 (1036)
Q Consensus 619 ~~~~~f~~~~-----~--~~--~l~~~~~~~l~i~~l~~~~~~~~~~i~l-~~tp~~i~y~~~~~~~~v~~~~~~~~~~~ 688 (1036)
..+-.-+.++ + .| ++-+.+++.++|=..-.. -..+.+.= |.-++.+.+||...+++.+.-+
T Consensus 173 k~~~ahh~eaIRdlafSpnDskF~t~SdDg~ikiWdf~~~--kee~vL~GHgwdVksvdWHP~kgLiasgskD------- 243 (464)
T KOG0284|consen 173 KIIQAHHAEAIRDLAFSPNDSKFLTCSDDGTIKIWDFRMP--KEERVLRGHGWDVKSVDWHPTKGLIASGSKD------- 243 (464)
T ss_pred HHhhHhhhhhhheeccCCCCceeEEecCCCeEEEEeccCC--chhheeccCCCCcceeccCCccceeEEccCC-------
Confidence 2221111111 1 12 233345677766443221 11222221 3578889999998877776543
Q ss_pred eeeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEE-EE
Q 001655 689 SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIA-EK 767 (1036)
Q Consensus 689 ~~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~-~~ 767 (1036)
.-|+|.||++.+.+.+.- ++=.++-.++|..++ ++++-|.. | -.+.+|++. .++.+. ..
T Consensus 244 ---nlVKlWDprSg~cl~tlh----~HKntVl~~~f~~n~-N~Llt~sk------D----~~~kv~DiR--~mkEl~~~r 303 (464)
T KOG0284|consen 244 ---NLVKLWDPRSGSCLATLH----GHKNTVLAVKFNPNG-NWLLTGSK------D----QSCKVFDIR--TMKELFTYR 303 (464)
T ss_pred ---ceeEeecCCCcchhhhhh----hccceEEEEEEcCCC-CeeEEccC------C----ceEEEEehh--HhHHHHHhh
Confidence 289999999998876532 222455666676544 78876542 2 156677775 222222 22
Q ss_pred EecCceeEe--ccccCeEEEEEC--CEEEEEEeeecCCCccchhcc-cccccceEEEEEEEeCCEEEEEeccccEEE
Q 001655 768 ETKGAVYSL--NAFNGKLLAAIN--QKIQLYKWMLRDDGTRELQSE-CGHHGHILALYVQTRGDFIVVGDLMKSISL 839 (1036)
Q Consensus 768 ~~~g~v~~i--~~~~g~Lv~~~g--~~l~v~~~~~~~~~~~~L~~~-~~~~~~~~~~~l~~~~~~I~vgD~~~Sv~~ 839 (1036)
..+.-|+++ .+++..|++..| ..|+.+.+.. .+.+... -.++..+..++-.-.|-.+.-|+--+.+-|
T Consensus 304 ~Hkkdv~~~~WhP~~~~lftsgg~Dgsvvh~~v~~----~~p~~~i~~AHd~~iwsl~~hPlGhil~tgsnd~t~rf 376 (464)
T KOG0284|consen 304 GHKKDVTSLTWHPLNESLFTSGGSDGSVVHWVVGL----EEPLGEIPPAHDGEIWSLAYHPLGHILATGSNDRTVRF 376 (464)
T ss_pred cchhhheeeccccccccceeeccCCCceEEEeccc----cccccCCCcccccceeeeeccccceeEeecCCCcceee
Confidence 345566666 566777766655 4566655531 1112111 112222222233334655556777776644
|
|
| >PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1 | Back alignment and domain information |
|---|
Probab=82.87 E-value=81 Score=34.79 Aligned_cols=116 Identities=18% Similarity=0.133 Sum_probs=68.5
Q ss_pred eEEEEe-CCeEEEEEEcCCCCeEEEEEeCCC-----------ccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeC
Q 001655 632 SLAIAK-EGELTIGTIDDIQKLHIRSIPLGE-----------HPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD 699 (1036)
Q Consensus 632 ~l~~~~-~~~l~i~~l~~~~~~~~~~i~l~~-----------tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~ 699 (1036)
.+++.+ ++.+.-..+.....-..++.++-. --.-++||+..+.+.|+..+............|-.+|.
T Consensus 197 ~~~F~Sy~G~v~~~dlsg~~~~~~~~~~~~t~~e~~~~WrPGG~Q~~A~~~~~~rlyvLMh~g~~gsHKdpgteVWv~D~ 276 (342)
T PF06433_consen 197 RLYFVSYEGNVYSADLSGDSAKFGKPWSLLTDAEKADGWRPGGWQLIAYHAASGRLYVLMHQGGEGSHKDPGTEVWVYDL 276 (342)
T ss_dssp EEEEEBTTSEEEEEEETTSSEEEEEEEESS-HHHHHTTEEE-SSS-EEEETTTTEEEEEEEE--TT-TTS-EEEEEEEET
T ss_pred eEEEEecCCEEEEEeccCCcccccCcccccCccccccCcCCcceeeeeeccccCeEEEEecCCCCCCccCCceEEEEEEC
Confidence 577776 788877777766533445555431 12448999888777676643221111224567888999
Q ss_pred CCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeE
Q 001655 700 QTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKL 761 (1036)
Q Consensus 700 ~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl 761 (1036)
++.+.+.+++++. .+.|| .+..+.+++++.-+. ..|-|++|+...+|+
T Consensus 277 ~t~krv~Ri~l~~--~~~Si---~Vsqd~~P~L~~~~~---------~~~~l~v~D~~tGk~ 324 (342)
T PF06433_consen 277 KTHKRVARIPLEH--PIDSI---AVSQDDKPLLYALSA---------GDGTLDVYDAATGKL 324 (342)
T ss_dssp TTTEEEEEEEEEE--EESEE---EEESSSS-EEEEEET---------TTTEEEEEETTT--E
T ss_pred CCCeEEEEEeCCC--ccceE---EEccCCCcEEEEEcC---------CCCeEEEEeCcCCcE
Confidence 9999999999863 24444 344555666653321 347888888776543
|
4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). |
| >KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=82.33 E-value=35 Score=37.13 Aligned_cols=70 Identities=19% Similarity=0.272 Sum_probs=42.9
Q ss_pred ceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCe--e-EEecCCCccCceeEEeeeecCeEEEEEEeCCCcEE
Q 001655 478 YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL--I-TKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLL 554 (1036)
Q Consensus 478 ~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~--v-~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~ 554 (1036)
..|..++++|. +.-++-|.+ |.+|+||...+..- + +...+. +-+++.-..+..|++.|..||.+-
T Consensus 273 sAV~dVdfspt------G~Efvsgsy-DksIRIf~~~~~~SRdiYhtkRMq-----~V~~Vk~S~Dskyi~SGSdd~nvR 340 (433)
T KOG0268|consen 273 SAVMDVDFSPT------GQEFVSGSY-DKSIRIFPVNHGHSRDIYHTKRMQ-----HVFCVKYSMDSKYIISGSDDGNVR 340 (433)
T ss_pred eeEEEeccCCC------cchhccccc-cceEEEeecCCCcchhhhhHhhhh-----eeeEEEEeccccEEEecCCCccee
Confidence 34556677763 344566778 99999999876321 1 111111 111222223578999999999998
Q ss_pred EEEEe
Q 001655 555 NFLLN 559 (1036)
Q Consensus 555 ~y~~~ 559 (1036)
.++..
T Consensus 341 lWka~ 345 (433)
T KOG0268|consen 341 LWKAK 345 (433)
T ss_pred eeecc
Confidence 88754
|
|
| >KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport] | Back alignment and domain information |
|---|
Probab=82.23 E-value=1.2e+02 Score=37.24 Aligned_cols=211 Identities=18% Similarity=0.227 Sum_probs=100.8
Q ss_pred cEEEEEeCCCEEEEEEcCCeEEEEEEcCcEEEEEEecc-CCce-eEEEEcCCCCCCCCcccEEEEEEecC--C-EEEEEE
Q 001655 438 SVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ-LEYE-ISCLDINPIGENPSYSQIAAVGMWTD--I-SVRIFS 512 (1036)
Q Consensus 438 ~I~~as~~~~~vvv~~~~~~~~~l~~~~~~l~~~~~~~-l~~~-is~l~~~~~~~~~~~~~~~~vg~w~~--~-~i~i~~ 512 (1036)
.|++++.+...|++....|.++.+. ..+.....+. .+.. +.-+-+. + +..-.+.||-... . .+.||+
T Consensus 27 ~isc~~s~~~~vvigt~~G~V~~Ln---~s~~~~~~fqa~~~siv~~L~~~---~--~~~~L~sv~Ed~~~np~llkiw~ 98 (933)
T KOG2114|consen 27 AISCCSSSTGSVVIGTADGRVVILN---SSFQLIRGFQAYEQSIVQFLYIL---N--KQNFLFSVGEDEQGNPVLLKIWD 98 (933)
T ss_pred ceeEEcCCCceEEEeeccccEEEec---ccceeeehheecchhhhhHhhcc---c--CceEEEEEeecCCCCceEEEEec
Confidence 6777777777888877777777653 1222211111 0111 2222111 0 1123445554422 1 578888
Q ss_pred CCCC---C-eeEE-----e--cCCCccCceeEEeeeecCeEEEEEEeCCCcEEEEEEecCCCccccceeeecCccceEEE
Q 001655 513 LPDL---N-LITK-----E--HLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581 (1036)
Q Consensus 513 l~~l---~-~v~~-----~--~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~l~~~~~~~lG~~pv~l~ 581 (1036)
++.. + +.+. . ..|.+..|-+++.+. ++-..+.||..||.++.|+=|.....-...+-..-|..|+.=-
T Consensus 99 lek~~~n~sP~c~~~~ri~~~~np~~~~p~s~l~Vs-~~l~~Iv~Gf~nG~V~~~~GDi~RDrgsr~~~~~~~~~pITgL 177 (933)
T KOG2114|consen 99 LEKVDKNNSPQCLYEHRIFTIKNPTNPSPASSLAVS-EDLKTIVCGFTNGLVICYKGDILRDRGSRQDYSHRGKEPITGL 177 (933)
T ss_pred ccccCCCCCcceeeeeeeeccCCCCCCCcceEEEEE-ccccEEEEEecCcEEEEEcCcchhccccceeeeccCCCCceee
Confidence 7642 2 2111 1 222233454443332 2345688999999999998654221111111223467776543
Q ss_pred EEEeCCceEEEEEcCCceEEEecCCc---EEEeeeccccceeeecccCCCCCCeEEEEeCCeEEEEEEcCCCCeEEEEEe
Q 001655 582 TFSSKNTTHVFAASDRPTVIYSSNKK---LLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIP 658 (1036)
Q Consensus 582 ~~~~~~~~~v~~~~~~p~~i~~~~~~---~~~~~~~~~~i~~~~~f~~~~~~~~l~~~~~~~l~i~~l~~~~~~~~~~i~ 658 (1036)
.+...+...+|+....-..+|.-.++ .....-+..++.+ ..|+ +.. ..|+++..+.|.+...+... +.-..+
T Consensus 178 ~~~~d~~s~lFv~Tt~~V~~y~l~gr~p~~~~ld~~G~~lnC-ss~~-~~t-~qfIca~~e~l~fY~sd~~~--~cfaf~ 252 (933)
T KOG2114|consen 178 ALRSDGKSVLFVATTEQVMLYSLSGRTPSLKVLDNNGISLNC-SSFS-DGT-YQFICAGSEFLYFYDSDGRG--PCFAFE 252 (933)
T ss_pred EEecCCceeEEEEecceeEEEEecCCCcceeeeccCCcccee-eecC-CCC-ccEEEecCceEEEEcCCCcc--eeeeec
Confidence 44444444477776665444433222 2211111111211 1111 111 14888888888888877432 233444
Q ss_pred CCCc
Q 001655 659 LGEH 662 (1036)
Q Consensus 659 l~~t 662 (1036)
+|+.
T Consensus 253 ~g~k 256 (933)
T KOG2114|consen 253 VGEK 256 (933)
T ss_pred CCCe
Confidence 4444
|
|
| >KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning] | Back alignment and domain information |
|---|
Probab=82.16 E-value=1.5e+02 Score=37.37 Aligned_cols=223 Identities=11% Similarity=0.128 Sum_probs=131.3
Q ss_pred ccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEEC--CCCCeEeEEEEEEEcCCCceEEEEEeeee
Q 001655 662 HPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPL--DTFEYGCSILSCSFSDDSNVYYCVGTAYV 739 (1036)
Q Consensus 662 tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~--~~~E~v~si~~~~l~~~~~~~i~VGT~~~ 739 (1036)
.|+.+.+||-...++++-. +..+++.|-...+.+..|.. .+.-.|+.++-+ +.....++++|+
T Consensus 1066 ~pk~~~~hpf~p~i~~ad~----------r~~i~vwd~e~~~~l~~F~n~~~~~t~Vs~l~li--Ne~D~aLlLtas--- 1130 (1387)
T KOG1517|consen 1066 PPKTLKFHPFEPQIAAADD----------RERIRVWDWEKGRLLNGFDNGAFPDTRVSDLELI--NEQDDALLLTAS--- 1130 (1387)
T ss_pred CCceeeecCCCceeEEcCC----------cceEEEEecccCceeccccCCCCCCCccceeeee--cccchhheeeec---
Confidence 4788889998887776642 24678887766666666543 344566666554 334578888887
Q ss_pred CCCCCCCcceEEEEEEEe-C--CeEEEEEEEE----------ecCceeEeccccCeEEEEECC-EEEEEEeeecCCCccc
Q 001655 740 LPEENEPTKGRILVFIVE-D--GKLQLIAEKE----------TKGAVYSLNAFNGKLLAAINQ-KIQLYKWMLRDDGTRE 805 (1036)
Q Consensus 740 ~~~e~~~~~Gri~v~~i~-~--~kl~~~~~~~----------~~g~v~~i~~~~g~Lv~~~g~-~l~v~~~~~~~~~~~~ 805 (1036)
+.|-|.+|+=- + ++.++|.... -.|.|+.=.+-.|+|+++-+. .|++++.. ...
T Consensus 1131 -------~dGvIRIwk~y~~~~~~~eLVTaw~~Ls~~~~~~r~~~~v~dWqQ~~G~Ll~tGd~r~IRIWDa~-----~E~ 1198 (1387)
T KOG1517|consen 1131 -------SDGVIRIWKDYADKWKKPELVTAWSSLSDQLPGARGTGLVVDWQQQSGHLLVTGDVRSIRIWDAH-----KEQ 1198 (1387)
T ss_pred -------cCceEEEecccccccCCceeEEeeccccccCccCCCCCeeeehhhhCCeEEecCCeeEEEEEecc-----cce
Confidence 45777777533 2 4667766432 123555656668888877653 34454421 111
Q ss_pred hhcccccccceEEEEEEE---eCCEEEEEeccccEEEEEEec-cCCeEEEEeccCCcc-eeEEEEEeeC--ceEEEEccC
Q 001655 806 LQSECGHHGHILALYVQT---RGDFIVVGDLMKSISLLIYKH-EEGAIEERARDYNAN-WMSAVEILDD--DIYLGAENN 878 (1036)
Q Consensus 806 L~~~~~~~~~~~~~~l~~---~~~~I~vgD~~~Sv~~~~~~~-~~~~l~~~a~D~~~~-~~~~~~~l~~--~~~l~~D~~ 878 (1036)
+....-+.+...+++|+. .||.|++|=.--|+-+|--+. .+..++-.-|-.... .+..+.+-.. +.++.+-.+
T Consensus 1199 ~~~diP~~s~t~vTaLS~~~~~gn~i~AGfaDGsvRvyD~R~a~~ds~v~~~R~h~~~~~Iv~~slq~~G~~elvSgs~~ 1278 (1387)
T KOG1517|consen 1199 VVADIPYGSSTLVTALSADLVHGNIIAAGFADGSVRVYDRRMAPPDSLVCVYREHNDVEPIVHLSLQRQGLGELVSGSQD 1278 (1387)
T ss_pred eEeecccCCCccceeecccccCCceEEEeecCCceEEeecccCCccccceeecccCCcccceeEEeecCCCcceeeeccC
Confidence 111111222334455443 479999999999988764332 234455555443332 1555555443 347777789
Q ss_pred CcEEEEeeCCCCCCcccccceeEEEEEEcCCccceEE
Q 001655 879 FNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915 (1036)
Q Consensus 879 gnl~~l~~~~~~~~~~~~~~L~~~~~~~lg~~v~~~~ 915 (1036)
|.+..++..-. . ...-+.....+.-|+-.|+|.
T Consensus 1279 G~I~~~DlR~~-~---~e~~~~iv~~~~yGs~lTal~ 1311 (1387)
T KOG1517|consen 1279 GDIQLLDLRMS-S---KETFLTIVAHWEYGSALTALT 1311 (1387)
T ss_pred CeEEEEecccC-c---ccccceeeeccccCccceeee
Confidence 99999886542 1 112455566666687676664
|
|
| >KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=82.09 E-value=95 Score=35.06 Aligned_cols=240 Identities=17% Similarity=0.192 Sum_probs=124.1
Q ss_pred ceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECCCCCeeEEecCCCc-cCceeEEeeeecCeEEEEEEeCCCcEEEE
Q 001655 478 YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE-IIPRSVLLCAFEGISYLLCALGDGHLLNF 556 (1036)
Q Consensus 478 ~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~~l~~v~~~~l~~~-~~p~si~~~~~~~~~~L~vgl~~G~l~~y 556 (1036)
.++.|+++++ ...|++-|-- +..++||...+++.+.. +..- -...++++- .+...|+.+..|..+-.|
T Consensus 203 keil~~avS~------Dgkylatgg~-d~~v~Iw~~~t~ehv~~--~~ghr~~V~~L~fr--~gt~~lys~s~Drsvkvw 271 (479)
T KOG0299|consen 203 KEILTLAVSS------DGKYLATGGR-DRHVQIWDCDTLEHVKV--FKGHRGAVSSLAFR--KGTSELYSASADRSVKVW 271 (479)
T ss_pred ceeEEEEEcC------CCcEEEecCC-CceEEEecCcccchhhc--ccccccceeeeeee--cCccceeeeecCCceEEE
Confidence 5778888864 2678888766 88999999988876532 1110 011223222 245679999999999999
Q ss_pred EEecCCCccccceeeecCccceEEEEEEeCCceEEEEEc--CCceEEE--ecCCcEEEeeeccccceeeecccCCCCCCe
Q 001655 557 LLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAAS--DRPTVIY--SSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632 (1036)
Q Consensus 557 ~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~--~~p~~i~--~~~~~~~~~~~~~~~i~~~~~f~~~~~~~~ 632 (1036)
.++... ... ...|-+.. +..+..-+...+.+.| |+...++ .+..++.|.+- ...+.+++-++.+ +.
T Consensus 272 ~~~~~s----~ve-tlyGHqd~-v~~IdaL~reR~vtVGgrDrT~rlwKi~eesqlifrg~-~~sidcv~~In~~---Hf 341 (479)
T KOG0299|consen 272 SIDQLS----YVE-TLYGHQDG-VLGIDALSRERCVTVGGRDRTVRLWKIPEESQLIFRGG-EGSIDCVAFINDE---HF 341 (479)
T ss_pred ehhHhH----HHH-HHhCCccc-eeeechhcccceEEeccccceeEEEeccccceeeeeCC-CCCeeeEEEeccc---ce
Confidence 887532 100 11222221 1111111122333333 4444333 23455666542 2345555544432 12
Q ss_pred EEEEeCCeEEEEEEcCCCCeEEEEEeCCCcc-----------ceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeC-C
Q 001655 633 LAIAKEGELTIGTIDDIQKLHIRSIPLGEHP-----------RRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD-Q 700 (1036)
Q Consensus 633 l~~~~~~~l~i~~l~~~~~~~~~~i~l~~tp-----------~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~-~ 700 (1036)
+.-.+++.+-+=.+-....+.+.+++=+-.| ..++-.+.+++++-.+ +.++|+|.-- .
T Consensus 342 vsGSdnG~IaLWs~~KKkplf~~~~AHgv~~~~~~~~~~~Witsla~i~~sdL~asGS----------~~G~vrLW~i~~ 411 (479)
T KOG0299|consen 342 VSGSDNGSIALWSLLKKKPLFTSRLAHGVIPELDPVNGNFWITSLAVIPGSDLLASGS----------WSGCVRLWKIED 411 (479)
T ss_pred eeccCCceEEEeeecccCceeEeeccccccCCccccccccceeeeEecccCceEEecC----------CCCceEEEEecC
Confidence 3334456666555555555666665543332 2345556666555443 3468888632 2
Q ss_pred C---ceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCe
Q 001655 701 T---FEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK 760 (1036)
Q Consensus 701 t---~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~k 760 (1036)
. ++.+.++.+ .=.|+++ .|..+ ...|++|++. ++.-||-....-..++
T Consensus 412 g~r~i~~l~~ls~--~GfVNsl---~f~~s-gk~ivagiGk------EhRlGRW~~~k~~~~~ 462 (479)
T KOG0299|consen 412 GLRAINLLYSLSL--VGFVNSL---AFSNS-GKRIVAGIGK------EHRLGRWWCLKSGKNS 462 (479)
T ss_pred Cccccceeeeccc--ccEEEEE---EEccC-CCEEEEeccc------ccccceeeEeeccccc
Confidence 2 334443332 2334444 34332 3489999874 3567887665444333
|
|
| >KOG1188 consensus WD40 repeat protein [General function prediction only] | Back alignment and domain information |
|---|
Probab=82.03 E-value=59 Score=35.25 Aligned_cols=141 Identities=23% Similarity=0.294 Sum_probs=82.1
Q ss_pred EEEecCcEEEEecCCccceeeeeCCCC--CcEEEEEe-CCCEEEEEEcCCeEEEEEEcCcEEEEEEeccC----CceeEE
Q 001655 410 VQVTSGSVRLVSSTSRELRNEWKSPPG--YSVNVATA-NASQVLLATGGGHLVYLEIGDGILTEVKHAQL----EYEISC 482 (1036)
Q Consensus 410 vQVt~~~i~l~~~~~~~~~~~w~~~~~--~~I~~as~-~~~~vvv~~~~~~~~~l~~~~~~l~~~~~~~l----~~~is~ 482 (1036)
+-....+||+++..+++.+.+++.+.+ ..+..+++ .+..|+.+.++|.+..+++.. ..++.++.. +..-+|
T Consensus 45 v~lSngsv~lyd~~tg~~l~~fk~~~~~~N~vrf~~~ds~h~v~s~ssDG~Vr~wD~Rs--~~e~a~~~~~~~~~~~f~~ 122 (376)
T KOG1188|consen 45 VSLSNGSVRLYDKGTGQLLEEFKGPPATTNGVRFISCDSPHGVISCSSDGTVRLWDIRS--QAESARISWTQQSGTPFIC 122 (376)
T ss_pred EEecCCeEEEEeccchhhhheecCCCCcccceEEecCCCCCeeEEeccCCeEEEEEeec--chhhhheeccCCCCCcceE
Confidence 334467899999988777877764432 23455555 466777777889998887743 112222222 223468
Q ss_pred EEcCCCCCCCCcccEEEEEEec---CCEEEEEECCCCCe-eE--EecCCCccCceeEEeeeecCeEEEEEEeCCCcEEEE
Q 001655 483 LDINPIGENPSYSQIAAVGMWT---DISVRIFSLPDLNL-IT--KEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNF 556 (1036)
Q Consensus 483 l~~~~~~~~~~~~~~~~vg~w~---~~~i~i~~l~~l~~-v~--~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y 556 (1036)
++..- +...++.|+-. +-.+.+|++..-+. +. .+.-.. -..++.+.+. +...|+-|.-||.+-.|
T Consensus 123 ld~nc------k~~ii~~GtE~~~s~A~v~lwDvR~~qq~l~~~~eSH~D--DVT~lrFHP~-~pnlLlSGSvDGLvnlf 193 (376)
T KOG1188|consen 123 LDLNC------KKNIIACGTELTRSDASVVLWDVRSEQQLLRQLNESHND--DVTQLRFHPS-DPNLLLSGSVDGLVNLF 193 (376)
T ss_pred eeccC------cCCeEEeccccccCceEEEEEEeccccchhhhhhhhccC--cceeEEecCC-CCCeEEeecccceEEee
Confidence 87632 25567777643 34677777765322 21 111111 1234444432 34568889999999999
Q ss_pred EEecC
Q 001655 557 LLNMK 561 (1036)
Q Consensus 557 ~~~~~ 561 (1036)
.....
T Consensus 194 D~~~d 198 (376)
T KOG1188|consen 194 DTKKD 198 (376)
T ss_pred ecCCC
Confidence 87643
|
|
| >PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain | Back alignment and domain information |
|---|
Probab=81.95 E-value=3.8 Score=30.70 Aligned_cols=38 Identities=24% Similarity=0.417 Sum_probs=30.8
Q ss_pred EEEeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEECC
Q 001655 470 EVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP 514 (1036)
Q Consensus 470 ~~~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~l~ 514 (1036)
.+....+..+|++++.+| .-+.+++|+. +|.|.+|.+.
T Consensus 4 ~~~~k~l~~~v~~~~w~P------~mdLiA~~t~-~g~v~v~Rl~ 41 (47)
T PF12894_consen 4 QLGEKNLPSRVSCMSWCP------TMDLIALGTE-DGEVLVYRLN 41 (47)
T ss_pred eecccCCCCcEEEEEECC------CCCEEEEEEC-CCeEEEEECC
Confidence 445566778899999977 3679999998 9999999983
|
|
| >PF14761 HPS3_N: Hermansky-Pudlak syndrome 3 | Back alignment and domain information |
|---|
Probab=81.87 E-value=65 Score=32.96 Aligned_cols=165 Identities=14% Similarity=0.126 Sum_probs=85.5
Q ss_pred EeCCeEEEEEEcCCCCeEEEEEeCCCccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEEEEEECCCCCe
Q 001655 636 AKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEY 715 (1036)
Q Consensus 636 ~~~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i~~~~~~~~E~ 715 (1036)
+.+..+....+.....-..-+++.-++..+++|.+..+.++-+..+.... ....+|.+- +|+.. ..++.
T Consensus 35 ~~g~~Vev~~l~~~~~~~~~~F~Tv~~V~~l~y~~~GDYlvTlE~k~~~~----~~~fvR~Y~--NWr~~-----~~~~~ 103 (215)
T PF14761_consen 35 ASGCKVEVYDLEQEECPLLCTFSTVGRVLQLVYSEAGDYLVTLEEKNKRS----PVDFVRAYF--NWRSQ-----KEENS 103 (215)
T ss_pred cCCCEEEEEEcccCCCceeEEEcchhheeEEEeccccceEEEEEeecCCc----cceEEEEEE--Ehhhh-----cccCC
Confidence 44566666666633333445566667788999999888666665443111 011555552 34432 11222
Q ss_pred EeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeE--eccccCeEEEEECCEEEE
Q 001655 716 GCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYS--LNAFNGKLLAAINQKIQL 793 (1036)
Q Consensus 716 v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~--i~~~~g~Lv~~~g~~l~v 793 (1036)
..++ -++|.....+ -.++.+.. ++ +-+...+.++.+ .|+..|.|++|+++++.+
T Consensus 104 ~v~v------------RiaG~~v~~~-~~~~~~~q----------le-iiElPl~~~p~ciaCC~~tG~LlVg~~~~l~l 159 (215)
T PF14761_consen 104 PVRV------------RIAGHRVTPS-FNESSKDQ----------LE-IIELPLSEPPLCIACCPVTGNLLVGCGNKLVL 159 (215)
T ss_pred cEEE------------EEcccccccC-CCCccccc----------eE-EEEecCCCCCCEEEecCCCCCEEEEcCCEEEE
Confidence 2222 2235321110 01111112 22 223455556654 467799999999999999
Q ss_pred EEeeecCCCccch--hc----ccccccceEEEEEEEeCCEEEEEeccc
Q 001655 794 YKWMLRDDGTREL--QS----ECGHHGHILALYVQTRGDFIVVGDLMK 835 (1036)
Q Consensus 794 ~~~~~~~~~~~~L--~~----~~~~~~~~~~~~l~~~~~~I~vgD~~~ 835 (1036)
|.+....-...++ .. .......+.+..+...++||.+-+-..
T Consensus 160 f~l~~~~~~~~~~~~lDFe~~l~~~~~~~~p~~v~ic~~yiA~~s~~e 207 (215)
T PF14761_consen 160 FTLKYQTIQSEKFSFLDFERSLIDHIDNFKPTQVAICEGYIAVMSDLE 207 (215)
T ss_pred EEEEEEEEecccccEEechhhhhheecCceEEEEEEEeeEEEEecCCE
Confidence 9985321101111 11 111122345778888899988755444
|
|
| >PHA02713 hypothetical protein; Provisional | Back alignment and domain information |
|---|
Probab=81.31 E-value=39 Score=40.78 Aligned_cols=150 Identities=12% Similarity=0.107 Sum_probs=82.2
Q ss_pred ceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEecCceeEeccccCeEEEEECC-------------------
Q 001655 729 NVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQ------------------- 789 (1036)
Q Consensus 729 ~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~~g~Lv~~~g~------------------- 789 (1036)
...+++|... +. .....+..|+...++.+.+.....+-.-.+++.++|+|++.-|.
T Consensus 352 g~IYviGG~~---~~--~~~~sve~Ydp~~~~W~~~~~mp~~r~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~ 426 (557)
T PHA02713 352 DTIYAIGGQN---GT--NVERTIECYTMGDDKWKMLPDMPIALSSYGMCVLDQYIYIIGGRTEHIDYTSVHHMNSIDMEE 426 (557)
T ss_pred CEEEEECCcC---CC--CCCceEEEEECCCCeEEECCCCCcccccccEEEECCEEEEEeCCCcccccccccccccccccc
Confidence 5678888642 11 12335777877777777665443332223455668888765552
Q ss_pred ------EEEEEEeeecCCCccchhcccccccceEEEEEEEeCCEEEE-Eecc---ccE-EEEEEeccC-CeEEEEeccCC
Q 001655 790 ------KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV-GDLM---KSI-SLLIYKHEE-GAIEERARDYN 857 (1036)
Q Consensus 790 ------~l~v~~~~~~~~~~~~L~~~~~~~~~~~~~~l~~~~~~I~v-gD~~---~Sv-~~~~~~~~~-~~l~~~a~D~~ 857 (1036)
++..|+ +.. .. +...+.....-....+.+.++.|+| |... ... .+.+|+++. ++...++.=+.
T Consensus 427 ~~~~~~~ve~YD--P~t-d~--W~~v~~m~~~r~~~~~~~~~~~IYv~GG~~~~~~~~~~ve~Ydp~~~~~W~~~~~m~~ 501 (557)
T PHA02713 427 DTHSSNKVIRYD--TVN-NI--WETLPNFWTGTIRPGVVSHKDDIYVVCDIKDEKNVKTCIFRYNTNTYNGWELITTTES 501 (557)
T ss_pred cccccceEEEEC--CCC-Ce--EeecCCCCcccccCcEEEECCEEEEEeCCCCCCccceeEEEecCCCCCCeeEccccCc
Confidence 233343 322 22 3333322222122345677887765 5332 111 356899987 78998887777
Q ss_pred cceeEEEEEeeCceEEEEccCCcEEEEeeCC
Q 001655 858 ANWMSAVEILDDDIYLGAENNFNLFTVRKNS 888 (1036)
Q Consensus 858 ~~~~~~~~~l~~~~~l~~D~~gnl~~l~~~~ 888 (1036)
+|....+..+++.-++++..+|.-.+-.|++
T Consensus 502 ~r~~~~~~~~~~~iyv~Gg~~~~~~~e~yd~ 532 (557)
T PHA02713 502 RLSALHTILHDNTIMMLHCYESYMLQDTFNV 532 (557)
T ss_pred ccccceeEEECCEEEEEeeecceeehhhcCc
Confidence 7777777777655555554455433334443
|
|
| >KOG1896 consensus mRNA cleavage and polyadenylation factor II complex, subunit CFT1 (CPSF subunit) [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=81.28 E-value=1.7e+02 Score=37.45 Aligned_cols=94 Identities=19% Similarity=0.205 Sum_probs=67.9
Q ss_pred EEEEEEEe-CCeEEEEEEEEecCceeEeccccCeEEEE-ECCEEEEEEeeecCCCccch--hcccccccceEEEEEEEeC
Q 001655 750 RILVFIVE-DGKLQLIAEKETKGAVYSLNAFNGKLLAA-INQKIQLYKWMLRDDGTREL--QSECGHHGHILALYVQTRG 825 (1036)
Q Consensus 750 ri~v~~i~-~~kl~~~~~~~~~g~v~~i~~~~g~Lv~~-~g~~l~v~~~~~~~~~~~~L--~~~~~~~~~~~~~~l~~~~ 825 (1036)
.|+++.++ +..|.=++-.+.+--|+++..+++.|++| +=+.+.+..|+++ -.+| ..+.+-....+++..-+.|
T Consensus 1117 KI~v~~l~r~~~ligVaFiD~~~yv~s~~~vknlIl~gDV~ksisfl~fqee---p~rlsL~srd~~~l~v~s~EFLVdg 1193 (1366)
T KOG1896|consen 1117 KIIVRKLDRDSELIGVAFIDLPLYVHSMKVVKNLILAGDVMKSISFLGFQEE---PYRLSLLSRDFEPLNVYSTEFLVDG 1193 (1366)
T ss_pred EEEEEEeccCCcceeeEEeccceeEEehhhhhhheehhhhhhceEEEEEccC---ceEEEEeecCCchhhceeeeeEEcC
Confidence 68889884 55677677677788888999999998877 4477888888642 2333 3333323344666666777
Q ss_pred C--EEEEEeccccEEEEEEeccC
Q 001655 826 D--FIVVGDLMKSISLLIYKHEE 846 (1036)
Q Consensus 826 ~--~I~vgD~~~Sv~~~~~~~~~ 846 (1036)
+ ..+|.|+.+-+.++.|.++.
T Consensus 1194 ~~L~flvsDa~rNi~vy~Y~Pe~ 1216 (1366)
T KOG1896|consen 1194 SNLSFLVSDADRNIHVYMYAPEN 1216 (1366)
T ss_pred CeeEEEEEcCCCcEEEEEeCCCC
Confidence 6 67899999999999999863
|
|
| >KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only] | Back alignment and domain information |
|---|
Probab=81.22 E-value=33 Score=41.46 Aligned_cols=174 Identities=17% Similarity=0.148 Sum_probs=100.7
Q ss_pred eeEEEEEEeCCCceEEEEEECCCCCeEeEEEEEEEcCCCceEEEEEeeeeCCCCCCCcceEEEEEEEeCCeEEEEEEEEe
Q 001655 690 EMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKET 769 (1036)
Q Consensus 690 ~~~~l~l~d~~t~~~i~~~~~~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~ 769 (1036)
....++.+|+.+.+-...-+++..-.-.+++.+ +...+++|.-.. .....-.+..|+...++...+.+...
T Consensus 299 ~~~~ve~yd~~~~~w~~~a~m~~~r~~~~~~~~-----~~~lYv~GG~~~----~~~~l~~ve~YD~~~~~W~~~a~M~~ 369 (571)
T KOG4441|consen 299 SLRSVECYDPKTNEWSSLAPMPSPRCRVGVAVL-----NGKLYVVGGYDS----GSDRLSSVERYDPRTNQWTPVAPMNT 369 (571)
T ss_pred ccceeEEecCCcCcEeecCCCCcccccccEEEE-----CCEEEEEccccC----CCcccceEEEecCCCCceeccCCccC
Confidence 345788888865432222233322222333333 246777775431 12344567777777666666665555
Q ss_pred cCceeEeccccCeEEEEECC-------EEEEEEeeecCCCccchhcccccccceEEEEEEEeCCEEEE--E-e----ccc
Q 001655 770 KGAVYSLNAFNGKLLAAINQ-------KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV--G-D----LMK 835 (1036)
Q Consensus 770 ~g~v~~i~~~~g~Lv~~~g~-------~l~v~~~~~~~~~~~~L~~~~~~~~~~~~~~l~~~~~~I~v--g-D----~~~ 835 (1036)
.-.=.+++.++|.|++.-|. ++..|+. . ..++...+.+...-......+.+++|++ | | ..+
T Consensus 370 ~R~~~~v~~l~g~iYavGG~dg~~~l~svE~YDp--~---~~~W~~va~m~~~r~~~gv~~~~g~iYi~GG~~~~~~~l~ 444 (571)
T KOG4441|consen 370 KRSDFGVAVLDGKLYAVGGFDGEKSLNSVECYDP--V---TNKWTPVAPMLTRRSGHGVAVLGGKLYIIGGGDGSSNCLN 444 (571)
T ss_pred ccccceeEEECCEEEEEeccccccccccEEEecC--C---CCcccccCCCCcceeeeEEEEECCEEEEEcCcCCCccccc
Confidence 55555555566666554443 2444442 2 2334444444444445566777887765 3 1 334
Q ss_pred cEEEEEEeccCCeEEEEeccCCcceeEEEEEeeCceEEEEccCC
Q 001655 836 SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNF 879 (1036)
Q Consensus 836 Sv~~~~~~~~~~~l~~~a~D~~~~~~~~~~~l~~~~~l~~D~~g 879 (1036)
| +..|++..++...++.=..+|....+..+++.-++++..+|
T Consensus 445 s--ve~YDP~t~~W~~~~~M~~~R~~~g~a~~~~~iYvvGG~~~ 486 (571)
T KOG4441|consen 445 S--VECYDPETNTWTLIAPMNTRRSGFGVAVLNGKIYVVGGFDG 486 (571)
T ss_pred e--EEEEcCCCCceeecCCcccccccceEEEECCEEEEECCccC
Confidence 4 45789999999999988888888888888766666655554
|
|
| >KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification] | Back alignment and domain information |
|---|
Probab=81.15 E-value=98 Score=34.59 Aligned_cols=73 Identities=12% Similarity=0.180 Sum_probs=46.3
Q ss_pred CCCCeEEEEeCCeEEEEEEcCCCC-eEEEEEeCC-CccceEEEecCCCEEEEEEeecCCCCCceeeEEEEEEeCCCceEE
Q 001655 628 AFPDSLAIAKEGELTIGTIDDIQK-LHIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFI 705 (1036)
Q Consensus 628 ~~~~~l~~~~~~~l~i~~l~~~~~-~~~~~i~l~-~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~l~l~d~~t~~~i 705 (1036)
..|+|+-++++++=..|++.+..+ -..-.||=. .-+.+|.|.|..+.|.+-++-. .++++.+..+|..+
T Consensus 353 fsPNGy~lATgs~Dnt~kVWDLR~r~~ly~ipAH~nlVS~Vk~~p~~g~fL~TasyD---------~t~kiWs~~~~~~~ 423 (459)
T KOG0272|consen 353 FSPNGYHLATGSSDNTCKVWDLRMRSELYTIPAHSNLVSQVKYSPQEGYFLVTASYD---------NTVKIWSTRTWSPL 423 (459)
T ss_pred ECCCceEEeecCCCCcEEEeeecccccceecccccchhhheEecccCCeEEEEcccC---------cceeeecCCCcccc
Confidence 347888777754444444444321 113344432 2467899999888887776532 58999999999887
Q ss_pred EEEE
Q 001655 706 STYP 709 (1036)
Q Consensus 706 ~~~~ 709 (1036)
.+..
T Consensus 424 ksLa 427 (459)
T KOG0272|consen 424 KSLA 427 (459)
T ss_pred hhhc
Confidence 6543
|
|
| >PF08553 VID27: VID27 cytoplasmic protein; InterPro: IPR013863 This entry represents fungal and plant proteins and contains many hypothetical proteins | Back alignment and domain information |
|---|
Probab=80.24 E-value=1.6e+02 Score=36.74 Aligned_cols=96 Identities=18% Similarity=0.244 Sum_probs=64.4
Q ss_pred CeEEE--EecCcEEEEecCCccceeeeeCCCCCcEEEEE-------eCCCEEEEEEcCCeEEEEEEcC----cEEEEE--
Q 001655 407 NQLVQ--VTSGSVRLVSSTSRELRNEWKSPPGYSVNVAT-------ANASQVLLATGGGHLVYLEIGD----GILTEV-- 471 (1036)
Q Consensus 407 ~~~vQ--Vt~~~i~l~~~~~~~~~~~w~~~~~~~I~~as-------~~~~~vvv~~~~~~~~~l~~~~----~~l~~~-- 471 (1036)
++++| ..++.+.-+|.+.++.+.+|+...+..|..-+ ..+.+.+++.+++.+.. +|+ .++...
T Consensus 494 ~mil~~~~~~~~ly~mDLe~GKVV~eW~~~~~~~v~~~~p~~K~aqlt~e~tflGls~n~lfr--iDpR~~~~k~v~~~~ 571 (794)
T PF08553_consen 494 NMILLDPNNPNKLYKMDLERGKVVEEWKVHDDIPVVDIAPDSKFAQLTNEQTFLGLSDNSLFR--IDPRLSGNKLVDSQS 571 (794)
T ss_pred ceEeecCCCCCceEEEecCCCcEEEEeecCCCcceeEecccccccccCCCceEEEECCCceEE--eccCCCCCceeeccc
Confidence 46666 35788888999999999999986654444332 23567888888777654 444 223321
Q ss_pred EeccCCceeEEEEcCCCCCCCCcccEEEEEEecCCEEEEEE
Q 001655 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFS 512 (1036)
Q Consensus 472 ~~~~l~~~is~l~~~~~~~~~~~~~~~~vg~w~~~~i~i~~ 512 (1036)
........-+|++-. ...+++||.- +|.|++|+
T Consensus 572 k~Y~~~~~Fs~~aTt-------~~G~iavgs~-~G~IRLyd 604 (794)
T PF08553_consen 572 KQYSSKNNFSCFATT-------EDGYIAVGSN-KGDIRLYD 604 (794)
T ss_pred cccccCCCceEEEec-------CCceEEEEeC-CCcEEeec
Confidence 122233456788764 2568999988 99999997
|
Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ]. |
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() | |
| Query | 1036 | ||||
| 4a0b_A | 1159 | Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Dup | 0.0 | ||
| 4a0a_A | 1159 | Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Dup | 0.0 | ||
| 3e0c_A | 1140 | Crystal Structure Of Dna Damage-Binding Protein 1(D | 0.0 | ||
| 4e54_A | 1150 | Damaged Dna Induced Uv-Damaged Dna-Binding Protein | 0.0 | ||
| 3ei1_A | 1158 | Structure Of Hsddb1-Drddb2 Bound To A 14 Bp 6-4 Pho | 0.0 | ||
| 4a11_A | 1159 | Structure Of The Hsddb1-Hscsa Complex Length = 1159 | 0.0 | ||
| 4a0l_A | 1144 | Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp | 0.0 | ||
| 4a08_A | 1159 | Structure Of Hsddb1-Drddb2 Bound To A 13 Bp Cpd-Dup | 0.0 | ||
| 2b5l_A | 1140 | Crystal Structure Of Ddb1 In Complex With Simian Vi | 0.0 | ||
| 3ei4_A | 1158 | Structure Of The Hsddb1-Hsddb2 Complex Length = 115 | 0.0 | ||
| 3i7h_A | 1143 | Crystal Structure Of Ddb1 In Complex With The H-Box | 0.0 | ||
| 2b5n_A | 323 | Crystal Structure Of The Ddb1 Bpb Domain Length = 3 | 7e-98 |
| >pdb|4A0B|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex ( Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4) Length = 1159 | Back alignment and structure |
|
| >pdb|4A0A|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex ( Pyrimidine At D-1 Position) At 3.6 A Resolution (Cpd 3) Length = 1159 | Back alignment and structure |
|
| >pdb|3E0C|A Chain A, Crystal Structure Of Dna Damage-Binding Protein 1(Ddb1) Length = 1140 | Back alignment and structure |
|
| >pdb|4E54|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb) Dimerization And Its Roles In Chromatinized Dna Repair Length = 1150 | Back alignment and structure |
|
| >pdb|3EI1|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 14 Bp 6-4 Photoproduct Containing Dna-Duplex Length = 1158 | Back alignment and structure |
|
| >pdb|4A11|A Chain A, Structure Of The Hsddb1-Hscsa Complex Length = 1159 | Back alignment and structure |
|
| >pdb|4A0L|A Chain A, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic Site Containing Dna-Duplex Length = 1144 | Back alignment and structure |
|
| >pdb|4A08|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 13 Bp Cpd-Duplex ( Purine At D-1 Position) At 3.0 A Resolution (Cpd 1) Length = 1159 | Back alignment and structure |
|
| >pdb|2B5L|A Chain A, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V Protein Length = 1140 | Back alignment and structure |
|
| >pdb|3EI4|A Chain A, Structure Of The Hsddb1-Hsddb2 Complex Length = 1158 | Back alignment and structure |
|
| >pdb|3I7H|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of Hbx Length = 1143 | Back alignment and structure |
|
| >pdb|2B5N|A Chain A, Crystal Structure Of The Ddb1 Bpb Domain Length = 323 | Back alignment and structure |
|
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 1036 | |||
| 3ei3_A | 1158 | DNA damage-binding protein 1; UV-damage, DDB, nucl | 0.0 | |
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 3e-11 | |
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 3e-07 | |
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 2e-06 |
| >3ei3_A DNA damage-binding protein 1; UV-damage, DDB, nucleotide excision repair, xeroderma pigmentosum, cytoplasm, DNA repair; HET: DNA PG4; 2.30A {Homo sapiens} PDB: 3ei1_A* 3ei2_A* 3ei4_A* 4a0l_A* 3e0c_A* 3i7k_A* 3i7h_A* 3i7l_A* 3i7n_A* 3i7o_A* 3i7p_A* 3i89_A* 3i8c_A* 3i8e_A* 2b5l_A 2b5m_A 2hye_A* 4a11_A* 4a0k_C* 4a0a_A* ... Length = 1158 | Back alignment and structure |
|---|
Score = 911 bits (2355), Expect = 0.0
Identities = 572/1096 (52%), Positives = 755/1096 (68%), Gaps = 67/1096 (6%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDR 58
+ +V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ++ITRA G+V DR
Sbjct: 70 VKEVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDR 129
Query: 59 IGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
IGRP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLY
Sbjct: 130 IGRPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLY 189
Query: 119 GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVL 178
GC PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +
Sbjct: 190 GCQAPTICFVYQDPQG-RHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAI 248
Query: 179 IIGEETIVYCSANAF---KAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
IIG+E+I Y + + + I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 249 IIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQ 308
Query: 236 -----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +
Sbjct: 309 MDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAM 368
Query: 291 ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+
Sbjct: 369 ETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGL 428
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W LRS + D LV+SF+ +TR+L +N +E+EETE+ GF QT FC + + QL+
Sbjct: 429 WPLRSDPNRETDDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLI 487
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
Q+TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L +
Sbjct: 488 QITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGR-ALYYLQIHPQELRQ 546
Query: 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
+ H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIP
Sbjct: 547 ISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIP 606
Query: 531 RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
RS+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+
Sbjct: 607 RSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTN 666
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQ
Sbjct: 667 VFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQ 726
Query: 651 KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ--------------------------- 683
KLHIR++PL E PR+IC+QE S+ F + S + +
Sbjct: 727 KLHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSS 786
Query: 684 ------------SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
S EE E+H + ++D TFE + + EY S++SC D N Y
Sbjct: 787 KLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTY 846
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
+ VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN +
Sbjct: 847 FIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTV 906
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+LY+W EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE
Sbjct: 907 RLYEWTTEK----ELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEE 962
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFV
Sbjct: 963 IARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFV 1022
Query: 912 NRFRHGSLVMRLP-DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG 970
N F HGSLVM+ ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK
Sbjct: 1023 NVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKS 1082
Query: 971 VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------V 1020
VG + H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ +
Sbjct: 1083 VGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1142
Query: 1021 SVEELCKRVEELTRLH 1036
+ ++L K VEELTR+H
Sbjct: 1143 TADDLIKVVEELTRIH 1158
|
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
Score = 66.8 bits (162), Expect = 3e-11
Identities = 92/699 (13%), Positives = 210/699 (30%), Gaps = 197/699 (28%)
Query: 285 SYVEVLERYVNLGPIV-DF-C--VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQA 340
Y ++L V V +F C V D+ + +++ I
Sbjct: 17 QYKDILS--VFEDAFVDNFDCKDVQDMPKS----ILS-KEEIDH-----------IIMSK 58
Query: 341 SVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF 400
++ W+L S ++ F+ + L ++ E + + +
Sbjct: 59 DAVSGTLRLFWTLLSKQEEMVQKFVEEVLRINYKFLMSPIKTEQRQPSMM----TRMYIE 114
Query: 401 CHDAIYNQLVQVTSGSV-RLVSSTSRELRN---EWKSPP----------GYSVNVATANA 446
D +YN +V RL +LR E + G +
Sbjct: 115 QRDRLYNDNQVFAKYNVSRL--QPYLKLRQALLELRPAKNVLIDGVLGSGKTWVALDVCL 172
Query: 447 SQVLLATGGGHLVYLEIG-----DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVG 501
S + + +L + + +L ++ +L Y+I P+++ +
Sbjct: 173 SYKVQCKMDFKIFWLNLKNCNSPETVLEMLQ--KLLYQID----------PNWTSRS--- 217
Query: 502 MWTDISVRI-FSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGD----GHLLNF 556
D S I + + + L + +L+ L + F
Sbjct: 218 ---DHSSNIKLRIHSIQAELRRLLKSKPYENCLLV------------LLNVQNAKAWNAF 262
Query: 557 LLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLL--YSNVN 614
L+ K T K+V+ S+ TTH+ T+ K LL Y +
Sbjct: 263 NLSCKILLTTRFKQVT--------DFLSAATTTHISLDHHSMTLTPDEVKSLLLKYLDCR 314
Query: 615 LKEVSH-MCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSR 673
+++ + N P ++ E + + + + +
Sbjct: 315 PQDLPREVLTTN----P--RRLSIIAESIRDGLATWDNW-----------KHVNCDKLTT 357
Query: 674 TFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
SL AE +M F+ S+ F +++
Sbjct: 358 IIES-SLNVLEPAEYRKM---------------------FDR-LSV----FPPSAHI--- 387
Query: 734 VGTAYVL------PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAI 787
+L +++ +V + + K L+ EK+ K + S+ + +L +
Sbjct: 388 --PTILLSLIWFDVIKSDV-----MVVVNKLHKYSLV-EKQPKESTISIPSIYLELKVKL 439
Query: 788 NQKIQLYKWML-RDDGTRELQSE--------------CGHHGHILALYVQTRGDFIVVGD 832
+ L++ ++ + + S+ GHH L + +
Sbjct: 440 ENEYALHRSIVDHYNIPKTFDSDDLIPPYLDQYFYSHIGHH-----LKNIEHPERM---T 491
Query: 833 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 892
L + + L ++ +E++ R + W ++ IL+ L ++
Sbjct: 492 LFRMV-FLDFRF----LEQKIRHDSTAWNASGSILNT-----------LQQLKFYKPYIC 535
Query: 893 DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
D + +V + +F+ + ++ +D+ +I
Sbjct: 536 DNDPKYERLVNA--ILDFL--PKIEENLICSKYTDLLRI 570
|
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
Score = 53.7 bits (128), Expect = 3e-07
Identities = 69/448 (15%), Positives = 132/448 (29%), Gaps = 124/448 (27%)
Query: 604 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
K +L +V + F+ D + L+ ID I +
Sbjct: 17 QYKDIL--SVFEDAF--VDNFDCKDVQDMP----KSILSKEEIDHI---------IMSKD 59
Query: 664 RRICHQEQS-RTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722
+ R F K + ++ FV + ++F+ + P+ T + S+++
Sbjct: 60 A----VSGTLRLFWTLLSKQEEMVQK----FVEEVLRINYKFLMS-PIKTEQRQPSMMTR 110
Query: 723 SFSDDSNVYYCVGTAYVLPEENEPTKGRI-LVFIVEDGKLQLIAEKETKG-AVYSLNAFN 780
+ + + Y V + N R+ + L+L + K + +
Sbjct: 111 MYIEQRDRLY--NDNQVFAKYNVS---RLQPYLKLRQALLEL---RPAKNVLIDGVLGS- 161
Query: 781 GK-LLAAI---NQKIQLYK-----WM-LRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
GK +A + K+Q W+ L++ + E E + L Q ++
Sbjct: 162 GKTWVALDVCLSYKVQCKMDFKIFWLNLKNCNSPETVLE-----MLQKLLYQIDPNWTSR 216
Query: 831 GDLMKSISLLIYKHEEGAIEERARDYNANWM--SAVEILDDDIYLGAENNFN-----LFT 883
D +I K +I+ R + + + +L + A N FN L T
Sbjct: 217 SDHSSNI-----KLRIHSIQAELRRLLKSKPYENCLLVLLNVQNAKAWNAFNLSCKILLT 271
Query: 884 VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVI 943
R TD L H+ H S+ + + +
Sbjct: 272 TRFKQ--VTD----FLSAATTTHI-----SLDHHSMTLTPDEVKS------LLLKYLDCR 314
Query: 944 GVIASLPHEQYLFLEKLQTNLRKVIKGVG-----GLNH-EQWRSFNNEKKTVDAKNFLDG 997
LP E L TN R + + GL + W+ N +K
Sbjct: 315 --PQDLPRE------VLTTNPR-RLSIIAESIRDGLATWDNWKHVNCDK----------- 354
Query: 998 DLIESFLDLSRTRMDEISKTMNVSVEEL 1025
++ + S+ L
Sbjct: 355 ----------------LTTIIESSLNVL 366
|
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
Score = 51.4 bits (122), Expect = 2e-06
Identities = 79/601 (13%), Positives = 163/601 (27%), Gaps = 176/601 (29%)
Query: 473 HAQLEYEIS-----CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE 527
H +++E DI + E ++ D+ I S +++ I +
Sbjct: 4 HHHMDFETGEHQYQYKDILSVFE-DAFVDNFDCKDVQDMPKSILSKEEIDHIIMS---KD 59
Query: 528 IIPRSVLLCAFEGISYLLCALGDGHLLNFL---LNMKTGELTDR-KKVSLGTQPITLRTF 583
+ ++ L F L + + + F+ L + L K +T R +
Sbjct: 60 AVSGTLRL--FW----TLLSKQEEMVQKFVEEVLRINYKFLMSPIKTEQRQPSMMT-RMY 112
Query: 584 SSK-----NTTHVFAASD--RPTVIYSSNKKLLYSNVNLKEVSH-MCPF------NSAAF 629
+ N VFA + R + LL + +
Sbjct: 113 IEQRDRLYNDNQVFAKYNVSRLQPYLKLRQALLELRPAKNVLIDGVLGSGKTWVALDVCL 172
Query: 630 PDSLAIAKEGE---LTIG---TIDDI----QKLHIRSIP----LGEHPRRICH-----QE 670
+ + + L + + + + QKL + P +H I Q
Sbjct: 173 SYKVQCKMDFKIFWLNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQA 232
Query: 671 QSRTFAICSLKNQSCAEESEMHFVRL--LDD-QTFEFISTYPLDTFEYGCSILSCSFSDD 727
+ R + S ++C L L + Q + + F C IL +
Sbjct: 233 ELRRL-LKSKPYENC----------LLVLLNVQNAKAW-----NAFNLSCKILLTT-RFK 275
Query: 728 SNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLA-A 786
+ + + + E K LL
Sbjct: 276 QVTDFLSAATTTHISLDHHSMT--------------LTPDEVKS-----------LLLKY 310
Query: 787 INQKIQLYKWMLRDDGTRELQSE-CGHHGHILALYVQTRGDFIVVGDLMKSISLLI--YK 843
++ + Q +L E + L++ + + ++ +K
Sbjct: 311 LDCRPQ------------DLPREVLTTNPRRLSI----------IAESIRDGLATWDNWK 348
Query: 844 HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV-RKNSEGATDEERGRLEVV 902
H + S++ +L+ Y F+ +V + L ++
Sbjct: 349 HVNCDKLTTIIE------SSLNVLEPAEY---RKMFDRLSVFPPS----AHIPTILLSLI 395
Query: 903 ----GEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
+ + VN+ SLV + P I YL L+
Sbjct: 396 WFDVIKSDVMVVVNKLHKYSLVEKQPKESTISI-------------------PSIYLELK 436
Query: 959 KLQTN---LRKVIKGVGGLNH-EQWRSFNNEKKTVDAKNFLDGDLIESFLD--LSRTRMD 1012
N L + I ++H ++F+++ +LD S + L
Sbjct: 437 VKLENEYALHRSI-----VDHYNIPKTFDSDDLI---PPYLDQ-YFYSHIGHHLKNIEHP 487
Query: 1013 E 1013
E
Sbjct: 488 E 488
|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| 1036 | ||||
| d1qksa2 | 432 | b.70.2.1 (A:136-567) C-terminal (heme d1) domain o | 0.003 |
| >d1qksa2 b.70.2.1 (A:136-567) C-terminal (heme d1) domain of cytochrome cd1-nitrite reductase {Paracoccus denitrificans [TaxId: 266]} Length = 432 | Back information, alignment and structure |
|---|
class: All beta proteins fold: 8-bladed beta-propeller superfamily: C-terminal (heme d1) domain of cytochrome cd1-nitrite reductase family: C-terminal (heme d1) domain of cytochrome cd1-nitrite reductase domain: C-terminal (heme d1) domain of cytochrome cd1-nitrite reductase species: Paracoccus denitrificans [TaxId: 266]
Score = 39.0 bits (90), Expect = 0.003
Identities = 39/394 (9%), Positives = 102/394 (25%), Gaps = 60/394 (15%)
Query: 645 TIDDIQKLHIRSIPLGEHPRRICHQEQSRT-FAICSLKNQSCAEESEMHFVRLLDDQTFE 703
+ ++++ + + P + + F++ + + L+D T+E
Sbjct: 3 GMKEMRESWKVHVAPEDRPTQQMNDWDLENLFSVT---------LRDAGQIALIDGSTYE 53
Query: 704 FISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQL 763
+ G ++ S + +G G++ + + +
Sbjct: 54 -----IKTVLDTGYAVHISRLSASGRYLFVIG-----------RDGKVNMIDLWMKEPTT 97
Query: 764 IAE----KETKGAVYSLN-AFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-------- 810
+AE E + S + K A Y M + +
Sbjct: 98 VAEIKIGSEARSIETSKMEGWEDKYAIAGAYWPPQYVIMDGETLEPKKIQSTRGMTYDEQ 157
Query: 811 --GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
+ A+ +V + ++ +L+ + + + ++ +
Sbjct: 158 EYHPEPRVAAILASHYRPEFIVN-VKETGKILLVDYTDLNNLKTTEISAERFLHDGGLDG 216
Query: 869 DDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
Y T + ++V G G+ + V
Sbjct: 217 SHRY--------FITAANARNKLVVIDTKEGKLVAIEDTGGQTPHPGRGANFVHPTFGPV 268
Query: 929 GQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT 988
+ + + +I + P K+ +GG + N++
Sbjct: 269 WATSHM----GDDSVALIGTDPEGHPDNAWKIL----DSFPALGGGSLFIKTHPNSQYLY 320
Query: 989 VDAKNFLDGDLIES--FLDLSRTRMDEISKTMNV 1020
VDA + ++ S D+ D
Sbjct: 321 VDATLNPEAEISGSVAVFDIKAMTGDGSDPEFKT 354
|
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 1036 | |||
| d1gxra_ | 337 | Groucho/tle1, C-terminal domain {Human (Homo sapie | 98.51 | |
| d1tbga_ | 340 | beta1-subunit of the signal-transducing G protein | 98.5 | |
| d1erja_ | 388 | Tup1, C-terminal domain {Baker's yeast (Saccharomy | 98.38 | |
| d1nr0a2 | 299 | Actin interacting protein 1 {Nematode (Caenorhabdi | 98.34 | |
| d1gxra_ | 337 | Groucho/tle1, C-terminal domain {Human (Homo sapie | 98.2 | |
| d1nexb2 | 355 | Cdc4 propeller domain {Baker's yeast (Saccharomyce | 98.17 | |
| d1nr0a1 | 311 | Actin interacting protein 1 {Nematode (Caenorhabdi | 97.73 | |
| d1nr0a2 | 299 | Actin interacting protein 1 {Nematode (Caenorhabdi | 97.67 | |
| d1nexb2 | 355 | Cdc4 propeller domain {Baker's yeast (Saccharomyce | 97.6 | |
| d1nr0a1 | 311 | Actin interacting protein 1 {Nematode (Caenorhabdi | 97.45 | |
| d1l0qa2 | 301 | Surface layer protein {Archaeon Methanosarcina maz | 97.42 | |
| d1k8kc_ | 371 | Arp2/3 complex 41 kDa subunit ARPC1 {Cow (Bos taur | 97.05 | |
| d1k8kc_ | 371 | Arp2/3 complex 41 kDa subunit ARPC1 {Cow (Bos taur | 96.99 | |
| d1l0qa2 | 301 | Surface layer protein {Archaeon Methanosarcina maz | 96.96 | |
| d1yfqa_ | 342 | Cell cycle arrest protein BUB3 {Baker's yeast (Sac | 96.75 | |
| d1jmxb_ | 346 | Quinohemoprotein amine dehydrogenase B chain {Pseu | 96.72 | |
| d1tbga_ | 340 | beta1-subunit of the signal-transducing G protein | 96.67 | |
| d1yfqa_ | 342 | Cell cycle arrest protein BUB3 {Baker's yeast (Sac | 96.67 | |
| d1pgua1 | 325 | Actin interacting protein 1 {Baker's yeast (Saccha | 96.44 | |
| d1pgua2 | 287 | Actin interacting protein 1 {Baker's yeast (Saccha | 96.3 | |
| d1p22a2 | 293 | F-box/WD-repeat protein 1 (beta-TRCP1) {Human (Hom | 96.25 | |
| d1jmxb_ | 346 | Quinohemoprotein amine dehydrogenase B chain {Pseu | 96.04 | |
| d1pbyb_ | 337 | Quinohemoprotein amine dehydrogenase B chain {Para | 96.03 | |
| d1erja_ | 388 | Tup1, C-terminal domain {Baker's yeast (Saccharomy | 95.58 | |
| d1vyhc1 | 317 | Platelet-activating factor acetylhydrolase IB subu | 95.08 | |
| d1ri6a_ | 333 | Putative isomerase YbhE {Escherichia coli [TaxId: | 94.79 | |
| d1pgua2 | 287 | Actin interacting protein 1 {Baker's yeast (Saccha | 93.93 | |
| d1qksa2 | 432 | C-terminal (heme d1) domain of cytochrome cd1-nitr | 93.76 | |
| d1ri6a_ | 333 | Putative isomerase YbhE {Escherichia coli [TaxId: | 93.75 | |
| d1q7fa_ | 279 | Brain tumor cg10719-pa {Fruit fly (Drosophila mela | 91.95 | |
| d1vyhc1 | 317 | Platelet-activating factor acetylhydrolase IB subu | 91.57 | |
| d1hzua2 | 426 | C-terminal (heme d1) domain of cytochrome cd1-nitr | 82.24 | |
| d1mdah_ | 368 | Methylamine dehydrogenase, H-chain {Paracoccus den | 81.22 | |
| d2ovrb2 | 342 | F-box/WD repeat-containing protein 7, FBXW7 {Human | 81.1 |
| >d1gxra_ b.69.4.1 (A:) Groucho/tle1, C-terminal domain {Human (Homo sapiens) [TaxId: 9606]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: Groucho/tle1, C-terminal domain species: Human (Homo sapiens) [TaxId: 9606]
Probab=98.51 E-value=2.5e-05 Score=47.31 Aligned_cols=76 Identities=13% Similarity=0.098 Sum_probs=33.5
Q ss_pred CCCCCEEEEEECCCCCCCCCCCCEEEEEEECCCEEEEEECCCCC---EEEE-ECCCCCCCCEEEEEEEECCEEEEEEEEC
Q ss_conf 13882347997089899998663999998557789999779987---0688-4289966721589456449079999958
Q 001655 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLN---LITK-EHLGGEIIPRSVLLCAFEGISYLLCALG 549 (1036)
Q Consensus 474 ~~l~~~is~l~~~~~~~~~~~~~~~~v~~w~~~~i~i~~l~~l~---~v~~-~~l~~~~~p~si~~~~~~~~~~L~vgl~ 549 (1036)
......|+|+++++ .+.+++.| . ++.|.||++...+ .+.. .....+....++.+.+ +..+|++|..
T Consensus 48 ~~H~~~V~~v~fs~------~g~~latg-~-dg~V~iWd~~~~~~~~~~~~~~~~~h~~~I~~v~~s~--dg~~l~s~~~ 117 (337)
T d1gxra_ 48 LNHGEVVCAVTISN------PTRHVYTG-G-KGCVKVWDISHPGNKSPVSQLDCLNRDNYIRSCKLLP--DGCTLIVGGE 117 (337)
T ss_dssp ECCSSCCCEEEECS------SSSEEEEE-C-BSEEEEEETTSTTCCSCSEEEECSCTTSBEEEEEECT--TSSEEEEEES
T ss_pred CCCCCCEEEEEECC------CCCEEEEE-E-CCEEEEEECCCCCCCCEEEEEEECCCCCCEEEEEECC--CCCEEEEEEC
T ss_conf 79999289999989------99999999-7-9988997736776331168764048899689999867--9988988612
Q ss_pred CCCEEEEEEE
Q ss_conf 9839999993
Q 001655 550 DGHLLNFLLN 559 (1036)
Q Consensus 550 ~G~l~~y~~~ 559 (1036)
||.+..|.+.
T Consensus 118 dg~i~iwd~~ 127 (337)
T d1gxra_ 118 ASTLSIWDLA 127 (337)
T ss_dssp SSEEEEEECC
T ss_pred CCCCCCCCCC
T ss_conf 3321111111
|
| >d1tbga_ b.69.4.1 (A:) beta1-subunit of the signal-transducing G protein heterotrimer {Cow (Bos taurus) [TaxId: 9913]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: beta1-subunit of the signal-transducing G protein heterotrimer species: Cow (Bos taurus) [TaxId: 9913]
Probab=98.50 E-value=2.7e-05 Score=47.09 Aligned_cols=276 Identities=13% Similarity=0.084 Sum_probs=118.7
Q ss_pred CCEEEEEECCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCEEEEECCCCCCCCEEEEEEEECCEEEEEEEECCCCEEEE
Q ss_conf 82347997089899998663999998557789999779987068842899667215894564490799999589839999
Q 001655 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNF 556 (1036)
Q Consensus 477 ~~~is~l~~~~~~~~~~~~~~~~v~~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y 556 (1036)
...|+|+++++ ...+++.|.. ||.|.||++.+.+.+...... ...+.++.+. .+..++..|..||.+..+
T Consensus 55 ~~~I~~l~~s~------~~~~l~sgs~-Dg~v~iWd~~~~~~~~~~~~~-~~~v~~v~~~--~~~~~l~~~~~d~~i~~~ 124 (340)
T d1tbga_ 55 LAKIYAMHWGT------DSRLLVSASQ-DGKLIIWDSYTTNKVHAIPLR-SSWVMTCAYA--PSGNYVACGGLDNICSIY 124 (340)
T ss_dssp SSCEEEEEECT------TSSEEEEEET-TTEEEEEETTTTEEEEEEECS-CSCEEEEEEC--TTSSEEEEEETTCCEEEE
T ss_pred CCCEEEEEECC------CCCEEEEEEC-CCCEEEEECCCCEEEEEEECC-CCCEEEEEEE--CCCEEEEEECCCCEEECC
T ss_conf 89888999989------9999999978-995556310210257997246-5337756760--121144310133201013
Q ss_pred EEECCCCCCCCCEEEECCCCCEEEEEEEECCCEEEEEECCCCEEEEECCCC-EEE-EEECCCCCEEEECCCCCCCCCEEE
Q ss_conf 993688764332143128543379999839921799972986599962990-898-220534541452545799999199
Q 001655 557 LLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK-LLY-SNVNLKEVSHMCPFNSAAFPDSLA 634 (1036)
Q Consensus 557 ~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~~~p~~i~~~~~~-~~~-~~~~~~~i~~~~~~~~~~~~~~li 634 (1036)
............+. ..|............+........+........... ... .......+...... .....++
T Consensus 125 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~ 200 (340)
T d1tbga_ 125 NLKTREGNVRVSRE-LAGHTGYLSCCRFLDDNQIVTSSGDTTCALWDIETGQQTTTFTGHTGDVMSLSLA---PDTRLFV 200 (340)
T ss_dssp ESSSSCSCCCEEEE-ECCCSSCEEEEEEEETTEEEEEETTTEEEEEETTTTEEEEEEECCSSCEEEEEEC---TTSSEEE
T ss_pred CCCCCCCCCCCCEE-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCC---CCCCEEE
T ss_conf 32222122211100-1354211011111111111111244543200123221111123310157630012---4421268
Q ss_pred EE-ECCEEEEEEECCCCCEEEEEEEC-CCCCCEEEEECCCCEEEEEEEECCCCCCCEEEEEEEEEECCCCEEEEEEECCC
Q ss_conf 98-58918999985888758899708-98546689746998999998304788883035799999689953999897799
Q 001655 635 IA-KEGELTIGTIDDIQKLHIRSIPL-GEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDT 712 (1036)
Q Consensus 635 ~~-~~~~l~i~~l~~~~~~~~~~i~l-~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~i~lid~~t~~~i~~~~l~~ 712 (1036)
.. .++.+.+..+.. .-..+++.- ...++.++++|..+.+++++.+ +.+++++....+.+..+....
T Consensus 201 ~~~~d~~v~i~d~~~--~~~~~~~~~h~~~i~~v~~~p~~~~l~s~s~d----------~~i~~~~~~~~~~~~~~~~~~ 268 (340)
T d1tbga_ 201 SGACDASAKLWDVRE--GMCRQTFTGHESDINAICFFPNGNAFATGSDD----------ATCRLFDLRADQELMTYSHDN 268 (340)
T ss_dssp EEETTTEEEEEETTT--TEEEEEECCCSSCEEEEEECTTSSEEEEEETT----------SCEEEEETTTTEEEEEECCTT
T ss_pred EEECCCEEEEEECCC--CCEEEEEECCCCCEEEEEECCCCCEEEEEECC----------CEEEEEEECCCCCCCCCCCCC
T ss_conf 760573699999999--94889995788985899997998999999699----------969997521221111111224
Q ss_pred CCE-EEEEEEEEECCCCCEEEEEEEEEECCCCCCCCCEEEEEEEEECCEEEEEEEE-EECCCEEEECCC-CC-EEEEEE-
Q ss_conf 870-7689999976899439999855437998877414999999969969999998-735940274434-68-199998-
Q 001655 713 FEY-GCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEK-ETKGAVYSLNAF-NG-KLLAAI- 787 (1036)
Q Consensus 713 ~E~-v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~-~~~g~v~~i~~~-~g-~li~~~- 787 (1036)
... +.++ .+.. ...++++|+ ..|.|+++++..+++. ... .+.++|++++-- ++ +|+.+.
T Consensus 269 ~~~~i~~~---~~s~-~~~~l~~g~----------~dg~i~iwd~~~~~~~--~~~~~H~~~V~~l~~s~d~~~l~s~s~ 332 (340)
T d1tbga_ 269 IICGITSV---SFSK-SGRLLLAGY----------DDFNCNVWDALKADRA--GVLAGHDNRVSCLGVTDDGMAVATGSW 332 (340)
T ss_dssp CCSCEEEE---EECS-SSCEEEEEE----------TTSCEEEEETTTCCEE--EEECCCSSCEEEEEECTTSSCEEEEET
T ss_pred CCCCEEEE---EECC-CCCEEEEEE----------CCCEEEEEECCCCCEE--EEECCCCCCEEEEEEECCCCEEEEECC
T ss_conf 45745899---9989-999999997----------9798999999999398--998489997899999089999999906
Q ss_pred CCEEEEE
Q ss_conf 7979999
Q 001655 788 NQKIQLY 794 (1036)
Q Consensus 788 g~~i~i~ 794 (1036)
.+.|+++
T Consensus 333 Dg~v~iW 339 (340)
T d1tbga_ 333 DSFLKIW 339 (340)
T ss_dssp TSCEEEE
T ss_pred CCEEEEE
T ss_conf 9979985
|
| >d1erja_ b.69.4.1 (A:) Tup1, C-terminal domain {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: Tup1, C-terminal domain species: Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]
Probab=98.38 E-value=5.3e-05 Score=45.19 Aligned_cols=36 Identities=14% Similarity=0.365 Sum_probs=20.7
Q ss_pred CCCEEEEEECCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCEE
Q ss_conf 88234799708989999866399999855778999977998706
Q 001655 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLI 519 (1036)
Q Consensus 476 l~~~is~l~~~~~~~~~~~~~~~~v~~w~~~~i~i~~l~~l~~v 519 (1036)
-...|+|+++++ .+.+++.|. ++.+.||++.+.+.+
T Consensus 61 H~~~V~~l~fs~------dg~~lasg~--d~~i~iW~~~~~~~~ 96 (388)
T d1erja_ 61 HTSVVCCVKFSN------DGEYLATGC--NKTTQVYRVSDGSLV 96 (388)
T ss_dssp CSSCCCEEEECT------TSSEEEEEC--BSCEEEEETTTCCEE
T ss_pred CCCCEEEEEECC------CCCEEEEEE--CCEEEEEEECCCCEE
T ss_conf 999689999999------999999994--994899981364057
|
| >d1nr0a2 b.69.4.1 (A:313-611) Actin interacting protein 1 {Nematode (Caenorhabditis elegans) [TaxId: 6239]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: Actin interacting protein 1 species: Nematode (Caenorhabditis elegans) [TaxId: 6239]
Probab=98.34 E-value=6.7e-05 Score=44.57 Aligned_cols=144 Identities=15% Similarity=0.106 Sum_probs=62.4
Q ss_pred CEEEEEEEEEECCCCCCCCCEEEEEEEEECCEEEEEEEEEECCCEEEECCC-CC-EEEEEE-CCEEEEEEEEECCCCCCC
Q ss_conf 439999855437998877414999999969969999998735940274434-68-199998-797999994525777200
Q 001655 729 NVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF-NG-KLLAAI-NQKIQLYKWMLRDDGTRE 805 (1036)
Q Consensus 729 ~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~-~g-~li~~~-g~~i~i~~~~~~~~~~~~ 805 (1036)
..++++|+ ..|.+.++++...++..+.....+++|++++.- ++ +|+++. +..+.+|+... + ..
T Consensus 148 ~~~l~~g~----------~dg~i~~~d~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~d~~i~~~~~~~---~-~~ 213 (299)
T d1nr0a2 148 KQFVAVGG----------QDSKVHVYKLSGASVSEVKTIVHPAEITSVAFSNNGAFLVATDQSRKVIPYSVAN---N-FE 213 (299)
T ss_dssp SCEEEEEE----------TTSEEEEEEEETTEEEEEEEEECSSCEEEEEECTTSSEEEEEETTSCEEEEEGGG---T-TE
T ss_pred CCCCCCCC----------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---C-CC
T ss_conf 11111111----------1111111111111111111111111111111111111111111111111111111---1-11
Q ss_pred HHCCCCCCCC-EEEEEEE--EECCEEEEEECCCCEEEEEEECCCCEEEEEECCCCCCEEEEEEEEECCEEEEECCCCCEE
Q ss_conf 0200243222-1899999--829999998524409999996248919997124886106999996186199970688599
Q 001655 806 LQSECGHHGH-ILALYVQ--TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882 (1036)
Q Consensus 806 L~~~~~~~~~-~~~~~l~--~~~n~IivgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~v~~~~~l~~~~~l~~D~~gnl~ 882 (1036)
+.....+..+ ..+.++. ..+++++.|..-..+.++..+.................+.++.+.+++.++.+..+|++.
T Consensus 214 ~~~~~~~~~h~~~v~~l~~s~~~~~l~sgs~dg~i~iwd~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~l~s~s~D~~i~ 293 (299)
T d1nr0a2 214 LAHTNSWTFHTAKVACVSWSPDNVRLATGSLDNSVIVWNMNKPSDHPIIIKGAHAMSSVNSVIWLNETTIVSAGQDSNIK 293 (299)
T ss_dssp ESCCCCCCCCSSCEEEEEECTTSSEEEEEETTSCEEEEETTCTTSCCEEETTSSTTSCEEEEEEEETTEEEEEETTSCEE
T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCEEEEEECCCCCCCEEEEECCCCCCCEEEEEECCCCEEEEEECCCEEE
T ss_conf 11111111111111111246664513888289979999899997314898348988968999977989999992899799
Q ss_pred EEEE
Q ss_conf 9951
Q 001655 883 TVRK 886 (1036)
Q Consensus 883 ~l~~ 886 (1036)
+...
T Consensus 294 iWdl 297 (299)
T d1nr0a2 294 FWNV 297 (299)
T ss_dssp EEEC
T ss_pred EEEC
T ss_conf 9944
|
| >d1gxra_ b.69.4.1 (A:) Groucho/tle1, C-terminal domain {Human (Homo sapiens) [TaxId: 9606]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: Groucho/tle1, C-terminal domain species: Human (Homo sapiens) [TaxId: 9606]
Probab=98.20 E-value=0.00013 Score=42.72 Aligned_cols=111 Identities=15% Similarity=0.148 Sum_probs=50.2
Q ss_pred CEEEEEEC--CCEEEEEECCCEEEEEEECCC----EEEEEEECCCCCEEEEEECCCCCCCCCCCCEEEEEEECCCEEEEE
Q ss_conf 58999967--889999986986999998386----799997113882347997089899998663999998557789999
Q 001655 438 SVNVATAN--ASQVLLATGGGHLVYLEIGDG----ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 511 (1036)
Q Consensus 438 ~I~~as~~--~~~ivi~~~~~~iv~l~~~~~----~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~v~~w~~~~i~i~ 511 (1036)
.|.+.+.+ +.+++.+ +++.+...++..+ .+.......-...|.|+++.+ .+.+++.|.. ++.+.+|
T Consensus 53 ~V~~v~fs~~g~~latg-~dg~V~iWd~~~~~~~~~~~~~~~~~h~~~I~~v~~s~------dg~~l~s~~~-dg~i~iw 124 (337)
T d1gxra_ 53 VVCAVTISNPTRHVYTG-GKGCVKVWDISHPGNKSPVSQLDCLNRDNYIRSCKLLP------DGCTLIVGGE-ASTLSIW 124 (337)
T ss_dssp CCCEEEECSSSSEEEEE-CBSEEEEEETTSTTCCSCSEEEECSCTTSBEEEEEECT------TSSEEEEEES-SSEEEEE
T ss_pred CEEEEEECCCCCEEEEE-ECCEEEEEECCCCCCCCEEEEEEECCCCCCEEEEEECC------CCCEEEEEEC-CCCCCCC
T ss_conf 28999998999999999-79988997736776331168764048899689999867------9988988612-3321111
Q ss_pred ECCCCCEEEEECCCCC-CCCEEEEEEEECCEEEEEEEECCCCEEEEEE
Q ss_conf 7799870688428996-6721589456449079999958983999999
Q 001655 512 SLPDLNLITKEHLGGE-IIPRSVLLCAFEGISYLLCALGDGHLLNFLL 558 (1036)
Q Consensus 512 ~l~~l~~v~~~~l~~~-~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~ 558 (1036)
++...+......+... .....+.+. .+..+++.+..++.+..+.+
T Consensus 125 d~~~~~~~~~~~~~~~~~~v~~~~~~--~~~~~l~s~~~d~~i~~~~~ 170 (337)
T d1gxra_ 125 DLAAPTPRIKAELTSSAPACYALAIS--PDSKVCFSCCSDGNIAVWDL 170 (337)
T ss_dssp ECCCC--EEEEEEECSSSCEEEEEEC--TTSSEEEEEETTSCEEEEET
T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCC
T ss_conf 11111111111111111111111111--11111111111111111111
|
| >d1nexb2 b.69.4.1 (B:370-744) Cdc4 propeller domain {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: Cdc4 propeller domain species: Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]
Probab=98.17 E-value=0.00015 Score=42.33 Aligned_cols=326 Identities=10% Similarity=0.113 Sum_probs=155.4
Q ss_pred CEEEEEECCCEEEEEECCCEEEEEEECCCEEEEEEECCCCCEEEEEECCCCCCCCCCCCEEEEEEECCCEEEEEECCCCC
Q ss_conf 58999967889999986986999998386799997113882347997089899998663999998557789999779987
Q 001655 438 SVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLN 517 (1036)
Q Consensus 438 ~I~~as~~~~~ivi~~~~~~iv~l~~~~~~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~v~~w~~~~i~i~~l~~l~ 517 (1036)
.|++.+.++.+++.+..++.+...++.++++...- ......|.|+++.+ ...++.|.. ++.+.++.+...+
T Consensus 15 vitc~~~~~~~l~tgs~Dg~i~vWd~~~~~~~~~l-~~H~~~V~~l~~s~-------~~~l~s~s~-D~~i~iw~~~~~~ 85 (355)
T d1nexb2 15 VITCLQFEDNYVITGADDKMIRVYDSINKKFLLQL-SGHDGGVWALKYAH-------GGILVSGST-DRTVRVWDIKKGC 85 (355)
T ss_dssp CEEEEEEETTEEEEEETTTEEEEEETTTTEEEEEE-ECCSSCEEEEEEET-------TTEEEEEET-TCCEEEEETTTTE
T ss_pred CEEEEEECCCEEEEEECCCEEEEEECCCCCEEEEE-ECCCCCEEEEEECC-------CCEEEEEEC-CCCCCCCCCCCCC
T ss_conf 69999988999999918990999989999399999-78999889999869-------999999964-5244321111111
Q ss_pred EEEEECCCCCCCCEEEEEEEECCEEEEEEEECCCCEEEEEEECCCCCCCCCEEEECCCCCEEEEEEEECCCEEEEEECCC
Q ss_conf 06884289966721589456449079999958983999999368876433214312854337999983992179997298
Q 001655 518 LITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDR 597 (1036)
Q Consensus 518 ~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~~~ 597 (1036)
......... ............+..+++.+..||.+..|.+..... +.... .....+. .. ... ...
T Consensus 86 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~d~~i~iw~~~~~~~-~~~~~--~~~~~~~--~~-~~~--------~~~ 150 (355)
T d1nexb2 86 CTHVFEGHN-STVRCLDIVEYKNIKYIVTGSRDNTLHVWKLPKESS-VPDHG--EEHDYPL--VF-HTP--------EEN 150 (355)
T ss_dssp EEEEECCCS-SCEEEEEEEEETTEEEEEEEETTSEEEEEECCC--------------CCCE--EE-SCT--------TTC
T ss_pred CCCCCCCCC-CCCCCCCCCCCCCCCEEEEECCCCCEEEEECCCCCE-ECCCC--CCCEECC--CE-ECC--------CCC
T ss_conf 111110011-111111111112322045543888689998567730-01246--5200010--00-001--------123
Q ss_pred CEEEEECCCCEEEEEECCCCCEEEECCCCCCCCCEEEEEE-CCEEEEEEECCCCCEEEEEEE-CCCCCCEEEEECCCCEE
Q ss_conf 6599962990898220534541452545799999199985-891899998588875889970-89854668974699899
Q 001655 598 PTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK-EGELTIGTIDDIQKLHIRSIP-LGEHPRRICHQEQSRTF 675 (1036)
Q Consensus 598 p~~i~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~li~~~-~~~l~i~~l~~~~~~~~~~i~-l~~tp~~i~y~~~~~~~ 675 (1036)
...+... ......+....+ .++.++... ++.+.+..+.... ..+... .......+.+.+..+.+
T Consensus 151 ~~~~~~~-------~~~~~~v~~~~~-----~~~~~~~~~~d~~i~~~d~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~ 216 (355)
T d1nexb2 151 PYFVGVL-------RGHMASVRTVSG-----HGNIVVSGSYDNTLIVWDVAQMK--CLYILSGHTDRIYSTIYDHERKRC 216 (355)
T ss_dssp TTEEEEE-------ECCSSCEEEEEE-----ETTEEEEEETTSCEEEEETTTTE--EEEEECCCSSCEEEEEEETTTTEE
T ss_pred CCEEEEE-------EECCCCCCCCCC-----CCCEEEEECCCCEEEEEECCCCC--CEEEEECCCCCCCCCCCCCCCEEE
T ss_conf 4012101-------100222100002-----56334421144204443013110--001100012332111111210021
Q ss_pred EEEEEECCCCCCCEEEEEEEEEECCCCEEEEEEECCCCCEEEEEEEEEECCCCCEEEEEEEEEECCCCCCCCCEEEEEEE
Q ss_conf 99983047888830357999996899539998977998707689999976899439999855437998877414999999
Q 001655 676 AICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI 755 (1036)
Q Consensus 676 ~v~~~~~~~~~~~~~~~~i~lid~~t~~~i~~~~l~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~ 755 (1036)
+.++.+ +.++++|..+...+..++-. ...+.++. +. ..+++.|+ ..|.|.+++
T Consensus 217 ~~~~~d----------~~i~i~d~~~~~~~~~~~~h-~~~v~~~~---~~---~~~l~~~~----------~dg~i~iwd 269 (355)
T d1nexb2 217 ISASMD----------TTIRIWDLENGELMYTLQGH-TALVGLLR---LS---DKFLVSAA----------ADGSIRGWD 269 (355)
T ss_dssp EEEETT----------SCEEEEETTTCCEEEEECCC-SSCCCEEE---EC---SSEEEEEC----------TTSEEEEEE
T ss_pred ECCCCC----------CEEEEEECCCCCCCCCCCCC-CCCCCCCC---CC---CCEEEEEE----------CCCCCCCCC
T ss_conf 012456----------36876301221111111111-11111112---32---10033320----------111111111
Q ss_pred EECCEEEEEEEEEECCCEEEECCC--CCEE-EEEECCEEEEEEEEECCCCCCCHHCCCCCCCCEEEEEEEEECCEEEEEE
Q ss_conf 969969999998735940274434--6819-9998797999994525777200020024322218999998299999985
Q 001655 756 VEDGKLQLIAEKETKGAVYSLNAF--NGKL-LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 832 (1036)
Q Consensus 756 i~~~kl~~~~~~~~~g~v~~i~~~--~g~l-i~~~g~~i~i~~~~~~~~~~~~L~~~~~~~~~~~~~~l~~~~n~IivgD 832 (1036)
+.+.+.+... ..+.+.+++.+ ++++ +++..++|++|+++ .+ ++...........+.++...++.++++-
T Consensus 270 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~l~~g~d~~i~vwd~~---tg--~~~~~~~~~~~~~V~~v~~~~~~~~~~~ 341 (355)
T d1nexb2 270 ANDYSRKFSY---HHTNLSAITTFYVSDNILVSGSENQFNIYNLR---SG--KLVHANILKDADQIWSVNFKGKTLVAAV 341 (355)
T ss_dssp TTTCCEEEEE---ECTTCCCCCEEEECSSEEEEEETTEEEEEETT---TC--CBCCSCTTTTCSEEEEEEEETTEEEEEE
T ss_pred CCCCCEECCC---CCCCCEEEEEECCCCCEEEEEECCEEEEEECC---CC--CEEEEEECCCCCCEEEEEECCCEEEEEE
T ss_conf 1111100012---46882299998499989999809979999999---99--7988884589998999998399199999
Q ss_pred CCCC
Q ss_conf 2440
Q 001655 833 LMKS 836 (1036)
Q Consensus 833 ~~~S 836 (1036)
..++
T Consensus 342 s~dg 345 (355)
T d1nexb2 342 EKDG 345 (355)
T ss_dssp ESSS
T ss_pred ECCC
T ss_conf 8989
|
| >d1nr0a1 b.69.4.1 (A:2-312) Actin interacting protein 1 {Nematode (Caenorhabditis elegans) [TaxId: 6239]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: Actin interacting protein 1 species: Nematode (Caenorhabditis elegans) [TaxId: 6239]
Probab=97.73 E-value=0.00095 Score=37.26 Aligned_cols=63 Identities=16% Similarity=0.202 Sum_probs=29.9
Q ss_pred CCEEEEEECCCEEEEEEECCCEEEEEEECCCCCEEEEEECCCCCCCCCCCCEEEEEEECCCEEEEEECCCCC
Q ss_conf 889999986986999998386799997113882347997089899998663999998557789999779987
Q 001655 446 ASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLN 517 (1036)
Q Consensus 446 ~~~ivi~~~~~~iv~l~~~~~~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~v~~w~~~~i~i~~l~~l~ 517 (1036)
+..++.+. ++.++.+.+++.+...+- ......|+|++++| .+.+++.|.. ++.+.+|++.+.+
T Consensus 29 g~~l~~~~-~~~v~i~~~~~~~~~~~~-~~H~~~v~~~~~sp------~g~~latg~~-dg~i~iwd~~~~~ 91 (311)
T d1nr0a1 29 GDKIQYCN-GTSVYTVPVGSLTDTEIY-TEHSHQTTVAKTSP------SGYYCASGDV-HGNVRIWDTTQTT 91 (311)
T ss_dssp SSEEEEEE-TTEEEEEETTCSSCCEEE-CCCSSCEEEEEECT------TSSEEEEEET-TSEEEEEESSSTT
T ss_pred CCEEEEEE-CCEEEEEECCCCCEEEEE-CCCCCCEEEEEEEC------CCCEEECCCC-CCEEEEEEEECCC
T ss_conf 89999996-999999999999661797-47888889999948------9996722556-7367466310111
|
| >d1nr0a2 b.69.4.1 (A:313-611) Actin interacting protein 1 {Nematode (Caenorhabditis elegans) [TaxId: 6239]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: Actin interacting protein 1 species: Nematode (Caenorhabditis elegans) [TaxId: 6239]
Probab=97.67 E-value=0.0012 Score=36.70 Aligned_cols=61 Identities=13% Similarity=0.117 Sum_probs=22.7
Q ss_pred EEEEEECCCEEEEEEECCCEEEEEEECCCCCEEEEEECCCCCCCCCCCCEEEEEEECCCEEEEEECCC
Q ss_conf 99999869869999983867999971138823479970898999986639999985577899997799
Q 001655 448 QVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD 515 (1036)
Q Consensus 448 ~ivi~~~~~~iv~l~~~~~~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~v~~w~~~~i~i~~l~~ 515 (1036)
.++++..++.+..+.+..+++.+.....-...|+|+++.+ ...+++.+.. ++.+.+|++.+
T Consensus 150 ~l~~g~~dg~i~~~d~~~~~~~~~~~~~~~~~i~~~~~~~------~~~~l~~~~~-d~~i~~~~~~~ 210 (299)
T d1nr0a2 150 FVAVGGQDSKVHVYKLSGASVSEVKTIVHPAEITSVAFSN------NGAFLVATDQ-SRKVIPYSVAN 210 (299)
T ss_dssp EEEEEETTSEEEEEEEETTEEEEEEEEECSSCEEEEEECT------TSSEEEEEET-TSCEEEEEGGG
T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC------CCCCCCCCCC-CCCCCCCCCCC
T ss_conf 1111111111111111111111111111111111111111------1111111111-11111111111
|
| >d1nexb2 b.69.4.1 (B:370-744) Cdc4 propeller domain {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: Cdc4 propeller domain species: Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]
Probab=97.60 E-value=0.0014 Score=36.14 Aligned_cols=333 Identities=12% Similarity=0.095 Sum_probs=135.7
Q ss_pred EEEEEECCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCEEEEECCCCCCCCEEEEEEEECCEEEEEEEECCCCEEEEEE
Q ss_conf 34799708989999866399999855778999977998706884289966721589456449079999958983999999
Q 001655 479 EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLL 558 (1036)
Q Consensus 479 ~is~l~~~~~~~~~~~~~~~~v~~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~ 558 (1036)
-++|+++. .++++.|.. |+.|.+|++.+.+.+....- ....+.++.+.+ ..+|+.|..||.+..+..
T Consensus 15 vitc~~~~--------~~~l~tgs~-Dg~i~vWd~~~~~~~~~l~~-H~~~V~~l~~s~---~~~l~s~s~D~~i~iw~~ 81 (355)
T d1nexb2 15 VITCLQFE--------DNYVITGAD-DKMIRVYDSINKKFLLQLSG-HDGGVWALKYAH---GGILVSGSTDRTVRVWDI 81 (355)
T ss_dssp CEEEEEEE--------TTEEEEEET-TTEEEEEETTTTEEEEEEEC-CSSCEEEEEEET---TTEEEEEETTCCEEEEET
T ss_pred CEEEEEEC--------CCEEEEEEC-CCEEEEEECCCCCEEEEEEC-CCCCEEEEEECC---CCEEEEEECCCCCCCCCC
T ss_conf 69999988--------999999918-99099998999939999978-999889999869---999999964524432111
Q ss_pred ECCCCCCCCCEEEECCCCCEE-EEEEEECCCEEEEEECCCCEEEEECCCCEEEEEECCCCCEEEECCCCCCCCCEEEEEE
Q ss_conf 368876433214312854337-9999839921799972986599962990898220534541452545799999199985
Q 001655 559 NMKTGELTDRKKVSLGTQPIT-LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637 (1036)
Q Consensus 559 ~~~~~~l~~~~~~~lG~~pv~-l~~~~~~~~~~v~~~~~~p~~i~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~li~~~ 637 (1036)
.... .. ........+.. .......+...+...+. ++.+.+..+..... ........ ........
T Consensus 82 ~~~~--~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------d~~i~iw~~~~~~~--~~~~~~~~-~~~~~~~~ 146 (355)
T d1nexb2 82 KKGC--CT--HVFEGHNSTVRCLDIVEYKNIKYIVTGSR--------DNTLHVWKLPKESS--VPDHGEEH-DYPLVFHT 146 (355)
T ss_dssp TTTE--EE--EEECCCSSCEEEEEEEEETTEEEEEEEET--------TSEEEEEECCC--------------CCCEEESC
T ss_pred CCCC--CC--CCCCCCCCCCCCCCCCCCCCCCEEEEECC--------CCCEEEEECCCCCE--ECCCCCCC-EECCCEEC
T ss_conf 1111--11--11100111111111111123220455438--------88689998567730--01246520-00100000
Q ss_pred CCEEEEEEECCCCCEEEEEEECCCCCCEEEEECCCCEEEEEEEECCCCCCCEEEEEEEEEECCCCEEEEEEECCCCCEEE
Q ss_conf 89189999858887588997089854668974699899999830478888303579999968995399989779987076
Q 001655 638 EGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGC 717 (1036)
Q Consensus 638 ~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~i~lid~~t~~~i~~~~l~~~E~v~ 717 (1036)
.............. ...-....+..+.++..+.+ ..+++.|..+.+.+....-.... +.
T Consensus 147 ---------~~~~~~~~~~~~~~-~~~v~~~~~~~~~~~~~~~d----------~~i~~~d~~~~~~~~~~~~~~~~-~~ 205 (355)
T d1nexb2 147 ---------PEENPYFVGVLRGH-MASVRTVSGHGNIVVSGSYD----------NTLIVWDVAQMKCLYILSGHTDR-IY 205 (355)
T ss_dssp ---------TTTCTTEEEEEECC-SSCEEEEEEETTEEEEEETT----------SCEEEEETTTTEEEEEECCCSSC-EE
T ss_pred ---------CCCCCCEEEEEEEC-CCCCCCCCCCCCEEEEECCC----------CEEEEEECCCCCCEEEEECCCCC-CC
T ss_conf ---------11234012101100-22210000256334421144----------20444301311000110001233-21
Q ss_pred EEEEEEECCCCCEEEEEEEEEECCCCCCCCCEEEEEEEEECCEEEEEEEEEECCCEEEECCCCCEEEEEE-CCEEEEEEE
Q ss_conf 8999997689943999985543799887741499999996996999999873594027443468199998-797999994
Q 001655 718 SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAI-NQKIQLYKW 796 (1036)
Q Consensus 718 si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl~~~~~~~~~g~v~~i~~~~g~li~~~-g~~i~i~~~ 796 (1036)
++ .+.. ...++++|+ ..|.+++++...++.... .....++|+++..-+++|+.+. +..+++++.
T Consensus 206 ~~---~~~~-~~~~~~~~~----------~d~~i~i~d~~~~~~~~~-~~~h~~~v~~~~~~~~~l~~~~~dg~i~iwd~ 270 (355)
T d1nexb2 206 ST---IYDH-ERKRCISAS----------MDTTIRIWDLENGELMYT-LQGHTALVGLLRLSDKFLVSAAADGSIRGWDA 270 (355)
T ss_dssp EE---EEET-TTTEEEEEE----------TTSCEEEEETTTCCEEEE-ECCCSSCCCEEEECSSEEEEECTTSEEEEEET
T ss_pred CC---CCCC-CCEEEECCC----------CCCEEEEEECCCCCCCCC-CCCCCCCCCCCCCCCCEEEEEECCCCCCCCCC
T ss_conf 11---1112-100210124----------563687630122111111-11111111111232100333201111111111
Q ss_pred EECCCCCCCHHCCCCCCCCEE-EEEEEEECCEEEEEECCCCEEEEEEECCCCEEEEEECCCCCCEEEEEEEEECCEEEEE
Q ss_conf 525777200020024322218-9999982999999852440999999624891999712488610699999618619997
Q 001655 797 MLRDDGTRELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA 875 (1036)
Q Consensus 797 ~~~~~~~~~L~~~~~~~~~~~-~~~l~~~~n~IivgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~v~~~~~l~~~~~l~~ 875 (1036)
. .+.+++.. ...... .......++++..|. -..+.+ |+.+.+++.........-.+.++.+-.+..++++
T Consensus 271 ~---~~~~~~~~---~~~~~~~~~~~~~~~~~l~~g~-d~~i~v--wd~~tg~~~~~~~~~~~~~V~~v~~~~~~~~~~~ 341 (355)
T d1nexb2 271 N---DYSRKFSY---HHTNLSAITTFYVSDNILVSGS-ENQFNI--YNLRSGKLVHANILKDADQIWSVNFKGKTLVAAV 341 (355)
T ss_dssp T---TCCEEEEE---ECTTCCCCCEEEECSSEEEEEE-TTEEEE--EETTTCCBCCSCTTTTCSEEEEEEEETTEEEEEE
T ss_pred C---CCCEECCC---CCCCCEEEEEECCCCCEEEEEE-CCEEEE--EECCCCCEEEEEECCCCCCEEEEEECCCEEEEEE
T ss_conf 1---11100012---4688229999849998999980-997999--9999997988884589998999998399199999
Q ss_pred CCCCCEEEE
Q ss_conf 068859999
Q 001655 876 ENNFNLFTV 884 (1036)
Q Consensus 876 D~~gnl~~l 884 (1036)
..+|.+++-
T Consensus 342 s~dg~~~l~ 350 (355)
T d1nexb2 342 EKDGQSFLE 350 (355)
T ss_dssp ESSSCEEEE
T ss_pred ECCCCEEEE
T ss_conf 898909999
|
| >d1nr0a1 b.69.4.1 (A:2-312) Actin interacting protein 1 {Nematode (Caenorhabditis elegans) [TaxId: 6239]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: Actin interacting protein 1 species: Nematode (Caenorhabditis elegans) [TaxId: 6239]
Probab=97.45 E-value=0.0023 Score=34.87 Aligned_cols=18 Identities=17% Similarity=0.270 Sum_probs=6.5
Q ss_pred EEEEEEECCCEEEEEECCC
Q ss_conf 9999985577899997799
Q 001655 497 IAAVGMWTDISVRIFSLPD 515 (1036)
Q Consensus 497 ~~~v~~w~~~~i~i~~l~~ 515 (1036)
+++.|.. |+.++||++.+
T Consensus 252 ~l~tgs~-Dg~v~iwd~~t 269 (311)
T d1nr0a1 252 KIASASA-DKTIKIWNVAT 269 (311)
T ss_dssp EEEEEET-TSEEEEEETTT
T ss_pred EEEEEEC-CCEEEEEECCC
T ss_conf 9999937-99699999999
|
| >d1l0qa2 b.69.2.3 (A:1-301) Surface layer protein {Archaeon Methanosarcina mazei [TaxId: 2209]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: YVTN repeat-like/Quinoprotein amine dehydrogenase family: YVTN repeat domain: Surface layer protein species: Archaeon Methanosarcina mazei [TaxId: 2209]
Probab=97.42 E-value=0.0025 Score=34.59 Aligned_cols=286 Identities=14% Similarity=0.189 Sum_probs=118.4
Q ss_pred EEEEEE-CCCEEEEEEECCCEEEEEEECCCCCEEEEEECCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCEEEEECCCC
Q ss_conf 999998-6986999998386799997113882347997089899998663999998557789999779987068842899
Q 001655 448 QVLLAT-GGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG 526 (1036)
Q Consensus 448 ~ivi~~-~~~~iv~l~~~~~~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~v~~w~~~~i~i~~l~~l~~v~~~~l~~ 526 (1036)
++.|+. +++.+..+...++++. .+.++...+..++++| .+.+++++.+.++.+.+|++.+.+.+......
T Consensus 3 ~~yV~~~~~~~v~v~D~~t~~~~--~~i~~g~~p~~va~sp------dG~~l~v~~~~~~~i~v~d~~t~~~~~~~~~~- 73 (301)
T d1l0qa2 3 FAYIANSESDNISVIDVTSNKVT--ATIPVGSNPMGAVISP------DGTKVYVANAHSNDVSIIDTATNNVIATVPAG- 73 (301)
T ss_dssp EEEEEETTTTEEEEEETTTTEEE--EEEECSSSEEEEEECT------TSSEEEEEEGGGTEEEEEETTTTEEEEEEECS-
T ss_pred EEEEEECCCCEEEEEECCCCEEE--EEEECCCCCEEEEEEC------CCCEEEEEECCCCEEEEEECCCCCEEEEEECC-
T ss_conf 99999789998999999999599--9998899836999928------98999999789998999999989410320002-
Q ss_pred CCCCEEEEEEEECCEEEEEE-EECCCCEEEEEEECCCCCCCCCEEEECCCCCEEEEEEEECCCEEEEEECC-CCEEE-EE
Q ss_conf 66721589456449079999-95898399999936887643321431285433799998399217999729-86599-96
Q 001655 527 EIIPRSVLLCAFEGISYLLC-ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASD-RPTVI-YS 603 (1036)
Q Consensus 527 ~~~p~si~~~~~~~~~~L~v-gl~~G~l~~y~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~~~-~p~~i-~~ 603 (1036)
..|+.+.+.. +..++++ +..++.+..+. ..++. ..+.......|..+ .+ ..+...+++.+. ...+. ..
T Consensus 74 -~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~--~~~~~--~~~~~~~~~~~~~~-~~-~~dg~~~~~~~~~~~~~~~~~ 144 (301)
T d1l0qa2 74 -SSPQGVAVSP--DGKQVYVTNMASSTLSVID--TTSNT--VAGTVKTGKSPLGL-AL-SPDGKKLYVTNNGDKTVSVIN 144 (301)
T ss_dssp -SSEEEEEECT--TSSEEEEEETTTTEEEEEE--TTTTE--EEEEEECSSSEEEE-EE-CTTSSEEEEEETTTTEEEEEE
T ss_pred -CCCCCCCCCC--CCCCCCCCCCCCCEEEECC--CCCCE--EEEECCCCCCCEEE-EE-ECCCCEEEEEECCCCCEEEEE
T ss_conf -4643110001--1111111111110011001--24302--43202444442378-76-058971554201111001100
Q ss_pred CC-CCEEEEEECC-CCCEEEECCCCCCCCCEEEEEECCEEEEEEECCCCCEEEEEEECCCCCCEEEEECCCCEEEEEEEE
Q ss_conf 29-9089822053-454145254579999919998589189999858887588997089854668974699899999830
Q 001655 604 SN-KKLLYSNVNL-KEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK 681 (1036)
Q Consensus 604 ~~-~~~~~~~~~~-~~i~~~~~~~~~~~~~~li~~~~~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~ 681 (1036)
.. +... ..+.. .....++ +.. -...++.+..+...+...+.......+.++.+..|..+++.++.+...+++..
T Consensus 145 ~~~~~~~-~~~~~~~~~~~~~-~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~ 220 (301)
T d1l0qa2 145 TVTKAVI-NTVSVGRSPKGIA-VTP--DGTKVYVANFDSMSISVIDTVTNSVIDTVKVEAAPSGIAVNPEGTKAYVTNVD 220 (301)
T ss_dssp TTTTEEE-EEEECCSSEEEEE-ECT--TSSEEEEEETTTTEEEEEETTTTEEEEEEECSSEEEEEEECTTSSEEEEEEEC
T ss_pred CCCCCEE-EECCCCCCCEEEE-EEC--CCCCEEEECCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCCCCCCCCCC
T ss_conf 0146303-5315678842888-604--65401310121111111111100011101335775031101111011110021
Q ss_pred CCCCCCCEEEEEEEEEECCCCEEEEEEECCCCCEEEEEEEEEECCCCCEEEEEEEEEECCCCCCCCCEEEEEEEEECCEE
Q ss_conf 47888830357999996899539998977998707689999976899439999855437998877414999999969969
Q 001655 682 NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKL 761 (1036)
Q Consensus 682 ~~~~~~~~~~~~i~lid~~t~~~i~~~~l~~~E~v~si~~~~l~~~~~~~i~VGT~~~~~~e~~~~~Gri~v~~i~~~kl 761 (1036)
. ....+.++|..+.+.+..++.. ..+.++ .+..+++ ++.|... ..|+|.++++..+++
T Consensus 221 ~-------~~~~v~v~D~~t~~~~~~~~~~--~~~~~v---a~spdg~-~l~va~~---------~~~~i~v~D~~t~~~ 278 (301)
T d1l0qa2 221 K-------YFNTVSMIDTGTNKITARIPVG--PDPAGI---AVTPDGK-KVYVALS---------FCNTVSVIDTATNTI 278 (301)
T ss_dssp S-------SCCEEEEEETTTTEEEEEEECC--SSEEEE---EECTTSS-EEEEEET---------TTTEEEEEETTTTEE
T ss_pred C-------EEEEEEEEECCCCEEEEEECCC--CCEEEE---EEECCCC-EEEEEEC---------CCCEEEEEECCCCEE
T ss_conf 0-------0002323656998199998489--987799---9918989-9999989---------999699999999959
Q ss_pred EEEEEEEECCCEEEECCC
Q ss_conf 999998735940274434
Q 001655 762 QLIAEKETKGAVYSLNAF 779 (1036)
Q Consensus 762 ~~~~~~~~~g~v~~i~~~ 779 (1036)
+....+...++++..|
T Consensus 279 --~~~~~vg~~P~~~g~f 294 (301)
T d1l0qa2 279 --TATMAVGKNPYASGQF 294 (301)
T ss_dssp --EEEEECSSSEECCSSC
T ss_pred --EEEEECCCCCCEEEEE
T ss_conf --9999689997493798
|
| >d1k8kc_ b.69.4.1 (C:) Arp2/3 complex 41 kDa subunit ARPC1 {Cow (Bos taurus) [TaxId: 9913]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: Arp2/3 complex 41 kDa subunit ARPC1 species: Cow (Bos taurus) [TaxId: 9913]
Probab=97.05 E-value=0.0061 Score=32.13 Aligned_cols=114 Identities=11% Similarity=0.065 Sum_probs=49.1
Q ss_pred CEEEEEEC--CCEEEEEECCCEEEEEEECCCEEEEEEEC-CCCCEEEEEECCCCCCCCCCCCEEEEEEECCCEEEEEECC
Q ss_conf 58999967--88999998698699999838679999711-3882347997089899998663999998557789999779
Q 001655 438 SVNVATAN--ASQVLLATGGGHLVYLEIGDGILTEVKHA-QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP 514 (1036)
Q Consensus 438 ~I~~as~~--~~~ivi~~~~~~iv~l~~~~~~l~~~~~~-~l~~~is~l~~~~~~~~~~~~~~~~v~~w~~~~i~i~~l~ 514 (1036)
+|++.+.+ +..++++..++.+.+++.+.+++..+... .-...|.|++++| ...+++.+.. |+.+.+|++.
T Consensus 9 pIt~~~~s~dg~~la~~~~~~~i~iw~~~~~~~~~~~~l~gH~~~V~~l~fsp------~~~~l~s~s~-D~~i~vWd~~ 81 (371)
T d1k8kc_ 9 PISCHAWNKDRTQIAICPNNHEVHIYEKSGNKWVQVHELKEHNGQVTGVDWAP------DSNRIVTCGT-DRNAYVWTLK 81 (371)
T ss_dssp CCCEEEECTTSSEEEEECSSSEEEEEEEETTEEEEEEEEECCSSCEEEEEEET------TTTEEEEEET-TSCEEEEEEE
T ss_pred CEEEEEECCCCCEEEEEECCCEEEEEECCCCCEEEEEEECCCCCCEEEEEECC------CCCEEEEEEC-CCEEEEEEEC
T ss_conf 83899998999999999488989999888997899999558899888999979------9999999979-9939998620
Q ss_pred CCCEEEEECCCC-CCCCEEEEEEEECCEEEEEEEECCCCEEEEEEEC
Q ss_conf 987068842899-6672158945644907999995898399999936
Q 001655 515 DLNLITKEHLGG-EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNM 560 (1036)
Q Consensus 515 ~l~~v~~~~l~~-~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~~~ 560 (1036)
+.+......+.. ...+.++.+.+ +..++++|..||.+..+.++.
T Consensus 82 ~~~~~~~~~~~~~~~~v~~i~~~p--~~~~l~~~s~d~~i~i~~~~~ 126 (371)
T d1k8kc_ 82 GRTWKPTLVILRINRAARCVRWAP--NEKKFAVGSGSRVISICYFEQ 126 (371)
T ss_dssp TTEEEEEEECCCCSSCEEEEEECT--TSSEEEEEETTSSEEEEEEET
T ss_pred CCCCCCCCCCCCCCCCCCCCCCCC--CCCCCEEECCCCCCEEEEEEC
T ss_conf 332110012232211000111111--121100000257630254420
|
| >d1k8kc_ b.69.4.1 (C:) Arp2/3 complex 41 kDa subunit ARPC1 {Cow (Bos taurus) [TaxId: 9913]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: Arp2/3 complex 41 kDa subunit ARPC1 species: Cow (Bos taurus) [TaxId: 9913]
Probab=96.99 E-value=0.0069 Score=31.79 Aligned_cols=75 Identities=15% Similarity=0.048 Sum_probs=46.7
Q ss_pred CCEEEEEECCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCEEEEECCC-CCCCCEEEEEEEECCEEEEEEEECCCCEEE
Q ss_conf 8234799708989999866399999855778999977998706884289-966721589456449079999958983999
Q 001655 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG-GEIIPRSVLLCAFEGISYLLCALGDGHLLN 555 (1036)
Q Consensus 477 ~~~is~l~~~~~~~~~~~~~~~~v~~w~~~~i~i~~l~~l~~v~~~~l~-~~~~p~si~~~~~~~~~~L~vgl~~G~l~~ 555 (1036)
..+|+|+++++ ...++++|.. ++.|.||++..-+......+. ....+.++.+.+ +..+|+.+..||.+..
T Consensus 7 ~~pIt~~~~s~------dg~~la~~~~-~~~i~iw~~~~~~~~~~~~l~gH~~~V~~l~fsp--~~~~l~s~s~D~~i~v 77 (371)
T d1k8kc_ 7 VEPISCHAWNK------DRTQIAICPN-NHEVHIYEKSGNKWVQVHELKEHNGQVTGVDWAP--DSNRIVTCGTDRNAYV 77 (371)
T ss_dssp SSCCCEEEECT------TSSEEEEECS-SSEEEEEEEETTEEEEEEEEECCSSCEEEEEEET--TTTEEEEEETTSCEEE
T ss_pred CCCEEEEEECC------CCCEEEEEEC-CCEEEEEECCCCCEEEEEEECCCCCCEEEEEECC--CCCEEEEEECCCEEEE
T ss_conf 98838999989------9999999948-8989999888997899999558899888999979--9999999979993999
Q ss_pred EEEEC
Q ss_conf 99936
Q 001655 556 FLLNM 560 (1036)
Q Consensus 556 y~~~~ 560 (1036)
+.+..
T Consensus 78 Wd~~~ 82 (371)
T d1k8kc_ 78 WTLKG 82 (371)
T ss_dssp EEEET
T ss_pred EEECC
T ss_conf 86203
|
| >d1l0qa2 b.69.2.3 (A:1-301) Surface layer protein {Archaeon Methanosarcina mazei [TaxId: 2209]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: YVTN repeat-like/Quinoprotein amine dehydrogenase family: YVTN repeat domain: Surface layer protein species: Archaeon Methanosarcina mazei [TaxId: 2209]
Probab=96.96 E-value=0.0073 Score=31.63 Aligned_cols=49 Identities=18% Similarity=0.254 Sum_probs=22.6
Q ss_pred EEEEEECCCCCCEEEEECCCCEEEEEEEECCCCCCCEEEEEEEEEECCCCEEEEEEEC
Q ss_conf 8899708985466897469989999983047888830357999996899539998977
Q 001655 653 HIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPL 710 (1036)
Q Consensus 653 ~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~i~lid~~t~~~i~~~~l 710 (1036)
..++++.+..|+.++++|+.+.+.|++... ..|+++|..+.+.+.+++.
T Consensus 236 ~~~~~~~~~~~~~va~spdg~~l~va~~~~---------~~i~v~D~~t~~~~~~~~v 284 (301)
T d1l0qa2 236 ITARIPVGPDPAGIAVTPDGKKVYVALSFC---------NTVSVIDTATNTITATMAV 284 (301)
T ss_dssp EEEEEECCSSEEEEEECTTSSEEEEEETTT---------TEEEEEETTTTEEEEEEEC
T ss_pred EEEEECCCCCEEEEEEECCCCEEEEEECCC---------CEEEEEECCCCEEEEEEEC
T ss_conf 999984899877999918989999998999---------9699999999959999968
|
| >d1yfqa_ b.69.4.2 (A:) Cell cycle arrest protein BUB3 {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: Cell cycle arrest protein BUB3 domain: Cell cycle arrest protein BUB3 species: Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]
Probab=96.75 E-value=0.011 Score=30.60 Aligned_cols=18 Identities=17% Similarity=0.538 Sum_probs=7.7
Q ss_pred CEEEEEEECCCEEEEEECC
Q ss_conf 3999998557789999779
Q 001655 496 QIAAVGMWTDISVRIFSLP 514 (1036)
Q Consensus 496 ~~~~v~~w~~~~i~i~~l~ 514 (1036)
.+++.|.+ |++++||++.
T Consensus 24 ~~L~s~s~-Dg~v~iwd~~ 41 (342)
T d1yfqa_ 24 SLLLITSW-DGSLTVYKFD 41 (342)
T ss_dssp TEEEEEET-TSEEEEEEEE
T ss_pred CEEEEEEC-CCEEEEEECC
T ss_conf 99999979-9929999756
|
| >d1jmxb_ b.69.2.2 (B:) Quinohemoprotein amine dehydrogenase B chain {Pseudomonas putida [TaxId: 303]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: YVTN repeat-like/Quinoprotein amine dehydrogenase family: Quinohemoprotein amine dehydrogenase B chain domain: Quinohemoprotein amine dehydrogenase B chain species: Pseudomonas putida [TaxId: 303]
Probab=96.72 E-value=0.011 Score=30.42 Aligned_cols=59 Identities=5% Similarity=0.080 Sum_probs=23.0
Q ss_pred CCEEEEEEECCCEEEEEECCCCCEEEEECCCCCCCCEEEEEEEECCEEEEEE-EECCCCEEEE
Q ss_conf 6399999855778999977998706884289966721589456449079999-9589839999
Q 001655 495 SQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLC-ALGDGHLLNF 556 (1036)
Q Consensus 495 ~~~~~v~~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~v-gl~~G~l~~y 556 (1036)
.++++++.. ++.|.+|++...+++....++....|.++.+.+ +..++++ +..+|.+..|
T Consensus 8 ~~~l~~~~~-~~~v~v~D~~t~~~~~t~~~~~~~~p~~l~~sp--DG~~l~v~~~~~~~v~~~ 67 (346)
T d1jmxb_ 8 HEYMIVTNY-PNNLHVVDVASDTVYKSCVMPDKFGPGTAMMAP--DNRTAYVLNNHYGDIYGI 67 (346)
T ss_dssp CEEEEEEET-TTEEEEEETTTTEEEEEEECSSCCSSCEEEECT--TSSEEEEEETTTTEEEEE
T ss_pred CCEEEEECC-CCEEEEEECCCCCEEEEEECCCCCCCCEEEECC--CCCEEEEEECCCCCEEEE
T ss_conf 969999869-997999999999899999948999704599978--989999997899939999
|
| >d1tbga_ b.69.4.1 (A:) beta1-subunit of the signal-transducing G protein heterotrimer {Cow (Bos taurus) [TaxId: 9913]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: beta1-subunit of the signal-transducing G protein heterotrimer species: Cow (Bos taurus) [TaxId: 9913]
Probab=96.67 E-value=0.012 Score=30.20 Aligned_cols=15 Identities=7% Similarity=-0.042 Sum_probs=6.1
Q ss_pred EEEEEECCCCEEEEE
Q ss_conf 999996899539998
Q 001655 693 FVRLLDDQTFEFIST 707 (1036)
Q Consensus 693 ~i~lid~~t~~~i~~ 707 (1036)
.++++|..+++.+..
T Consensus 293 ~i~iwd~~~~~~~~~ 307 (340)
T d1tbga_ 293 NCNVWDALKADRAGV 307 (340)
T ss_dssp CEEEEETTTCCEEEE
T ss_pred EEEEEECCCCCEEEE
T ss_conf 899999999939899
|
| >d1yfqa_ b.69.4.2 (A:) Cell cycle arrest protein BUB3 {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: Cell cycle arrest protein BUB3 domain: Cell cycle arrest protein BUB3 species: Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]
Probab=96.67 E-value=0.012 Score=30.20 Aligned_cols=113 Identities=12% Similarity=-0.030 Sum_probs=56.8
Q ss_pred CEEEEEEC--CCEEEEEECCCEEEEEEECCC--EEEEEEECCCCCEEEEEECCCCCCCCCCCCEEEEEEECCCEEEEEEC
Q ss_conf 58999967--889999986986999998386--79999711388234799708989999866399999855778999977
Q 001655 438 SVNVATAN--ASQVLLATGGGHLVYLEIGDG--ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL 513 (1036)
Q Consensus 438 ~I~~as~~--~~~ivi~~~~~~iv~l~~~~~--~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~v~~w~~~~i~i~~l 513 (1036)
.|+..+.+ +..++.+..++.+...+++.+ .............|.|+++.+- ...+++.|.. ++.+.++.+
T Consensus 13 ~I~~l~fsp~~~~L~s~s~Dg~v~iwd~~~~~~~~~~~~~~~h~~~V~~v~f~~~-----~~~~l~sg~~-d~~v~~w~~ 86 (342)
T d1yfqa_ 13 YISDIKIIPSKSLLLITSWDGSLTVYKFDIQAKNVDLLQSLRYKHPLLCCNFIDN-----TDLQIYVGTV-QGEILKVDL 86 (342)
T ss_dssp CEEEEEEEGGGTEEEEEETTSEEEEEEEETTTTEEEEEEEEECSSCEEEEEEEES-----SSEEEEEEET-TSCEEEECS
T ss_pred CEEEEEEECCCCEEEEEECCCEEEEEECCCCCCCEEEEEECCCCCCEEEEEEECC-----CCCEEEECCC-CCCEEEEEC
T ss_conf 7888999589999999979992999975699863689885589998899999589-----9978998126-531145420
Q ss_pred CCCCEEEEECCCCCCCCEEEEEEEECCEEEEEEEECCCCEEEEEE
Q ss_conf 998706884289966721589456449079999958983999999
Q 001655 514 PDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLL 558 (1036)
Q Consensus 514 ~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~ 558 (1036)
............. ....... ....+...++.+..+|.+..+..
T Consensus 87 ~~~~~~~~~~~~~-~~~~~~~-~~~~~~~~~~~~~~~~~~~~wd~ 129 (342)
T d1yfqa_ 87 IGSPSFQALTNNE-ANLGICR-ICKYGDDKLIAASWDGLIEVIDP 129 (342)
T ss_dssp SSSSSEEECBSCC-CCSCEEE-EEEETTTEEEEEETTSEEEEECH
T ss_pred CCCCCCCCCCCCC-CCCCCCC-CCCCCCCCCCCCCCCCCCCEEEC
T ss_conf 4432000001111-1111111-11111111111012221110202
|
| >d1pgua1 b.69.4.1 (A:2-326) Actin interacting protein 1 {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: Actin interacting protein 1 species: Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]
Probab=96.44 E-value=0.017 Score=29.29 Aligned_cols=103 Identities=6% Similarity=-0.010 Sum_probs=42.2
Q ss_pred EEEEECCCEEEEEEECCCEEEEEEE--CCCCCEEEEEECCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCEEEEECCCC
Q ss_conf 9999869869999983867999971--13882347997089899998663999998557789999779987068842899
Q 001655 449 VLLATGGGHLVYLEIGDGILTEVKH--AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG 526 (1036)
Q Consensus 449 ivi~~~~~~iv~l~~~~~~l~~~~~--~~l~~~is~l~~~~~~~~~~~~~~~~v~~w~~~~i~i~~l~~l~~v~~~~l~~ 526 (1036)
++.+..++.+.+++....+...... ......|.|+++.|- ...+++.+.. ++.+.+|++.+.+.+....-..
T Consensus 175 ~~~~~~d~~v~~~d~~~~~~~~~~~~~~~~~~~v~~v~~~pd-----~~~~l~s~~~-d~~i~iwd~~~~~~~~~l~~~~ 248 (325)
T d1pgua1 175 SMTVGDDGSVVFYQGPPFKFSASDRTHHKQGSFVRDVEFSPD-----SGEFVITVGS-DRKISCFDGKSGEFLKYIEDDQ 248 (325)
T ss_dssp EEEEETTTEEEEEETTTBEEEEEECSSSCTTCCEEEEEECST-----TCCEEEEEET-TCCEEEEETTTCCEEEECCBTT
T ss_pred EEEEECCCCCCCCCCCCCCCCEECCCCCCCCCCCEEEEECCC-----CCEECCCCCC-CCCEEEEEECCCCCCCCCCCCC
T ss_conf 888621112211112211000000001577775277630345-----3100001123-3210134300122211111111
Q ss_pred CCCCEEEEEE-EECCEEEEEEEECCCCEEEEEE
Q ss_conf 6672158945-6449079999958983999999
Q 001655 527 EIIPRSVLLC-AFEGISYLLCALGDGHLLNFLL 558 (1036)
Q Consensus 527 ~~~p~si~~~-~~~~~~~L~vgl~~G~l~~y~~ 558 (1036)
....++.+. ...+..+|+.+..||.+..|.+
T Consensus 249 -~~v~~~~~s~~~~dg~~l~s~s~D~~i~iwd~ 280 (325)
T d1pgua1 249 -EPVQGGIFALSWLDSQKFATVGADATIRVWDV 280 (325)
T ss_dssp -BCCCSCEEEEEESSSSEEEEEETTSEEEEEET
T ss_pred -CCCCCCEEEEECCCCCEEEEEECCCEEEEEEC
T ss_conf -11111000000368999999958993999999
|
| >d1pgua2 b.69.4.1 (A:327-613) Actin interacting protein 1 {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: Actin interacting protein 1 species: Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]
Probab=96.30 E-value=0.021 Score=28.79 Aligned_cols=34 Identities=18% Similarity=0.193 Sum_probs=15.6
Q ss_pred CCEEEEEECCCCCCCCCCCCEEEEEEECCCEEEEEECCCCC
Q ss_conf 82347997089899998663999998557789999779987
Q 001655 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLN 517 (1036)
Q Consensus 477 ~~~is~l~~~~~~~~~~~~~~~~v~~w~~~~i~i~~l~~l~ 517 (1036)
...++|+++++ ...++++|.. ++.+.+|++.+.+
T Consensus 162 ~~~v~~~~~s~------~~~~l~~g~~-dg~i~i~d~~~~~ 195 (287)
T d1pgua2 162 RAKPSYISISP------SETYIAAGDV-MGKILLYDLQSRE 195 (287)
T ss_dssp SSCEEEEEECT------TSSEEEEEET-TSCEEEEETTTTE
T ss_pred CCCEEEEEECC------CCCCCCCCCC-CCCCCCEEECCCC
T ss_conf 78536999516------7652110111-1110000002332
|
| >d1p22a2 b.69.4.1 (A:253-545) F-box/WD-repeat protein 1 (beta-TRCP1) {Human (Homo sapiens) [TaxId: 9606]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: F-box/WD-repeat protein 1 (beta-TRCP1) species: Human (Homo sapiens) [TaxId: 9606]
Probab=96.25 E-value=0.022 Score=28.60 Aligned_cols=23 Identities=22% Similarity=0.325 Sum_probs=10.6
Q ss_pred CEEEEEEECCCEEEEEECCCCCEE
Q ss_conf 399999855778999977998706
Q 001655 496 QIAAVGMWTDISVRIFSLPDLNLI 519 (1036)
Q Consensus 496 ~~~~v~~w~~~~i~i~~l~~l~~v 519 (1036)
.+++.|.. |+.|.+|++.+.+.+
T Consensus 26 ~~l~sgs~-Dg~i~vWd~~~~~~~ 48 (293)
T d1p22a2 26 QKIVSGLR-DNTIKIWDKNTLECK 48 (293)
T ss_dssp SEEEEEES-SSCEEEEESSSCCEE
T ss_pred CEEEEEEC-CCEEEEEECCCCCEE
T ss_conf 99999928-993999999999199
|
| >d1jmxb_ b.69.2.2 (B:) Quinohemoprotein amine dehydrogenase B chain {Pseudomonas putida [TaxId: 303]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: YVTN repeat-like/Quinoprotein amine dehydrogenase family: Quinohemoprotein amine dehydrogenase B chain domain: Quinohemoprotein amine dehydrogenase B chain species: Pseudomonas putida [TaxId: 303]
Probab=96.04 E-value=0.028 Score=27.95 Aligned_cols=68 Identities=7% Similarity=-0.046 Sum_probs=25.2
Q ss_pred CEEEEEECCCEEEEEEECCCEEEEEEECCCCCEEEEEECCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCEEE
Q ss_conf 89999986986999998386799997113882347997089899998663999998557789999779987068
Q 001655 447 SQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT 520 (1036)
Q Consensus 447 ~~ivi~~~~~~iv~l~~~~~~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~v~~w~~~~i~i~~l~~l~~v~ 520 (1036)
.+++++.+++.+..+++..+++...-+.+-...+.+++++| ...++.|+.-.++.+.+|++...+.+.
T Consensus 9 ~~l~~~~~~~~v~v~D~~t~~~~~t~~~~~~~~p~~l~~sp------DG~~l~v~~~~~~~v~~~d~~t~~~~~ 76 (346)
T d1jmxb_ 9 EYMIVTNYPNNLHVVDVASDTVYKSCVMPDKFGPGTAMMAP------DNRTAYVLNNHYGDIYGIDLDTCKNTF 76 (346)
T ss_dssp EEEEEEETTTEEEEEETTTTEEEEEEECSSCCSSCEEEECT------TSSEEEEEETTTTEEEEEETTTTEEEE
T ss_pred CEEEEECCCCEEEEEECCCCCEEEEEECCCCCCCCEEEECC------CCCEEEEEECCCCCEEEEECCCCEEEE
T ss_conf 69999869997999999999899999948999704599978------989999997899939999675671312
|
| >d1pbyb_ b.69.2.2 (B:) Quinohemoprotein amine dehydrogenase B chain {Paracoccus denitrificans [TaxId: 266]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: YVTN repeat-like/Quinoprotein amine dehydrogenase family: Quinohemoprotein amine dehydrogenase B chain domain: Quinohemoprotein amine dehydrogenase B chain species: Paracoccus denitrificans [TaxId: 266]
Probab=96.03 E-value=0.028 Score=27.92 Aligned_cols=172 Identities=9% Similarity=0.029 Sum_probs=79.2
Q ss_pred CEEEEEEECCCEEEEEECCCCCEEEEECCCCC-CCCEEEEEEEECCEEEE-EEEECCCCEEEEEEECCCCCCCCCEEEEC
Q ss_conf 39999985577899997799870688428996-67215894564490799-99958983999999368876433214312
Q 001655 496 QIAAVGMWTDISVRIFSLPDLNLITKEHLGGE-IIPRSVLLCAFEGISYL-LCALGDGHLLNFLLNMKTGELTDRKKVSL 573 (1036)
Q Consensus 496 ~~~~v~~w~~~~i~i~~l~~l~~v~~~~l~~~-~~p~si~~~~~~~~~~L-~vgl~~G~l~~y~~~~~~~~l~~~~~~~l 573 (1036)
++++.+.. ++.|.+|++.+.+.+....++.. ..|.++.+.+ +..++ +.+..+|.+..|.+. ++.+. +....
T Consensus 2 ~~~vt~~~-d~~v~v~D~~s~~~~~~i~~~~~~~~~~~i~~sp--Dg~~l~v~~~~~~~v~v~D~~--t~~~~--~~~~~ 74 (337)
T d1pbyb_ 2 DYILAPAR-PDKLVVIDTEKMAVDKVITIADAGPTPMVPMVAP--GGRIAYATVNKSESLVKIDLV--TGETL--GRIDL 74 (337)
T ss_dssp EEEEEEET-TTEEEEEETTTTEEEEEEECTTCTTCCCCEEECT--TSSEEEEEETTTTEEEEEETT--TCCEE--EEEEC
T ss_pred EEEEEECC-CCEEEEEECCCCEEEEEEECCCCCCCCCEEEECC--CCCEEEEEECCCCEEEEEECC--CCCEE--EEEEC
T ss_conf 69999767-9989999999994999998778899823799999--989999997899949999999--99298--88724
Q ss_pred CCCCEEEEEEEECCCEEEEEECCCCEEEEECCCCEEEEEECCCCCEEEECCCCCCCCCEEEEEECCEEEEEEECCCCCEE
Q ss_conf 85433799998399217999729865999629908982205345414525457999991999858918999985888758
Q 001655 574 GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 653 (1036)
Q Consensus 574 G~~pv~l~~~~~~~~~~v~~~~~~p~~i~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~li~~~~~~l~i~~l~~~~~~~ 653 (1036)
....... . ...++ .++.+++..+..-..... ... ........+. -.+....-.
T Consensus 75 ~~~~~~~-~----~~~~v---------~~s~dg~~l~~~~~~~~~---~~~--------~~~~~~~~~~--~~d~~~~~~ 127 (337)
T d1pbyb_ 75 STPEERV-K----SLFGA---------ALSPDGKTLAIYESPVRL---ELT--------HFEVQPTRVA--LYDAETLSR 127 (337)
T ss_dssp CBTTEEE-E----CTTCE---------EECTTSSEEEEEEEEEEE---CSS--------CEEECCCEEE--EEETTTTEE
T ss_pred CCCCCCC-C----CEEEE---------EECCCCCEEEEEECCCCC---EEE--------ECCCCCCCEE--ECCCCCCEE
T ss_conf 7773125-4----02548---------986877579995047762---034--------2034555212--035667759
Q ss_pred EEEEECCCCCCEEEEECCCCEEEEEEEECCCCCCCEEEEEEEEEECCCCEEEEEEECCCC
Q ss_conf 899708985466897469989999983047888830357999996899539998977998
Q 001655 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTF 713 (1036)
Q Consensus 654 ~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~i~lid~~t~~~i~~~~l~~~ 713 (1036)
.+.++.+..|+.+++.++.+.++++.. ...++|..+.+..........
T Consensus 128 ~~~~~~~~~~~~~~~s~dg~~l~~~~~------------~~~~~d~~~~~~~~~~~~~~~ 175 (337)
T d1pbyb_ 128 RKAFEAPRQITMLAWARDGSKLYGLGR------------DLHVMDPEAGTLVEDKPIQSW 175 (337)
T ss_dssp EEEEECCSSCCCEEECTTSSCEEEESS------------SEEEEETTTTEEEEEECSTTT
T ss_pred EEECCCCCCCEEEEECCCCCEEEEECC------------CCCEEEEECCCEEEEEECCCC
T ss_conf 884145687218998688888999717------------750566303727888614775
|
| >d1erja_ b.69.4.1 (A:) Tup1, C-terminal domain {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: Tup1, C-terminal domain species: Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]
Probab=95.58 E-value=0.043 Score=26.78 Aligned_cols=28 Identities=18% Similarity=0.297 Sum_probs=11.4
Q ss_pred EEEEEEEEECCEEEEEEEEEECCCEEEEC
Q ss_conf 49999999699699999987359402744
Q 001655 749 GRILVFIVEDGKLQLIAEKETKGAVYSLN 777 (1036)
Q Consensus 749 Gri~v~~i~~~kl~~~~~~~~~g~v~~i~ 777 (1036)
|.|+++++..++..... ..+.++|++++
T Consensus 330 g~i~vwd~~~~~~~~~l-~~H~~~V~~~~ 357 (388)
T d1erja_ 330 RGVLFWDKKSGNPLLML-QGHRNSVISVA 357 (388)
T ss_dssp SEEEEEETTTCCEEEEE-ECCSSCEEEEE
T ss_pred CEEEEEECCCCCEEEEE-ECCCCCEEEEE
T ss_conf 97999999999699999-68899789999
|
| >d1vyhc1 b.69.4.1 (C:92-408) Platelet-activating factor acetylhydrolase IB subunit alpha {Mouse (Mus musculus) [TaxId: 10090]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: Platelet-activating factor acetylhydrolase IB subunit alpha species: Mouse (Mus musculus) [TaxId: 10090]
Probab=95.08 E-value=0.062 Score=25.76 Aligned_cols=20 Identities=5% Similarity=0.169 Sum_probs=8.2
Q ss_pred CEEEEEEECCCEEEEEECCCC
Q ss_conf 399999855778999977998
Q 001655 496 QIAAVGMWTDISVRIFSLPDL 516 (1036)
Q Consensus 496 ~~~~v~~w~~~~i~i~~l~~l 516 (1036)
.+++.|.. |+.|.||++++.
T Consensus 30 ~~l~s~s~-Dg~i~iWd~~~~ 49 (317)
T d1vyhc1 30 SVMVSASE-DATIKVWDYETG 49 (317)
T ss_dssp SEEEEEES-SSCEEEEETTTC
T ss_pred CEEEEEEC-CCEEEEEECCCC
T ss_conf 99999938-992999989999
|
| >d1ri6a_ b.69.11.1 (A:) Putative isomerase YbhE {Escherichia coli [TaxId: 562]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: Putative isomerase YbhE family: Putative isomerase YbhE domain: Putative isomerase YbhE species: Escherichia coli [TaxId: 562]
Probab=94.79 E-value=0.074 Score=25.29 Aligned_cols=54 Identities=13% Similarity=0.258 Sum_probs=23.8
Q ss_pred CCEEEEEEECC-CEEEEEEECCCCCEEEEEECCCCCCCCCCCCEEEEEEECCCEEEEEECC
Q ss_conf 98699999838-6799997113882347997089899998663999998557789999779
Q 001655 455 GGHLVYLEIGD-GILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP 514 (1036)
Q Consensus 455 ~~~iv~l~~~~-~~l~~~~~~~l~~~is~l~~~~~~~~~~~~~~~~v~~w~~~~i~i~~l~ 514 (1036)
++.+..+.++. +.+..+...+.+.++.+++++| ...+++++...++.+.+|++.
T Consensus 13 ~~~I~v~~~~~~~~l~~~~~~~~~~~v~~la~sp------DG~~L~v~~~~d~~i~~~~i~ 67 (333)
T d1ri6a_ 13 SQQIHVWNLNHEGALTLTQVVDVPGQVQPMVVSP------DKRYLYVGVRPEFRVLAYRIA 67 (333)
T ss_dssp GTEEEEEEECTTSCEEEEEEEECSSCCCCEEECT------TSSEEEEEETTTTEEEEEEEC
T ss_pred CCCEEEEEECCCCCEEEEEEECCCCCEEEEEEEC------CCCEEEEEECCCCEEEEEEEE
T ss_conf 9938999983999769999975799886899958------979999997789969999996
|
| >d1pgua2 b.69.4.1 (A:327-613) Actin interacting protein 1 {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: Actin interacting protein 1 species: Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]
Probab=93.93 E-value=0.11 Score=24.10 Aligned_cols=14 Identities=21% Similarity=0.387 Sum_probs=4.9
Q ss_pred EEEEEECCCCEEEE
Q ss_conf 99999589839999
Q 001655 543 YLLCALGDGHLLNF 556 (1036)
Q Consensus 543 ~L~vgl~~G~l~~y 556 (1036)
+|++|..||.+..|
T Consensus 176 ~l~~g~~dg~i~i~ 189 (287)
T d1pgua2 176 YIAAGDVMGKILLY 189 (287)
T ss_dssp EEEEEETTSCEEEE
T ss_pred CCCCCCCCCCCCCE
T ss_conf 21101111110000
|
| >d1qksa2 b.70.2.1 (A:136-567) C-terminal (heme d1) domain of cytochrome cd1-nitrite reductase {Paracoccus denitrificans [TaxId: 266]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 8-bladed beta-propeller superfamily: C-terminal (heme d1) domain of cytochrome cd1-nitrite reductase family: C-terminal (heme d1) domain of cytochrome cd1-nitrite reductase domain: C-terminal (heme d1) domain of cytochrome cd1-nitrite reductase species: Paracoccus denitrificans [TaxId: 266]
Probab=93.76 E-value=0.12 Score=23.90 Aligned_cols=63 Identities=11% Similarity=0.138 Sum_probs=37.4
Q ss_pred CEEEEEEECCCCCEEEEEEECCCCCCEEEEECCCCEEEEEEEECCCCCCCEEEEEEEEEECCCCEEEEEEEC
Q ss_conf 918999985888758899708985466897469989999983047888830357999996899539998977
Q 001655 639 GELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPL 710 (1036)
Q Consensus 639 ~~l~i~~l~~~~~~~~~~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~i~lid~~t~~~i~~~~l 710 (1036)
+.+.+..........+.+++.+..|+.++++|+.+.+++++... ..+.++|..+.+.+..+..
T Consensus 185 ~~i~~~d~~~~~~~~~~~i~~g~~~~~~~~spdg~~~~va~~~~---------~~v~v~d~~~~~~~~~~~~ 247 (432)
T d1qksa2 185 GKILLVDYTDLNNLKTTEISAERFLHDGGLDGSHRYFITAANAR---------NKLVVIDTKEGKLVAIEDT 247 (432)
T ss_dssp TEEEEEETTCSSEEEEEEEECCSSEEEEEECTTSCEEEEEEGGG---------TEEEEEETTTTEEEEEEEC
T ss_pred CEEEEEECCCCCCCEEEEECCCCCCCCCEECCCCCEEEEECCCC---------CEEEEEECCCCEEEEEECC
T ss_conf 82999984378752279983367542653889887999951666---------3677761445268887214
|
| >d1ri6a_ b.69.11.1 (A:) Putative isomerase YbhE {Escherichia coli [TaxId: 562]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: Putative isomerase YbhE family: Putative isomerase YbhE domain: Putative isomerase YbhE species: Escherichia coli [TaxId: 562]
Probab=93.75 E-value=0.12 Score=23.90 Aligned_cols=63 Identities=8% Similarity=0.123 Sum_probs=23.0
Q ss_pred EEEEEECCCEEEEEECCCCC-EEEEECCCCCCCCEEEEEEEECCEEEEEEEE-CCCCEEEEEEECCC
Q ss_conf 99998557789999779987-0688428996672158945644907999995-89839999993688
Q 001655 498 AAVGMWTDISVRIFSLPDLN-LITKEHLGGEIIPRSVLLCAFEGISYLLCAL-GDGHLLNFLLNMKT 562 (1036)
Q Consensus 498 ~~v~~w~~~~i~i~~l~~l~-~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl-~~G~l~~y~~~~~~ 562 (1036)
+.|+.-.++.|++|+++.-. +...+.+.....|+++.+.+ +..+|+++. .+|.+..|.++...
T Consensus 6 v~v~~~~~~~I~v~~~~~~~~l~~~~~~~~~~~v~~la~sp--DG~~L~v~~~~d~~i~~~~i~~~~ 70 (333)
T d1ri6a_ 6 VYIASPESQQIHVWNLNHEGALTLTQVVDVPGQVQPMVVSP--DKRYLYVGVRPEFRVLAYRIAPDD 70 (333)
T ss_dssp EEEEEGGGTEEEEEEECTTSCEEEEEEEECSSCCCCEEECT--TSSEEEEEETTTTEEEEEEECTTT
T ss_pred EEEECCCCCCEEEEEECCCCCEEEEEEECCCCCEEEEEEEC--CCCEEEEEECCCCEEEEEEEECCC
T ss_conf 99987899938999983999769999975799886899958--979999997789969999996898
|
| >d1q7fa_ b.68.9.1 (A:) Brain tumor cg10719-pa {Fruit fly (Drosophila melanogaster) [TaxId: 7227]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 6-bladed beta-propeller superfamily: NHL repeat family: NHL repeat domain: Brain tumor cg10719-pa species: Fruit fly (Drosophila melanogaster) [TaxId: 7227]
Probab=91.95 E-value=0.22 Score=22.30 Aligned_cols=73 Identities=16% Similarity=0.292 Sum_probs=38.2
Q ss_pred EEEEEEECCEEEEEECCCCEEEEEEECCCCEEEEEECCCCCCEEEEEEEEECCEEEEECCCCCEEEEEECCCCC
Q ss_conf 99999829999998524409999996248919997124886106999996186199970688599995189999
Q 001655 818 ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA 891 (1036)
Q Consensus 818 ~~~l~~~~n~IivgD~~~Sv~~~~~~~~~~~l~~~a~D~~~~~v~~~~~l~~~~~l~~D~~gnl~~l~~~~~~~ 891 (1036)
.++++..|+ |+|+|....-.+..|+++..-+..+..+........+.+-.++.++++|.+..+.+|+|..-++
T Consensus 204 giavD~~G~-i~Vad~~~~~~v~~f~~~G~~~~~~~~~~~~~~p~~vav~~dG~l~V~~~n~~v~~fr~~~~~~ 276 (279)
T d1q7fa_ 204 GVGINSNGE-ILIADNHNNFNLTIFTQDGQLISALESKVKHAQCFDVALMDDGSVVLASKDYRLYIYRYVQLAP 276 (279)
T ss_dssp EEEECTTCC-EEEEECSSSCEEEEECTTSCEEEEEEESSCCSCEEEEEEETTTEEEEEETTTEEEEEECSCCCC
T ss_pred CCCCCCCCE-EEEEECCCCCEEEEECCCCCEEEEEECCCCCCCEEEEEEECCCCEEEEECCCEEEEEEEEEECC
T ss_conf 232314786-9999789980899999999999999688888988379990899199991899699987220357
|
| >d1vyhc1 b.69.4.1 (C:92-408) Platelet-activating factor acetylhydrolase IB subunit alpha {Mouse (Mus musculus) [TaxId: 10090]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: Platelet-activating factor acetylhydrolase IB subunit alpha species: Mouse (Mus musculus) [TaxId: 10090]
Probab=91.57 E-value=0.24 Score=22.05 Aligned_cols=11 Identities=9% Similarity=0.540 Sum_probs=4.7
Q ss_pred CCEEEEEECCC
Q ss_conf 77899997799
Q 001655 505 DISVRIFSLPD 515 (1036)
Q Consensus 505 ~~~i~i~~l~~ 515 (1036)
++.+.++++..
T Consensus 122 d~~~~~~~~~~ 132 (317)
T d1vyhc1 122 DKTIKMWEVQT 132 (317)
T ss_dssp TSEEEEEETTT
T ss_pred CCCEEEEECCC
T ss_conf 75235751144
|
| >d1hzua2 b.70.2.1 (A:118-543) C-terminal (heme d1) domain of cytochrome cd1-nitrite reductase {Pseudomonas aeruginosa [TaxId: 287]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 8-bladed beta-propeller superfamily: C-terminal (heme d1) domain of cytochrome cd1-nitrite reductase family: C-terminal (heme d1) domain of cytochrome cd1-nitrite reductase domain: C-terminal (heme d1) domain of cytochrome cd1-nitrite reductase species: Pseudomonas aeruginosa [TaxId: 287]
Probab=82.24 E-value=0.85 Score=18.55 Aligned_cols=102 Identities=14% Similarity=0.212 Sum_probs=48.9
Q ss_pred CCEEEEECCCCCCEEEEECCCCCCEEEEEEC--CCEEEEEECCCEEEEEEECCCEEEEEEECCCCCEEEEEECCCCCCCC
Q ss_conf 9189993588520123418999858999967--88999998698699999838679999711388234799708989999
Q 001655 415 GSVRLVSSTSRELRNEWKSPPGYSVNVATAN--ASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENP 492 (1036)
Q Consensus 415 ~~i~l~~~~~~~~~~~~~~~~~~~I~~as~~--~~~ivi~~~~~~iv~l~~~~~~l~~~~~~~l~~~is~l~~~~~~~~~ 492 (1036)
..|+++|..+++.+..+.. +..+...+.+ +..++++..++.+.++.++.++..............-+.+++.. .
T Consensus 42 g~v~vwD~~t~~~~~~l~~--g~~~~~vafSPDGk~l~~~~~d~~v~vwd~~t~~~~~~~~i~~~~~~~~~~~s~~~--s 117 (426)
T d1hzua2 42 GQIALVDGDSKKIVKVIDT--GYAVHISRMSASGRYLLVIGRDARIDMIDLWAKEPTKVAEIKIGIEARSVESSKFK--G 117 (426)
T ss_dssp TEEEEEETTTCSEEEEEEC--CSSEEEEEECTTSCEEEEEETTSEEEEEETTSSSCEEEEEEECCSEEEEEEECCST--T
T ss_pred CEEEEEECCCCCEEEEEEC--CCCEEEEEECCCCCEEEEEECCCCEEEEECCCCCEEEEEEEECCCCCCCEEEEEEE--C
T ss_conf 9799999999959999968--99803899989999999995899889997568860489998678887645885002--6
Q ss_pred CCCCEEEEEEECCCEEEEEECCCCCEEE
Q ss_conf 8663999998557789999779987068
Q 001655 493 SYSQIAAVGMWTDISVRIFSLPDLNLIT 520 (1036)
Q Consensus 493 ~~~~~~~v~~w~~~~i~i~~l~~l~~v~ 520 (1036)
..+.+++++.+.++.+.+++....+.+.
T Consensus 118 pDG~~l~v~~~~~~~v~i~d~~~~~~~~ 145 (426)
T d1hzua2 118 YEDRYTIAGAYWPPQFAIMDGETLEPKQ 145 (426)
T ss_dssp CTTTEEEEEEEESSEEEEEETTTCCEEE
T ss_pred CCCCEEEEEECCCCEEEEECCCCCCEEE
T ss_conf 8898799963589769998577641257
|
| >d1mdah_ b.69.2.1 (H:) Methylamine dehydrogenase, H-chain {Paracoccus denitrificans [TaxId: 266]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: YVTN repeat-like/Quinoprotein amine dehydrogenase family: Methylamine dehydrogenase, H-chain domain: Methylamine dehydrogenase, H-chain species: Paracoccus denitrificans [TaxId: 266]
Probab=81.22 E-value=0.92 Score=18.32 Aligned_cols=26 Identities=12% Similarity=-0.009 Sum_probs=12.2
Q ss_pred CCEEEEEEECCCEEEEEECCCCCEEE
Q ss_conf 63999998557789999779987068
Q 001655 495 SQIAAVGMWTDISVRIFSLPDLNLIT 520 (1036)
Q Consensus 495 ~~~~~v~~w~~~~i~i~~l~~l~~v~ 520 (1036)
+.+++++.+.++.+.++++.+.+.+.
T Consensus 135 Gk~l~va~~~~~~v~~~d~~~~~~~~ 160 (368)
T d1mdah_ 135 SACLLFFLFGSSAAAGLSVPGASDDQ 160 (368)
T ss_dssp SSCEEEEECSSSCEEEEEETTTEEEE
T ss_pred CCEEEEEECCCCEEEEEECCCCCEEE
T ss_conf 98999996899859999899893867
|
| >d2ovrb2 b.69.4.1 (B:2365-2706) F-box/WD repeat-containing protein 7, FBXW7 {Human (Homo sapiens) [TaxId: 9606]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: 7-bladed beta-propeller superfamily: WD40 repeat-like family: WD40-repeat domain: F-box/WD repeat-containing protein 7, FBXW7 species: Human (Homo sapiens) [TaxId: 9606]
Probab=81.10 E-value=0.93 Score=18.29 Aligned_cols=58 Identities=10% Similarity=-0.070 Sum_probs=26.8
Q ss_pred CCEEEEEEECCCEEEEEECCCCCEEEEECCCCCCCCEEEEEEEECCEEEEEEEECCCCEEEEEE
Q ss_conf 6399999855778999977998706884289966721589456449079999958983999999
Q 001655 495 SQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLL 558 (1036)
Q Consensus 495 ~~~~~v~~w~~~~i~i~~l~~l~~v~~~~l~~~~~p~si~~~~~~~~~~L~vgl~~G~l~~y~~ 558 (1036)
+++++-|.. |+.|.||++.+.+.+....- ....+.++.+. ..+|+.|..||.+..+..
T Consensus 27 g~~l~sgs~-Dg~i~vWd~~~~~~~~~~~~-h~~~V~~v~~~----~~~l~s~s~D~~~~~~~~ 84 (342)
T d2ovrb2 27 GNRIVSGSD-DNTLKVWSAVTGKCLRTLVG-HTGGVWSSQMR----DNIIISGSTDRTLKVWNA 84 (342)
T ss_dssp TTEEEEEET-TSCEEEEETTTCCEEEECCC-CSSCEEEEEEE----TTEEEEEETTSCEEEEET
T ss_pred CCEEEEEEC-CCEEEEEECCCCCEEEEEEC-CCCCEEEEEEC----CCCCCCCEECCCCCCCCC
T ss_conf 999999918-99099998999979999948-89998999947----986321000001111111
|