BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 008269
(571 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9SZD1|TCX5_ARATH Protein tesmin/TSO1-like CXC 5 OS=Arabidopsis thaliana GN=TCX5 PE=1
SV=1
Length = 603
Score = 428 bits (1100), Expect = e-119, Method: Compositional matrix adjust.
Identities = 241/485 (49%), Positives = 300/485 (61%), Gaps = 51/485 (10%)
Query: 42 ESPRSR-PRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
ESP++R PR N +G+DGTP+K+KQCNC++SRCLKL
Sbjct: 110 ESPKARGPRPNVEGRDGTPQKKKQCNCKHSRCLKL------------------------- 144
Query: 101 NLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
YCECFA+G YCDGCNCVNC NNV++E AR+EAV ATLERNP AFRPKIASSPHG +
Sbjct: 145 ----YCECFASGTYCDGCNCVNCFNNVDNEPARREAVEATLERNPFAFRPKIASSPHGGR 200
Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 220
D RED E L KHNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSEER+A
Sbjct: 201 DKREDIGEVVLLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEERQA 260
Query: 221 LFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRN-P 278
LFHG+H N +A++Q+AANAAI GAVGSSG+ +RK +E+LF A KD S + + P
Sbjct: 261 LFHGEHSNHMAYLQQAANAAITGAVGSSGFAPSPAPKRRKGQEILFNQAIKDSSRLSHFP 320
Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
Q G + PVS+ + +K RS LADI+QP D++ LCS+LV V
Sbjct: 321 QVNNGRT--GGPTSGTSPSPVSRAGGNASSVPSKFVYRSLLADIIQPHDVRALCSVLVTV 378
Query: 339 SSEATKTLTKKVGAMQTEREG-SRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVD 397
+ EA KT T K ++ + + +S+AS Q D +G+ + NQ D
Sbjct: 379 AGEAAKTSTDKRNEIENRVDDQTETSLASSAQ---DQPQGNNNAADVEMVATDHNQADKS 435
Query: 398 GTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRI 457
G S SDG D PLSP T ALMCDE+D+ FM A P + + N+ + I
Sbjct: 436 GPEESNSDGVDASKVTPLSPATLALMCDEQDTIFMVAAPSPNGSVDPNGCRPNSQGQSEI 495
Query: 458 YAEQERLILTRFRDFLNRLITCGSIKETMCSPL------------VKSETRSQQKA-VED 504
YAEQERL+LT+FRD LNRLI+ IKE+ C L VK+E QQ+ + +
Sbjct: 496 YAEQERLVLTKFRDCLNRLISYAEIKESKCLSLARMHIQPPAIATVKTENGIQQQVPIVN 555
Query: 505 GTSRS 509
G SR+
Sbjct: 556 GASRT 560
>sp|Q9SL70|TCX6_ARATH Protein tesmin/TSO1-like CXC 6 OS=Arabidopsis thaliana GN=TCX6 PE=1
SV=1
Length = 571
Score = 394 bits (1011), Expect = e-108, Method: Compositional matrix adjust.
Identities = 217/450 (48%), Positives = 282/450 (62%), Gaps = 66/450 (14%)
Query: 40 KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
+ ESP S PR + +DGTP+K+KQCNC++SRCLKL
Sbjct: 96 RPESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKL------------------------ 131
Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
YCECFA+G YCDGCNCVNC NNVE+E AR++AV +TLERNPNAFRPKIA+SPHG
Sbjct: 132 -----YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGG 186
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+D RE+ + + A+HNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSE R+
Sbjct: 187 RDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEVRQ 246
Query: 220 ALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
+LFHG+H + +A++Q ANAAI GA+GSSG+ + +RK +E+ F KD S R
Sbjct: 247 SLFHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG 305
Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
Q+ G + S S+ + LG +K +S LA+I++P D+K LCS+LV V
Sbjct: 306 QANNGRT--------TSSQTGSRAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAV 357
Query: 339 SSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDG 398
+ EA KTLT+K A Q E +SVAS +Q+ +GH N+ + G
Sbjct: 358 AGEAAKTLTEKRLANQKE-----TSVASSVQD-----QGH-----------VNNKAEKSG 396
Query: 399 TSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIY 458
S +DG R LSP T ALMCDE D+ M A + + T Q N ++Y
Sbjct: 397 LEDSNADGS---KGRSLSPETLALMCDERDTMLMVAA-SPNCSVEPTSQLPN--GQDQVY 450
Query: 459 AEQERLILTRFRDFLNRLITCGSIKETMCS 488
AEQE+++LT+FRD LNR+I+CG +KE+ CS
Sbjct: 451 AEQEKVVLTKFRDCLNRIISCGEVKESNCS 480
>sp|F4JY84|TCX7_ARATH Protein tesmin/TSO1-like CXC 7 OS=Arabidopsis thaliana GN=TCX7 PE=1
SV=1
Length = 459
Score = 314 bits (804), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 184/437 (42%), Positives = 245/437 (56%), Gaps = 68/437 (15%)
Query: 41 QESPRSRPRANTDGKDGTP-KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
Q+ SR + K+ TP K+QK CNC+NS+CLKL
Sbjct: 72 QDPSTSRRHNEVESKENTPNKQQKHCNCKNSKCLKL------------------------ 107
Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
YCECFA+G YC+GCNCVNCHN +E+E +RQ A+ LERNP+AF+PKIA SPHG
Sbjct: 108 -----YCECFASGSYCNGCNCVNCHNKLENESSRQVAISGILERNPDAFKPKIAGSPHGM 162
Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
+D +E+ ++ L KH+KGCHC+KSGCLKKYCEC+QANILCSENCRC DCKNFEGSEER+
Sbjct: 163 KDLQENVQQVLLIGKHSKGCHCRKSGCLKKYCECYQANILCSENCRCQDCKNFEGSEERK 222
Query: 220 ALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQ 279
AL HG ++Q+ NAA+ A+ S Y S KRKS+++ V + Q
Sbjct: 223 ALLHGSQVSDTYIQQMTNAAVNRAIDMSAYLYPPESRKRKSKDISDSVVSS-----YGVQ 277
Query: 280 SQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVS 339
Q+ N+V+ + S+P N +V GST SA RS ++ QP ++ELCSLLV S
Sbjct: 278 YQRANHVRRNGENSLFSLP----NNKAVSGSTTSAYRSSWSNTFQPHHVRELCSLLVSNS 333
Query: 340 SEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGT 399
+ L+ K R + P R +E+ N PD N+MD
Sbjct: 334 VDVANKLSDK----------GRKNDKGPSSLDGAQRVANEI-NDSPDCVLDANRMD---- 378
Query: 400 SGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYA 459
E+P+SP TRALMCDEE + + +T K++ +K + S+ IY
Sbjct: 379 ------------EKPISPATRALMCDEEHE--IGSEKETSARVKTSQEKEDTDTSSGIYL 424
Query: 460 EQERLILTRFRDFLNRL 476
EQER IL+ FRD+L +L
Sbjct: 425 EQERQILSIFRDYLIQL 441
>sp|Q700D0|TCX8_ARATH Protein tesmin/TSO1-like CXC 8 OS=Arabidopsis thaliana GN=TCX8 PE=1
SV=1
Length = 368
Score = 140 bits (352), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 137/457 (29%), Positives = 190/457 (41%), Gaps = 134/457 (29%)
Query: 27 PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYS 86
PV +LPL PP +E S + TD + T +K K C C+ S+CLKL
Sbjct: 34 PVKELPL--FPPISRE--HSEAKDKTDEEGITSRKHKGCRCKQSKCLKL----------- 78
Query: 87 NTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 146
YC+CFA+G+ C C+CV+CHNN E AR+ A+ L RNPN
Sbjct: 79 ------------------YCDCFASGVVCTDCDCVDCHNNSEKCDAREAAMVNVLGRNPN 120
Query: 147 AFRPKIASSPHGAQ-DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 205
AF K S Q A D + L+ +GC CK++ CLKKYCECFQAN+LCS+NC+
Sbjct: 121 AFSEKALGSLTDNQCKAAPDTKPGLLS----RGCKCKRTRCLKKYCECFQANLLCSDNCK 176
Query: 206 CLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLF 265
C++CKN SE + HN +R + + LF
Sbjct: 177 CINCKNV--SEAFQPPAFSAHNSPQVYRRRRDRELTEWNSCPAP--------------LF 220
Query: 266 GVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQP 325
+ D S+ N S + CS +P + S++G T + L
Sbjct: 221 SIP--DNSI---------QNALGSPMSCSPKLPYRK--KRSLMGYTSTLL---------- 257
Query: 326 QDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIP 385
D+ +LCSLLV S AT T ED + P
Sbjct: 258 PDLGDLCSLLVAASESATTT-------------------------AEDQNRIF----TKP 288
Query: 386 DDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCD-----EEDSAFMQAGPQTGL 440
DD+++ ++ +S S S + E S G + D EEDS
Sbjct: 289 DDKEA-----IELSSESESRN---VEEEIQSRGRLIELIDVQYNGEEDSQ---------C 331
Query: 441 ASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLI 477
+K++V + T IY EQER +L FRD L + I
Sbjct: 332 KTKTSVNE------TDIYMEQERAVLETFRDCLQKFI 362
>sp|A1Z9E2|LIN54_DROME Protein lin-54 homolog OS=Drosophila melanogaster GN=mip120 PE=1
SV=1
Length = 950
Score = 138 bits (347), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 99/194 (51%), Gaps = 54/194 (27%)
Query: 33 LPTLPPGKQESPRSRP-------------RANTDGKDGTPKKQKQCNCRNSRCLKLDFVE 79
LPTLPPG + + ++P D G ++K CNC S+CLKL
Sbjct: 696 LPTLPPGVKANVPAKPLFEVLKPPATAAAAGAVDPLGGMTSRRKHCNCSKSQCLKL---- 751
Query: 80 IWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 139
YC+CFA G +C C C +C NN+++EV R+ A+ +
Sbjct: 752 -------------------------YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRS 786
Query: 140 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 199
L+RNP+AF+PKI + G D R HNKGC+CK+SGCLK YCEC++A I
Sbjct: 787 CLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCECYEAKIP 834
Query: 200 CSENCRCLDCKNFE 213
CS C+C+ C+N E
Sbjct: 835 CSSICKCVGCRNME 848
>sp|Q95QD7|LIN54_CAEEL Protein lin-54 OS=Caenorhabditis elegans GN=lin-54 PE=1 SV=1
Length = 435
Score = 135 bits (339), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 77/213 (36%), Positives = 107/213 (50%), Gaps = 46/213 (21%)
Query: 46 SRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKY 105
S R T K P ++K CNC S+CLKL Y
Sbjct: 158 SHIRLKTKKKVFAPGQRKPCNCTKSQCLKL-----------------------------Y 188
Query: 106 CECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARED 165
C+CFA G +C CNC +CHNN+E++ R +A+ +LERNPNAF+PKI + G D
Sbjct: 189 CDCFANGEFCRDCNCKDCHNNIEYDSQRSKAIRQSLERNPNAFKPKIGIARGGITD---- 244
Query: 166 AREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHG- 224
+ H KGCHCKKSGCLK YCEC++A + C++ C+C C+N E R G
Sbjct: 245 -----IERLHQKGCHCKKSGCLKNYCECYEAKVPCTDRCKCKGCQNTETYRMTRYKNSGG 299
Query: 225 ---DHNGIAFMQRAANAAIIGAVGSSGYGTLMT 254
+ N + + A++ A SG G+++T
Sbjct: 300 AVSNTNALMSLTNASST----ATPDSGPGSVVT 328
>sp|Q571G4|LIN54_MOUSE Protein lin-54 homolog OS=Mus musculus GN=Lin54 PE=2 SV=2
Length = 749
Score = 132 bits (333), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 77/182 (42%), Positives = 100/182 (54%), Gaps = 41/182 (22%)
Query: 56 DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
+ T + +K CNC S CLKL YC+CFA G +C
Sbjct: 516 ESTSRPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFC 546
Query: 116 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 175
+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H
Sbjct: 547 NNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------H 595
Query: 176 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQR 234
+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+
Sbjct: 596 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQ 655
Query: 235 AA 236
A
Sbjct: 656 TA 657
>sp|Q6MZP7|LIN54_HUMAN Protein lin-54 homolog OS=Homo sapiens GN=LIN54 PE=1 SV=3
Length = 749
Score = 132 bits (331), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
Score = 36.2 bits (82), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 13/23 (56%), Positives = 17/23 (73%)
Query: 172 AAKHNKGCHCKKSGCLKKYCECF 194
A++ K C+C KS CLK YC+CF
Sbjct: 518 ASRPRKPCNCTKSLCLKLYCDCF 540
>sp|Q0IHV2|LIN54_XENTR Protein lin-54 homolog OS=Xenopus tropicalis GN=lin54 PE=2 SV=1
Length = 741
Score = 131 bits (330), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/182 (41%), Positives = 101/182 (55%), Gaps = 43/182 (23%)
Query: 45 RSRPRANTDG---KDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDN 101
+++PR +G + + +K CNC S CLKL
Sbjct: 494 QNQPRGPLNGIISSESASRPRKPCNCTKSLCLKL-------------------------- 527
Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
YC+CFA G +C+ CNC NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D
Sbjct: 528 ---YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESD 584
Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L
Sbjct: 585 RR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTL 633
Query: 222 FH 223
H
Sbjct: 634 MH 635
>sp|Q641Z1|LIN54_RAT Protein lin-54 homolog OS=Rattus norvegicus GN=Lin54 PE=2 SV=2
Length = 749
Score = 131 bits (330), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
Score = 35.8 bits (81), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 13/23 (56%), Positives = 17/23 (73%)
Query: 172 AAKHNKGCHCKKSGCLKKYCECF 194
A++ K C+C KS CLK YC+CF
Sbjct: 518 ASRPRKPCNCTKSLCLKLYCDCF 540
>sp|Q5RBN8|LIN54_PONAB Protein lin-54 homolog OS=Pongo abelii GN=LIN54 PE=2 SV=1
Length = 528
Score = 129 bits (323), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S CLKL YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
C NC+NN+EHE RQ+A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CK+SGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
Score = 35.0 bits (79), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 13/23 (56%), Positives = 17/23 (73%)
Query: 172 AAKHNKGCHCKKSGCLKKYCECF 194
A++ K C+C KS CLK YC+CF
Sbjct: 297 ASRPRKPCNCTKSLCLKLYCDCF 319
>sp|Q08CM4|LIN54_DANRE Protein lin-54 homolog OS=Danio rerio GN=lin54 PE=1 SV=1
Length = 771
Score = 127 bits (319), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)
Query: 60 KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
+ +K CNC S+CLKL YC+CFA G +C+ CN
Sbjct: 543 RPRKPCNCTRSQCLKL-----------------------------YCDCFANGEFCNNCN 573
Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
CVNC NN++HE R +A+ A L+RNP AF+PKI G D R H+KGC
Sbjct: 574 CVNCFNNLDHESERLKAIKACLDRNPVAFKPKIGKGKEGESDRR-----------HSKGC 622
Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
+CKKSGCLK YCEC++A I+CS C+C+ CKNFE S ER+ L H D + Q+ A
Sbjct: 623 NCKKSGCLKNYCECYEAKIMCSSICKCMGCKNFEESPERKTLMHLADAAEVRVQQQTA 680
>sp|Q9WTJ6|MTL5_MOUSE Tesmin OS=Mus musculus GN=Mtl5 PE=2 SV=2
Length = 475
Score = 125 bits (315), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 60/136 (44%), Positives = 83/136 (61%), Gaps = 15/136 (11%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G L K L YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 264 GPALQGPPKITLSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 319
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
K+ GA R H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 320 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACK 368
Query: 211 NFEGSEERRALFHGDH 226
N+E S ER+ L H
Sbjct: 369 NYEESPERKMLMSTPH 384
>sp|Q5XHX9|MTL5_RAT Tesmin OS=Rattus norvegicus GN=Mtl5 PE=2 SV=2
Length = 475
Score = 124 bits (311), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 60/136 (44%), Positives = 82/136 (60%), Gaps = 15/136 (11%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G L K L YC+CF++G +C+ C+C NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 264 GPALQGPPKIALAGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 319
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
K+ GA R H+KGC+CK+SGCLK YCEC++A I CS C+C+ CK
Sbjct: 320 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACK 368
Query: 211 NFEGSEERRALFHGDH 226
N+E S ER+ L H
Sbjct: 369 NYEESPERKMLMSTPH 384
>sp|F4JIF5|TCX2_ARATH Protein tesmin/TSO1-like CXC 2 OS=Arabidopsis thaliana GN=TCX2 PE=1
SV=1
Length = 674
Score = 123 bits (309), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 70/184 (38%), Positives = 93/184 (50%), Gaps = 35/184 (19%)
Query: 38 PGKQESPRSRP---RANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
P QE S P R D +G + K+CNC+ S+CLKL
Sbjct: 349 PALQELNLSSPKKKRVKLDSGEG--ESCKRCNCKKSKCLKL------------------- 387
Query: 95 NTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 153
YCECFAAG+YC + C+C++C N HE RNP AF PK+
Sbjct: 388 ----------YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVI 437
Query: 154 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
+ Q+ +DA + +A+H +GC+CKKS CLKKYCEC+Q + CS NCRC CKN
Sbjct: 438 RNSDSVQETGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAF 497
Query: 214 GSEE 217
G ++
Sbjct: 498 GRKD 501
>sp|Q9LUI3|TSO1_ARATH CRC domain-containing protein TSO1 OS=Arabidopsis thaliana GN=TSO1
PE=1 SV=1
Length = 695
Score = 122 bits (306), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 88/164 (53%), Gaps = 38/164 (23%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG+YC + C+C+
Sbjct: 401 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCI 431
Query: 122 NCHNNVEHEVARQEAVGATLE----RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
+C N HE E V AT + RNP AF PK+ + +A +DA + +A+H +
Sbjct: 432 DCFNKPIHE----ETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKTPASARHKR 487
Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
GC+CKKS C+KKYCEC+Q + CS NCRC C N G ++ L
Sbjct: 488 GCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKDGSLL 531
>sp|Q8L548|TCX3_ARATH Protein tesmin/TSO1-like CXC 3 OS=Arabidopsis thaliana GN=TCX3 PE=1
SV=1
Length = 609
Score = 121 bits (304), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 85/164 (51%), Gaps = 31/164 (18%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+CNC+ S+CLKL YCECFAAG YC + C+C+
Sbjct: 328 KRCNCKKSKCLKL-----------------------------YCECFAAGFYCIEPCSCI 358
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
NC N H+ RNP AF PK+ + + EDA + +A+H +GC+C
Sbjct: 359 NCFNKPIHKDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNC 418
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD 225
KKS CLKKYCEC+Q + CS NCRC CKN G ++ +LF D
Sbjct: 419 KKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQD 461
Score = 54.3 bits (129), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 59/130 (45%), Gaps = 8/130 (6%)
Query: 89 SPGAHLNT---QFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQ----EAVGATL 141
S G HLNT KD V F+ I + + + +H++ R+ + G +
Sbjct: 234 SIGLHLNTIAMSSKDKNVANEYSFSGNIKVGVQSSLTPVLHSQHDIVRENESGKDSGQII 293
Query: 142 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 201
E P + + +P + R + ++ K C+CKKS CLK YCECF A C
Sbjct: 294 EVVPKSLA-SVDLTPISPKKKRRKSEQSGEGDSSCKRCNCKKSKCLKLYCECFAAGFYCI 352
Query: 202 ENCRCLDCKN 211
E C C++C N
Sbjct: 353 EPCSCINCFN 362
>sp|Q84JZ8|TCX4_ARATH Protein tesmin/TSO1-like CXC 4 OS=Arabidopsis thaliana GN=TCX4 PE=1
SV=1
Length = 639
Score = 113 bits (282), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 59/153 (38%), Positives = 79/153 (51%), Gaps = 30/153 (19%)
Query: 63 KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
K+C CR S+CLKL YCECF+AG++C + C+C
Sbjct: 453 KRCKCRKSQCLKL-----------------------------YCECFSAGLFCGEPCSCQ 483
Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
NC N HE ++ RNP AF PK+ S+ D + + +A+H +GC+C
Sbjct: 484 NCFNKPIHEDLVMKSREVIKARNPLAFAPKVVSTSDTVIDLWVENSKTPASARHKRGCNC 543
Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
+KSGC KKYCECF + CS NCRC+ CKN G
Sbjct: 544 RKSGCSKKYCECFMMGVGCSSNCRCMGCKNTFG 576
Score = 33.1 bits (74), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 13/27 (48%), Positives = 15/27 (55%), Gaps = 1/27 (3%)
Query: 104 KYCECFAAGIYCDG-CNCVNCHNNVEH 129
KYCECF G+ C C C+ C N H
Sbjct: 551 KYCECFMMGVGCSSNCRCMGCKNTFGH 577
>sp|Q9Y4I5|MTL5_HUMAN Tesmin OS=Homo sapiens GN=MTL5 PE=1 SV=2
Length = 508
Score = 108 bits (270), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)
Query: 91 GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
G+ L K L YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+P
Sbjct: 293 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQP 352
Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
KI G + +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CK
Sbjct: 353 KIGKGQLG-----------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCK 401
Query: 211 NFEGSEERRALF 222
N+E S ER+ L
Sbjct: 402 NYEESPERKTLM 413
>sp|P93831|CLF_ARATH Histone-lysine N-methyltransferase CLF OS=Arabidopsis thaliana
GN=CLF PE=1 SV=2
Length = 902
Score = 36.2 bits (82), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 39/91 (42%), Gaps = 5/91 (5%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCS----ENCRCLDCKNFEGSEERRALFHGDHNGIAFM 232
+GCHC KS C + C CF A+ C NC + G +R + N + +
Sbjct: 689 RGCHCAKSQCRSRQCPCFAADRECDPDVCRNCWVIGGDGSLGVPSQRGDNYECRN-MKLL 747
Query: 233 QRAANAAIIGAVGSSGYGTLMTSNKRKSEEL 263
+ ++G SG+G + ++ K E L
Sbjct: 748 LKQQQRVLLGISDVSGWGAFLKNSVSKHEYL 778
>sp|Q9ZSM8|EZA1_ARATH Histone-lysine N-methyltransferase EZA1 OS=Arabidopsis thaliana
GN=EZA1 PE=2 SV=1
Length = 856
Score = 36.2 bits (82), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 41/91 (45%), Gaps = 6/91 (6%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSEN-CR--CLDCKNFE-GSEERRALFHGDHNGIAFM 232
+GCHC KS C + C CF A C + CR + C + G RR G + +
Sbjct: 645 RGCHCAKSQCRSRQCPCFAAGRECDPDVCRNCWVSCGDGSLGEAPRRG--EGQCGNMRLL 702
Query: 233 QRAANAAIIGAVGSSGYGTLMTSNKRKSEEL 263
R ++G +G+G + ++ K+E L
Sbjct: 703 LRQQQRILLGKSDVAGWGAFLKNSVSKNEYL 733
>sp|Q8S4P6|EZ1_MAIZE Histone-lysine N-methyltransferase EZ1 OS=Zea mays GN=EZ1 PE=2 SV=1
Length = 931
Score = 32.7 bits (73), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 24/92 (26%), Positives = 40/92 (43%), Gaps = 7/92 (7%)
Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDC-----KNFEGSEERRALFHGDHNGIAF 231
+GCHC KS C + C CF A+ C + C +C G +R + N +
Sbjct: 715 RGCHCAKSQCRSRQCPCFAADRECDPDV-CRNCWVGCGDGTLGVPNQRGDNYECRN-MKL 772
Query: 232 MQRAANAAIIGAVGSSGYGTLMTSNKRKSEEL 263
+ + ++G SG+G + ++ K E L
Sbjct: 773 LLKQQQRVLLGRSDVSGWGAFLKNSVSKHEYL 804
>sp|A1DP40|ATG2_NEOFI Autophagy-related protein 2 OS=Neosartorya fischeri (strain ATCC
1020 / DSM 3700 / FGSC A1164 / NRRL 181) GN=atg2 PE=3
SV=1
Length = 2137
Score = 32.7 bits (73), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 26/55 (47%), Gaps = 4/55 (7%)
Query: 111 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARED 165
A IY G C NV ++ +E+ GA + NPN+ RP A G D+ D
Sbjct: 82 ADIYSSGIICEASGINVHLQLPSEESFGAAKDENPNSRRPSQA----GTHDSSSD 132
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.314 0.128 0.377
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 208,111,309
Number of Sequences: 539616
Number of extensions: 8787794
Number of successful extensions: 18896
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 26
Number of HSP's successfully gapped in prelim test: 33
Number of HSP's that attempted gapping in prelim test: 18712
Number of HSP's gapped (non-prelim): 142
length of query: 571
length of database: 191,569,459
effective HSP length: 123
effective length of query: 448
effective length of database: 125,196,691
effective search space: 56088117568
effective search space used: 56088117568
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 64 (29.3 bits)