BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 008269
         (571 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q9SZD1|TCX5_ARATH Protein tesmin/TSO1-like CXC 5 OS=Arabidopsis thaliana GN=TCX5 PE=1
           SV=1
          Length = 603

 Score =  428 bits (1100), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 241/485 (49%), Positives = 300/485 (61%), Gaps = 51/485 (10%)

Query: 42  ESPRSR-PRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKD 100
           ESP++R PR N +G+DGTP+K+KQCNC++SRCLKL                         
Sbjct: 110 ESPKARGPRPNVEGRDGTPQKKKQCNCKHSRCLKL------------------------- 144

Query: 101 NLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQ 160
               YCECFA+G YCDGCNCVNC NNV++E AR+EAV ATLERNP AFRPKIASSPHG +
Sbjct: 145 ----YCECFASGTYCDGCNCVNCFNNVDNEPARREAVEATLERNPFAFRPKIASSPHGGR 200

Query: 161 DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRA 220
           D RED  E  L  KHNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSEER+A
Sbjct: 201 DKREDIGEVVLLGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEERQA 260

Query: 221 LFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRN-P 278
           LFHG+H N +A++Q+AANAAI GAVGSSG+       +RK +E+LF  A KD S + + P
Sbjct: 261 LFHGEHSNHMAYLQQAANAAITGAVGSSGFAPSPAPKRRKGQEILFNQAIKDSSRLSHFP 320

Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
           Q   G          +   PVS+    +    +K   RS LADI+QP D++ LCS+LV V
Sbjct: 321 QVNNGRT--GGPTSGTSPSPVSRAGGNASSVPSKFVYRSLLADIIQPHDVRALCSVLVTV 378

Query: 339 SSEATKTLTKKVGAMQTEREG-SRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVD 397
           + EA KT T K   ++   +  + +S+AS  Q   D  +G+     +       NQ D  
Sbjct: 379 AGEAAKTSTDKRNEIENRVDDQTETSLASSAQ---DQPQGNNNAADVEMVATDHNQADKS 435

Query: 398 GTSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRI 457
           G   S SDG D     PLSP T ALMCDE+D+ FM A P    +      + N+   + I
Sbjct: 436 GPEESNSDGVDASKVTPLSPATLALMCDEQDTIFMVAAPSPNGSVDPNGCRPNSQGQSEI 495

Query: 458 YAEQERLILTRFRDFLNRLITCGSIKETMCSPL------------VKSETRSQQKA-VED 504
           YAEQERL+LT+FRD LNRLI+   IKE+ C  L            VK+E   QQ+  + +
Sbjct: 496 YAEQERLVLTKFRDCLNRLISYAEIKESKCLSLARMHIQPPAIATVKTENGIQQQVPIVN 555

Query: 505 GTSRS 509
           G SR+
Sbjct: 556 GASRT 560


>sp|Q9SL70|TCX6_ARATH Protein tesmin/TSO1-like CXC 6 OS=Arabidopsis thaliana GN=TCX6 PE=1
           SV=1
          Length = 571

 Score =  394 bits (1011), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 217/450 (48%), Positives = 282/450 (62%), Gaps = 66/450 (14%)

Query: 40  KQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
           + ESP S PR   + +DGTP+K+KQCNC++SRCLKL                        
Sbjct: 96  RPESPNSMPRPAGETRDGTPQKKKQCNCKHSRCLKL------------------------ 131

Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
                YCECFA+G YCDGCNCVNC NNVE+E AR++AV +TLERNPNAFRPKIA+SPHG 
Sbjct: 132 -----YCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNAFRPKIAASPHGG 186

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           +D RE+  +  + A+HNKGCHCKKSGCLKKYCECFQANILCSENC+CLDCKNFEGSE R+
Sbjct: 187 RDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCKCLDCKNFEGSEVRQ 246

Query: 220 ALFHGDH-NGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNP 278
           +LFHG+H + +A++Q  ANAAI GA+GSSG+ +     +RK +E+ F    KD S  R  
Sbjct: 247 SLFHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG 305

Query: 279 QSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMV 338
           Q+  G          + S   S+    + LG +K   +S LA+I++P D+K LCS+LV V
Sbjct: 306 QANNGRT--------TSSQTGSRAGGNASLGPSKVVYKSLLANIIKPMDVKALCSVLVAV 357

Query: 339 SSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDG 398
           + EA KTLT+K  A Q E     +SVAS +Q+     +GH             N+ +  G
Sbjct: 358 AGEAAKTLTEKRLANQKE-----TSVASSVQD-----QGH-----------VNNKAEKSG 396

Query: 399 TSGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIY 458
              S +DG      R LSP T ALMCDE D+  M A      + + T Q  N     ++Y
Sbjct: 397 LEDSNADGS---KGRSLSPETLALMCDERDTMLMVAA-SPNCSVEPTSQLPN--GQDQVY 450

Query: 459 AEQERLILTRFRDFLNRLITCGSIKETMCS 488
           AEQE+++LT+FRD LNR+I+CG +KE+ CS
Sbjct: 451 AEQEKVVLTKFRDCLNRIISCGEVKESNCS 480


>sp|F4JY84|TCX7_ARATH Protein tesmin/TSO1-like CXC 7 OS=Arabidopsis thaliana GN=TCX7 PE=1
           SV=1
          Length = 459

 Score =  314 bits (804), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 184/437 (42%), Positives = 245/437 (56%), Gaps = 68/437 (15%)

Query: 41  QESPRSRPRANTDGKDGTP-KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFK 99
           Q+   SR     + K+ TP K+QK CNC+NS+CLKL                        
Sbjct: 72  QDPSTSRRHNEVESKENTPNKQQKHCNCKNSKCLKL------------------------ 107

Query: 100 DNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGA 159
                YCECFA+G YC+GCNCVNCHN +E+E +RQ A+   LERNP+AF+PKIA SPHG 
Sbjct: 108 -----YCECFASGSYCNGCNCVNCHNKLENESSRQVAISGILERNPDAFKPKIAGSPHGM 162

Query: 160 QDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERR 219
           +D +E+ ++  L  KH+KGCHC+KSGCLKKYCEC+QANILCSENCRC DCKNFEGSEER+
Sbjct: 163 KDLQENVQQVLLIGKHSKGCHCRKSGCLKKYCECYQANILCSENCRCQDCKNFEGSEERK 222

Query: 220 ALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLFGVAAKDQSVIRNPQ 279
           AL HG      ++Q+  NAA+  A+  S Y     S KRKS+++   V +         Q
Sbjct: 223 ALLHGSQVSDTYIQQMTNAAVNRAIDMSAYLYPPESRKRKSKDISDSVVSS-----YGVQ 277

Query: 280 SQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQPQDMKELCSLLVMVS 339
            Q+ N+V+ +      S+P     N +V GST SA RS  ++  QP  ++ELCSLLV  S
Sbjct: 278 YQRANHVRRNGENSLFSLP----NNKAVSGSTTSAYRSSWSNTFQPHHVRELCSLLVSNS 333

Query: 340 SEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIPDDRQSGNQMDVDGT 399
            +    L+ K           R +   P       R  +E+ N  PD     N+MD    
Sbjct: 334 VDVANKLSDK----------GRKNDKGPSSLDGAQRVANEI-NDSPDCVLDANRMD---- 378

Query: 400 SGSGSDGGDMLNERPLSPGTRALMCDEEDSAFMQAGPQTGLASKSTVQKSNAYASTRIYA 459
                       E+P+SP TRALMCDEE    + +  +T    K++ +K +   S+ IY 
Sbjct: 379 ------------EKPISPATRALMCDEEHE--IGSEKETSARVKTSQEKEDTDTSSGIYL 424

Query: 460 EQERLILTRFRDFLNRL 476
           EQER IL+ FRD+L +L
Sbjct: 425 EQERQILSIFRDYLIQL 441


>sp|Q700D0|TCX8_ARATH Protein tesmin/TSO1-like CXC 8 OS=Arabidopsis thaliana GN=TCX8 PE=1
           SV=1
          Length = 368

 Score =  140 bits (352), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 137/457 (29%), Positives = 190/457 (41%), Gaps = 134/457 (29%)

Query: 27  PVHKLPLPTLPPGKQESPRSRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYS 86
           PV +LPL   PP  +E   S  +  TD +  T +K K C C+ S+CLKL           
Sbjct: 34  PVKELPL--FPPISRE--HSEAKDKTDEEGITSRKHKGCRCKQSKCLKL----------- 78

Query: 87  NTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPN 146
                             YC+CFA+G+ C  C+CV+CHNN E   AR+ A+   L RNPN
Sbjct: 79  ------------------YCDCFASGVVCTDCDCVDCHNNSEKCDAREAAMVNVLGRNPN 120

Query: 147 AFRPKIASSPHGAQ-DAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCR 205
           AF  K   S    Q  A  D +   L+    +GC CK++ CLKKYCECFQAN+LCS+NC+
Sbjct: 121 AFSEKALGSLTDNQCKAAPDTKPGLLS----RGCKCKRTRCLKKYCECFQANLLCSDNCK 176

Query: 206 CLDCKNFEGSEERRALFHGDHNGIAFMQRAANAAIIGAVGSSGYGTLMTSNKRKSEELLF 265
           C++CKN   SE  +      HN     +R  +  +                       LF
Sbjct: 177 CINCKNV--SEAFQPPAFSAHNSPQVYRRRRDRELTEWNSCPAP--------------LF 220

Query: 266 GVAAKDQSVIRNPQSQQGNNVQNSAVPCSQSVPVSQTANASVLGSTKSALRSPLADILQP 325
            +   D S+          N   S + CS  +P  +    S++G T + L          
Sbjct: 221 SIP--DNSI---------QNALGSPMSCSPKLPYRK--KRSLMGYTSTLL---------- 257

Query: 326 QDMKELCSLLVMVSSEATKTLTKKVGAMQTEREGSRSSVASPIQEGEDSRKGHEVHNGIP 385
            D+ +LCSLLV  S  AT T                          ED  +        P
Sbjct: 258 PDLGDLCSLLVAASESATTT-------------------------AEDQNRIF----TKP 288

Query: 386 DDRQSGNQMDVDGTSGSGSDGGDMLNERPLSPGTRALMCD-----EEDSAFMQAGPQTGL 440
           DD+++     ++ +S S S     + E   S G    + D     EEDS           
Sbjct: 289 DDKEA-----IELSSESESRN---VEEEIQSRGRLIELIDVQYNGEEDSQ---------C 331

Query: 441 ASKSTVQKSNAYASTRIYAEQERLILTRFRDFLNRLI 477
            +K++V +      T IY EQER +L  FRD L + I
Sbjct: 332 KTKTSVNE------TDIYMEQERAVLETFRDCLQKFI 362


>sp|A1Z9E2|LIN54_DROME Protein lin-54 homolog OS=Drosophila melanogaster GN=mip120 PE=1
           SV=1
          Length = 950

 Score =  138 bits (347), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 72/194 (37%), Positives = 99/194 (51%), Gaps = 54/194 (27%)

Query: 33  LPTLPPGKQESPRSRP-------------RANTDGKDGTPKKQKQCNCRNSRCLKLDFVE 79
           LPTLPPG + +  ++P                 D   G   ++K CNC  S+CLKL    
Sbjct: 696 LPTLPPGVKANVPAKPLFEVLKPPATAAAAGAVDPLGGMTSRRKHCNCSKSQCLKL---- 751

Query: 80  IWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGA 139
                                    YC+CFA G +C  C C +C NN+++EV R+ A+ +
Sbjct: 752 -------------------------YCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRS 786

Query: 140 TLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANIL 199
            L+RNP+AF+PKI +   G  D R           HNKGC+CK+SGCLK YCEC++A I 
Sbjct: 787 CLDRNPSAFKPKITAPNSG--DMR----------LHNKGCNCKRSGCLKNYCECYEAKIP 834

Query: 200 CSENCRCLDCKNFE 213
           CS  C+C+ C+N E
Sbjct: 835 CSSICKCVGCRNME 848


>sp|Q95QD7|LIN54_CAEEL Protein lin-54 OS=Caenorhabditis elegans GN=lin-54 PE=1 SV=1
          Length = 435

 Score =  135 bits (339), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 77/213 (36%), Positives = 107/213 (50%), Gaps = 46/213 (21%)

Query: 46  SRPRANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKY 105
           S  R  T  K   P ++K CNC  S+CLKL                             Y
Sbjct: 158 SHIRLKTKKKVFAPGQRKPCNCTKSQCLKL-----------------------------Y 188

Query: 106 CECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARED 165
           C+CFA G +C  CNC +CHNN+E++  R +A+  +LERNPNAF+PKI  +  G  D    
Sbjct: 189 CDCFANGEFCRDCNCKDCHNNIEYDSQRSKAIRQSLERNPNAFKPKIGIARGGITD---- 244

Query: 166 AREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHG- 224
                +   H KGCHCKKSGCLK YCEC++A + C++ C+C  C+N E     R    G 
Sbjct: 245 -----IERLHQKGCHCKKSGCLKNYCECYEAKVPCTDRCKCKGCQNTETYRMTRYKNSGG 299

Query: 225 ---DHNGIAFMQRAANAAIIGAVGSSGYGTLMT 254
              + N +  +  A++     A   SG G+++T
Sbjct: 300 AVSNTNALMSLTNASST----ATPDSGPGSVVT 328


>sp|Q571G4|LIN54_MOUSE Protein lin-54 homolog OS=Mus musculus GN=Lin54 PE=2 SV=2
          Length = 749

 Score =  132 bits (333), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 77/182 (42%), Positives = 100/182 (54%), Gaps = 41/182 (22%)

Query: 56  DGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC 115
           + T + +K CNC  S CLKL                             YC+CFA G +C
Sbjct: 516 ESTSRPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFC 546

Query: 116 DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKH 175
           + CNC NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H
Sbjct: 547 NNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------H 595

Query: 176 NKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQR 234
           +KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+
Sbjct: 596 SKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQ 655

Query: 235 AA 236
            A
Sbjct: 656 TA 657


>sp|Q6MZP7|LIN54_HUMAN Protein lin-54 homolog OS=Homo sapiens GN=LIN54 PE=1 SV=3
          Length = 749

 Score =  132 bits (331), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657



 Score = 36.2 bits (82), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 13/23 (56%), Positives = 17/23 (73%)

Query: 172 AAKHNKGCHCKKSGCLKKYCECF 194
           A++  K C+C KS CLK YC+CF
Sbjct: 518 ASRPRKPCNCTKSLCLKLYCDCF 540


>sp|Q0IHV2|LIN54_XENTR Protein lin-54 homolog OS=Xenopus tropicalis GN=lin54 PE=2 SV=1
          Length = 741

 Score =  131 bits (330), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 76/182 (41%), Positives = 101/182 (55%), Gaps = 43/182 (23%)

Query: 45  RSRPRANTDG---KDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDN 101
           +++PR   +G    +   + +K CNC  S CLKL                          
Sbjct: 494 QNQPRGPLNGIISSESASRPRKPCNCTKSLCLKL-------------------------- 527

Query: 102 LVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQD 161
              YC+CFA G +C+ CNC NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D
Sbjct: 528 ---YCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESD 584

Query: 162 AREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
            R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L
Sbjct: 585 RR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTL 633

Query: 222 FH 223
            H
Sbjct: 634 MH 635


>sp|Q641Z1|LIN54_RAT Protein lin-54 homolog OS=Rattus norvegicus GN=Lin54 PE=2 SV=2
          Length = 749

 Score =  131 bits (330), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 520 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 550

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 551 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 599

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 600 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657



 Score = 35.8 bits (81), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 13/23 (56%), Positives = 17/23 (73%)

Query: 172 AAKHNKGCHCKKSGCLKKYCECF 194
           A++  K C+C KS CLK YC+CF
Sbjct: 518 ASRPRKPCNCTKSLCLKLYCDCF 540


>sp|Q5RBN8|LIN54_PONAB Protein lin-54 homolog OS=Pongo abelii GN=LIN54 PE=2 SV=1
          Length = 528

 Score =  129 bits (323), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S CLKL                             YC+CFA G +C+ CN
Sbjct: 299 RPRKPCNCTKSLCLKL-----------------------------YCDCFANGEFCNNCN 329

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           C NC+NN+EHE  RQ+A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 330 CTNCYNNLEHENERQKAIKACLDRNPEAFKPKIGKGKEGESDRR-----------HSKGC 378

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CK+SGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 379 NCKRSGCLKNYCECYEAKIMCSSICKCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436



 Score = 35.0 bits (79), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 13/23 (56%), Positives = 17/23 (73%)

Query: 172 AAKHNKGCHCKKSGCLKKYCECF 194
           A++  K C+C KS CLK YC+CF
Sbjct: 297 ASRPRKPCNCTKSLCLKLYCDCF 319


>sp|Q08CM4|LIN54_DANRE Protein lin-54 homolog OS=Danio rerio GN=lin54 PE=1 SV=1
          Length = 771

 Score =  127 bits (319), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 76/178 (42%), Positives = 98/178 (55%), Gaps = 41/178 (23%)

Query: 60  KKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYCDGCN 119
           + +K CNC  S+CLKL                             YC+CFA G +C+ CN
Sbjct: 543 RPRKPCNCTRSQCLKL-----------------------------YCDCFANGEFCNNCN 573

Query: 120 CVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGC 179
           CVNC NN++HE  R +A+ A L+RNP AF+PKI     G  D R           H+KGC
Sbjct: 574 CVNCFNNLDHESERLKAIKACLDRNPVAFKPKIGKGKEGESDRR-----------HSKGC 622

Query: 180 HCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFH-GDHNGIAFMQRAA 236
           +CKKSGCLK YCEC++A I+CS  C+C+ CKNFE S ER+ L H  D   +   Q+ A
Sbjct: 623 NCKKSGCLKNYCECYEAKIMCSSICKCMGCKNFEESPERKTLMHLADAAEVRVQQQTA 680


>sp|Q9WTJ6|MTL5_MOUSE Tesmin OS=Mus musculus GN=Mtl5 PE=2 SV=2
          Length = 475

 Score =  125 bits (315), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 60/136 (44%), Positives = 83/136 (61%), Gaps = 15/136 (11%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G  L    K  L  YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 264 GPALQGPPKITLSGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 319

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           K+     GA   R           H+KGC+CK+SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 320 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACK 368

Query: 211 NFEGSEERRALFHGDH 226
           N+E S ER+ L    H
Sbjct: 369 NYEESPERKMLMSTPH 384


>sp|Q5XHX9|MTL5_RAT Tesmin OS=Rattus norvegicus GN=Mtl5 PE=2 SV=2
          Length = 475

 Score =  124 bits (311), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 60/136 (44%), Positives = 82/136 (60%), Gaps = 15/136 (11%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G  L    K  L  YC+CF++G +C+ C+C    NN+ HE+ R +A+ A L+RNP AF+P
Sbjct: 264 GPALQGPPKIALAGYCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQP 319

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           K+     GA   R           H+KGC+CK+SGCLK YCEC++A I CS  C+C+ CK
Sbjct: 320 KMGKGRLGAAKLR-----------HSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACK 368

Query: 211 NFEGSEERRALFHGDH 226
           N+E S ER+ L    H
Sbjct: 369 NYEESPERKMLMSTPH 384


>sp|F4JIF5|TCX2_ARATH Protein tesmin/TSO1-like CXC 2 OS=Arabidopsis thaliana GN=TCX2 PE=1
           SV=1
          Length = 674

 Score =  123 bits (309), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 70/184 (38%), Positives = 93/184 (50%), Gaps = 35/184 (19%)

Query: 38  PGKQESPRSRP---RANTDGKDGTPKKQKQCNCRNSRCLKLDFVEIWINIYSNTSPGAHL 94
           P  QE   S P   R   D  +G  +  K+CNC+ S+CLKL                   
Sbjct: 349 PALQELNLSSPKKKRVKLDSGEG--ESCKRCNCKKSKCLKL------------------- 387

Query: 95  NTQFKDNLVKYCECFAAGIYC-DGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIA 153
                     YCECFAAG+YC + C+C++C N   HE            RNP AF PK+ 
Sbjct: 388 ----------YCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIESRNPLAFAPKVI 437

Query: 154 SSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFE 213
            +    Q+  +DA +   +A+H +GC+CKKS CLKKYCEC+Q  + CS NCRC  CKN  
Sbjct: 438 RNSDSVQETGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNAF 497

Query: 214 GSEE 217
           G ++
Sbjct: 498 GRKD 501


>sp|Q9LUI3|TSO1_ARATH CRC domain-containing protein TSO1 OS=Arabidopsis thaliana GN=TSO1
           PE=1 SV=1
          Length = 695

 Score =  122 bits (306), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 65/164 (39%), Positives = 88/164 (53%), Gaps = 38/164 (23%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG+YC + C+C+
Sbjct: 401 KRCNCKKSKCLKL-----------------------------YCECFAAGVYCIEPCSCI 431

Query: 122 NCHNNVEHEVARQEAVGATLE----RNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNK 177
           +C N   HE    E V AT +    RNP AF PK+  +     +A +DA +   +A+H +
Sbjct: 432 DCFNKPIHE----ETVLATRKQIESRNPLAFAPKVIRNADSIMEASDDASKTPASARHKR 487

Query: 178 GCHCKKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRAL 221
           GC+CKKS C+KKYCEC+Q  + CS NCRC  C N  G ++   L
Sbjct: 488 GCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKDGSLL 531


>sp|Q8L548|TCX3_ARATH Protein tesmin/TSO1-like CXC 3 OS=Arabidopsis thaliana GN=TCX3 PE=1
           SV=1
          Length = 609

 Score =  121 bits (304), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 65/164 (39%), Positives = 85/164 (51%), Gaps = 31/164 (18%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+CNC+ S+CLKL                             YCECFAAG YC + C+C+
Sbjct: 328 KRCNCKKSKCLKL-----------------------------YCECFAAGFYCIEPCSCI 358

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           NC N   H+            RNP AF PK+  +     +  EDA +   +A+H +GC+C
Sbjct: 359 NCFNKPIHKDVVLATRKQIESRNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNC 418

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEGSEERRALFHGD 225
           KKS CLKKYCEC+Q  + CS NCRC  CKN  G ++  +LF  D
Sbjct: 419 KKSNCLKKYCECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQD 461



 Score = 54.3 bits (129), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/130 (30%), Positives = 59/130 (45%), Gaps = 8/130 (6%)

Query: 89  SPGAHLNT---QFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQ----EAVGATL 141
           S G HLNT     KD  V     F+  I     + +    + +H++ R+    +  G  +
Sbjct: 234 SIGLHLNTIAMSSKDKNVANEYSFSGNIKVGVQSSLTPVLHSQHDIVRENESGKDSGQII 293

Query: 142 ERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCS 201
           E  P +    +  +P   +  R  + ++       K C+CKKS CLK YCECF A   C 
Sbjct: 294 EVVPKSLA-SVDLTPISPKKKRRKSEQSGEGDSSCKRCNCKKSKCLKLYCECFAAGFYCI 352

Query: 202 ENCRCLDCKN 211
           E C C++C N
Sbjct: 353 EPCSCINCFN 362


>sp|Q84JZ8|TCX4_ARATH Protein tesmin/TSO1-like CXC 4 OS=Arabidopsis thaliana GN=TCX4 PE=1
           SV=1
          Length = 639

 Score =  113 bits (282), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 59/153 (38%), Positives = 79/153 (51%), Gaps = 30/153 (19%)

Query: 63  KQCNCRNSRCLKLDFVEIWINIYSNTSPGAHLNTQFKDNLVKYCECFAAGIYC-DGCNCV 121
           K+C CR S+CLKL                             YCECF+AG++C + C+C 
Sbjct: 453 KRCKCRKSQCLKL-----------------------------YCECFSAGLFCGEPCSCQ 483

Query: 122 NCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDAREDAREAQLAAKHNKGCHC 181
           NC N   HE    ++      RNP AF PK+ S+     D   +  +   +A+H +GC+C
Sbjct: 484 NCFNKPIHEDLVMKSREVIKARNPLAFAPKVVSTSDTVIDLWVENSKTPASARHKRGCNC 543

Query: 182 KKSGCLKKYCECFQANILCSENCRCLDCKNFEG 214
           +KSGC KKYCECF   + CS NCRC+ CKN  G
Sbjct: 544 RKSGCSKKYCECFMMGVGCSSNCRCMGCKNTFG 576



 Score = 33.1 bits (74), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 13/27 (48%), Positives = 15/27 (55%), Gaps = 1/27 (3%)

Query: 104 KYCECFAAGIYCDG-CNCVNCHNNVEH 129
           KYCECF  G+ C   C C+ C N   H
Sbjct: 551 KYCECFMMGVGCSSNCRCMGCKNTFGH 577


>sp|Q9Y4I5|MTL5_HUMAN Tesmin OS=Homo sapiens GN=MTL5 PE=1 SV=2
          Length = 508

 Score =  108 bits (270), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 61/132 (46%), Positives = 84/132 (63%), Gaps = 11/132 (8%)

Query: 91  GAHLNTQFKDNLVKYCECFAAGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRP 150
           G+ L    K  L  YC+CFA+G +C+ CNC NC NN+ H++ R +A+ A L RNP AF+P
Sbjct: 293 GSTLPGPPKITLAGYCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQP 352

Query: 151 KIASSPHGAQDAREDAREAQLAAKHNKGCHCKKSGCLKKYCECFQANILCSENCRCLDCK 210
           KI     G            +  +HNKGC+C++SGCLK YCEC++A I+CS  C+C+ CK
Sbjct: 353 KIGKGQLG-----------NVKPQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCK 401

Query: 211 NFEGSEERRALF 222
           N+E S ER+ L 
Sbjct: 402 NYEESPERKTLM 413


>sp|P93831|CLF_ARATH Histone-lysine N-methyltransferase CLF OS=Arabidopsis thaliana
           GN=CLF PE=1 SV=2
          Length = 902

 Score = 36.2 bits (82), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 24/91 (26%), Positives = 39/91 (42%), Gaps = 5/91 (5%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCS----ENCRCLDCKNFEGSEERRALFHGDHNGIAFM 232
           +GCHC KS C  + C CF A+  C      NC  +      G   +R   +   N +  +
Sbjct: 689 RGCHCAKSQCRSRQCPCFAADRECDPDVCRNCWVIGGDGSLGVPSQRGDNYECRN-MKLL 747

Query: 233 QRAANAAIIGAVGSSGYGTLMTSNKRKSEEL 263
            +     ++G    SG+G  + ++  K E L
Sbjct: 748 LKQQQRVLLGISDVSGWGAFLKNSVSKHEYL 778


>sp|Q9ZSM8|EZA1_ARATH Histone-lysine N-methyltransferase EZA1 OS=Arabidopsis thaliana
           GN=EZA1 PE=2 SV=1
          Length = 856

 Score = 36.2 bits (82), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 41/91 (45%), Gaps = 6/91 (6%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSEN-CR--CLDCKNFE-GSEERRALFHGDHNGIAFM 232
           +GCHC KS C  + C CF A   C  + CR   + C +   G   RR    G    +  +
Sbjct: 645 RGCHCAKSQCRSRQCPCFAAGRECDPDVCRNCWVSCGDGSLGEAPRRG--EGQCGNMRLL 702

Query: 233 QRAANAAIIGAVGSSGYGTLMTSNKRKSEEL 263
            R     ++G    +G+G  + ++  K+E L
Sbjct: 703 LRQQQRILLGKSDVAGWGAFLKNSVSKNEYL 733


>sp|Q8S4P6|EZ1_MAIZE Histone-lysine N-methyltransferase EZ1 OS=Zea mays GN=EZ1 PE=2 SV=1
          Length = 931

 Score = 32.7 bits (73), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 24/92 (26%), Positives = 40/92 (43%), Gaps = 7/92 (7%)

Query: 177 KGCHCKKSGCLKKYCECFQANILCSENCRCLDC-----KNFEGSEERRALFHGDHNGIAF 231
           +GCHC KS C  + C CF A+  C  +  C +C         G   +R   +   N +  
Sbjct: 715 RGCHCAKSQCRSRQCPCFAADRECDPDV-CRNCWVGCGDGTLGVPNQRGDNYECRN-MKL 772

Query: 232 MQRAANAAIIGAVGSSGYGTLMTSNKRKSEEL 263
           + +     ++G    SG+G  + ++  K E L
Sbjct: 773 LLKQQQRVLLGRSDVSGWGAFLKNSVSKHEYL 804


>sp|A1DP40|ATG2_NEOFI Autophagy-related protein 2 OS=Neosartorya fischeri (strain ATCC
           1020 / DSM 3700 / FGSC A1164 / NRRL 181) GN=atg2 PE=3
           SV=1
          Length = 2137

 Score = 32.7 bits (73), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 26/55 (47%), Gaps = 4/55 (7%)

Query: 111 AGIYCDGCNCVNCHNNVEHEVARQEAVGATLERNPNAFRPKIASSPHGAQDARED 165
           A IY  G  C     NV  ++  +E+ GA  + NPN+ RP  A    G  D+  D
Sbjct: 82  ADIYSSGIICEASGINVHLQLPSEESFGAAKDENPNSRRPSQA----GTHDSSSD 132


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.314    0.128    0.377 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 208,111,309
Number of Sequences: 539616
Number of extensions: 8787794
Number of successful extensions: 18896
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 26
Number of HSP's successfully gapped in prelim test: 33
Number of HSP's that attempted gapping in prelim test: 18712
Number of HSP's gapped (non-prelim): 142
length of query: 571
length of database: 191,569,459
effective HSP length: 123
effective length of query: 448
effective length of database: 125,196,691
effective search space: 56088117568
effective search space used: 56088117568
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 64 (29.3 bits)