BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 007469
(602 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9SZD1|TCX5_ARATH Protein tesmin/TSO1-like CXC 5 OS=Arabidopsis thaliana GN=TCX5 PE=1
SV=1
Length = 603
Score = 608 bits (1569), Expect = e-173, Method: Compositional matrix adjust.
Identities = 339/579 (58%), Positives = 397/579 (68%), Gaps = 46/579 (7%)
Query: 1 MGE-GEGGEFPPKKSQSESGGDIPAKKLARQLDFTGGFVGSVILPEHPQSQAVTANSHPP 59
MGE G GGEFPPKK E G P KK ARQLDFTGG + P + V A S P
Sbjct: 1 MGEDGGGGEFPPKKDGVEEG--FPTKKPARQLDFTGGSDEHSL--SKPAAPTVVATSVKP 56
Query: 60 PQPQPKP----------------VVPPVPPPP----PPLPGQATQPPVRVPKPESPKSRS 99
P V P +P PP Q P+R P PESPK+R
Sbjct: 57 IISSSVPSTIRPGMTIAIGQVTQVRPTLPMATTMSNPPSQSQIVNAPIRHPIPESPKARG 116
Query: 100 -RSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREA 158
R NVE ++ TP+K+KQCNCKHSRCLKLYCECFASG YCDGCNCVNC NNV+NE ARREA
Sbjct: 117 PRPNVEGRDGTPQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVDNEPARREA 176
Query: 159 VEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQA 218
VEATLERNP AFRPKIASSPHG RD +E+ GEV++LGKHNKGCHCKKSGCLKKYCECFQA
Sbjct: 177 VEATLERNPFAFRPKIASSPHGGRDKREDIGEVVLLGKHNKGCHCKKSGCLKKYCECFQA 236
Query: 219 NILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVS 278
NILCSENC+C+DCKNFEGSEERQALFHG+H+N+MAY+QQAANAAITGA+GSSG+A P
Sbjct: 237 NILCSENCKCLDCKNFEGSEERQALFHGEHSNHMAYLQQAANAAITGAVGSSGFAPSPAP 296
Query: 279 KKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRT-SPSSTLSSIPVARASSTAAVGSSKFT 337
K+RKGQE+ F KD S RL HF Q N+ RT P+S S PV+RA A+ SKF
Sbjct: 297 KRRKGQEILFNQAIKDSS--RLSHFPQVNNGRTGGPTSGTSPSPVSRAGGNASSVPSKFV 354
Query: 338 YRSLLADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQ 397
YRSLLADIIQP D++ LCSVLV ++ EAAK+ +RN E + +D+ TS+ASSAQ + Q
Sbjct: 355 YRSLLADIIQPHDVRALCSVLVTVAGEAAKTSTDKRNEIENRVDDQTETSLASSAQDQPQ 414
Query: 398 -SHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFM 456
++ DV+ TD NQADK GP++S SDG D K P+SP TLALMCDEQDTIFM
Sbjct: 415 GNNNAADVEMVATDH----NQADKSGPEESNSDGVDASKVTPLSPATLALMCDEQDTIFM 470
Query: 457 -AASSPN-RLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKY 514
AA SPN + +GC P QG +E+YAEQER+VLTKFRDCLNRLI++ EIKE K
Sbjct: 471 VAAPSPNGSVDPNGCR-----PNSQGQSEIYAEQERLVLTKFRDCLNRLISYAEIKESKC 525
Query: 515 SSLARTEVGSQKDALSNGTASSRTGTGNQPATVSNGVSK 553
SLAR + Q A++ T + G Q + NG S+
Sbjct: 526 LSLARMHI--QPPAIA--TVKTENGIQQQ-VPIVNGASR 559
>sp|Q9SL70|TCX6_ARATH Protein tesmin/TSO1-like CXC 6 OS=Arabidopsis thaliana GN=TCX6 PE=1
SV=1
Length = 571
Score = 553 bits (1425), Expect = e-156, Method: Compositional matrix adjust.
Identities = 316/574 (55%), Positives = 383/574 (66%), Gaps = 69/574 (12%)
Query: 1 MGEGE-GGEFPPKKSQSESGGDIPAKKLARQLDFTGGFVGSVILPEHP-QSQAVTANSHP 58
MGEGE G +FPPK + + K ARQLDFTGG S + H Q+ ++ A S P
Sbjct: 1 MGEGEEGDKFPPKTDEVTQ----ESMKSARQLDFTGG--SSDVEHSHSNQASSMAAASIP 54
Query: 59 PP---------QPQPKPVVPPVPPPPPPLPGQATQPPVRVPKPESPKSRSRSNVEVKENT 109
P Q P V PPPP G P+R P+PESP S R E ++ T
Sbjct: 55 SPIVTVTRPIITSQAPPTVATPIPPPPQSQGIILHVPIRHPRPESPNSMPRPAGETRDGT 114
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNA 169
P+K+KQCNCKHSRCLKLYCECFASG YCDGCNCVNC NNVENE ARR+AVE+TLERNPNA
Sbjct: 115 PQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNVENEPARRQAVESTLERNPNA 174
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
FRPKIA+SPHG RDN+EE G+V+ML +HNKGCHCKKSGCLKKYCECFQANILCSENC+C+
Sbjct: 175 FRPKIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCLKKYCECFQANILCSENCKCL 234
Query: 230 DCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFG 289
DCKNFEGSE RQ+LFHG+H++N+AY+Q ANAAITGAIGSSG+AS P K+RKGQE+FF
Sbjct: 235 DCKNFEGSEVRQSLFHGEHSHNLAYLQH-ANAAITGAIGSSGFASAPPPKRRKGQEIFFN 293
Query: 290 STAKDPSIHRLGHFQQGNHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQ 349
KD S HRLG Q N+ RT+ S T S RA A++G SK Y+SLLA+II+P
Sbjct: 294 QGTKDSSTHRLG---QANNGRTTSSQTGS-----RAGGNASLGPSKVVYKSLLANIIKPM 345
Query: 350 DLKELCSVLVVLSSEAAKSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVT 409
D+K LCSVLV ++ EAAK++ TEK+ ++ TSVASS Q
Sbjct: 346 DVKALCSVLVAVAGEAAKTL------TEKRLANQKETSVASSVQ---------------- 383
Query: 410 DDCSSANQADKVGPDDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGC 469
D N+A+K G +DS +DGS G R +SP TLALMCDE+DT+ M A+SPN C
Sbjct: 384 DQGHVNNKAEKSGLEDSNADGSKG---RSLSPETLALMCDERDTMLMVAASPN------C 434
Query: 470 NT--SSQLPYGQGMTEVYAEQERIVLTKFRDCLNRLITFGEIKEMKYSSLARTEVGSQKD 527
+ +SQLP GQ +VYAEQE++VLTKFRDCLNR+I+ GE+KE S S+ D
Sbjct: 435 SVEPTSQLPNGQD--QVYAEQEKVVLTKFRDCLNRIISCGEVKESNCSM-------SRMD 485
Query: 528 ALSNGTASSRTGTGNQPATVSNGVSK-AYGPSAL 560
+ + R Q A V+NGVS+ A PS L
Sbjct: 486 LDTPVQTTVRIDPVVQQAPVANGVSQTAKQPSQL 519
>sp|F4JY84|TCX7_ARATH Protein tesmin/TSO1-like CXC 7 OS=Arabidopsis thaliana GN=TCX7 PE=1
SV=1
Length = 459
Score = 349 bits (896), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 200/446 (44%), Positives = 263/446 (58%), Gaps = 54/446 (12%)
Query: 71 VPPPPPPLP--GQATQPPVRVPKPESPKSRSRSNVEVKENTP-KKQKQCNCKHSRCLKLY 127
PP PPL G RV + SR + VE KENTP K+QK CNCK+S+CLKLY
Sbjct: 49 TPPQKPPLHQFGHNQVQKNRVSNQDPSTSRRHNEVESKENTPNKQQKHCNCKNSKCLKLY 108
Query: 128 CECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEE 187
CECFASG YC+GCNCVNCHN +ENE++R+ A+ LERNP+AF+PKIA SPHG +D +E
Sbjct: 109 CECFASGSYCNGCNCVNCHNKLENESSRQVAISGILERNPDAFKPKIAGSPHGMKDLQEN 168
Query: 188 TGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGD 247
+VL++GKH+KGCHC+KSGCLKKYCEC+QANILCSENCRC DCKNFEGSEER+AL HG
Sbjct: 169 VQQVLLIGKHSKGCHCRKSGCLKKYCECYQANILCSENCRCQDCKNFEGSEERKALLHGS 228
Query: 248 HANNMAYIQQAANAAITGAIGSSGYASPPVSKKRKGQELFFGSTAKDPSIHRLG-HFQQG 306
++ YIQQ NAA+ AI S Y PP S+KRK +++ D + G +Q+
Sbjct: 229 QVSD-TYIQQMTNAAVNRAIDMSAYLYPPESRKRKSKDI------SDSVVSSYGVQYQRA 281
Query: 307 NHIRTSPSSTLSSIPVARASSTAAVGSSKFTYRSLLADIIQPQDLKELCSVLVVLSSEAA 366
NH+R + ++L S+P +A S GS+ YRS ++ QP ++ELCS+LV +
Sbjct: 282 NHVRRNGENSLFSLPNNKAVS----GSTTSAYRSSWSNTFQPHHVRELCSLLV------S 331
Query: 367 KSVAVQRNTTEKQAEDRAGTSVASSAQGRLQSHKEPDVDRTVTD--DCS-SANQADKVGP 423
SV V ++K ++ G S AQ V + D DC AN+ D+
Sbjct: 332 NSVDVANKLSDKGRKNDKGPSSLDGAQR---------VANEINDSPDCVLDANRMDE--- 379
Query: 424 DDSGSDGSDGQKARPMSPGTLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTE 483
+P+SP T ALMCDE+ I + R+ TS + +
Sbjct: 380 -------------KPISPATRALMCDEEHEIGSEKETSARVK-----TSQEKEDTDTSSG 421
Query: 484 VYAEQERIVLTKFRDCLNRLITFGEI 509
+Y EQER +L+ FRD L +L I
Sbjct: 422 IYLEQERQILSIFRDYLIQLSNRARI 447
>sp|Q700D0|TCX8_ARATH Protein tesmin/TSO1-like CXC 8 OS=Arabidopsis thaliana GN=TCX8 PE=1
SV=1
Length = 368
Score = 155 bits (392), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 131/422 (31%), Positives = 186/422 (44%), Gaps = 114/422 (27%)
Query: 95 PKSRSRSNVEVKEN----TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVE 150
P SR S + K + T +K K C CK S+CLKLYC+CFASG+ C C+CV+CHNN E
Sbjct: 43 PISREHSEAKDKTDEEGITSRKHKGCRCKQSKCLKLYCDCFASGVVCTDCDCVDCHNNSE 102
Query: 151 NEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLK 210
AR A+ L RNPNAF K S DN+ + G ++GC CK++ CLK
Sbjct: 103 KCDAREAAMVNVLGRNPNAFSEKALGS---LTDNQCKAAPDTKPGLLSRGCKCKRTRCLK 159
Query: 211 KYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGAIGSS 270
KYCECFQAN+LCS+NC+C++CKN + + A S
Sbjct: 160 KYCECFQANLLCSDNCKCINCKNVSEAFQPPAF--------------------------S 193
Query: 271 GYASPPVSKKRKGQELFFGSTAKDPSIHRLGHFQQGNHIRTSPSSTLSSIP---VARASS 327
+ SP V ++R+ +EL ++ P L SIP + A
Sbjct: 194 AHNSPQVYRRRRDRELTEWNSCPAP---------------------LFSIPDNSIQNALG 232
Query: 328 TAAVGSSKFTYR---SLL--ADIIQPQDLKELCSVLVVLSSEAAKSVAVQRNTTEKQAED 382
+ S K YR SL+ + P DL +LCS+LV +SE+A + A +N + +D
Sbjct: 233 SPMSCSPKLPYRKKRSLMGYTSTLLP-DLGDLCSLLVA-ASESATTTAEDQNRIFTKPDD 290
Query: 383 RAGTSVASSAQGRLQSHKEPDVDRTVTDDCSSANQADKVGPDDSGSDGSDGQKARPMSPG 442
+ ++S ++ S N +++ S G
Sbjct: 291 KEAIELSSESE--------------------SRNVEEEI-----------------QSRG 313
Query: 443 TLALMCDEQDTIFMAASSPNRLTGHGCNTSSQLPYGQGMTEVYAEQERIVLTKFRDCLNR 502
L + D Q N C T + + T++Y EQER VL FRDCL +
Sbjct: 314 RLIELIDVQ---------YNGEEDSQCKTKTSV----NETDIYMEQERAVLETFRDCLQK 360
Query: 503 LI 504
I
Sbjct: 361 FI 362
>sp|Q95QD7|LIN54_CAEEL Protein lin-54 OS=Caenorhabditis elegans GN=lin-54 PE=1 SV=1
Length = 435
Score = 150 bits (378), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 72/170 (42%), Positives = 100/170 (58%), Gaps = 9/170 (5%)
Query: 97 SRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARR 156
S R + K P ++K CNC S+CLKLYC+CFA+G +C CNC +CHNN+E ++ R
Sbjct: 158 SHIRLKTKKKVFAPGQRKPCNCTKSQCLKLYCDCFANGEFCRDCNCKDCHNNIEYDSQRS 217
Query: 157 EAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECF 216
+A+ +LERNPNAF+PKI + G D + H KGCHCKKSGCLK YCEC+
Sbjct: 218 KAIRQSLERNPNAFKPKIGIARGGITD---------IERLHQKGCHCKKSGCLKNYCECY 268
Query: 217 QANILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAANAAITGA 266
+A + C++ C+C C+N E + G +N + NA+ T
Sbjct: 269 EAKVPCTDRCKCKGCQNTETYRMTRYKNSGGAVSNTNALMSLTNASSTAT 318
>sp|Q571G4|LIN54_MOUSE Protein lin-54 homolog OS=Mus musculus GN=Lin54 PE=2 SV=2
Length = 749
Score = 148 bits (374), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 76/153 (49%), Positives = 103/153 (67%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+T + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESTSRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>sp|A1Z9E2|LIN54_DROME Protein lin-54 homolog OS=Drosophila melanogaster GN=mip120 PE=1
SV=1
Length = 950
Score = 147 bits (370), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 74/187 (39%), Positives = 110/187 (58%), Gaps = 14/187 (7%)
Query: 50 QAVTANSHPPPQPQPKPVVPPVPPPPPPLPGQA-TQPPVRVPKPESPKSRSRSNVEVKEN 108
Q + P Q P V P +P PP + +P V KP + + + V+
Sbjct: 675 QRIRQQQLPTEQSTPIKVEPKLPTLPPGVKANVPAKPLFEVLKPPA-TAAAAGAVDPLGG 733
Query: 109 TPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPN 168
++K CNC S+CLKLYC+CFA+G +C C C +C NN++ E R A+ + L+RNP+
Sbjct: 734 MTSRRKHCNCSKSQCLKLYCDCFANGEFCQDCTCKDCFNNLDYEVERERAIRSCLDRNPS 793
Query: 169 AFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRC 228
AF+PKI ++P+ +G++ + HNKGC+CK+SGCLK YCEC++A I CS C+C
Sbjct: 794 AFKPKI-TAPN--------SGDMRL---HNKGCNCKRSGCLKNYCECYEAKIPCSSICKC 841
Query: 229 MDCKNFE 235
+ C+N E
Sbjct: 842 VGCRNME 848
>sp|Q6MZP7|LIN54_HUMAN Protein lin-54 homolog OS=Homo sapiens GN=LIN54 PE=1 SV=3
Length = 749
Score = 146 bits (369), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>sp|Q641Z1|LIN54_RAT Protein lin-54 homolog OS=Rattus norvegicus GN=Lin54 PE=2 SV=2
Length = 749
Score = 146 bits (369), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 516 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 575
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 576 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 624
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 625 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 657
>sp|Q08CM4|LIN54_DANRE Protein lin-54 homolog OS=Danio rerio GN=lin54 PE=1 SV=1
Length = 771
Score = 146 bits (368), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 78/161 (48%), Positives = 105/161 (65%), Gaps = 11/161 (6%)
Query: 99 SRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREA 158
S + + E + +K CNC S+CLKLYC+CFA+G +C+ CNCVNC NN+++E+ R +A
Sbjct: 531 SLNGLSTSEANSRPRKPCNCTRSQCLKLYCDCFANGEFCNNCNCVNCFNNLDHESERLKA 590
Query: 159 VEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQA 218
++A L+RNP AF+PKI G D +H+KGC+CKKSGCLK YCEC++A
Sbjct: 591 IKACLDRNPVAFKPKIGKGKEGESDR-----------RHSKGCNCKKSGCLKNYCECYEA 639
Query: 219 NILCSENCRCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
I+CS C+CM CKNFE S ER+ L H A + QQ A
Sbjct: 640 KIMCSSICKCMGCKNFEESPERKTLMHLADAAEVRVQQQTA 680
>sp|Q0IHV2|LIN54_XENTR Protein lin-54 homolog OS=Xenopus tropicalis GN=lin54 PE=2 SV=1
Length = 741
Score = 145 bits (367), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 508 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 567
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 568 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 616
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 617 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 649
>sp|Q5RBN8|LIN54_PONAB Protein lin-54 homolog OS=Pongo abelii GN=LIN54 PE=2 SV=1
Length = 528
Score = 143 bits (361), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 11/153 (7%)
Query: 107 ENTPKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERN 166
E+ + +K CNC S CLKLYC+CFA+G +C+ CNC NC+NN+E+E R++A++A L+RN
Sbjct: 295 ESASRPRKPCNCTKSLCLKLYCDCFANGEFCNNCNCTNCYNNLEHENERQKAIKACLDRN 354
Query: 167 PNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENC 226
P AF+PKI G D +H+KGC+CK+SGCLK YCEC++A I+CS C
Sbjct: 355 PEAFKPKIGKGKEGESDR-----------RHSKGCNCKRSGCLKNYCECYEAKIMCSSIC 403
Query: 227 RCMDCKNFEGSEERQALFHGDHANNMAYIQQAA 259
+C+ CKNFE S ER+ L H A + QQ A
Sbjct: 404 KCIGCKNFEESPERKTLMHLADAAEVRVQQQTA 436
>sp|Q8L548|TCX3_ARATH Protein tesmin/TSO1-like CXC 3 OS=Arabidopsis thaliana GN=TCX3 PE=1
SV=1
Length = 609
Score = 131 bits (330), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 71/163 (43%), Positives = 98/163 (60%), Gaps = 9/163 (5%)
Query: 92 PESPKSRSRSNVEVKENTPKKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN--- 147
P SPK + R + + E K+CNCK S+CLKLYCECFA+G YC + C+C+NC N
Sbjct: 307 PISPKKKRRKSEQSGEGD-SSCKRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPI 365
Query: 148 NVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSG 207
+ + A R+ +E+ RNP AF PK+ + + E+ + +H +GC+CKKS
Sbjct: 366 HKDVVLATRKQIES---RNPLAFAPKVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSN 422
Query: 208 CLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHGDHAN 250
CLKKYCEC+Q + CS NCRC CKN G ++ +LF D N
Sbjct: 423 CLKKYCECYQGGVGCSINCRCEGCKNAFGRKD-GSLFEQDEEN 464
>sp|F4JIF5|TCX2_ARATH Protein tesmin/TSO1-like CXC 2 OS=Arabidopsis thaliana GN=TCX2 PE=1
SV=1
Length = 674
Score = 129 bits (325), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 60/130 (46%), Positives = 86/130 (66%), Gaps = 7/130 (5%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENE---AARREAVEATLERNPNA 169
K+CNCK S+CLKLYCECFA+G+YC + C+C++C N +E A R+ +E+ RNP A
Sbjct: 375 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIES---RNPLA 431
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ + ++ ++ + +H +GC+CKKS CLKKYCEC+Q + CS NCRC
Sbjct: 432 FAPKVIRNSDSVQETGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCE 491
Query: 230 DCKNFEGSEE 239
CKN G ++
Sbjct: 492 GCKNAFGRKD 501
>sp|Q9LUI3|TSO1_ARATH CRC domain-containing protein TSO1 OS=Arabidopsis thaliana GN=TSO1
PE=1 SV=1
Length = 695
Score = 128 bits (322), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 85/134 (63%), Gaps = 7/134 (5%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN---NVENEAARREAVEATLERNPNA 169
K+CNCK S+CLKLYCECFA+G+YC + C+C++C N + E A R+ +E+ RNP A
Sbjct: 401 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIES---RNPLA 457
Query: 170 FRPKIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCM 229
F PK+ + + ++ + +H +GC+CKKS C+KKYCEC+Q + CS NCRC
Sbjct: 458 FAPKVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCE 517
Query: 230 DCKNFEGSEERQAL 243
C N G ++ L
Sbjct: 518 GCTNVFGRKDGSLL 531
Score = 52.0 bits (123), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 39/78 (50%), Gaps = 2/78 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCD-GCNCVNCHNNV-ENEAARREAVEATLERNPN 168
+ ++ CNCK S C+K YCEC+ G+ C C C C N + + +E+ LE N
Sbjct: 484 RHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCTNVFGRKDGSLLVIMESKLEENQE 543
Query: 169 AFRPKIASSPHGTRDNKE 186
+ +IA H +KE
Sbjct: 544 TYEKRIAKIQHNVEVSKE 561
>sp|Q84JZ8|TCX4_ARATH Protein tesmin/TSO1-like CXC 4 OS=Arabidopsis thaliana GN=TCX4 PE=1
SV=1
Length = 639
Score = 125 bits (315), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 63/151 (41%), Positives = 85/151 (56%), Gaps = 14/151 (9%)
Query: 114 KQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHNNVENEAARREAVEATLERNPNAFRP 172
K+C C+ S+CLKLYCECF++G++C + C+C NC N +E ++ E RNP AF P
Sbjct: 453 KRCKCRKSQCLKLYCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAP 512
Query: 173 KIASSPHGTRDNKEETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCK 232
K+ S+ D E + +H +GC+C+KSGC KKYCECF + CS NCRCM CK
Sbjct: 513 KVVSTSDTVIDLWVENSKTPASARHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCK 572
Query: 233 NFEGSEERQ-------------ALFHGDHAN 250
N G Q A +GDH +
Sbjct: 573 NTFGHTNEQCAGDSDVVTINDEAKHYGDHGD 603
Score = 41.6 bits (96), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYC-DGCNCVNCHN 147
+ ++ CNC+ S C K YCECF G+ C C C+ C N
Sbjct: 536 RHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCKN 573
>sp|Q9WTJ6|MTL5_MOUSE Tesmin OS=Mus musculus GN=Mtl5 PE=2 SV=2
Length = 475
Score = 119 bits (297), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 54/122 (44%), Positives = 78/122 (63%), Gaps = 15/122 (12%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CF+SG +C+ C+C NN+ +E R +A++A L+RNP AF+PK+ G +
Sbjct: 278 YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLR- 332
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
H+KGC+CK+SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 333 ----------HSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIACKNYEESPERKMLMST 382
Query: 247 DH 248
H
Sbjct: 383 PH 384
Score = 46.2 bits (108), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 36/83 (43%), Gaps = 8/83 (9%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVENEAARREAVEATLERNPNA 169
+ K CNCK S CLK YCEC+ + I C C C+ C N E R+ + P
Sbjct: 332 RHSKGCNCKRSGCLKNYCECYEAKIMCSSICKCIAC-KNYEESPERKMLMSTPHYMEPGD 390
Query: 170 FR------PKIASSPHGTRDNKE 186
F P S P R N++
Sbjct: 391 FESSHYLSPAKFSGPPKLRKNRQ 413
>sp|Q5XHX9|MTL5_RAT Tesmin OS=Rattus norvegicus GN=Mtl5 PE=2 SV=2
Length = 475
Score = 117 bits (293), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 54/122 (44%), Positives = 77/122 (63%), Gaps = 15/122 (12%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CF+SG +C+ C+C NN+ +E R +A++A L+RNP AF+PK+ G +
Sbjct: 278 YCDCFSSGDFCNSCSC----NNLRHELERFKAIKACLDRNPEAFQPKMGKGRLGAAKLR- 332
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALFHG 246
H+KGC+CK+SGCLK YCEC++A I CS C+C+ CKN+E S ER+ L
Sbjct: 333 ----------HSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEESPERKMLMST 382
Query: 247 DH 248
H
Sbjct: 383 PH 384
Score = 45.4 bits (106), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 111 KKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
+ K CNCK S CLK YCEC+ + I C C C+ C N E+
Sbjct: 332 RHSKGCNCKRSGCLKNYCECYEAKITCSSICKCIACKNYEES 373
>sp|Q9Y4I5|MTL5_HUMAN Tesmin OS=Homo sapiens GN=MTL5 PE=1 SV=2
Length = 508
Score = 102 bits (255), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 59/118 (50%), Positives = 81/118 (68%), Gaps = 11/118 (9%)
Query: 127 YCECFASGIYCDGCNCVNCHNNVENEAARREAVEATLERNPNAFRPKIASSPHGTRDNKE 186
YC+CFASG +C+ CNC NC NN+ ++ R +A++A L RNP AF+PKI K
Sbjct: 307 YCDCFASGDFCNNCNCNNCCNNLHHDIERFKAIKACLGRNPEAFQPKIG---------KG 357
Query: 187 ETGEVLMLGKHNKGCHCKKSGCLKKYCECFQANILCSENCRCMDCKNFEGSEERQALF 244
+ G V +HNKGC+C++SGCLK YCEC++A I+CS C+C+ CKN+E S ER+ L
Sbjct: 358 QLGNVK--PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEESPERKTLM 413
Score = 46.6 bits (109), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 110 PKKQKQCNCKHSRCLKLYCECFASGIYCDG-CNCVNCHNNVEN 151
P+ K CNC+ S CLK YCEC+ + I C C C+ C N E+
Sbjct: 364 PQHNKGCNCRRSGCLKNYCECYEAQIMCSSICKCIGCKNYEES 406
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.311 0.127 0.376
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 233,025,492
Number of Sequences: 539616
Number of extensions: 10330122
Number of successful extensions: 109319
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 966
Number of HSP's successfully gapped in prelim test: 988
Number of HSP's that attempted gapping in prelim test: 61630
Number of HSP's gapped (non-prelim): 23897
length of query: 602
length of database: 191,569,459
effective HSP length: 123
effective length of query: 479
effective length of database: 125,196,691
effective search space: 59969214989
effective search space used: 59969214989
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 64 (29.3 bits)