BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 000955
(1210 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q55FD6|Y8158_DICDI PHD finger-containing protein DDB_G0268158 OS=Dictyostelium
discoideum GN=DDB_G0268158 PE=4 SV=1
Length = 688
Score = 92.0 bits (227), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 93/192 (48%), Gaps = 23/192 (11%)
Query: 15 HDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCASLGLTK- 72
++T +D+ + ++VC+FC+ G+LL C+G CLRSFH + + A + +S ++
Sbjct: 121 NETNDDDKPKRPRKNEAVCTFCEKPGELLMCDGLCLRSFHISCLKARNLYNPSSSSISPV 180
Query: 73 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 132
++ + + C +C Q+ CF+C K G G ++ C CG FYH C
Sbjct: 181 TTIDGTVRWECNDCVSSQNSCFSCKKRGI----IGIDLMKCKVHQCGKFYHYKC------ 230
Query: 133 RDDEVAADQLAKSI-IAGESFTCPLHKCCICK-QGENKADSDLQFAVCRRCPKAYHRKCL 190
VA +LAK I F CPLH C +C+ G+ K Q C RCP AYH C+
Sbjct: 231 ----VADYKLAKLINTKTPRFNCPLHYCSVCEVSGDGK-----QSVHCFRCPTAYHVICM 281
Query: 191 PRKIAFEDKLEE 202
+ K E
Sbjct: 282 QPGVKMLTKTRE 293
>sp|Q6P2L6|NSD3_MOUSE Histone-lysine N-methyltransferase NSD3 OS=Mus musculus GN=Whsc1l1
PE=1 SV=2
Length = 1439
Score = 81.3 bits (199), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 68/159 (42%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C+ GD L+ CEG C R FH LGLT +F C+ CE
Sbjct: 701 DTVCQVCEKAGDCLVACEGECCRHFHVEC----------LGLT---AVPEGHFTCEECET 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C + CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVSVCGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYH 827
>sp|O88491|NSD1_MOUSE Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific OS=Mus musculus GN=Nsd1 PE=1 SV=1
Length = 2588
Score = 80.5 bits (197), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 66/161 (40%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGL + F C C
Sbjct: 1439 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLPE---MPRGKFICNECH 1485
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K ++
Sbjct: 1486 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVT 1526
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F CPLH C C N + S + C RCP AYH
Sbjct: 1527 QNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1567
>sp|Q96L73|NSD1_HUMAN Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific OS=Homo sapiens GN=NSD1 PE=1 SV=1
Length = 2696
Score = 80.1 bits (196), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1541 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1587
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1588 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1628
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1629 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1669
>sp|Q8BVE8|NSD2_MOUSE Histone-lysine N-methyltransferase NSD2 OS=Mus musculus GN=Whsc1
PE=1 SV=2
Length = 1365
Score = 79.3 bits (194), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 669 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASGIH 715
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ EV CV CG FYH CV K L ++
Sbjct: 716 SCFVC-------KESKMEVKRCVVNQCGKFYHEACVKKYPL------------TVFESRG 756
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 757 FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 793
>sp|O96028|NSD2_HUMAN Histone-lysine N-methyltransferase NSD2 OS=Homo sapiens GN=WHSC1
PE=1 SV=1
Length = 1365
Score = 78.6 bits (192), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ +V CV CG FYH CV K L ++
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793
>sp|Q9BZ95|NSD3_HUMAN Histone-lysine N-methyltransferase NSD3 OS=Homo sapiens GN=WHSC1L1
PE=1 SV=1
Length = 1437
Score = 77.8 bits (190), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLASLPDS-----KFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>sp|Q8BRH4|MLL3_MOUSE Histone-lysine N-methyltransferase MLL3 OS=Mus musculus GN=Mll3
PE=2 SV=2
Length = 4903
Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 52/201 (25%), Positives = 75/201 (37%), Gaps = 55/201 (27%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 88
C+FC + G + CCE +C + +H AG G +D +FF E+
Sbjct: 284 CAFCKHLGATIKCCEEKCTQMYHYPCAAG-------AGTFQD----FSHFFLLCPEHIDQ 332
Query: 89 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 142
+ C C G + F C TCG YH C+ ++A L
Sbjct: 333 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DIAVTPL 377
Query: 143 AKSIIAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 201
++ + CP K C CKQ + D + VC C K YH CL +
Sbjct: 378 KRA-----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVM 421
Query: 202 EGIITRAWEGLLPNHRILIYC 222
+ + T W+ N RI I C
Sbjct: 422 KSVPTNGWK--CKNCRICIEC 440
Score = 42.0 bits (97), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 42/102 (41%), Gaps = 20/102 (19%)
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
+Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 949 QQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVL 991
Query: 149 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 992 SKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1030
>sp|Q8NEZ4|MLL3_HUMAN Histone-lysine N-methyltransferase MLL3 OS=Homo sapiens GN=MLL3
PE=1 SV=3
Length = 4911
Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 52/197 (26%), Positives = 75/197 (38%), Gaps = 47/197 (23%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAG-----EESHCASLGLTKDEVEAMLNFFCKNC 86
C+FC + G + CCE +C + +H AG + SH L + + NC
Sbjct: 285 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTFQDFSHIFLLCPEHIDQAPERSKEDANC 344
Query: 87 EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI 146
C + G L + F C TCG YH C+ ++A L ++
Sbjct: 345 AV----CDSPGDL--------LDQFFCT--TCGQHYHGMCL--------DIAVTPLKRA- 381
Query: 147 IAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGII 205
+ CP K C CKQ + D + VC C K YH CL + + +
Sbjct: 382 ----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVMKSVP 426
Query: 206 TRAWEGLLPNHRILIYC 222
T W+ N RI I C
Sbjct: 427 TNGWK--CKNCRICIEC 441
Score = 42.0 bits (97), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 957 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 999
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 1000 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1037
>sp|Q9UPN9|TRI33_HUMAN E3 ubiquitin-protein ligase TRIM33 OS=Homo sapiens GN=TRIM33 PE=1
SV=3
Length = 1127
Score = 42.7 bits (99), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>sp|Q64127|TIF1A_MOUSE Transcription intermediary factor 1-alpha OS=Mus musculus GN=Trim24
PE=1 SV=1
Length = 1051
Score = 41.2 bits (95), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 25/36 (69%), Gaps = 1/36 (2%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
E+ +ED+ + C+ C NGG+LLCCE +C + FH T
Sbjct: 818 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 852
>sp|Q99PP7|TRI33_MOUSE E3 ubiquitin-protein ligase TRIM33 OS=Mus musculus GN=Trim33 PE=1
SV=2
Length = 1142
Score = 40.8 bits (94), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D ++++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 892 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 927
>sp|O14686|MLL2_HUMAN Histone-lysine N-methyltransferase MLL2 OS=Homo sapiens GN=MLL2
PE=1 SV=2
Length = 5537
Score = 40.4 bits (93), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 41/160 (25%), Positives = 61/160 (38%), Gaps = 30/160 (18%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY-KQ 90
CS C G + C C R +H S L+ ++ + + Y ++
Sbjct: 172 CSHCTRLGASIPCRSPGCPRLYHFPCATASGSF-----LSMKTLQLLCPEHSEGAAYLEE 226
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
+C C G ++F C S CGH YH C+ D + A + A
Sbjct: 227 ARCAVCEGPGEL-----CDLFFCTS--CGHHYHGACL------DTALTARKRA------- 266
Query: 151 SFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ CP +C +C Q K +D + VC C K YH CL
Sbjct: 267 GWQCP--ECKVC-QACRKPGNDSKMLVCETCDKGYHTFCL 303
Score = 36.2 bits (82), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/101 (23%), Positives = 39/101 (38%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + C YHP+CV+ + + + + + I+
Sbjct: 1377 QDMCVVCGSFGR-----GAEGHLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVECIV-- 1429
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
C +C Q + + + +C C +YH CL
Sbjct: 1430 ---------CEVCGQASDPS----RLLLCDDCDISYHTYCL 1457
>sp|O15164|TIF1A_HUMAN Transcription intermediary factor 1-alpha OS=Homo sapiens GN=TRIM24
PE=1 SV=3
Length = 1050
Score = 40.0 bits (92), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH + +GE L+K
Sbjct: 817 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 875
Query: 73 DEVE 76
EVE
Sbjct: 876 PEVE 879
>sp|Q56R14|TRI33_XENLA E3 ubiquitin-protein ligase TRIM33 OS=Xenopus laevis GN=trim33 PE=1
SV=1
Length = 1091
Score = 39.3 bits (90), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 15/24 (62%), Positives = 18/24 (75%), Gaps = 1/24 (4%)
Query: 33 CSFCDNGGDLLCCEGRCLRSFHAT 56
C+ C NGGDLLCCE +C + FH T
Sbjct: 853 CAVCQNGGDLLCCE-KCPKVFHLT 875
>sp|Q6PDK2|MLL2_MOUSE Histone-lysine N-methyltransferase MLL2 OS=Mus musculus GN=Mll2
PE=1 SV=2
Length = 5588
Score = 38.9 bits (89), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 37/82 (45%), Gaps = 18/82 (21%)
Query: 117 TCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFA 176
+CGH YH C+ D + A + A S+ CP +C +C Q K +D +
Sbjct: 246 SCGHHYHGACL------DTALTARKRA-------SWQCP--ECKVC-QSCRKPGNDSKML 289
Query: 177 VCRRCPKAYHRKCLPRKIAFED 198
VC C K YH CL K ED
Sbjct: 290 VCETCDKGYHTFCL--KPPMED 309
Score = 36.2 bits (82), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 24/101 (23%), Positives = 39/101 (38%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + C YHP+CV+ + + + + + I+
Sbjct: 1333 QDMCVVCGSFGR-----GAEGHLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVECIV-- 1385
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
C +C Q + + + +C C +YH CL
Sbjct: 1386 ---------CEVCGQASDPS----RLLLCDDCDISYHTYCL 1413
>sp|O15016|TRI66_HUMAN Tripartite motif-containing protein 66 OS=Homo sapiens GN=TRIM66 PE=2
SV=4
Length = 1216
Score = 37.7 bits (86), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 33/69 (47%), Gaps = 13/69 (18%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTKDEVEAMLNF 81
+ C+ C NGG+LLCC+ RC + FH + GE LT+ E+E +
Sbjct: 970 EDFCAVCLNGGELLCCD-RCPKVFHLSCHVPALLSFPGGEWVCTLCRSLTQPEME----Y 1024
Query: 82 FCKNCEYKQ 90
C+N Y Q
Sbjct: 1025 DCENACYNQ 1033
>sp|Q6E2N3|TRI33_DANRE E3 ubiquitin-protein ligase TRIM33 OS=Danio rerio GN=trim33 PE=2
SV=1
Length = 1163
Score = 37.0 bits (84), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 22/33 (66%), Gaps = 1/33 (3%)
Query: 24 EEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
+ED+ + C+ C NGG+LLCC+ C + FH T
Sbjct: 915 KEDDPNEDWCAVCQNGGELLCCD-HCPKVFHIT 946
>sp|Q9SD34|C3H44_ARATH Zinc finger CCCH domain-containing protein 44 OS=Arabidopsis
thaliana GN=At3g51120 PE=2 SV=3
Length = 1292
Score = 37.0 bits (84), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 34/91 (37%), Gaps = 20/91 (21%)
Query: 93 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 152
CF C G ++ C C YHP C+ + DE AK +
Sbjct: 113 CFICFD--------GGDLVLCDRRNCPKAYHPACIKR-----DEAFFRTTAK-------W 152
Query: 153 TCPLHKCCICKQGENKADSDLQFAVCRRCPK 183
C H C C++ + F+VC+RC K
Sbjct: 153 NCGWHICGTCQKASSYMCYTCTFSVCKRCIK 183
>sp|Q9RFD5|BCHH_RHOS4 Magnesium-chelatase subunit H OS=Rhodobacter sphaeroides (strain ATCC
17023 / 2.4.1 / NCIB 8253 / DSM 158) GN=bchH PE=3 SV=1
Length = 1193
Score = 37.0 bits (84), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 50/98 (51%), Gaps = 8/98 (8%)
Query: 332 GNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKF--GKQDNSDRETSRTVTVK 389
G V++L Q+ +++++ SL + Y + G E + + N+ ++ T V+
Sbjct: 1066 GTVRTLKDQI----ALETRARSLNPKYYEGLLKHGAEGVRQIEAQVTNTLGWSATTQQVE 1121
Query: 390 PLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEIL 427
P + +SE LDE +RRL+ L +A SVRM E L
Sbjct: 1122 PWVYQRLSETFVLDEAMRRRLAELNPEA--SVRMAERL 1157
>sp|Q924W6|TRI66_MOUSE Tripartite motif-containing protein 66 OS=Mus musculus GN=Trim66 PE=1
SV=3
Length = 1242
Score = 36.2 bits (82), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 32/67 (47%), Gaps = 13/67 (19%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTKDEVEAMLNF 81
+ C+ C NGG+LLCC+ RC + +H + GE LT+ E+E +
Sbjct: 996 EDFCAVCINGGELLCCD-RCPKVYHLSCHVPALLSFPGGEWVCTLCRSLTQPEME----Y 1050
Query: 82 FCKNCEY 88
C+N Y
Sbjct: 1051 DCENARY 1057
>sp|Q65S78|AROA_MANSM 3-phosphoshikimate 1-carboxyvinyltransferase OS=Mannheimia
succiniciproducens (strain MBEL55E) GN=aroA PE=3 SV=1
Length = 433
Score = 35.8 bits (81), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 30/52 (57%), Gaps = 1/52 (1%)
Query: 457 VEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMR 508
V + ALK+L V+ S ED K+VCE + L + F W+N L ++L MR
Sbjct: 50 VRHMLNALKQLGVNYSLSED-KSVCEVQGLGKAFAWQNGLALFLGNAGTAMR 100
>sp|O43918|AIRE_HUMAN Autoimmune regulator OS=Homo sapiens GN=AIRE PE=1 SV=1
Length = 545
Score = 35.4 bits (80), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 16/44 (36%), Positives = 24/44 (54%), Gaps = 12/44 (27%)
Query: 11 EDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH 54
+ +H EDE C+ C +GG+L+CC+G C R+FH
Sbjct: 288 DPQLHQKNEDE-----------CAVCRDGGELICCDG-CPRAFH 319
>sp|Q9Z0E3|AIRE_MOUSE Autoimmune regulator OS=Mus musculus GN=Aire PE=1 SV=1
Length = 552
Score = 34.7 bits (78), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 12/22 (54%), Positives = 18/22 (81%), Gaps = 1/22 (4%)
Query: 33 CSFCDNGGDLLCCEGRCLRSFH 54
C+ C +GG+L+CC+G C R+FH
Sbjct: 301 CAVCHDGGELICCDG-CPRAFH 321
>sp|Q3ED65|MENG_ARATH 2-phytyl-1,4-beta-naphthoquinone methyltransferase, chloroplastic
OS=Arabidopsis thaliana GN=MENG PE=1 SV=2
Length = 261
Score = 34.7 bits (78), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 12/27 (44%), Positives = 19/27 (70%)
Query: 536 GDMIVDFCCGANDFSCLMKKKLDETGK 562
GD ++D CCG+ D + L+ +K+ TGK
Sbjct: 74 GDYVLDLCCGSGDLAFLLSEKVGSTGK 100
>sp|Q04570|AROA_PASMU 3-phosphoshikimate 1-carboxyvinyltransferase OS=Pasteurella
multocida (strain Pm70) GN=aroA PE=3 SV=2
Length = 440
Score = 33.9 bits (76), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 44/85 (51%), Gaps = 8/85 (9%)
Query: 431 KIPSTHAYASKSAVDKAITLGKVEGS-------VEAIRTALKKLDVDGSSIEDAKAVCEP 483
++P + + ++++ + A+ GK + V + ALK+L V ED K+VCE
Sbjct: 21 RLPGSKSLSNRALLLSALAKGKTTLTNLLDSDDVRHMLNALKELGVTYQLSED-KSVCEI 79
Query: 484 EVLSQIFKWKNKLKVYLAPFLHGMR 508
E L + F+W++ L ++L MR
Sbjct: 80 EGLGRAFEWQSGLALFLGNAGTAMR 104
>sp|Q17551|RPM1_CAEEL E3 ubiquitin-protein ligase rpm-1 OS=Caenorhabditis elegans GN=rpm-1
PE=1 SV=1
Length = 3766
Score = 33.9 bits (76), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 22/81 (27%), Positives = 31/81 (38%), Gaps = 13/81 (16%)
Query: 65 CASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK-ETGAEVF-----------P 112
C + T +V N FC + +C C D + G +V P
Sbjct: 3471 CVNYAKTACQVMHTCNHFCGGIR-NEEECLPCMTCKREDAAQDGDDVCVICFTERLGAAP 3529
Query: 113 CVSATCGHFYHPHCVSKLLLR 133
C+ CGH +H HCV +L R
Sbjct: 3530 CIRLGCGHMFHFHCVRMILER 3550
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.315 0.132 0.397
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 483,873,652
Number of Sequences: 539616
Number of extensions: 22237557
Number of successful extensions: 63381
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 17
Number of HSP's successfully gapped in prelim test: 123
Number of HSP's that attempted gapping in prelim test: 62972
Number of HSP's gapped (non-prelim): 437
length of query: 1210
length of database: 191,569,459
effective HSP length: 129
effective length of query: 1081
effective length of database: 121,958,995
effective search space: 131837673595
effective search space used: 131837673595
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 67 (30.4 bits)