BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 028018
(215 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9SE96|GEML1_ARATH GEM-like protein 1 OS=Arabidopsis thaliana GN=FIP1 PE=1 SV=1
Length = 259
Score = 246 bits (627), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 130/188 (69%), Positives = 151/188 (80%), Gaps = 14/188 (7%)
Query: 28 HSSDYAPYPKIDPNDVAPPPQNWSNVSTATGPAPINESAATTMPAESNPYVSPSPIEPSS 87
HSSDYAPYPK+DP DV PP TG +AATTMPAESNPYVSPSP +
Sbjct: 28 HSSDYAPYPKLDPTDVTPP----PPQPIPTG------AAATTMPAESNPYVSPSP----A 73
Query: 88 SKNKLESVKGALSNWGKKAAEATKRAEDLAGNMWQHLKTGPSLADAAVGRIAQRTKVLAE 147
+N ++SVK L WGK AA+ATK+AEDLAGN WQHLKTGPS+ADAAV RIAQ TK+LAE
Sbjct: 74 PRNTMDSVKDTLGKWGKMAADATKKAEDLAGNFWQHLKTGPSVADAAVSRIAQGTKILAE 133
Query: 148 GGYEKIFRQSFDTVPEEQLLKTYACYLSTSAGPVVGVLYLSTAKLAFCSDDPLSYKVGDE 207
GGYEK+F+Q+FD +P+E+LLKTYACYLSTSAGPV+GV+YLST KLAF SD+PLSYK G++
Sbjct: 134 GGYEKVFKQTFDCLPDEKLLKTYACYLSTSAGPVLGVMYLSTHKLAFSSDNPLSYKEGEQ 193
Query: 208 TQWSYYKV 215
T WSYYKV
Sbjct: 194 TLWSYYKV 201
>sp|Q8S8F8|GEM_ARATH GLABRA2 expression modulator OS=Arabidopsis thaliana GN=GEM PE=1
SV=1
Length = 299
Score = 220 bits (560), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 101/143 (70%), Positives = 121/143 (84%), Gaps = 2/143 (1%)
Query: 75 NPYVSPSPIEPS--SSKNKLESVKGALSNWGKKAAEATKRAEDLAGNMWQHLKTGPSLAD 132
NPY++ SP E S S K+ +E+VKG L WGK+ AEA K+ E LAGN WQHL+T PS AD
Sbjct: 96 NPYIARSPAETSDASLKDTMETVKGVLGRWGKRVAEAAKKTESLAGNTWQHLRTAPSFAD 155
Query: 133 AAVGRIAQRTKVLAEGGYEKIFRQSFDTVPEEQLLKTYACYLSTSAGPVVGVLYLSTAKL 192
AA+GRIAQ TKV AEGGYEKIFRQ+F+T PEEQLL ++ACYLSTSAGPV+GVLY+S+AKL
Sbjct: 156 AAMGRIAQSTKVFAEGGYEKIFRQTFETDPEEQLLNSFACYLSTSAGPVMGVLYISSAKL 215
Query: 193 AFCSDDPLSYKVGDETQWSYYKV 215
A+CSD+PLSYK GD+T+WSYYKV
Sbjct: 216 AYCSDNPLSYKNGDQTEWSYYKV 238
>sp|Q9M063|GEML3_ARATH Putative GEM-like protein 3 OS=Arabidopsis thaliana GN=At4g40100
PE=3 SV=2
Length = 239
Score = 170 bits (430), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 90/155 (58%), Positives = 113/155 (72%), Gaps = 10/155 (6%)
Query: 49 NWSNVSTATGPAPINESAATTMPAESNPYVSPSPIEPS--SSKNKLESVKGALSNWGKKA 106
+W+ + PAP ++ +++ A SNPYV+ +P E S S K +ESVKG L WG++
Sbjct: 29 HWNPELVSESPAPDEKALSSSSAARSNPYVARAPTETSDASLKETMESVKGVLGRWGRRV 88
Query: 107 AEATKRAEDLAGNMWQH-LKTGPSLADAAVGRIAQRTKVLAEGGYEKIFRQSFDTVPEEQ 165
EA +AE LAGN WQH L+ AA+GRIAQ TKVLAEGGYEKIFRQ+F+TVPEEQ
Sbjct: 89 GEAAMKAESLAGNTWQHPLR-------AAMGRIAQSTKVLAEGGYEKIFRQTFETVPEEQ 141
Query: 166 LLKTYACYLSTSAGPVVGVLYLSTAKLAFCSDDPL 200
L ++ACYLSTSAGPV+GVLY+STAKLA+CSD L
Sbjct: 142 LQNSFACYLSTSAGPVMGVLYVSTAKLAYCSDTSL 176
>sp|Q9LYV6|GEML5_ARATH GEM-like protein 5 OS=Arabidopsis thaliana GN=At5g13200 PE=1 SV=1
Length = 272
Score = 140 bits (353), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 76/188 (40%), Positives = 109/188 (57%), Gaps = 19/188 (10%)
Query: 39 DPNDVAPPPQNWSNVSTATGPAPI----NESAATTMPAESN-----PYVSPSPIEPSSSK 89
+P+ +P + W AP+ N+ AA + ++ PYV SP+E ++
Sbjct: 28 EPSSSSPDQKKWGTHVMGAPAAPVAHPDNQQAAAWVAGDNQQTQYQPYVIYSPVEHPTTN 87
Query: 90 NKLESVKGALSNWGKKAAEATKRAEDLAGNMWQHLKTGPSLADAAVGRIAQRTKVLAEGG 149
N LE V G W +KA E +A N+W +LKTGPS+++ A G++ K + +GG
Sbjct: 88 NPLEPVIGMFHTWSRKA-------ETVARNLWHNLKTGPSMSETAWGKVNLTAKAITKGG 140
Query: 150 YEKIFRQSFDTVPEEQLLKTYACYLSTSAGPVVGVLYLSTAKLAFCSDDPLSYKV--GDE 207
+E +FRQ F T P E L KT+ACYLST+ GPV G +YLS A++AFCSD PL + G E
Sbjct: 141 FESLFRQIFGTEPNETLKKTFACYLSTTTGPVAGTVYLSNARVAFCSDRPLYFTAPSGQE 200
Query: 208 TQWSYYKV 215
+ WSYY+V
Sbjct: 201 S-WSYYRV 207
>sp|Q9M122|GEML2_ARATH GEM-like protein 2 OS=Arabidopsis thaliana GN=At4g01600 PE=2 SV=1
Length = 233
Score = 123 bits (309), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 59/143 (41%), Positives = 91/143 (63%), Gaps = 2/143 (1%)
Query: 75 NPYVSPSPIEPSSSKNKLESVKGALSNWGKKAAEATKRAEDLAGNMWQHLKTGPSLADAA 134
NPYV + +S K + V L+ GKK +AT++AE L G + HLK PS++DAA
Sbjct: 29 NPYVHITSPTSASDKRSKDKVLEVLNRCGKKVEDATRKAEALVGGLKDHLKFSPSISDAA 88
Query: 135 VGRIAQRTKVLAEGGYEKIFRQSFDTVPEEQLLKTYACYLSTSAGPVVGVLYLSTAKLAF 194
+ R++Q TK++ EGG E++F++ F + E+LL ++ CY+ST++GPV GV+Y+S ++AF
Sbjct: 89 MARLSQGTKMIVEGGPERVFQREFGVLAVEKLLDSFVCYISTTSGPVTGVIYISNRRIAF 148
Query: 195 CSDDPLSY--KVGDETQWSYYKV 215
CSD + G +YYKV
Sbjct: 149 CSDYAIRLPSSAGGNGVAAYYKV 171
>sp|Q9FMW4|GEML8_ARATH Putative GEM-like protein 8 OS=Arabidopsis thaliana GN=At5g23370
PE=3 SV=1
Length = 219
Score = 82.0 bits (201), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 72/134 (53%), Gaps = 12/134 (8%)
Query: 85 PSSSKNKLESVKGALSNWGKKAAEATKRAEDLAGNMWQHLKTGPSLADAAVGRIAQRTKV 144
P+SSK + KG K+ K+ + + K GP L + +++ ++
Sbjct: 36 PTSSKFSFLTGKG-------KSMLRKKKNDSFTNGVRDQDKLGPKLTETVKRKLSLGARI 88
Query: 145 LAEGGYEKIFRQSFDTVPEEQLLKTYACYLSTSAGPVVGVLYLSTAKLAFCSDDPLSYKV 204
L GG EKI+++ F EE+L K Y CYLST+AGP+ G+L++S+ K+AFCS+ S KV
Sbjct: 89 LQMGGLEKIYKRLFKVSDEEKLFKAYQCYLSTTAGPIAGLLFISSKKIAFCSER--SIKV 146
Query: 205 GD---ETQWSYYKV 215
E +YKV
Sbjct: 147 ASPQGELNRVHYKV 160
>sp|Q9FMW6|GEML6_ARATH GEM-like protein 6 OS=Arabidopsis thaliana GN=At5g23350 PE=2 SV=2
Length = 218
Score = 81.3 bits (199), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 64/116 (55%), Gaps = 7/116 (6%)
Query: 85 PSSSKNKLESVKGALSNWGKKAAEATKRAEDLAGNMWQHLKTGPSLADAAVGRIAQRTKV 144
P+SSK + KG KK T A D K GP L + +++ K+
Sbjct: 36 PTSSKVSYLTGKGKSMLRKKKTDSFTNGARD-------QDKLGPKLTETVKRKLSLGAKI 88
Query: 145 LAEGGYEKIFRQSFDTVPEEQLLKTYACYLSTSAGPVVGVLYLSTAKLAFCSDDPL 200
L GG EKI+++ F +E+L K Y CYLST+AGP+ G+L++S+ K+AFCS+ +
Sbjct: 89 LQMGGLEKIYKRLFKVCDQEKLFKAYQCYLSTTAGPIAGLLFISSKKIAFCSERSI 144
>sp|Q9FTA0|GEML4_ARATH GEM-like protein 4 OS=Arabidopsis thaliana GN=At5g08350 PE=2 SV=1
Length = 222
Score = 79.0 bits (193), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/130 (32%), Positives = 68/130 (52%), Gaps = 13/130 (10%)
Query: 71 PAESNPYVSPSPIEPSSSKNKLESVKGALSNWGKKAAEATKRAEDLAGNMWQHLKTGPSL 130
PA N + P+ SS K++ +VK L K+ + + K P L
Sbjct: 27 PASFNKFRVPA----SSKKSEQSNVKSILKR---------KKTDGFTNGVRDQSKIRPKL 73
Query: 131 ADAAVGRIAQRTKVLAEGGYEKIFRQSFDTVPEEQLLKTYACYLSTSAGPVVGVLYLSTA 190
+ +++ ++L GG EKIF++ F E+L K Y CYLST+AGP+ G+L++S+
Sbjct: 74 TETVKRKLSLGARILQVGGLEKIFKRLFRVSEGEKLFKMYQCYLSTTAGPIAGLLFISSK 133
Query: 191 KLAFCSDDPL 200
K+AFCS+ +
Sbjct: 134 KMAFCSERSI 143
>sp|Q9FMW5|GEML7_ARATH GEM-like protein 7 OS=Arabidopsis thaliana GN=At5g23360 PE=2 SV=1
Length = 210
Score = 75.9 bits (185), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 37/92 (40%), Positives = 56/92 (60%), Gaps = 6/92 (6%)
Query: 125 KTGPSLADAAVGRIAQRTKVLAEGGYEKIFRQSFDTVPEEQLLKTYACYLSTSAGPVVGV 184
K GP L + +++ K+L GG EKI+++ F +E+L K Y CYLST+ G + G+
Sbjct: 61 KLGPKLTETVKRKLSLGAKILQMGGLEKIYKRLFKVCDKEKLFKAYQCYLSTTEGSIAGL 120
Query: 185 LYLSTAKLAFCSDDPLSYKV----GDETQWSY 212
L++S+ K+AFCS+ S KV GD T+ Y
Sbjct: 121 LFISSKKIAFCSER--SIKVTSPQGDLTRVHY 150
>sp|Q4WXX4|RNA14_ASPFU mRNA 3'-end-processing protein rna14 OS=Neosartorya fumigata
(strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100)
GN=rna14 PE=3 SV=1
Length = 1029
Score = 37.4 bits (85), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 40/162 (24%), Positives = 70/162 (43%), Gaps = 25/162 (15%)
Query: 28 HSSDYAPYPKIDPNDVAPPPQNWSNVSTATGPAPINESAATTMPAESNPYVSPSPIEPSS 87
H+S Y+P P + P S+V+ A G + N A + + + +P+P PS+
Sbjct: 193 HNS-YSPAPAVAPK---------SDVAVAAGQSLYNSHALQSGNVQDSATATPTPDSPST 242
Query: 88 SKNKL-------------ESVKGALSNWGKKAAEATKRAE-DLAGNMWQH-LKTGPSLAD 132
SK +L E +G + W + E R D A ++++ LK P A+
Sbjct: 243 SKGRLPHDRVGILEDRIQEDPRGDIPAWLELINEHRSRNRFDSARDVFERFLKVFPFAAE 302
Query: 133 AAVGRIAQRTKVLAEGGYEKIFRQSFDTVPEEQLLKTYACYL 174
V +++ E+IF ++ T+P+ QL Y Y+
Sbjct: 303 QWVAYAKMESELNDLYRLEQIFNRTLLTIPDVQLWSVYLDYV 344
>sp|Q9UKF5|ADA29_HUMAN Disintegrin and metalloproteinase domain-containing protein 29
OS=Homo sapiens GN=ADAM29 PE=1 SV=3
Length = 820
Score = 36.6 bits (83), Expect = 0.14, Method: Composition-based stats.
Identities = 19/67 (28%), Positives = 34/67 (50%), Gaps = 2/67 (2%)
Query: 21 MQQQQEPHSSDYAPYPKID--PNDVAPPPQNWSNVSTATGPAPINESAATTMPAESNPYV 78
+++QQ+ + KI P+++ P Q W S + P ++S MP++S P V
Sbjct: 704 IKKQQDVQTPSAKEEEKIQRRPHELPPQSQPWVMPSQSQPPVTPSQSHPQVMPSQSQPPV 763
Query: 79 SPSPIEP 85
+PS +P
Sbjct: 764 TPSQSQP 770
Score = 33.1 bits (74), Expect = 1.5, Method: Composition-based stats.
Identities = 22/89 (24%), Positives = 40/89 (44%), Gaps = 14/89 (15%)
Query: 11 ETKASSETHHMQQQQEPHSSDYAPYPKIDPNDVAPP---PQNWSNVSTATGPAPINESAA 67
E S+ M Q +P + +P++ P+ PP Q+ V + P+
Sbjct: 727 ELPPQSQPWVMPSQSQPPVTPSQSHPQVMPSQSQPPVTPSQSQPRVMPSQSQPPV----- 781
Query: 68 TTMPAESNPYVSPS----PIEPSSSKNKL 92
MP++S+P ++PS P+ PS + +L
Sbjct: 782 --MPSQSHPQLTPSQSQPPVTPSQRQPQL 808
Score = 32.0 bits (71), Expect = 3.5, Method: Composition-based stats.
Identities = 21/86 (24%), Positives = 38/86 (44%), Gaps = 7/86 (8%)
Query: 5 QADHQIETKASSETHHMQQQQEPHSSDYAPYPKIDPNDVAPP---PQNWSNVSTATGPAP 61
+ ++T ++ E +Q++ PH P + P+ PP Q+ V + P
Sbjct: 705 KKQQDVQTPSAKEEEKIQRR--PHELPPQSQPWVMPSQSQPPVTPSQSHPQVMPSQSQPP 762
Query: 62 I--NESAATTMPAESNPYVSPSPIEP 85
+ ++S MP++S P V PS P
Sbjct: 763 VTPSQSQPRVMPSQSQPPVMPSQSHP 788
>sp|Q9FGS8|C2GR2_ARATH C2 and GRAM domain-containing protein At5g50170 OS=Arabidopsis
thaliana GN=At5g50170 PE=2 SV=1
Length = 1027
Score = 35.0 bits (79), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 154 FRQSFDTVPEEQLLKTYACYLSTSAGPVVGVLYLSTAKLAFCSD 197
F++ F EE LLK Y CYL PV G L+LS +AF S+
Sbjct: 694 FQKLFGLPHEEFLLKEYTCYLKRKL-PVQGKLFLSARIVAFYSN 736
>sp|P54265|DMPK_MOUSE Myotonin-protein kinase OS=Mus musculus GN=Dmpk PE=1 SV=1
Length = 631
Score = 32.3 bits (72), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 20/80 (25%), Positives = 34/80 (42%)
Query: 43 VAPPPQNWSNVSTATGPAPINESAATTMPAESNPYVSPSPIEPSSSKNKLESVKGALSNW 102
V+PP Q PAP+ E+ T + + + S +LE+++ A N+
Sbjct: 442 VSPPDQVAEEADLVAVPAPVAEAETTVTLQQLQEALEEEVLTRQSLSRELEAIRTANQNF 501
Query: 103 GKKAAEATKRAEDLAGNMWQ 122
+ EA R DL ++ Q
Sbjct: 502 SSQLQEAEVRNRDLEAHVRQ 521
>sp|Q5B3I8|RNA14_EMENI mRNA 3'-end-processing protein rna14 OS=Emericella nidulans (strain
FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139)
GN=rna14 PE=3 SV=2
Length = 1075
Score = 32.3 bits (72), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 57/142 (40%), Gaps = 25/142 (17%)
Query: 48 QNWSNVSTATGPAPINESAATTMPAESNPYVSPSPIEPSSSKNKL-------------ES 94
QN N T+ P ESA T P+P PS++K +L E
Sbjct: 202 QNLYNSRTSLQPEIARESATAT----------PAPDSPSTAKGRLPHDRVGILEDRIRED 251
Query: 95 VKGALSNWGKKAAEATKRAE-DLAGNMWQH-LKTGPSLADAAVGRIAQRTKVLAEGGYEK 152
+G + W + E R D A ++++ LK P A+ V +++ E+
Sbjct: 252 PRGDIPAWLELINEHRSRNRIDSARDVYERFLKVFPLSAEMWVAYATMESELNELFRLEQ 311
Query: 153 IFRQSFDTVPEEQLLKTYACYL 174
IF ++ T+P QL Y Y+
Sbjct: 312 IFNRTLLTIPAVQLWTVYLDYV 333
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.308 0.123 0.362
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 81,571,351
Number of Sequences: 539616
Number of extensions: 3275814
Number of successful extensions: 13909
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 48
Number of HSP's successfully gapped in prelim test: 261
Number of HSP's that attempted gapping in prelim test: 13319
Number of HSP's gapped (non-prelim): 802
length of query: 215
length of database: 191,569,459
effective HSP length: 113
effective length of query: 102
effective length of database: 130,592,851
effective search space: 13320470802
effective search space used: 13320470802
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 59 (27.3 bits)