BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 039195
         (195 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|357439857|ref|XP_003590206.1| hypothetical protein MTR_1g045930 [Medicago truncatula]
 gi|355479254|gb|AES60457.1| hypothetical protein MTR_1g045930 [Medicago truncatula]
          Length = 1019

 Score =  165 bits (418), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 78/133 (58%), Positives = 98/133 (73%), Gaps = 11/133 (8%)

Query: 63  KGYSVCCFCKTKDAEIDKIEDKEQDERPPFDINLAVILAGFAFEAYTTPPESVGRKEVDV 122
           +G+S+CC  KT D++I+K  +++ DERPPFDINLAVILAGFAFEAYTTPPE++GR+EVD 
Sbjct: 50  QGFSICCSSKT-DSQIEKSSNEKNDERPPFDINLAVILAGFAFEAYTTPPENLGRREVDA 108

Query: 123 AGCKTVYLSESFAHEMYDGQLFIKLEKCFNLPAMDPWITHACSTGFLGTSDPYVVMELDG 182
           AG KT+YLSE F  E+YDGQ+FIKL+K    PAMDPW          GTSDPYVV+++  
Sbjct: 109 AGTKTIYLSEEFFREIYDGQIFIKLKKGSRFPAMDPW----------GTSDPYVVVQMGS 158

Query: 183 QEIRMKCWWSWKE 195
           Q  +    W  KE
Sbjct: 159 QTAKSNIKWGTKE 171


>gi|224107575|ref|XP_002314524.1| predicted protein [Populus trichocarpa]
 gi|222863564|gb|EEF00695.1| predicted protein [Populus trichocarpa]
          Length = 552

 Score =  152 bits (383), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 72/96 (75%), Positives = 79/96 (82%), Gaps = 10/96 (10%)

Query: 88  ERPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQLFIKL 147
           ERPPFDINLAV+LAGFAFEAYT+ PE+VG++EVD A CKTVYLSESF  E+YDGQLFIKL
Sbjct: 2   ERPPFDINLAVVLAGFAFEAYTSLPENVGKREVDAADCKTVYLSESFVREIYDGQLFIKL 61

Query: 148 EKCFNLPAMDPWITHACSTGFLGTSDPYVVMELDGQ 183
            K F+LPAMDPW          GTSDPYVVMELDGQ
Sbjct: 62  TKGFDLPAMDPW----------GTSDPYVVMELDGQ 87


>gi|218195187|gb|EEC77614.1| hypothetical protein OsI_16594 [Oryza sativa Indica Group]
          Length = 889

 Score =  138 bits (348), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 63/107 (58%), Positives = 80/107 (74%), Gaps = 10/107 (9%)

Query: 89  RPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQLFIKLE 148
           RPPFD+NLAV+LAGFAFEAYT+PPE VG +E+D A C+TV+LS+SF  E+YDGQL ++L+
Sbjct: 75  RPPFDLNLAVVLAGFAFEAYTSPPEDVGWREIDAAECQTVFLSDSFLREVYDGQLVVRLK 134

Query: 149 KCFNLPAMDPWITHACSTGFLGTSDPYVVMELDGQEIRMKCWWSWKE 195
           K  NLP MDPW          GTSDPYVV++L+GQ  +    W+ KE
Sbjct: 135 KGVNLPVMDPW----------GTSDPYVVLQLNGQTAKSNIKWATKE 171


>gi|242073674|ref|XP_002446773.1| hypothetical protein SORBIDRAFT_06g022160 [Sorghum bicolor]
 gi|241937956|gb|EES11101.1| hypothetical protein SORBIDRAFT_06g022160 [Sorghum bicolor]
          Length = 895

 Score =  136 bits (343), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 61/107 (57%), Positives = 79/107 (73%), Gaps = 10/107 (9%)

Query: 89  RPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQLFIKLE 148
           RPPFDINLAV+LAGFAFEAYT+PP  VG +E D A C+TV+LS+ F  E+YDGQL ++L+
Sbjct: 89  RPPFDINLAVVLAGFAFEAYTSPPADVGWRETDAADCQTVFLSDVFLREVYDGQLVVRLK 148

Query: 149 KCFNLPAMDPWITHACSTGFLGTSDPYVVMELDGQEIRMKCWWSWKE 195
           K  NLPAMDPW          GTSDPYV+++L+GQ  + +  W+ K+
Sbjct: 149 KGINLPAMDPW----------GTSDPYVILQLNGQTAKSQIKWATKD 185


>gi|413918883|gb|AFW58815.1| hypothetical protein ZEAMMB73_298032 [Zea mays]
          Length = 697

 Score =  136 bits (342), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 61/108 (56%), Positives = 79/108 (73%), Gaps = 10/108 (9%)

Query: 88  ERPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQLFIKL 147
            RPPFDINLAV+LAGFAFEAYT+PP  VG +E D A C+TV+LS+ F  E+YDG+L ++L
Sbjct: 88  RRPPFDINLAVVLAGFAFEAYTSPPADVGWRETDAADCQTVFLSDVFLREVYDGRLVVRL 147

Query: 148 EKCFNLPAMDPWITHACSTGFLGTSDPYVVMELDGQEIRMKCWWSWKE 195
           +K  NLPAMDPW          GTSDPYV+++L+GQ  + +  W+ KE
Sbjct: 148 KKGINLPAMDPW----------GTSDPYVILQLNGQTAKSQIKWATKE 185


>gi|326504440|dbj|BAJ91052.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 859

 Score =  130 bits (326), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 63/120 (52%), Positives = 83/120 (69%), Gaps = 11/120 (9%)

Query: 76  AEIDKIEDKEQDERPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFA 135
           A+   + D+EQ   P FD+NLAV+LAGFAFEAY+TPP   G +E D A C+TV+LS+ F 
Sbjct: 61  ADDGSVGDEEQ-PHPQFDLNLAVVLAGFAFEAYSTPPADAGWRETDAAECQTVFLSDVFL 119

Query: 136 HEMYDGQLFIKLEKCFNLPAMDPWITHACSTGFLGTSDPYVVMELDGQEIRMKCWWSWKE 195
            E+YDGQL ++L+K  NLPAMDPW          GTSDPYVV++L+GQ  + +  W+ KE
Sbjct: 120 REVYDGQLVVRLKKGNNLPAMDPW----------GTSDPYVVLQLNGQTAKSQIKWATKE 169


>gi|357164616|ref|XP_003580112.1| PREDICTED: uncharacterized protein LOC100834372 [Brachypodium
           distachyon]
          Length = 879

 Score =  129 bits (324), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 60/112 (53%), Positives = 78/112 (69%), Gaps = 10/112 (8%)

Query: 84  KEQDERPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQL 143
           +E   RP FD+NLAV+LAGFAFEAY++PP   G +E D A C+TV+LS  F HE+YDGQL
Sbjct: 80  EEDRPRPQFDLNLAVVLAGFAFEAYSSPPVDAGWRETDAAECQTVFLSNVFLHEVYDGQL 139

Query: 144 FIKLEKCFNLPAMDPWITHACSTGFLGTSDPYVVMELDGQEIRMKCWWSWKE 195
            ++L+K  +LPAMDPW          GTSDPYVV++L+GQ  +    W+ KE
Sbjct: 140 VVRLKKGTSLPAMDPW----------GTSDPYVVLQLNGQTTKSNIKWATKE 181


>gi|224100007|ref|XP_002311707.1| predicted protein [Populus trichocarpa]
 gi|222851527|gb|EEE89074.1| predicted protein [Populus trichocarpa]
          Length = 158

 Score =  119 bits (299), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 68/157 (43%), Positives = 92/157 (58%), Gaps = 22/157 (14%)

Query: 18  AMLARATPSHSTVSPLSVSQLPQKPFSGKVELVFICPRSYQSKGDKGYSVCCFC------ 71
           A   R TP+      L+  +     FSG+  ++     +Y+  G   + VC         
Sbjct: 10  AFYHRFTPTSYKHRNLNFYRRFPSSFSGRFRVL-----TYRGSG---HGVCSSSTSSISK 61

Query: 72  --KTKDAEIDKI----EDKEQDERPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGC 125
                + E +K+    E +  +ERPPFDINLAV LAGFAFEAYT+ P++VG+ E D AGC
Sbjct: 62  ASNANNTETEKVAVQQEKENGNERPPFDINLAVALAGFAFEAYTSSPKNVGKWEADAAGC 121

Query: 126 KTVYLSESFAHEMYDGQLFIKLEKCFNLPAMDPWITH 162
           KTVYLSE    E+YDGQLFIKL+K F+ PA+DPW+ +
Sbjct: 122 KTVYLSE--LREIYDGQLFIKLKKGFDFPALDPWVRY 156


>gi|115459352|ref|NP_001053276.1| Os04g0509100 [Oryza sativa Japonica Group]
 gi|32488294|emb|CAE03360.1| OSJNBb0065L13.3 [Oryza sativa Japonica Group]
 gi|113564847|dbj|BAF15190.1| Os04g0509100 [Oryza sativa Japonica Group]
          Length = 888

 Score =  118 bits (295), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 50/72 (69%), Positives = 62/72 (86%)

Query: 89  RPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQLFIKLE 148
           RPPFD+NLAV+LAGFAFEAYT+PPE VG +E+D A C+TV+LS+SF  E+YDGQL ++L+
Sbjct: 87  RPPFDLNLAVVLAGFAFEAYTSPPEDVGWREIDAAECQTVFLSDSFLREVYDGQLVVRLK 146

Query: 149 KCFNLPAMDPWI 160
           K  NLP MDPWI
Sbjct: 147 KGVNLPVMDPWI 158


>gi|168061670|ref|XP_001782810.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162665712|gb|EDQ52387.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 909

 Score =  104 bits (259), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 46/98 (46%), Positives = 66/98 (67%), Gaps = 2/98 (2%)

Query: 85  EQDERPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQLF 144
           E ++RPPF+++LAV+LAGFAFEAY TP E+VG +E D   C+T +L++ F  E+YDGQ+F
Sbjct: 129 ENEDRPPFNLDLAVVLAGFAFEAYNTPAENVGVQEKDAGDCETTFLAQHFLREVYDGQVF 188

Query: 145 IKLEKCFNLPAMDPWITHACSTGFL--GTSDPYVVMEL 180
           + L+K  + P +D W++    TG     T DP    EL
Sbjct: 189 VNLKKGMHFPGLDFWVSTRAETGLFSSATKDPIWNEEL 226


>gi|302785770|ref|XP_002974656.1| hypothetical protein SELMODRAFT_415006 [Selaginella moellendorffii]
 gi|300157551|gb|EFJ24176.1| hypothetical protein SELMODRAFT_415006 [Selaginella moellendorffii]
          Length = 713

 Score = 99.8 bits (247), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 47/110 (42%), Positives = 69/110 (62%), Gaps = 10/110 (9%)

Query: 86  QDERPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQLFI 145
           +D+R  F+++LAV+LAGFAFE+Y +PP +VG +E D   C+T Y+S+    E+Y GQL +
Sbjct: 108 EDDRAKFNLDLAVMLAGFAFESYNSPPPNVGIRETDGQSCETTYMSDGLMRELYHGQLTV 167

Query: 146 KLEKCFNLPAMDPWITHACSTGFLGTSDPYVVMELDGQEIRMKCWWSWKE 195
           KLEK  +LP +D W          GTSDPYV + +     + K  W+ K+
Sbjct: 168 KLEKGVSLPGLDLW----------GTSDPYVWLRIGNCAKQSKTAWATKD 207


>gi|302759891|ref|XP_002963368.1| hypothetical protein SELMODRAFT_405179 [Selaginella moellendorffii]
 gi|300168636|gb|EFJ35239.1| hypothetical protein SELMODRAFT_405179 [Selaginella moellendorffii]
          Length = 736

 Score = 96.7 bits (239), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 46/104 (44%), Positives = 64/104 (61%), Gaps = 10/104 (9%)

Query: 92  FDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQLFIKLEKCF 151
            D++LAV+LAGFAFE+Y +PP +VG +E D   C+T Y+S+    E+Y GQL +KLEK  
Sbjct: 81  LDLDLAVMLAGFAFESYNSPPPNVGIRETDGQSCETTYMSDGLIRELYQGQLTVKLEKGV 140

Query: 152 NLPAMDPWITHACSTGFLGTSDPYVVMELDGQEIRMKCWWSWKE 195
           NLP +D W          GTSDPYV + +     + K  W+ K+
Sbjct: 141 NLPGLDLW----------GTSDPYVWLRIGNCAKQSKTAWATKD 174


>gi|222629183|gb|EEE61315.1| hypothetical protein OsJ_15419 [Oryza sativa Japonica Group]
          Length = 894

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 43/90 (47%), Positives = 53/90 (58%), Gaps = 18/90 (20%)

Query: 89  RPPFDINLAVILAGFAFEAYTTPPESV----GRK--------------EVDVAGCKTVYL 130
           RPPFD+NLAV+LAGFAFEAYT+PP+ +    G K               V+   C     
Sbjct: 75  RPPFDLNLAVVLAGFAFEAYTSPPKYLPSRCGAKLEREWNSERIDMVITVNFVNCTVPMY 134

Query: 131 SESFAHEMYDGQLFIKLEKCFNLPAMDPWI 160
             SF  E+YDGQL ++L+K  NLP MDPWI
Sbjct: 135 CSSFLREVYDGQLVVRLKKGVNLPVMDPWI 164


>gi|255554308|ref|XP_002518194.1| triacylglycerol lipase, putative [Ricinus communis]
 gi|223542790|gb|EEF44327.1| triacylglycerol lipase, putative [Ricinus communis]
          Length = 810

 Score = 77.0 bits (188), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 43/79 (54%), Positives = 49/79 (62%), Gaps = 7/79 (8%)

Query: 37  QLPQKPFSGKVELVFICPRSYQSKGDKGYSVCCFCKTKDAEIDKIEDKEQD---ERPPFD 93
           Q P + FSG  EL     RS  S  D    + CFCK  D  ++K+  +EQ    ERP FD
Sbjct: 33  QFPVR-FSGVFEL---KTRSVLSSRDGANLIGCFCKVNDGAVEKVSIEEQQNEIERPRFD 88

Query: 94  INLAVILAGFAFEAYTTPP 112
           INLAVILAGFAFEAYTTPP
Sbjct: 89  INLAVILAGFAFEAYTTPP 107


>gi|356577602|ref|XP_003556913.1| PREDICTED: uncharacterized protein LOC100777995 [Glycine max]
          Length = 809

 Score = 69.3 bits (168), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 33/70 (47%), Positives = 43/70 (61%), Gaps = 6/70 (8%)

Query: 43  FSGKVELVFICPRSYQSKGDKGYSVCCFCKTKDAEIDKIEDKEQDERPPFDINLAVILAG 102
           F GK+    +  R       +  S+CC      +++ ++   E D+RPPFDINLAVILAG
Sbjct: 33  FPGKLRAFSLARRG------RVLSICCGSSKTGSQLQRVAVPEDDDRPPFDINLAVILAG 86

Query: 103 FAFEAYTTPP 112
           FAFEAYTTPP
Sbjct: 87  FAFEAYTTPP 96


>gi|255084970|ref|XP_002504916.1| predicted protein [Micromonas sp. RCC299]
 gi|226520185|gb|ACO66174.1| predicted protein [Micromonas sp. RCC299]
          Length = 690

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 37/84 (44%), Positives = 48/84 (57%), Gaps = 13/84 (15%)

Query: 92  FDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQLFIKLEKCF 151
           FD++ AV+L GFAFE+Y+TP   V   + DV GC T YLS+ F  E+Y G L + + +  
Sbjct: 68  FDLDTAVLLGGFAFESYSTPEGGV--VDADVNGCSTAYLSD-FVREVYSGVLEVTMVRGE 124

Query: 152 NLPAMDPWITHACSTGFLGTSDPY 175
           NLP  D           LG SDPY
Sbjct: 125 NLPRGDK----------LGLSDPY 138


>gi|303282807|ref|XP_003060695.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458166|gb|EEH55464.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 669

 Score = 61.6 bits (148), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 37/99 (37%), Positives = 53/99 (53%), Gaps = 11/99 (11%)

Query: 90  PPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQLFIKLEK 149
           P FD++ +++LAGFAFE+Y +P    G  + DV G  T YLS+ F  E++ G L   + +
Sbjct: 62  PHFDLDTSILLAGFAFESYLSPEG--GLVDRDVRGGSTAYLSD-FVREVFAGVLEATVVR 118

Query: 150 CFNLPAMDPWITHACSTGFLGTSDPYVVMELDGQEIRMK 188
           C  LP  D  +          +SDPYVV+ L     R K
Sbjct: 119 CDGLPKTDVLVG--------ASSDPYVVLALGASSHRTK 149


>gi|159484662|ref|XP_001700373.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158272414|gb|EDO98215.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 135

 Score = 59.7 bits (143), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 48/98 (48%), Gaps = 13/98 (13%)

Query: 92  FDINLAVILAGFAFEAYTTPPESVGRK---EVDVAGCKTVYLSESFAHEMYDGQLFIKLE 148
           FD+ LAV+LAG AFE+Y  P  + G     +  V G    Y  +SF  E+Y G L ++L+
Sbjct: 14  FDLPLAVVLAGAAFESYLQPQAAEGGAAFLQRSVGGPTVTYTDKSFLTEVYQGVLVVELQ 73

Query: 149 KCFNLPAMDPWITHACSTGFLGTSDPYVVMELDGQEIR 186
              NL   DP           G SDPY V+ + G   R
Sbjct: 74  SAANLRPADP----------NGQSDPYAVLSMGGATHR 101


>gi|4678381|emb|CAB41113.1| putative protein [Arabidopsis thaliana]
 gi|7268058|emb|CAB78397.1| putative protein [Arabidopsis thaliana]
          Length = 805

 Score = 58.5 bits (140), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 27/47 (57%), Positives = 35/47 (74%), Gaps = 3/47 (6%)

Query: 66  SVCCFCKTKDAEIDKIEDKEQDERPPFDINLAVILAGFAFEAYTTPP 112
           S+ CF +++  E+      E+DER PFDINLAVILAGFAFE+Y +PP
Sbjct: 46  SLRCFAQSETKEVSL---PEKDERSPFDINLAVILAGFAFESYASPP 89


>gi|302844632|ref|XP_002953856.1| hypothetical protein VOLCADRAFT_94625 [Volvox carteri f.
           nagariensis]
 gi|300260964|gb|EFJ45180.1| hypothetical protein VOLCADRAFT_94625 [Volvox carteri f.
           nagariensis]
          Length = 1517

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 34/102 (33%), Positives = 51/102 (50%), Gaps = 11/102 (10%)

Query: 86  QDERPPFDINLAVILAGFAFEAYTTPPES-VGRKEVDVAGCKTVYLSESFAHEMYDGQLF 144
           Q +   FD+ LAV+LAG +FEAY  P E+ +  ++ ++ G    +  ++F  E Y G L 
Sbjct: 24  QHQAAKFDLPLAVVLAGASFEAYLQPQETGMAFQQRNIKGPTVTFTDKAFLTETYTGVLI 83

Query: 145 IKLEKCFNLPAMDPWITHACSTGFLGTSDPYVVMELDGQEIR 186
           ++L    NL A D            G+SDPY V+ L     R
Sbjct: 84  VQLMSATNLRAAD----------VTGSSDPYAVLSLGESSFR 115


>gi|449449168|ref|XP_004142337.1| PREDICTED: uncharacterized protein LOC101204368 [Cucumis sativus]
          Length = 808

 Score = 56.2 bits (134), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 27/47 (57%), Positives = 33/47 (70%), Gaps = 1/47 (2%)

Query: 66  SVCCFCKTKDAEIDKIEDKEQDERPPFDINLAVILAGFAFEAYTTPP 112
           S  CFC +   E+     +++ E  PFDINLAVILAGFAFEAYT+PP
Sbjct: 56  SFRCFCSS-GTELQNASLQQRTEPRPFDINLAVILAGFAFEAYTSPP 101


>gi|297804988|ref|XP_002870378.1| lipase class 3 family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316214|gb|EFH46637.1| lipase class 3 family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 783

 Score = 55.1 bits (131), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/67 (43%), Positives = 39/67 (58%), Gaps = 10/67 (14%)

Query: 53  CPRSYQSKGDKGY-------SVCCFCKTKDAEIDKIEDKEQDERPPFDINLAVILAGFAF 105
            PR++   G   +       S  CF ++   E+   E   ++ER PFDINLAVILAGFAF
Sbjct: 26  TPRTFHFPGKINFRIKAKPNSFRCFTQSDAKEVSLAE---KEERSPFDINLAVILAGFAF 82

Query: 106 EAYTTPP 112
           E+Y +PP
Sbjct: 83  ESYASPP 89


>gi|224005184|ref|XP_002296243.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|209586275|gb|ACI64960.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 860

 Score = 52.4 bits (124), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 39/69 (56%)

Query: 89  RPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQLFIKLE 148
           +PPF ++ ++ LAG AF+AYT PP +  R E   +G    +LS S+   +Y G + +   
Sbjct: 80  QPPFSLSTSLFLAGLAFDAYTEPPPNSSRWERGSSGLNVAFLSTSYTRSLYKGIIEVTPL 139

Query: 149 KCFNLPAMD 157
           +  +LP  D
Sbjct: 140 RASDLPDED 148


>gi|397578524|gb|EJK50906.1| hypothetical protein THAOC_29977 [Thalassiosira oceanica]
          Length = 895

 Score = 50.4 bits (119), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 38/80 (47%)

Query: 88  ERPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQLFIKL 147
            +PPF +  A+ LAG AF+AY  PP+   R E    G    +LS ++   +Y G + +  
Sbjct: 101 NQPPFSLPTALFLAGLAFDAYVEPPQDSSRWEKGSTGTNVAFLSTAYTRSLYKGIIEVTP 160

Query: 148 EKCFNLPAMDPWITHACSTG 167
            +  +LP  D       S G
Sbjct: 161 IRASDLPDEDDAAESMLSGG 180


>gi|449018387|dbj|BAM81789.1| similar to triacylglycerol lipase [Cyanidioschyzon merolae strain
           10D]
          Length = 778

 Score = 49.3 bits (116), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 1/68 (1%)

Query: 76  AEIDKIEDKEQDERPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFA 135
           AE   I +  +  R PFD  LA+++AGF F+AY  P    G K   +   +TVYLS  F 
Sbjct: 278 AESSYIAEPLRGRRVPFDAQLALLMAGFTFDAYRDPEPHEG-KRCALGDLQTVYLSLDFV 336

Query: 136 HEMYDGQL 143
            E ++G L
Sbjct: 337 RECFEGVL 344


>gi|219130275|ref|XP_002185294.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403209|gb|EEC43163.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 891

 Score = 47.0 bits (110), Expect = 0.005,   Method: Composition-based stats.
 Identities = 19/63 (30%), Positives = 33/63 (52%)

Query: 92  FDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQLFIKLEKCF 151
            ++  A+   G AF+AY  PP +  R E    G +  + S ++  ++Y G L +++  C 
Sbjct: 135 LELGTALFAGGLAFDAYVEPPSNSSRWEKGSQGLRVAFCSVAYTRQLYKGVLEVRVRTCT 194

Query: 152 NLP 154
           NLP
Sbjct: 195 NLP 197


>gi|299473333|emb|CBN77732.1| triacylglycerol lipase, putative (Partial) [Ectocarpus siliculosus]
          Length = 842

 Score = 46.6 bits (109), Expect = 0.006,   Method: Composition-based stats.
 Identities = 28/74 (37%), Positives = 40/74 (54%), Gaps = 5/74 (6%)

Query: 84  KEQDERPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSE-SFAHEMYDGQ 142
           +E  ER PFD++ AV LA FAFEAY  P    G+   +V G  +V   E +F  +++ G 
Sbjct: 13  REPPERRPFDLDAAVFLADFAFEAYREP----GKVAWEVPGPSSVRFKEGAFVSKVFPGL 68

Query: 143 LFIKLEKCFNLPAM 156
           L + L     LP +
Sbjct: 69  LTVSLLAAEGLPVV 82


>gi|384245351|gb|EIE18845.1| alpha/beta-hydrolase [Coccomyxa subellipsoidea C-169]
          Length = 646

 Score = 45.8 bits (107), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 46/95 (48%), Gaps = 12/95 (12%)

Query: 92  FDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQLFIKLEKCF 151
           F++ LAV L   +FEAY  P    G KE+   G +T+Y+  ++  E+ +G L + +   F
Sbjct: 4   FNLPLAVALGACSFEAYNLPYHMYGIKELHPCGAETLYMDPAYLEEIVEGLLRVHVRHAF 63

Query: 152 NLPAM------DPWITHACSTG------FLGTSDP 174
            L         +P++T A   G        G+SDP
Sbjct: 64  GLQPQGEKQECNPFVTVALGPGAGRTHAVKGSSDP 98


>gi|428174990|gb|EKX43883.1| lipase [Guillardia theta CCMP2712]
          Length = 910

 Score = 44.3 bits (103), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 35/90 (38%), Positives = 46/90 (51%), Gaps = 7/90 (7%)

Query: 70  FCKTKDAEIDKI---ED-KEQDERPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGC 125
           F K +D+ +  I   ED  E+ + P FD+  AV LAG AF++Y  P   V +K  D  G 
Sbjct: 136 FVKARDSVLKNIGIGEDLVEEVKGPNFDLEKAVWLAGAAFDSYNDPVGGVVKKYED--GV 193

Query: 126 KTVYLSESFAHEMYDGQLFIKLEKCFNLPA 155
           K  Y S     E + G L +KL K  NL A
Sbjct: 194 KVSYTSSKMLAETHSGVLMLKL-KFANLTA 222


>gi|415885226|ref|ZP_11547154.1| DNA topoisomerase III [Bacillus methanolicus MGA3]
 gi|387590895|gb|EIJ83214.1| DNA topoisomerase III [Bacillus methanolicus MGA3]
          Length = 727

 Score = 42.4 bits (98), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 49/102 (48%), Gaps = 12/102 (11%)

Query: 64  GYSVCCFCKTKDAEIDKIEDKEQDERPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVA 123
            Y +  FCK K AEI KIE + ++ +PP+  NL+ + A  A + +  PP+    K +D+A
Sbjct: 255 AYKIAQFCKGKSAEIQKIEKERKEFQPPYFFNLSALQAT-ANKMFKYPPQ----KTLDIA 309

Query: 124 GCKTVYLSESFAHEMYDGQLFIKLEKCFNLPAMDPWITHACS 165
             + +YL    ++   D     K E      AM P I    S
Sbjct: 310 --QKLYLKGMISYPRSDSNFVTKEE-----AAMFPEILQKLS 344


>gi|307102546|gb|EFN50817.1| hypothetical protein CHLNCDRAFT_141758 [Chlorella variabilis]
          Length = 671

 Score = 41.2 bits (95), Expect = 0.21,   Method: Composition-based stats.
 Identities = 28/97 (28%), Positives = 41/97 (42%), Gaps = 9/97 (9%)

Query: 85  EQDERPPFDINLAVILAGFAFEAYT-TPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQL 143
           + D    FD +L + LAG AFEAY      +   K   + G +  Y+ + F    + G +
Sbjct: 25  DDDPSTAFDWSLGLALAGCAFEAYNDIEAGAPSLKMTSMGGTEITYVDKDFLRRKFAGLM 84

Query: 144 FIKLEKCFNLPAMDPWITHACSTGFLGTSDPYVVMEL 180
            +   K   LPA D W            SDPY V+ +
Sbjct: 85  EVTAVKAAGLPAADWWPG--------SKSDPYAVLNI 113


>gi|323451439|gb|EGB07316.1| hypothetical protein AURANDRAFT_71849 [Aureococcus anophagefferens]
          Length = 3487

 Score = 40.0 bits (92), Expect = 0.45,   Method: Composition-based stats.
 Identities = 28/88 (31%), Positives = 37/88 (42%), Gaps = 5/88 (5%)

Query: 89   RPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQLFIKLE 148
            R  F +  A + AGFAFE+Y  P  +  R E    GC   +   SF    Y G L  ++ 
Sbjct: 1232 RTSFSLGAASLCAGFAFESYNDPDGA--RWERGADGCDVAFRDASFVRSCYAGALIARVV 1289

Query: 149  KCFNLPAMDPWITHACSTGFLGTSDPYV 176
            +   L      +     TG    SDPYV
Sbjct: 1290 EARGLEGQQD-LQEIALTG--SASDPYV 1314


>gi|296088265|emb|CBI35773.3| unnamed protein product [Vitis vinifera]
          Length = 91

 Score = 37.4 bits (85), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 19/40 (47%), Positives = 23/40 (57%), Gaps = 10/40 (25%)

Query: 156 MDPWITHACSTGFLGTSDPYVVMELDGQEIRMKCWWSWKE 195
           MDPW          GTSDPYVV++LDGQ ++    W  KE
Sbjct: 1   MDPW----------GTSDPYVVIQLDGQVVKSNVKWGTKE 30


>gi|302844323|ref|XP_002953702.1| hypothetical protein VOLCADRAFT_106050 [Volvox carteri f.
           nagariensis]
 gi|300261111|gb|EFJ45326.1| hypothetical protein VOLCADRAFT_106050 [Volvox carteri f.
           nagariensis]
          Length = 550

 Score = 37.4 bits (85), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 33/67 (49%)

Query: 92  FDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVYLSESFAHEMYDGQLFIKLEKCF 151
           FD+  AV LAG AF AY  P      ++ DV     + + + F  + Y G L + L +  
Sbjct: 9   FDLPRAVGLAGAAFGAYRDPKPVNRFEDFDVKDTTVIVMEKEFLSQAYWGILEVYLVRAD 68

Query: 152 NLPAMDP 158
           +LPA  P
Sbjct: 69  DLPAGVP 75


>gi|410964109|ref|XP_003988598.1| PREDICTED: synaptotagmin-10 [Felis catus]
          Length = 523

 Score = 37.0 bits (84), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 12/68 (17%)

Query: 113 ESVGRKEVDVAGCKTVYLSESFAHEMYDGQLFIKLEKCFNLPAMDPWITHACSTGFLGTS 172
           +S G +E DV  C  +  +  + +E  +  L +K+ K  +LPA D          F GTS
Sbjct: 220 DSEGNREEDVKTCGKLNFTLQYDYE--NELLIVKIIKALDLPAKD----------FTGTS 267

Query: 173 DPYVVMEL 180
           DPYV M L
Sbjct: 268 DPYVKMYL 275


>gi|387929442|ref|ZP_10132119.1| DNA topoisomerase III [Bacillus methanolicus PB1]
 gi|387586260|gb|EIJ78584.1| DNA topoisomerase III [Bacillus methanolicus PB1]
          Length = 723

 Score = 37.0 bits (84), Expect = 4.2,   Method: Composition-based stats.
 Identities = 29/96 (30%), Positives = 46/96 (47%), Gaps = 12/96 (12%)

Query: 70  FCKTKDAEIDKIEDKEQDERPPFDINLAVILAGFAFEAYTTPPESVGRKEVDVAGCKTVY 129
           FC+ K AEI  +E + ++ +PP+  NL+ + A  A + Y  PP+    K +D+A  + +Y
Sbjct: 261 FCEGKPAEIQSVEKERKEFQPPYFFNLSSLQAT-ANKRYKYPPQ----KTLDIA--QKLY 313

Query: 130 LSESFAHEMYDGQLFIKLEKCFNLPAMDPWITHACS 165
           L    ++   D     K E      AM P I    S
Sbjct: 314 LKGIISYPRSDSNFVTKEEA-----AMIPEILQKLS 344


>gi|302142349|emb|CBI19552.3| unnamed protein product [Vitis vinifera]
          Length = 467

 Score = 36.2 bits (82), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 26/129 (20%), Positives = 52/129 (40%), Gaps = 1/129 (0%)

Query: 3   LEEFDYYL-NVSMVHIAMLARATPSHSTVSPLSVSQLPQKPFSGKVELVFICPRSYQSKG 61
           L++  +Y  ++ +  ++ L+  T  H++++P  + + P+ P         I   S    G
Sbjct: 22  LQKLQFYSKSLEITSLSFLSSITQVHTSIAPSLMGEGPKSPLQPPTYGNLITILSIDGGG 81

Query: 62  DKGYSVCCFCKTKDAEIDKIEDKEQDERPPFDINLAVILAGFAFEAYTTPPESVGRKEVD 121
            +G          ++E+ K++ ++      FD+       G      TTP E+ GR    
Sbjct: 82  IRGLIPGTVLGFLESELQKLDGEDARISDYFDVIAGTSTGGLVTAMLTTPNENTGRPLFS 141

Query: 122 VAGCKTVYL 130
               K  YL
Sbjct: 142 AKDIKDFYL 150


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.136    0.431 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,298,682,420
Number of Sequences: 23463169
Number of extensions: 129623982
Number of successful extensions: 231431
Number of sequences better than 100.0: 37
Number of HSP's better than 100.0 without gapping: 27
Number of HSP's successfully gapped in prelim test: 10
Number of HSP's that attempted gapping in prelim test: 231375
Number of HSP's gapped (non-prelim): 45
length of query: 195
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 60
effective length of database: 9,191,667,552
effective search space: 551500053120
effective search space used: 551500053120
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 73 (32.7 bits)