Mystery solved! The secret of Excel curved line interpolation

Excel Interpolation: A Black Box No Longer

Have you ever wondered what formula Excel uses to make its “curved line” scatter plots? I have!

Admittedly, this sounds like an odd thing to wonder about. But it’s actually a super-practical question: what if you want to approximate an intermediate point along the curve you have just plotted? For some reason, Excel hides this formula from you, and according to my source post, Microsoft has left this formula a black-box and given users no access to it. WTF, mate.

An intrepid engineer named Brian Murphy at XLRotor seems to have “cracked the code” – he has a worksheet that can reproduce Excel’s curved line with a custom formula. It seems Excel uses a spline (as one might expect), but there are many different kinds of splines and he has found the right one. As I confirmed today, the kind of spline it uses is not a simple cubic spline like the one described in the Wikipedia spline article; Excel uses a (third-order) Bezier Spline, which is commonly used in computer graphics (like PostScript).

You can download Brian’s Bezier example file (deep link) and play with it yourself to convince yourself he has the right solution. Unfortunately, his workbook is an Excel 2003 sheet with some macros. The way I am dealing with this is putting my interpolations into his example sheet, and then pasting them back out as I need them.

If you want to do this as well, you need to understand his argument structure, which I think could use a bit more documentation. For those interested in using his example file, the syntax is:

FEvaluate_Bezier(x-range,y-range,i1,i2,i3,i4,delta, chart_name)

Where the x-range and y-ranges are the data points, i1-i4 are integers representing the indices of the x and y arrays to use as the points to interpolate between. You need four points instead of two in order to get the curviness right – this is also how you know it’s a cubic spline. For the interpolations at the edge of the curve, just use the first or last point twice.

Delta is a dimensionless number representing the increment along the path from {x[i2],y[i2]} and {x[i3],y[i3]}. When delta=0.0, the function gives {x[i2],y[i2]}, and delta=1.0 gives {x[i3],y[i3]}, so you can sample the intermediate curve as densely as you care to. Pretty nifty!

I have no idea why you need the name of the chart, so I just left the example chart in there.

One potential confusion – since the interpolation can return a fully parametric path, not just a function, the formula returns an array (as opposed to, say, just a y-value). If you’re not familiar with array formulas in Excel, the up-shot of this is that you should highlight two adjacent cells before you type in the formula, and then press Ctrl+Enter instead of just Enter to get both the x and y values to appear. Happy interpolating!

Published by

Jon Bittner

Splitwise helps you and your friends keep track of shared expenses, so that bills (and friends) get paid on time.

17 thoughts on “Mystery solved! The secret of Excel curved line interpolation”

    1. Sorry for the wicked slow reply to your comment Emiel – awesome! I had a look at your article, and so glad I could help out by documenting just a bit more what Brian Murphy had done.

  1. Using Excell 2007 (Windows 7)
    I can put my own data into this example it works fine with manually inserting my numbers into the base four point data array.
    When I try to add this macro code into my own worksheet I cannot get it to work – is the full macro function hidden? Keeps saying it cannot find the variable:
    Set cht = ActiveSheet.DrawingObjects(chtname).Chart
    Am I missing something?

    1. Further to above I am not quite explaining the problem. I can change/use the existing example as much as I like. If I try to copy/use the same spread sheet formula array anywhere elsewhere the:
      Set cht = ActiveSheet.DrawingObjects(chtname).Chart code fails every time with VALUE.
      I cannot find where it is trying to reference for the failure. The real chart name I have is correct. Any advice welcome ???

      1. Hi Chris,
        I struggled with this too, and had much more success by just building my whole sheet around the example and modifying it carefully, undoing if I hit any VALUE errors. I know this is weird, but Excel macros are very brittle and are totally hell to debug. I’m sorry I can’t offer more specific advice – you might want to try contacting Brian at XLRotor? (see above)

      2. Thanks for the reply! – After much pain I have already found that I could do the same thing that you suggest by building my work into the “example” but my main spreadsheet is very complex and I still have not got all of it into the “example” sheet.
        What I have done so far appears to work along the llines you have found but It does not like array references achieved from cells in other sheets in the same document e.g. array in Sheet1 A1 getting data from Sheet2!A1 (more pain).
        The macro code for the “FEvaluate function” does not display anywhere that I can find directly if I ask to view it from the “example”. I found the macro code (accidentally) and copied it into my own sheet VBA but it does not work there (original problem) but only in the example itself!

  2. Further to above . . . Viewing via the Excel “VBA Project Explorer”, If I take (copy the code from) all three function code modules from the example and add then to my own project and keep my own code in a separate module I can “escape” from being tied to the example as I was before. Previously I had tried to copy ALL the example module function code into my own module. Sorry – not very good at explaining this but I have mine working now completely independent of the example.

  3. Hi,
    I am trying to use this for a semi-logarithmic X-Y plot and its not acurately predicting the spline in the semi-log space. Any quick fixes i can apply?

Leave a reply to Emiel Nijhuis Cancel reply