• Posts
  • RSS
  • ◂◂RSS
  • Contact

  • Lilacs Blooming

    October 5th, 2011
    prediction  [html]
    Around 1848, Adolph Quetlet decided to try to predict when lilacs would bloom. [1] He found that the lilacs waited to bloom until it had been a while since the last frost. Specifically, when:

         sum(t^2 for t in mean_daily_temperatures(
           days since last frost)) > 4264C^2
    

    This is actually pretty strange: how does it make sense to be squaring temperatures? [2] We are talking about predicting frost, however, and it happens that 0C is set to the temperature of frost. So we could rewrite this as:

       sqrt(sum((t-t_freezing)^2
                for t in mean_daily_temperatures(
                  days since last frost))) > 65.3C
    

    This now makes sense, at least in terms of units. It also looks a lot like the common mean squared error loss function (though without 'sum' instead of 'mean'), and there might be an interpretation in which we see the lilacs as trying predicting "it's going to freeze" and then deciding "it's not going to freeze" when their total error since their last successful prediction crosses a threshold. This might be the same thing as the Sum of Squared Errors of Prediction?

    [1] This comes from Seeing Like A State p313 (James Scott, 1998), which sources it to Ian Hacking's, "The Taming of Chance", p 62 (1990). Scott claims that "the calculations must begin with an unpredictable event: the 'last frost'. Since the date of the last frost can be known only in retrospect, Quetelet's formula fails as a useful guide to action." This is probably a misinterpretation of either Quetelet or Hacking: instead of "the last frost of the season" the formula probably takes as an input "the most recent frost".

    Scott actually has the right hand side of the inequality as "(4264C) squared", and Hacking has it as "(4264C)^2", but this is clearly impossible as it would require lilacs to wait through 40 years of 35C days before blooming. While 4264 square centigrades just requires 10 days of 20C. This errant squared might be supposed to refer only to the C, however, in which case it would make sense.

    Looking now, this is also in the 1918 Publications of the American Statistical Association, Volume 15, p389 and the 1850 Edinburgh review, Volume 92, p39. Here they have as the right side of the inequality just "4264 centigrade", not noting that the units are square at all.

    [2] The particular choice of units matters; squaring acts very differently on temperatures measured in Kelvin or Farenheight because of their different zero points. Adding squared temperatures especially doesn't make sense. Imagine I chose a temperature scale (J) with 0J = 100C. Then if you add squared temperatures your total is lower for a series of 20C (-80J) days than 10C (-90J) days which would not work with this formula. (edit: was 80J and 90J)

    Comment via: google plus, facebook

    Recent posts on blogs I like:

    More on the Deutschlandtakt

    The Deutschlandtakt plans are out now. They cover investment through 2040, but even beforehand, there’s a plan for something like a national integrated timetable by 2030, with trains connecting the major cities every 30 minutes rather than hourly. But the…

    via Pedestrian Observations July 1, 2020

    How do cars fare in crash tests they're not specifically optimized for?

    Any time you have a benchmark that gets taken seriously, some people will start gaming the benchmark. Some famous examples in computing are the CPU benchmark specfp and video game benchmarks. With specfp, Sun managed to increase its score on 179.art (a su…

    via Posts on Dan Luu June 30, 2020

    Quick note on the name of this blog

    When I was 21 a friend introduced me to a volume of poems by the 14th-century Persian poet Hafiz, translated by Daniel Ladinsky. I loved them, and eventually named this blog for one of my favorite ones. At some point I read more and found that Ladinsky’s …

    via The whole sky June 21, 2020

    more     (via openring)


  • Posts
  • RSS
  • ◂◂RSS
  • Contact