• Posts
  • RSS
  • ◂◂RSS
  • Contact

  • Rent Map Data Sources

    October 29th, 2016
    housing, map  [html]
    I just finished updating my rent map to handle the recent Padmapper UI refresh, and someone asked how Padmapper not including Craigslist listings affected the map. This confused me: I had thought Padmapper got its data by buying it from 3Taps who scraped Google's cache who crawled Craigslist. But it turns out that Padmapper and 3Taps settled the lawsuit, and Padmapper has only gotten its listings from other sources since then.

    One issue, though, is it could be that the cheapest apartments are listed only on Craigslist [1] and not on the other services that Padmapper pulls from. To get a rough check this, I took ten random listings from the Boston Craigslist page, and tried to figure out which Padmapper listing it went with.

    Predictions summary:
    Listing Estimate Error In Padmapper
    $1700 $2605 +53% no
    $1700 $2065 +21% no
    $1495 $1775 +19% no
    $3000 $3155 +5% yes
    $3000 $2995 -0% no
    $1575 $1590 -1% yes
    $2900 $2800 -3% yes
    $6000 $3650 -64% no (dubious)
    While this is a small sample, it looks like the predictions are pretty good for the ones in Padmapper (which is what you would expect) and consistently too high (0%, 19%, 21%, 53%, avg=23%) for the ones not in Padmapper.

    Fixing this is pretty tricky. I could do a larger sample to try to get a better sense of what the error is, and then adust my map down by the combination of how much lower the non-Padmapper apartments are and what fraction aren't in padmapper. In this case, ignoring the dubious listing, 4 of 9 weren't on padmapper, with an average error of 23%, that would mean adjusting all my estimates down by 10%. On the other hand, as people's listing behavior changes this could get obsolete pretty quickly, and it's a pain to calculate the first time let alone on an ongoing basis. Ideas?


    [1] Or, worse for my map, listed only with signs in windows or something else not available online.

    Comment via: google plus, facebook

    Recent posts on blogs I like:

    Austerity is Inefficient

    Working on an emergency timetable for regional rail has made it clear how an environment of austerity requires tradeoffs that reduce efficiency. I already talked about how the Swiss electronics before concrete slogan is not about not spending money but ab…

    via Pedestrian Observations February 27, 2021

    Fireside Friday, February 26, 2021

    Fireside this week, but next week we are diving into our long awaited series on pre-modern textile production, though we will be particularly focused on the most important clothing fibers in the Mediterranean world, wool and linen (rather than, say, silk …

    via A Collection of Unmitigated Pedantry February 26, 2021

    The Troubling Ethics of Writing (A Speech from Ancient Sumer)

    (Translated from a transcript of an ancient Sumerian speech by Uruk's most well-respected Scriptological Ethicist) Writing is a profoundly dangerous technology: Access to writing was initially, and still remains, uneven. What's worse, the rich are m…

    via BLOG - Cullen O'Keefe February 15, 2021

    more     (via openring)


  • Posts
  • RSS
  • ◂◂RSS
  • Contact