Rent Map Data Sources

October 29th, 2016
housing, map
I just finished updating my rent map to handle the recent Padmapper UI refresh, and someone asked how Padmapper not including Craigslist listings affected the map. This confused me: I had thought Padmapper got its data by buying it from 3Taps who scraped Google's cache who crawled Craigslist. But it turns out that Padmapper and 3Taps settled the lawsuit, and Padmapper has only gotten its listings from other sources since then.

One issue, though, is it could be that the cheapest apartments are listed only on Craigslist [1] and not on the other services that Padmapper pulls from. To get a rough check this, I took ten random listings from the Boston Craigslist page, and tried to figure out which Padmapper listing it went with.

Predictions summary:
Listing Estimate Error In Padmapper
$1700 $2605 +53% no
$1700 $2065 +21% no
$1495 $1775 +19% no
$3000 $3155 +5% yes
$3000 $2995 -0% no
$1575 $1590 -1% yes
$2900 $2800 -3% yes
$6000 $3650 -64% no (dubious)
While this is a small sample, it looks like the predictions are pretty good for the ones in Padmapper (which is what you would expect) and consistently too high (0%, 19%, 21%, 53%, avg=23%) for the ones not in Padmapper.

Fixing this is pretty tricky. I could do a larger sample to try to get a better sense of what the error is, and then adust my map down by the combination of how much lower the non-Padmapper apartments are and what fraction aren't in padmapper. In this case, ignoring the dubious listing, 4 of 9 weren't on padmapper, with an average error of 23%, that would mean adjusting all my estimates down by 10%. On the other hand, as people's listing behavior changes this could get obsolete pretty quickly, and it's a pain to calculate the first time let alone on an ongoing basis. Ideas?

[1] Or, worse for my map, listed only with signs in windows or something else not available online.

Comment via: google plus, facebook

Recent posts on blogs I like:

Ideological Abuse, Busyness, and the Importance of Rest

ONE WEIRD TRICK to protect yourself against cults

via Thing of Things June 18, 2024

A discussion of discussions on AI bias

There've been regular viral stories about ML/AI bias with LLMs and generative AI for the past couple years. One thing I find interesting about discussions of bias is how different the reaction is in the LLM and generative AI case when compared to "…

via Posts on June 16, 2024

Conversations I often have about parenting

Topics that come up a lot The post Conversations I often have about parenting appeared first on Otherwise.

via Otherwise June 4, 2024

more     (via openring)