Related papers: A Comparison of Statistical and Machine Learning Algorithms for Predicting Rents in the San Francisco Bay Area

A Comparison of Statistical and Machine Learning Algorithms for Predicting Rents in the San Francisco Bay Area

URL: http://arxiv.org/abs/2011.14924v1
Date: Thu, 26 Nov 2020 08:50:45 GMT
Title: A Comparison of Statistical and Machine Learning Algorithms for Predicting Rents in the San Francisco Bay Area
Authors: Paul Waddell and Arezoo Besharati-Zadeh
Abstract summary: We present a use case in which predictive accuracy is of primary importance, and compare the use of random forest regression to multiple regression. We find that we are able to obtain useful predictions from both models using almost exclusively local accessibility variables.
Score: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Urban transportation and land use models have used theory and statistical modeling methods to develop model systems that are useful in planning applications. Machine learning methods have been considered too 'black box', lacking interpretability, and their use has been limited within the land use and transportation modeling literature. We present a use case in which predictive accuracy is of primary importance, and compare the use of random forest regression to multiple regression using ordinary least squares, to predict rents per square foot in the San Francisco Bay Area using a large volume of rental listings scraped from the Craigslist website. We find that we are able to obtain useful predictions from both models using almost exclusively local accessibility variables, though the predictive accuracy of the random forest model is substantially higher.

Related papers

Adaptive Location Hierarchy Learning for Long-Tailed Mobility Prediction [37.930452438916795]
We propose a plug-and-play framework for long-tailed mobility prediction in an exploitation and exploration manner.<n>First, we construct city-tailored location hierarchy based on Large Language Models (LLMs) by exploiting Maslow's theory of human motivation.<n>Experiments on state-of-the-art models across six datasets demonstrate the framework's consistent effectiveness and generalizability.
arXiv Detail & Related papers (2025-05-26T13:26:35Z)
Uncertainty-aware Bayesian machine learning modelling of land cover classification [0.0]
We propose a Bayesian classification framework using generative modelling to take account of input measurement uncertainty. We benchmark the performance of the model against more popular classification models used in land cover maps such as random forests and neural networks.
arXiv Detail & Related papers (2025-03-27T13:59:19Z)
Radio Map Prediction from Aerial Images and Application to Coverage Optimization [46.870065000932016]
We focus on predicting path loss radio maps using convolutional neural networks. We show that state-of-the-art models developed for existing radio map datasets can be effectively adapted to this task. We introduce a new model that slightly exceeds the performance of the present state-of-the-art with reduced complexity.
arXiv Detail & Related papers (2024-10-07T09:19:20Z)
OPUS: Occupancy Prediction Using a Sparse Set [64.60854562502523]
We present a framework to simultaneously predict occupied locations and classes using a set of learnable queries. OPUS incorporates a suite of non-trivial strategies to enhance model performance. Our lightest model achieves superior RayIoU on the Occ3D-nuScenes dataset at near 2x FPS, while our heaviest model surpasses previous best results by 6.1 RayIoU.
arXiv Detail & Related papers (2024-09-14T07:44:22Z)
BayesBlend: Easy Model Blending using Pseudo-Bayesian Model Averaging, Stacking and Hierarchical Stacking in Python [0.0]
We introduce the BayesBlend Python package to estimate weights and blend multiple (Bayesian) models' predictive distributions. BayesBlend implements pseudo-Bayesian model averaging, stacking and, uniquely, hierarchical Bayesian stacking to estimate model weights. We demonstrate the usage of BayesBlend with examples of insurance loss modeling.
arXiv Detail & Related papers (2024-04-30T19:15:33Z)
Towards Generalizable and Interpretable Motion Prediction: A Deep Variational Bayes Approach [54.429396802848224]
This paper proposes an interpretable generative model for motion prediction with robust generalizability to out-of-distribution cases. For interpretability, the model achieves the target-driven motion prediction by estimating the spatial distribution of long-term destinations. Experiments on motion prediction datasets validate that the fitted model can be interpretable and generalizable.
arXiv Detail & Related papers (2024-03-10T04:16:04Z)
Predictive Churn with the Set of Good Models [64.05949860750235]
We study the effect of conflicting predictions over the set of near-optimal machine learning models. We present theoretical results on the expected churn between models within the Rashomon set. We show how our approach can be used to better anticipate, reduce, and avoid churn in consumer-facing applications.
arXiv Detail & Related papers (2024-02-12T16:15:25Z)
A step towards the integration of machine learning and small area estimation [0.0]
We propose a predictor supported by machine learning algorithms which can be used to predict any population or subpopulation characteristics. We study only small departures from the assumed model, to show that our proposal is a good alternative in this case as well. What is more, we propose the method of the accuracy estimation of machine learning predictors, giving the possibility of the accuracy comparison with classic methods.
arXiv Detail & Related papers (2024-02-12T09:43:17Z)
Improving Heterogeneous Model Reuse by Density Estimation [105.97036205113258]
This paper studies multiparty learning, aiming to learn a model using the private data of different participants. Model reuse is a promising solution for multiparty learning, assuming that a local model has been trained for each party.
arXiv Detail & Related papers (2023-05-23T09:46:54Z)
LOPR: Latent Occupancy PRediction using Generative Models [49.15687400958916]
LiDAR generated occupancy grid maps (L-OGMs) offer a robust bird's eye-view scene representation. We propose a framework that decouples occupancy prediction into: representation learning and prediction within the learned latent space.
arXiv Detail & Related papers (2022-10-03T22:04:00Z)
Heterogeneous Ensemble Learning for Enhanced Crash Forecasts -- A Frequentest and Machine Learning based Stacking Framework [0.803552105641624]
In this study, we apply one of the key HEM methods, Stacking, to model crash frequency on five lane undivided segments (5T) of urban and suburban arterials. The prediction performance of Stacking is compared with parametric statistical models (Poisson and negative binomial) and three state of the art machine learning techniques (Decision tree, random forest, and gradient boosting)
arXiv Detail & Related papers (2022-07-21T19:15:53Z)
ALT-MAS: A Data-Efficient Framework for Active Testing of Machine Learning Algorithms [58.684954492439424]
We propose a novel framework to efficiently test a machine learning model using only a small amount of labeled test data. The idea is to estimate the metrics of interest for a model-under-test using Bayesian neural network (BNN)
arXiv Detail & Related papers (2021-04-11T12:14:04Z)
On Statistical Efficiency in Learning [37.08000833961712]
We address the challenge of model selection to strike a balance between model fitting and model complexity. We propose an online algorithm that sequentially expands the model complexity to enhance selection stability and reduce cost. Experimental studies show that the proposed method has desirable predictive power and significantly less computational cost than some popular methods.
arXiv Detail & Related papers (2020-12-24T16:08:29Z)
BusTime: Which is the Right Prediction Model for My Bus Arrival Time? [3.1761486589684975]
This paper tries to fill this gap by proposing a general and practical evaluation framework for analysing various widely used prediction models. In particular, this framework contains a raw bus GPS data pre-processing method that needs much less number of input data points. We also present preliminary results for city managers by analysing the practical strengths and weaknesses in both training and predicting stages of commonly used prediction models.
arXiv Detail & Related papers (2020-03-20T17:03:36Z)

This list is automatically generated from the titles and abstracts of the papers in this site.