MugRep: A Multi-Task Hierarchical Graph Representation Learning Framework for Real Estate Appraisal
Abstract.
Real estate appraisal refers to the process of developing an unbiased opinion for real property’s market value, which plays a vital role in decision-making for various players in the marketplace (e.g., real estate agents, appraisers, lenders, and buyers). However, it is a non-trivial task for accurate real estate appraisal because of three major challenges: (1) The complicated influencing factors for property value; (2) The asynchronously spatiotemporal dependencies among real estate transactions; (3) The diversified correlations between residential communities. To this end, we propose a Multi-Task Hierarchical Graph Representation Learning (MugRep) framework for accurate real estate appraisal. Specifically, by acquiring and integrating multi-source urban data, we first construct a rich feature set to comprehensively profile the real estate from multiple perspectives (e.g., geographical distribution, human mobility distribution, and resident demographics distribution). Then, an evolving real estate transaction graph and a corresponding event graph convolution module are proposed to incorporate asynchronously spatiotemporal dependencies among real estate transactions. Moreover, to further incorporate valuable knowledge from the view of residential communities, we devise a hierarchical heterogeneous community graph convolution module to capture diversified correlations between residential communities. Finally, an urban district partitioned multi-task learning module is introduced to generate differently distributed value opinions for real estate. Extensive experiments on two real-world datasets demonstrate the effectiveness of MugRep and its components and features.
† The research was done when the first author was an intern in Baidu Research under the supervision of the second author.
1. Introduction
Real estate appraisal provides an opinion of real property’s market value, which is the probable sales price it would bring in an open and competitive real estate market. Real estate appraisal is required and implemented by various players in the marketplace, such as real estate agents, appraisers, property developers, investors, lenders, and buyers (Pagourtzi et al., 2003). An accurate appraisal of real estate is of great importance to help buyers or sellers for negotiation and closing, help mortgage owners for lending and investigation, and help governments for urban planning.
Prior studies on real estate appraisal can be categorized into two classes: (1) Empirical Appraisal Methods (EAMs), such as sales comparison approach (Mccluskey and Borst, 1997), cost approach (Guo et al., 2014), and income approach (Baum et al., 2017), either heavily depend on the accuracy, availability, and timeliness of sale transaction data, or require strong domain knowledge to perform (Pagourtzi et al., 2003); (2) Automated Valuation Methods (AVMs) estimate the market value of a property based on automatic machine learning techniques, such as linear regression (Şipoş et al., 2008; Ahn et al., 2012), support vector regression (Lin and Chen, 2011), boosted regression trees (Graczyk et al., 2010; Park and Bae, 2015) and artificial neural networks (Peterson and Flanagan, 2009; Poursaeed et al., 2018; Law et al., 2019). Compared with EAMs, AVMs are easy-to-use even for non-domain experts and are widely used in practice.
However, with the prevalence of mobile devices and the proliferation of ubiquitous computing techniques, existing AVMs can be significantly improved from the following three aspects. First, existing AVMs are mainly based on fundamental real-estate information, such as the apartment attributes, residential community features, and surrounding geographical facilities (e.g., distance to mall, number of transport stations), but overlook the influence of demographic characteristics (e.g., mobility patterns and demographics of community residents), which contains important clues for real estate appraisal. For instance, real estates in a residential community which is in close relationship (e.g., similar human mobility patterns or resident demographics) with other high-end residential communities tend to have a higher market value. Second, existing AVMs usually regard the real estate transactions as a pile of independent samples, but ignores spatiotemporal dependencies among real estate transactions. Indeed, the value of real estate significantly depends on its spatially proximal estates (Fu et al., 2014b). Incorporating such spatiotemporal dependencies in asynchronous real estate transactions can further improve appraisal effectiveness. Third, the value of a real estate highly depends on the corresponding residential community. Beyond statistic attributes, the diversified correlations among residential communities can also be exploited to achieve higher appraisal accuracy.
Along these lines, in this paper, we present a Multi-Task Hierarchical Graph Representation Learning (MugRep) framework for real estate appraisal. Our major contributions can be summarized as follows: (1) We construct a rich set of features from multi-source user data, which provides a comprehensive real estate profile from multiple social demography views, e.g., geographical population visits, human mobility, and resident demographics. (2) By regarding each real estate transaction as an individual event, we construct an evolving transaction event graph based on both place and period proximity of transaction events, and introduce graph neural networks (GNN) for asynchronously spatiotemporal event-level dependencies modeling to enhance appraisal effectiveness. To the best of our knowledge, this is the first attempt to employ GNN techniques to improve real estate appraisal. (3) We further propose a hierarchical heterogeneous community graph convolution module to capture the diversified community-level correlations. Specifically, we devise a dynamic intra-community graph convolution block to obtain time-dependent community representation, and design a heterogeneous inter-community graph convolution block to propagate valuable knowledge from the correlated residential communities. (4) We conduct extensive experiments on two real-world datasets, the results demonstrate the effectiveness of our framework and its components and features.
Description | Beijing | Chengdu | |
---|---|---|---|
Real Estate Marketing Data | # of transactions | 185,151 | 134,781 |
# of communities | 6,267 | 3,995 | |
Geographical Data | # of POIs | 1,315,353 | 1,116,989 |
|
44,513 | 30,564 | |
Check-in Data | # of check-ins | 4,078,723,999 | 5,130,847,589 |
User Trip Data | # of trip queries | 73,307,426 | 66,960,348 |
User Profile Data | # of users | 2,042,718 | 1,426,860 |






2. Data description and analysis
In this section, we present the datasets to be used in our framework, with a preliminary data analysis. We use two datasets, i.e., Beijing and Chengdu, which represent two major metropolises in China. Except basic Real Estate Marketing Data, we exploit four additional multi-source urban datasets, i.e., Geographical Data, Check-in Data, User Trip Data, and User Profile Data, to improve the accuracy of real estate appraisal. Table 1 summarizes the statistics of the datasets.
2.1. Real Estate Marketing Data
In this paper, the real estate marketing datasets of Beijing and Chengdu are collected from a major commercial real estate agency111https://bj.lianjia.com. The scope of Beijing ranges from January 2018 to December 2019, and that of Chengdu ranges from January 2018 to December 2020. Each dataset consists of two kinds of data, i.e., real estate transaction data and residential community data.
We take Beijing as an example. Beijing totally contains 185,151 real estate transaction records and 6,267 residential communities. Each transaction record contains the attributes of the transaction estate, e.g., transaction date, transaction price, location, room number, area, transaction ownership, whether it is free of sales tax, the residential community real estate belongs to. Figure 1(a) shows the distribution of real estate unit price in Beijing. Figure 1(b) and Figure 1(c) show the fluctuation of unit price and transaction volume over time. Figure 1(d) and Figure 1(e) show the spatial distributions of unit price and transaction volume. Overall, the variation of real estate unit price in spatial domain is greater than that in temporal domain, whereas the transaction volume is distributed more evenly across the city with a notable periodical pattern. For each residential community, our datasets contain its developer, completion year, number of estates, property fee, etc. Figure 1(f) shows the spatial distribution of residential communities in Beijing, which is positively correlated with spatial distribution of transaction volume.
2.2. Geographical Data
2.3. Check-in Data
2.4. User Trip Data
The User Trip Data are collected from Baidu Maps, which records the mobility pattern of a city. The user trip data includes the origin (geographical location) and destination of a trip, user’s travel mode (e.g., drive, taxi, bus, cycle, walk), and the type (e.g., enterprise, shopping places) of trip destination. Overall, there are 73,307,426 and 66,960,348 trip records in Beijing and Chengdu, respectively.
2.5. User Profile Data
The User Profile Data contain user profile attributes (e.g., gender, age, income level, education level), which is collected from multiple Baidu applications including Baidu Search, Baidu App and Baidu Maps. There are 2,042,718 and 1,426,860 distinct user records in Beijing and Chengdu, respectively. Each record contains a user’s demographic attributes including hometown, gender, age, and social attributes such as the industry, income level, educational level, consumption level, and whether the user is a car owner. All user profile records are anonymous and cannot be associated with sensitive personal information such as names and phone numbers.
3. Preliminaries
We first introduce some important definitions and formally define the real estate appraisal problem.
Definition 1.
Subject Property. The subject property is the real estate that will be appraised.
Definition 2.
Real Estate Transaction Event. Consider a set of real estate transactions , a real estate transaction event is defined as the -th chronological real estate transaction in the dataset. Specifically, is the location of , is transaction date, is the feature input associated with the real estate in , and is the transaction unit price of .
Problem 1.
Real Estate Appraisal. Given the subject property , our task is to estimate the transaction unit price of .
4. Framework
Then we introduce our framework in detail, including the processes of feature construction, the event-level and community-level representation learning, and multi-task learning based valuation.
4.1. Overview
Figure 2 shows the framework overview of MugRep, which consists of five major components: Data Warehouse, Feature Construction, Graph Construction, Representation Learning, and Multi-Task Learning. The Data Warehouse stores Real Estate Marketing Data and four additional multi-source urban data. The Feature Construction module generates seven groups of features based on the corresponding datasets in Data Warehouse. Besides, the Graph Construction module builds evolving transaction event graph based on the place and period proximity of transaction events, and constructs hierarchical heterogeneous community graph based on intra-community real estate transactions and inter-community similarities. In particular, the community similarities are quantified by additional diversified urban feature groups. After that, the Representation Learning module is introduced to derive the event-level and community-level representation of subject property, of which the integration is processed in a Multi-Task Learning module, where each task corresponds to an urban district. The corresponding task block produces the final result of Real Estate Appraisal.
4.2. Feature Construction
In this subsection, we present the process of constructing various features based on the aforementioned multi-source urban datasets. The details of used features are listed in Table 3 of Appendix A.
4.2.1. Real Estate Profile Features.
The real estate’s profiles are the most fundamental factors for the real estate appraisal. We extract real estate profile features from real estate transaction data, including estate’s number of rooms, area, decoration, orientation, structure, free of tax, transaction ownership, etc. The features of estate also include some profiles of the building where it is located, such as floor number, building type, elevator household ratio.
4.2.2. Residential Community Profile Features.
The residential community where the estate is located is another crucial factor that determines the value of estate. The residential community profile features include its developer, completion year, number of buildings and estates, property fee and district. All the above features are extracted from residential community data. In addition, we identify each community by a unique identifier.
4.2.3. Temporal Features.
The temporal features include the valuation date of estate and the price distribution of historical transactions of the same residential community. Historical estate transaction prices in the same community are important reference factors for subject property appraisal, because these estates usually have many similar attributes (e.g., community attributes, geographical attributes), therefore, have similar price distribution. We quantify the distribution of historical prices by some statistic features (e.g., mean, variance, maximum, minimum) for the unit prices of estate transactions that have been closed in previous quarter of the valuation date in the same residential community.

4.2.4. Geographical Features.
The geographical features are of much importance when appraising a real estate as well. The estate in an area that has complete facilities, which provide more convenience and enjoyment for living, usually has a higher price. Except counting the number of all POIs and transport stations close to the estate to reflect the completeness of surrounding facilities, we further consider several kinds of important geographical factors, including transportation, education, medical treatment, shopping, living, entertainment, and unpleasantness. We count the number of facilities and places that correspond to above factors nearby the estate, and calculate the distance from the estate to the nearest ones. The spatial distribution of facilities number feature is shown in Figure 3(a), where we can observe a positive correlation between this feature and real estate unit price. More details of geographical features can be found in Table 3.
4.2.5. Population Visit Features.
The geographical visitation volume can reflect the popularity and prosperity of an area, which has a significant impact on real estate prices. We first aggregate the check-in records of each user by every 10 minutes as one visit of a location. Then we construct population visit features by accumulating the visiting frequency of population nearby the estate in work hours (10:00-18:00), break hours (18:00-23:00), and all day on workdays and weekends, respectively. By comparing Figure 1(d) and Figure 3(b), we observe the areas with high visiting frequency of population are usually of high unit prices of real estates.
4.2.6. Mobility Features.
Human mobility is also an important factor to estimate the real estate prices of a residential community. For example, if residents of a community frequently visit high-consumption or high-income places, then we can infer higher real estate prices for the residential community. Thus, we construct abundant mobility features based on User Trip Data to reflect mobility patterns of community residents, including the human volume of inflow and outflow of community, the distribution of travel modes (e.g., driver, taxi, bus, cycle, walk) and the distribution of travel destination types (e.g., enterprise, administration, shopping places, entertainment venues) of community residents on workdays and weekends, respectively.
4.2.7. Resident Population Profile Features.
The resident population profile also has strong correlation with real estate prices of the residential community. For example, the residential communities where high-income and high-consumption residents live indicates the brownstone districts, and are supposed to have high real estate prices. To this end, we construct valuable resident population profile features of the community based on User Profile Data to comprehensively profile residents’ demographic attributes and social attributes. These features include the resident population number, the distributions of residents’ hometown, life stage, industry, income level, education level, consumption level, etc. The details of these features are listed in Table 3. We depict the spatial distribution of community residents’ income in Figure 3(c), which further confirms that the communities with high-income residents usually correspond to high real estate prices.



4.3. Event-level Representation Learning
The price of real estate transactions in proximal places and periods have strong dependencies. For example, for a place with a planned subway station, the real estate prices of surrounding areas usually increase synchronously. The transaction price dependencies can also be validated by Figure 1(d), i.e., the nearby real estate prices tend to be similar. However, these transactions are dispersedly distributed in spatial and temporal domains, which induces spatiotemporal asynchronism. Such asynchronously spatiotemporal dependencies among real estate transactions also distinguish our task from existing works on predicting regional future house prices (Tan et al., 2017; Ge et al., 2019), where the input are more regular time series data. It also prevents us to adopt existing spatiotemporal prediction approaches (Li et al., 2018) for our task.
To tackle above problem, we first formulate each real estate transaction as a transaction event, which is defined in Section 3. Then, we model the continuously emerging transaction events as an evolving graph , where is a set of real estate transaction events, is a set of edges indicating connectivity among transaction events, and denotes the proximity matrix of . Specifically, we define connectivity constraint as
(1) |
where is the geographical distance between the locations of events, and are physical distance threshold and time interval threshold, respectively. To balance the connected events from different communities, we restrict a node at most to connect last events from each community. With real estate transaction events occurring over time, the graph evolves accordingly.
Inspired by recent success of GNN (Kipf and Welling, 2017; Velickovic et al., 2018) on processing non-Euclidean graph structures and its inductive ability to effectively generate representation for previous unseen node (Hamilton et al., 2017), we adopt graph attention networks (Velickovic et al., 2018), an effective variant of GNN, to capture event-level dependencies among real estate transactions on the evolving graph.
Specifically, to quantify the sophisticated influence of historical transaction events to subject property , we introduce the attention mechanism (Zhang et al., 2021) to automatically learn the coefficient between transaction events,
(2) |
where and are learnable parameters, denotes the concatenation operation. The proximity weight between event and is further defined as
(3) |
where is a set of adjacent events of in .
Once is obtained, we derive the representation that integrates nearby previous transaction events by aggregating and transforming the adjacent events of subject property, defined as
(4) |
where indicates the -th graph convolution layer in evolving graph, are learnable parameters for -th graph convolution, , and is an indicator function that equals one if and zero otherwise. By stacking convolution layers, we can aggregate -hop adjacent events to mine spatio-temporally extensive knowledge from historical real estate transaction data for more accurate subject property valuation.
4.4. Community-level Representation Learning
As aforementioned, the real estate value is also highly correlated with the residential community it belongs to (Fu et al., 2015). Therefore, an expressive representation of community can be useful in real estate appraisal. In this work, we devise a hierarchical heterogeneous community graph convolution module, including the dynamic intra-community graph convolution block and the heterogeneous inter-community graph convolution block.
4.4.1. Dynamic Intra-Community Graph Convolution
The representation of a residential community should be updated dynamically once there is a new transaction event, defined as , happening within it. The transaction events happened in each community can make up an individual impact graph: , where consists of the community and transaction events happening in , is a set of edges constraining what transaction events have impacts on the representation of , and denotes the impact matrix of , indicating the impacts of previous transaction events to the community representation. The connectivity between community and transaction event is defined as
(5) |
where denotes the number of days from the last -th transaction event to the latest transaction event in . Note the transaction event nodes set and the edges set connecting between community and transaction events change dynamically as the new transaction events occur.
Then the attention mechanism is further used to quantify the impact of each previous transaction within ,
(6) |
where and are learnable parameters. Similar to Eq. (3), we can finally derive the impact weight .
Once is obtained, the representation of each community is updated by performing the graph convolution operation
(7) |
where are learnable parameters.
4.4.2. Heterogeneous Inter-Community Graph Convolution
The diversified correlations between residential communities are induced by their various similarities. For example, the residential communities located in similar functional areas with similar surrounding geographical facilities distribution usually tend to be positively correlated in real estate value. Thus, we define four similarity metrics between residential communities based on four types of features, i.e., geographical features, population visit features, mobility features, and resident population profile features.
Then, we construct the heterogeneous inter-community graph to model the diversified correlations between residential communities, which is defined as , where is a set of residential communities. We construct four types of edges based on four kinds of similarities between residential communities. Next, we will take geographical edges set as a representative for detailed explanation.
We define the geographical features of community as . Then, we can compute the euclidean distance between and
(8) |
Smaller euclidean distance of features indicates higher similarity between communities. Thus, the geographical edge is defined as
(9) |
where is the distance threshold. Other types of edge sets can be derived in the same way but with respective thresholds.
We define as the subject property belonging to community . With the latest representation of each community and the heterogeneous edges set , the coefficient between communities and is computed by
(10) |
where and are learnable parameters, is a one-hot vector to denote the type of edge. Once coefficients are derived, the proximity weight can be obtained similar to Eq. (3). Then, we derive the overall representation of residential communities by the graph convolution operation
(11) |
where indicates the -th graph convolution layer in inter-community graph, are learnable parameters for -th graph convolution, . By stacking convolution layers, can integrate -hop diversified correlations between communities.
Finally, we obtain the overall representation of subject property through concatenation and multi-layer perceptron operations
(12) |
4.5. Multi-Task Learning Based Valuation


In general, a city is partitioned into several urban districts for administration. Each urban district can have distinctive urban functionalities and leads to diversified real estate price distributions, as illustrated in Figure 4. Inspire by (Zhu et al., 2016), we partition the tasks via urban districts, and each urban district corresponds to a learning task. These tasks share most of parameters of the model except have a unique fully-connected output layer to generate appraisal of distinctive distribution. Therefore, the real estate located in each urban district is valuated by
(13) |
where denotes the unique fully-connected layer of -th task associated with -th urban district.
Overall, our model aims to minimize the Mean Square Error (MSE) loss between the estimated unit price and the ground truth transaction unit price of real estate,
(14) |
where denotes the real estate transactions set happening in -th urban district, denotes the cardinality of .
5. Experiments
5.1. Experimental setup
We evaluate the performance of MugRep on both Beijing and Chengdu datasets. We chronologically order each dataset based on the transaction date. For Beijing, we take the data of which the transaction date ranges from January 2018 to June 2019, as the training set, the following one month as validation set, and the rest of data for testing. For Chengdu, the training set is set from January 2018 to June 2020, the other setting are the same as Beijing. Our experiments mainly focus on (1) the overall performance comparison, (2) the ablation studies of model and features, (3) the feature importance analysis, (4) and the effectiveness check for spatially different communities. Furthermore, please refer to Appendix C for the prototype system of real estate appraisal.
5.1.1. Implementation details.
We take m, , and for event-level representation learning. We choose , set the distance thresholds (e.g., ) of several types of edge as the 0.001 quantile of all pair-wise euclidean distance values, and select for community-level representation learning. The dimensions of , and are fixed to 32. The hidden dimension of is fixed to 64. We employ Adam optimizer, set learning rate as 0.01 to train our model, and early stop if the loss does not decrease on validation data set over 30 epochs.
5.1.2. Evaluation metrics.
We adopt Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), and Root Mean Square Error (RMSE), three widely used metrics (Wang et al., 2020) for evaluation. Note that the estimated unit price and ground truth transaction unit price are in 10,000 CNY.
5.1.3. Baselines.
We compare MugRep with one statistical baseline (HA), three classic machine learning based baselines (LR (Pedregosa et al., 2011), SVR (Pedregosa et al., 2011) and GBRT (Ke et al., 2017)), and two artificial neural networks (ANN) based baselines (DNN, PDVM (Bin et al., 2019)). The input features of all learning based methods are the same. The details of these baselines are introduced in Appendix B.
Dataset | Algorithm | MAE | MAPE | RMSE |
---|---|---|---|---|
Beijing | HA | 0.6313 | 11.33% | 1.1008 |
LR | 0.4776 | 8.94% | 0.7041 | |
SVR | 0.4427 | 8.10% | 0.6840 | |
GBRT | 0.3640 | 6.70% | 0.5515 | |
DNN | 0.3550 | 6.35% | 0.5505 | |
PDVM | 0.3469 | 6.17% | 0.5373 | |
MugRep | 0.3244 | 5.76% | 0.5097 | |
Chengdu | HA | 0.1456 | 10.65% | 0.2487 |
LR | 0.1515 | 9.73% | 0.2150 | |
SVR | 0.1339 | 8.57% | 0.1888 | |
GBRT | 0.1133 | 7.27% | 0.1708 | |
DNN | 0.1090 | 7.26% | 0.1651 | |
PDVM | 0.1051 | 6.95% | 0.1583 | |
MugRep | 0.0916 | 6.20% | 0.1404 |
5.2. Overall Performance
Table 2 reports overall results of our methods and all compared baselines on two datasets with respect to three metrics. Overall, our model achieves the best performance among all the baselines. Moreover, we observe all ANN based algorithms (DNN, PDVM, MugRep) outperform the statistical algorithm (HA) and machine learning based algorithms (LR, SVR, GBRT), which consistently verifies the advantages of applying ANN to real estate appraisal for its extraordinary non-linear processing ability. Particularly, MugRep achieves and improvements beyond the state-of-the-art baseline PDVM for (MAE, MAPE, RMSE) on Beijing and Chengdu, respectively. The results demonstrate the effectiveness of MugRep.
5.3. Ablation Study
In this section, we conduct ablation studies on MugRep, including model ablation and feature ablation, to further verify the effectiveness of each component and feature group. The experiments are finished for three metrics on both Beijing and Chengdu datasets.
5.3.1. Model Ablation.
We evaluate the performance of MugRep and it’s three variants, which are (1) noEvt removes the event-level representation learning module; (2) noCom removes the community-level representation learning module; (3) noMT removes the multi-task learning module. The ablation results are reported in Figure 5. As can be seen, removing any of the components leads to remarkable performance degradation. Among these components, we find the event-level and community-level representation modules are more important, especially the event-level representation module. This is because the nearby real estates have strong similarities and dependencies. The close historical real estate transactions can be a very valuable reference for subject property valuation. All the results demonstrate the effectiveness of MugRep and its each component.
5.3.2. Feature Ablation.
To examine the performance impact of feature groups that constructed based on four additional multi-source urban datasets, we evaluate MugRep with complete features and its five variants: (1) Complete is MugRep with complete features; (2) Basic excludes all features from four additional multi-source urban datasets and the community-level representation learning module; (3) noGeo excludes geographical features and geographical edge set in the heterogeneous inter-community graph; (4) noVis excludes population visit features and corresponding edge set ; (5) noMob excludes mobility features and corresponding edge set ; (6) noPop excludes resident population profile features and corresponding edge set . The experimental results are reported in Figure 6. There is a consistent performance degradation by excluding any of additional urban feature groups. If we exclude all these additional urban features (i.e., basic), MugRep will have significant and performance degradation for (MAE, MAPE, RMSE) on Beijing and Chengdu, which demonstrate the effectiveness of these urban features and community-level representation learning module. Besides, we observe noMob and noPop lead to notable performance degradation compared to MugRep with complete features. The observation verifies that considering the characteristics and correlations of community residents are very useful for real estate appraisal.






5.4. Feature Importance Analysis
To further evaluate the effect of our constructed features, we illustrate the top-30 most important features in Figure 7. The features are ranked by logarithmic information gain (Ke et al., 2017). As can be seen, the distribution of historical price are the most important features. The subsequent one is the district of residential community, which indicates the large difference between districts. Furthermore, we observe half of the top-30 features are from the four multi-source urban datasets, which demonstrates the effectiveness of these urban features. Among these urban features, the geographical features (Living, Entertainment, Shopping), mobility features (Travel destination) and resident population profile features (Income level, Consumption level, Industry) are ranked very high (in top-15). For the reasons that these geographical features are closely related to living quality, these mobility features are greatly relevant to travel preferences of community residents, and these resident population profile features are strongly associated with the wealth of community residents. The living quality, travel preferences, and wealth are three very important factors to reflect the real estate prices.

5.5. Effectiveness in Different Community
To evaluate the performance of MugRep on spatially diverse communities, we compute the separate MAPE for each residential community. Figure 8(a) and Figure 8(b) show the spatial distribution of MAPE and inverse transaction volume on Beijing. As can be seen, some communities with high MAPE (bright color) always have high inverse transaction volume (i.e., less transaction volume) nearby. This makes sense for two reasons: first, the historical price features of same community are important for subject property valuation, less transaction volume in the community means inaccurate or missing historical price features; second, the evolving transaction event graph convolution module of MugRep is highly correlated with nearby previous transactions, the absence of nearby transactions decreases the effectiveness of event-level representation. This result indicates further optimization can be applied to these residential communities with scarce transactions to improve the overall performance.
6. Related work
Real Estate Appraisal. Traditional real estate appraisal methodologies can be mainly categorized into three classes, i.e., sales comparison based approaches (Mccluskey and Borst, 1997), cost based approaches (Guo et al., 2014), and income based approaches (Baum et al., 2017). The sales comparison approach estimates the market value of real estate depending on some forms of comparison. The cost approach is based on the theory that the summation of land value and depreciated value of any improvements can be as the estimation for the real estate. And the income approach estimates the real estate market value based on its income. Hedonic price model (Rosen, 1974; Cheshire and Sheppard, 1995) is also commonly used in real estate appraisal. It assumes that the real estate can be viewed as an aggregation of individual attributes, which implicitly reflect the real estate market value. However, it does not consider attributes interactions and is incompetent in non-linear data (Limsombunchai, 2004). Besides, automated valuation methods (AVMs) have arisen researchers’ interests for they can automatically estimate the market value of an real estate based on its available attributes. Many AVMs such as linear regression (Şipoş et al., 2008; Ahn et al., 2012), support vector regression (Lin and Chen, 2011), boosted regression trees (Graczyk et al., 2010; Park and Bae, 2015) and artificial neural networks methods (Peterson and Flanagan, 2009; Peter et al., 2020; Poursaeed et al., 2018; You et al., 2017; Law et al., 2019) are widely applied into the real estate appraisal. Moreover, some works (Fu et al., 2014b, a; Fu et al., 2015) investigate how to rank real estates via various viewpoints, such as individual, peer and zone dependency, online user reviews and offline moving behaviors, and diverse mixed land use. Furthermore, there are few works that try to capture the peer-dependency among nearby estates. Fu et al. (2014b) use the generative likelihood of each edge to model peer-dependency, which does not adequately integrate the attributes knowledge of nearby estates. Works (Bin et al., 2019; You et al., 2017) sample fixed number of similar estates by selecting k-nearest similar estates or the random walk algorithm, and then feed these samples to recurrent neural networks. The sampling process could lead to the loss of valuable information. Overall, these prior studies all leave out community residents characteristics, and are incapable to fully model the spatiotemporal dependencies among real estate transactions. Besides, none of them attempt to capture the diversified correlations between residential communities.


Real Estate Forecasting. This work is also related to real estate forecasting tasks. For example, Tan et al. (2017) proposes a time-aware latent hierarchical model and Ge et al. (2019) proposes an integrated framework that improving the DenseNet to predict future house prices of regions. Zhu et al. (2016) proposes a multi-task linear regression model for real estate’s days-on-market prediction. Li et al. (2019) estimates the housing demand based on express delivery data.
Graph Neural Network. Graph neural networks (GNN) is designed to extend the well-known convolution neural network to non-Euclidean graph structures. GNN is usually used to obtain the expressive representation of each node by aggregating and transforming the representations of its neighbors in the graph (Kipf and Welling, 2017; Hamilton et al., 2017; Velickovic et al., 2018). Many previous studies have demonstrated the effectiveness of GNN in the graph-based representation learning tasks (Wu et al., 2020). Because of its effectiveness, GNN has been successfully applied to various fields, such as physics (Santoro et al., 2017), chemistry (Gilmer et al., 2017), biology (Fout et al., 2017), recommendation systems (Ying et al., 2018; Wang et al., 2019; Xin et al., 2021), and smart city systems (Zhang et al., 2020; Liu et al., 2020a; Han et al., 2021). However, to the best of our knowledge, this is the first work applying GNN to real estate appraisal.
7. Conclusion
In this paper, we presented MugRep, a multi-task hierarchical graph representation learning framework for real estate appraisal. We first constructed abundant valuable features with respect to the fundamental attributes and community residents characteristics to comprehensively profile the real estate value. Then we designed an evolving transaction event graph convolution module to model the asynchronously spatiotemporal dependencies among real estate transactions, and devised a hierarchical heterogeneous community graph convolution module to capture diversified correlations between residential communities. After that, an urban district partitioned multi-task learning module was introduced to perform the real estate appraisal of distinctive distribution. Extensive experiments on two real-world datasets demonstrated that MugRep achieves the best performance compared with six baselines.
Acknowledgements.
This research is supported in part by grants from the National Natural Science Foundation of China (Grant No.91746301, 71531001).References
- (1)
- Ahn et al. (2012) Jae Joon Ahn, Hyun Woo Byun, Kyong Joo Oh, and Tae Yoon Kim. 2012. Using ridge regression with genetic algorithm to enhance real estate appraisal forecasting. Expert Systems with Applications 39, 9 (2012), 8369–8379.
- Baum et al. (2017) Andrew Baum, David Mackmin, and Nick Nunnington. 2017. The income approach to property valuation. Routledge.
- Bin et al. (2019) Junchi Bin, Bryan Gardiner, Eric Li, and Zheng Liu. 2019. Peer-Dependence Valuation Model for Real Estate Appraisal. Data-Enabled Discovery and Applications 3, 1 (2019), 2.
- Cheshire and Sheppard (1995) Paul Cheshire and Stephen Sheppard. 1995. On the price of land and the value of amenities. Economica (1995), 247–267.
- Fout et al. (2017) Alex Fout, Jonathon Byrd, Basir Shariat, and Asa Ben-Hur. 2017. Protein interface prediction using graph convolutional networks. In Advances in Neural Information Processing Systems. 6533–6542.
- Fu et al. (2014a) Yanjie Fu, Yong Ge, Yu Zheng, Zijun Yao, Yanchi Liu, Hui Xiong, and Jing Yuan. 2014a. Sparse real estate ranking with online user reviews and offline moving behaviors. In IEEE International Conference on Data Mining. 120–129.
- Fu et al. (2015) Yanjie Fu, Guannan Liu, Spiros Papadimitriou, Hui Xiong, Yong Ge, Hengshu Zhu, and Chen Zhu. 2015. Real estate ranking via mixed land-use latent models. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 299–308.
- Fu et al. (2014b) Yanjie Fu, Hui Xiong, Yong Ge, Zijun Yao, Yu Zheng, and Zhi-Hua Zhou. 2014b. Exploiting geographic dependencies for real estate appraisal: A mutual perspective of ranking and clustering. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1047–1056.
- Ge et al. (2019) Chuancai Ge, Yang Wang, Xike Xie, Hengchang Liu, and Zhengyang Zhou. 2019. An integrated model for urban subregion house price forecasting: A multi-source data perspective. In IEEE International Conference on Data Mining. 1054–1059.
- Gilmer et al. (2017) Justin Gilmer, Samuel S Schoenholz, Patrick F Riley, Oriol Vinyals, and George E Dahl. 2017. Neural message passing for quantum chemistry. In International Conference on Machine Learning. 1263–1272.
- Graczyk et al. (2010) Magdalena Graczyk, Tadeusz Lasota, Bogdan Trawiński, and Krzysztof Trawiński. 2010. Comparison of bagging, boosting and stacking ensembles applied to real estate appraisal. In Asian Conference on Intelligent Information and Database Systems. 340–350.
- Guo et al. (2014) Jingjuan Guo, Shoubo Xu, and Zhuming Bi. 2014. An integrated cost-based approach for real estate appraisals. Information Technology and Management 15, 2 (2014), 131–139.
- Hamilton et al. (2017) Will Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. In Advances in Neural Information Processing Systems. 1024–1034.
- Han et al. (2021) Jindong Han, Hao Liu, Hengshu Zhu, Hui Xiong, and Dejing Dou. 2021. Joint Air Quality and Weather Prediction Based on Multi-Adversarial Spatiotemporal Networks. In Proceedings of the 35th AAAI Conference on Artificial Intelligence.
- Ke et al. (2017) Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye, and Tie-Yan Liu. 2017. Lightgbm: A highly efficient gradient boosting decision tree. In Advances in Neural Information Processing Systems. 3146–3154.
- Kipf and Welling (2017) Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In International Conference on Learning Representations, ICLR.
- Law et al. (2019) Stephen Law, Brooks Paige, and Chris Russell. 2019. Take a look around: using street view and satellite images to estimate house prices. ACM Transactions on Intelligent Systems and Technology (TIST) 10, 5 (2019), 1–19.
- Li et al. (2019) Qingyang Li, Zhiwen Yu, Bin Guo, Huang Xu, and Xinjiang Lu. 2019. Housing Demand Estimation Based on Express Delivery Data. ACM Transactions on Knowledge Discovery from Data (TKDD) 13, 4 (2019), 1–25.
- Li et al. (2020) Shuangli Li, Jingbo Zhou, Tong Xu, Hao Liu, Xinjiang Lu, and Hui Xiong. 2020. Competitive Analysis for Points of Interest. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1265–1274.
- Li et al. (2018) Yaguang Li, Rose Yu, Cyrus Shahabi, and Yan Liu. 2018. Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting. In International Conference on Learning Representations, ICLR.
- Limsombunchai (2004) Visit Limsombunchai. 2004. House price prediction: hedonic price model vs. artificial neural network. In New Zealand Agricultural and Resource Economics Society Conference. 25–26.
- Lin and Chen (2011) Hongyu Lin and Kuentai Chen. 2011. Predicting price of Taiwan real estates by neural networks and support vector regression. In Proceedings of the 15th WSEAS International Conference on Systems. 220–225.
- Liu et al. (2020a) Hao Liu, Jindong Han, Yanjie Fu, Jingbo Zhou, Xinjiang Lu, and Hui Xiong. 2020a. Multi-modal transportation recommendation with unified route representation learning. Proceedings of the VLDB Endowment 14, 3 (2020), 342–350.
- Liu et al. (2020b) Hao Liu, Yongxin Tong, Jindong Han, Panpan Zhang, Xinjiang Lu, and Hui Xiong. 2020b. Incorporating Multi-Source Urban Data for Personalized and Context-Aware Multi-Modal Transportation Recommendation. IEEE Transactions on Knowledge and Data Engineering (2020).
- Mccluskey and Borst (1997) W. Mccluskey and R. Borst. 1997. An evaluation of MRA, comparable sales analysis, and artificial neural networks (ANNs) for the mass appraisal of residential properties in Northern Ireland. Assessment 4 (1997), 47–55.
- Pagourtzi et al. (2003) Elli Pagourtzi, Vassilis Assimakopoulos, Thomas Hatzichristos, and Nick French. 2003. Real estate appraisal: a review of valuation methods. Journal of Property Investment & Finance (2003).
- Park and Bae (2015) Byeonghwa Park and Jae Kwon Bae. 2015. Using machine learning algorithms for housing price prediction: The case of Fairfax County, Virginia housing data. Expert Systems with Applications 42, 6 (2015), 2928–2934.
- Pedregosa et al. (2011) Fabian Pedregosa, Gaël Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel, Peter Prettenhofer, Ron Weiss, Vincent Dubourg, et al. 2011. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 12 (2011), 2825–2830.
- Peter et al. (2020) Nkolika J Peter, Hilary I Okagbue, Emmanuela CM Obasi, and Adedotun O Akinola. 2020. Review on the Application of Artificial Neural Networks in Real Estate Valuation. International Journal 9, 3 (2020).
- Peterson and Flanagan (2009) Steven Peterson and Albert Flanagan. 2009. Neural network hedonic pricing models in mass real estate appraisal. Journal of Real Estate Research 31, 2 (2009), 147–164.
- Poursaeed et al. (2018) Omid Poursaeed, Tomáš Matera, and Serge Belongie. 2018. Vision-based real estate price estimation. Machine Vision and Applications 29, 4 (2018), 667–676.
- Rosen (1974) Sherwin Rosen. 1974. Hedonic prices and implicit markets: product differentiation in pure competition. Journal of Political Economy 82, 1 (1974), 34–55.
- Santoro et al. (2017) Adam Santoro, David Raposo, David GT Barrett, Mateusz Malinowski, Razvan Pascanu, Peter Battaglia, and Timothy Lillicrap. 2017. A simple neural network module for relational reasoning. In Advances in Neural Information Processing Systems. 4974–4983.
- Şipoş et al. (2008) Ciprian Şipoş, Eng Adrian Crivii, and MBA FRICS. 2008. A linear regression model for real estate appraisal. In WAVO Valuation Congress Valuation in Diversified and Emerging Economies. 17–18.
- Tan et al. (2017) Fei Tan, Chaoran Cheng, and Zhi Wei. 2017. Time-aware latent hierarchical model for predicting house prices. In IEEE International Conference on Data Mining. 1111–1116.
- Velickovic et al. (2018) Petar Velickovic, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Lio, and Yoshua Bengio. 2018. Graph attention networks. In International Conference on Learning Representations, ICLR.
- Wang et al. (2019) Hongwei Wang, Fuzheng Zhang, Mengdi Zhang, Jure Leskovec, Miao Zhao, Wenjie Li, and Zhongyuan Wang. 2019. Knowledge-aware graph neural networks with label smoothness regularization for recommender systems. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 968–977.
- Wang et al. (2020) Jun Wang, Hefu Zhang, Qi Liu, Zhen Pan, and Hanqing Tao. 2020. Crowdfunding Dynamics Tracking: A Reinforcement Learning Approach. In Proceedings of the 34th AAAI Conference on Artificial Intelligence. 6210–6218.
- Wu et al. (2020) Zonghan Wu, Shirui Pan, Fengwen Chen, Guodong Long, Chengqi Zhang, and S Yu Philip. 2020. A comprehensive survey on graph neural networks. IEEE Transactions on Neural Networks and Learning Systems (2020).
- Xin et al. (2021) Haoran Xin, Xinjiang Lu, Tong Xu, Hao Liu, Jingjing Gu, Dejing Dou, and Hui Xiong. 2021. Out-of-Town Recommendation with Travel Intention Modeling. In Proceedings of the 35th AAAI Conference on Artificial Intelligence.
- Ying et al. (2018) Rex Ying, Ruining He, Kaifeng Chen, Pong Eksombatchai, William L Hamilton, and Jure Leskovec. 2018. Graph convolutional neural networks for web-scale recommender systems. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 974–983.
- You et al. (2017) Quanzeng You, Ran Pang, Liangliang Cao, and Jiebo Luo. 2017. Image-based appraisal of real estate properties. IEEE Transactions on Multimedia 19, 12 (2017), 2751–2759.
- Zhang et al. (2020) Weijia Zhang, Hao Liu, Yanchi Liu, Jingbo Zhou, Tong Xu, and Hui Xiong. 2020. Semi-Supervised City-Wide Parking Availability Prediction via Hierarchical Recurrent Graph Neural Network. IEEE Transactions on Knowledge and Data Engineering (2020).
- Zhang et al. (2021) Weijia Zhang, Hao Liu, Fan Wang, Tong Xu, Haoran Xin, Dejing Dou, and Hui Xiong. 2021. Intelligent Electric Vehicle Charging Recommendation Based on Multi-Agent Reinforcement Learning. In The World Wide Web Conference.
- Zhu et al. (2020) Hengshu Zhu, Ying Sun, Wenjia Zhao, Fuzhen Zhuang, Baoshan Wang, and Hui Xiong. 2020. Rapid Learning of Earthquake Felt Area and Intensity Distribution with Real-time Search Engine Queries. Scientific Reports 10, 1 (2020), 1–9.
- Zhu et al. (2016) Hengshu Zhu, Hui Xiong, Fangshuang Tang, Qi Liu, Yong Ge, Enhong Chen, and Yanjie Fu. 2016. Days on market: Measuring liquidity in real estate markets. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 393–402.
Feature Type | Feature | Description | |
Real Estate Profile | Number of rooms | Number of rooms (e.g., bedroom, living-room, kitchen, bathroom) in this estate. | |
Area | Number of square meters of the estate. | ||
Decoration | Type of decoration (e.g., simply-decorated, well-decorated, not decorated). | ||
Orientation | Orientation of the estate (e.g., north, south, west, east). | ||
Estate structure | Structure of estate (e.g., flat layer, jump layer, duplex). | ||
Heating method | Method of estate’s heating (e.g., central heating, self-heating, without heating). | ||
Floor type | Type of the floor of the estate (e.g., high, medium, low, basement). | ||
Free of tax | Whether the estate is free of sales tax. | ||
Transaction ownership | Transaction ownership of the estate (i.e., commercial housing, affordable housing, purchased public housing). | ||
Floor number | Number of floors of the building where the estate is located. | ||
Building type | Type of the building (e.g., tower building, slab-type building, mixed-type building). | ||
Elevator household ratio | Ratio of the number of elevators to households of the building. | ||
Residential Community Profile | Developer | Developer of the community. | |
Completion year | Completion year of the community. | ||
Number of buildings | The total number of buildings in the community. | ||
Number of estates | The total number of estates in the community. | ||
Property fee | Property fee of the community. | ||
District | District of the community (e.g., Xicheng, Haidian, Chaoyang). | ||
Community index | Index of community. | ||
Temporal | Valuation date | Date of making valuation for the estate. | |
Historical price |
|
||
Geographical | Transportation | Number of the transport facilities (e.g., subway and bus stations) nearby and the distances of the nearest ones. | |
Education |
|
||
Medical | Number of the medical facilities (e.g., hospital, clinic, pharmacy) nearby and the distances of the nearest ones. | ||
Shopping |
|
||
Living | Number of the living places (e.g., restaurant, barbershop, bank) nearby and the distances of the nearest ones. | ||
Entertainment | Number of the entertainment venues (e.g., cinema, gym, park) nearby and the distances of the nearest ones. | ||
Unpleasantness | Number of the unpleasant facilities (e.g., factory, cemetery) nearby and the distances of the nearest ones. | ||
Number of facilities | Number of all the POIs and transport stations nearby. | ||
Population Visit | Work hours visits | Visiting frequency of population nearby in work hours (10:00-18:00) on workdays and weekends. | |
Break hours visits | Visiting frequency of population nearby in break hours (18:00-23:00) on workdays and weekends. | ||
All day visits | Visiting frequency of population nearby in all day on workdays and weekends. | ||
Mobility | Inflow volume | Human volume of the inflow of community on workdays and weekends. | |
Outflow volume | Human volume of the outflow of community on workdays and weekends. | ||
Travel mode |
|
||
Travel destination |
|
||
Resident Population Profile | Resident population | Number of resident population in the community. | |
Hometown | Distribution of the hometowns (e.g., Beijing, Shandong) of resident population in the community. | ||
Gender | Distribution of the gender of resident population in the community. | ||
Age | Distribution of the ages (e.g., teenager, youth, the middle-aged, the old) of resident population in the community. | ||
Life stage | Distribution of the life stages (e.g., student, working, parent, retire) of resident population in the community. | ||
Industry | Distribution of the industries (e.g., education, catering, IT, finance) of resident population in the community. | ||
Car owner | Distribution of owning cars of resident population in the community. | ||
Income level | Distribution of the income levels (e.g., low, medium, high, very high) of resident population in the community. | ||
Education level | Distribution of the education levels (e.g., undergraduate, college, senior) of resident population in the community. | ||
Consumption level | Distribution of the consumption level (e.g., low, medium, high) of resident population in the community. | ||
Consumption wish |
|
Appendix A Details of used features
Table 3 is the feature list used in MugRep, including Real Estate Profile features, Residential Community Profile features, Temporal features, Geographical features, Population Visit features, Mobility features, and Resident Population Profile features.
Appendix B Details of baselines
We compare our MugRep with the following six baselines. We carefully tune major hyper-parameters of each baseline based on their recommended settings. GBRT, DNN, and PDVM employ the same early stop training strategy as MugRep.
-
•
HA uses the average previous 90 days’ price of transactions closed in the same residential community as estimated value.
-
•
LR (Pedregosa et al., 2011) makes appraisal via the well-known linear regression model.
-
•
SVR (Pedregosa et al., 2011) makes appraisal via the support vector regression model. We use the Radial Basis Function (RBF) kernel.
-
•
GBRT makes appraisal via gradient boosted regression tree model. We use the version in LightGBM (Ke et al., 2017), set learning rate to 0.1, set maximal tree depth to 10, and maximal leaves number to 31.
-
•
DNN is a type of ANN method contains two fully-connected 64 dimensions hidden layers with ReLU activation functions, and employ Adam for optimization. The learning rate is set to 0.005.
-
•
PDVM (Bin et al., 2019) is a state-of-the-art ANN method for real estate appraisal. It models estates peer-dependency by using the k-nearest similar estate sampling to sample fixed number of real estates, and feed them to a bidirectional LSTM to generate final real estate appraisal. We slightly modify PDVM to sample historical real estate transactions to fit our dataset. We employ one layer bidirectional LSTM, and its input sequence length is 7. The hidden dimension is set to 64, learning rate is 0.005.

Appendix C Prototype system
We have implemented a prototype system for supporting users to make real estate appraisal decisions. We use angularJS (JavaScript MVW framework), bootstrap (front-end framework for web development), and Django (a web framework in Python) along with MySQL to build our prototype system, of which Figure 9 shows a screenshot. Specifically, once the user enters a residential community name to search, and select the community from a list of returned candidate items, the system will show its position on the map and other detailed information, including developer, property fee, completion year, etc. Then the user is expected to input real estate profile attributes, such as estate’s number of rooms, area, decoration, orientation, and click the ”Valuate” button to generate estimated price of the subject property.