MugRep: A Multi-Task Hierarchical Graph Representation Learning Framework for Real Estate Appraisal

Weijia Zhang^1,2†, Hao Liu^2∗, Lijun Zha¹, Hengshu Zhu³, Ji Liu², Dejing Dou², Hui Xiong^4∗ ¹School of Computer Science, University of Science and Technology of China,
²Baidu Research, ³Baidu Talent Intelligence Center, Baidu Inc., ⁴Rutgers University
{wjzhang3,zlj160}@mail.ustc.edu.com, {liuhao30,zhuhengshu,liuji04,doudejing}@baidu.com, [email protected]

(2021)

Abstract.

Real estate appraisal refers to the process of developing an unbiased opinion for real property’s market value, which plays a vital role in decision-making for various players in the marketplace (e.g., real estate agents, appraisers, lenders, and buyers). However, it is a non-trivial task for accurate real estate appraisal because of three major challenges: (1) The complicated influencing factors for property value; (2) The asynchronously spatiotemporal dependencies among real estate transactions; (3) The diversified correlations between residential communities. To this end, we propose a Multi-Task Hierarchical Graph Representation Learning (MugRep) framework for accurate real estate appraisal. Specifically, by acquiring and integrating multi-source urban data, we first construct a rich feature set to comprehensively profile the real estate from multiple perspectives (e.g., geographical distribution, human mobility distribution, and resident demographics distribution). Then, an evolving real estate transaction graph and a corresponding event graph convolution module are proposed to incorporate asynchronously spatiotemporal dependencies among real estate transactions. Moreover, to further incorporate valuable knowledge from the view of residential communities, we devise a hierarchical heterogeneous community graph convolution module to capture diversified correlations between residential communities. Finally, an urban district partitioned multi-task learning module is introduced to generate differently distributed value opinions for real estate. Extensive experiments on two real-world datasets demonstrate the effectiveness of MugRep and its components and features.

Real estate appraisal; graph neural networks; multi-task learning

^∗ Corresponding author.
^† The research was done when the first author was an intern in Baidu Research under the supervision of the second author.

^†^†journalyear: 2021^†^†copyright: acmcopyright^†^†conference: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining; August 14–18, 2021; Virtual Event, Singapore^†^†booktitle: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD ’21), August 14–18, 2021, Virtual Event, Singapore^†^†price: 15.00^†^†doi: 10.1145/3447548.3467187^†^†isbn: 978-1-4503-8332-5/21/08^†^†ccs: Information systems Data mining^†^†ccs: Information systems Spatial-temporal systems^†^†ccs: Applied computing Marketing

1. Introduction

Real estate appraisal provides an opinion of real property’s market value, which is the probable sales price it would bring in an open and competitive real estate market. Real estate appraisal is required and implemented by various players in the marketplace, such as real estate agents, appraisers, property developers, investors, lenders, and buyers (Pagourtzi et al., 2003). An accurate appraisal of real estate is of great importance to help buyers or sellers for negotiation and closing, help mortgage owners for lending and investigation, and help governments for urban planning.

Prior studies on real estate appraisal can be categorized into two classes: (1) Empirical Appraisal Methods (EAMs), such as sales comparison approach (Mccluskey and Borst, 1997), cost approach (Guo et al., 2014), and income approach (Baum et al., 2017), either heavily depend on the accuracy, availability, and timeliness of sale transaction data, or require strong domain knowledge to perform (Pagourtzi et al., 2003); (2) Automated Valuation Methods (AVMs) estimate the market value of a property based on automatic machine learning techniques, such as linear regression (Şipoş et al., 2008; Ahn et al., 2012), support vector regression (Lin and Chen, 2011), boosted regression trees (Graczyk et al., 2010; Park and Bae, 2015) and artificial neural networks (Peterson and Flanagan, 2009; Poursaeed et al., 2018; Law et al., 2019). Compared with EAMs, AVMs are easy-to-use even for non-domain experts and are widely used in practice.

However, with the prevalence of mobile devices and the proliferation of ubiquitous computing techniques, existing AVMs can be significantly improved from the following three aspects. First, existing AVMs are mainly based on fundamental real-estate information, such as the apartment attributes, residential community features, and surrounding geographical facilities (e.g., distance to mall, number of transport stations), but overlook the influence of demographic characteristics (e.g., mobility patterns and demographics of community residents), which contains important clues for real estate appraisal. For instance, real estates in a residential community which is in close relationship (e.g., similar human mobility patterns or resident demographics) with other high-end residential communities tend to have a higher market value. Second, existing AVMs usually regard the real estate transactions as a pile of independent samples, but ignores spatiotemporal dependencies among real estate transactions. Indeed, the value of real estate significantly depends on its spatially proximal estates (Fu et al., 2014b). Incorporating such spatiotemporal dependencies in asynchronous real estate transactions can further improve appraisal effectiveness. Third, the value of a real estate highly depends on the corresponding residential community. Beyond statistic attributes, the diversified correlations among residential communities can also be exploited to achieve higher appraisal accuracy.

Along these lines, in this paper, we present a Multi-Task Hierarchical Graph Representation Learning (MugRep) framework for real estate appraisal. Our major contributions can be summarized as follows: (1) We construct a rich set of features from multi-source user data, which provides a comprehensive real estate profile from multiple social demography views, e.g., geographical population visits, human mobility, and resident demographics. (2) By regarding each real estate transaction as an individual event, we construct an evolving transaction event graph based on both place and period proximity of transaction events, and introduce graph neural networks (GNN) for asynchronously spatiotemporal event-level dependencies modeling to enhance appraisal effectiveness. To the best of our knowledge, this is the first attempt to employ GNN techniques to improve real estate appraisal. (3) We further propose a hierarchical heterogeneous community graph convolution module to capture the diversified community-level correlations. Specifically, we devise a dynamic intra-community graph convolution block to obtain time-dependent community representation, and design a heterogeneous inter-community graph convolution block to propagate valuable knowledge from the correlated residential communities. (4) We conduct extensive experiments on two real-world datasets, the results demonstrate the effectiveness of our framework and its components and features.

Table 1. Statistics of datasets.

Description

Beijing

Chengdu

Real Estate Marketing Data

# of transactions

185,151

134,781

# of communities

6,267

3,995

Geographical Data

# of POIs

1,315,353

1,116,989

# of transport stations

44,513

30,564

Check-in Data

# of check-ins

4,078,723,999

5,130,847,589

User Trip Data

# of trip queries

73,307,426

66,960,348

User Profile Data

# of users

2,042,718

1,426,860

Refer to caption — (a) Unit price distribution.

2. Data description and analysis

In this section, we present the datasets to be used in our framework, with a preliminary data analysis. We use two datasets, i.e., Beijing and Chengdu, which represent two major metropolises in China. Except basic Real Estate Marketing Data, we exploit four additional multi-source urban datasets, i.e., Geographical Data, Check-in Data, User Trip Data, and User Profile Data, to improve the accuracy of real estate appraisal. Table 1 summarizes the statistics of the datasets.

2.1. Real Estate Marketing Data

In this paper, the real estate marketing datasets of Beijing and Chengdu are collected from a major commercial real estate agency¹¹1https://bj.lianjia.com. The scope of Beijing ranges from January 2018 to December 2019, and that of Chengdu ranges from January 2018 to December 2020. Each dataset consists of two kinds of data, i.e., real estate transaction data and residential community data.

We take Beijing as an example. Beijing totally contains 185,151 real estate transaction records and 6,267 residential communities. Each transaction record contains the attributes of the transaction estate, e.g., transaction date, transaction price, location, room number, area, transaction ownership, whether it is free of sales tax, the residential community real estate belongs to. Figure 1(a) shows the distribution of real estate unit price in Beijing. Figure 1(b) and Figure 1(c) show the fluctuation of unit price and transaction volume over time. Figure 1(d) and Figure 1(e) show the spatial distributions of unit price and transaction volume. Overall, the variation of real estate unit price in spatial domain is greater than that in temporal domain, whereas the transaction volume is distributed more evenly across the city with a notable periodical pattern. For each residential community, our datasets contain its developer, completion year, number of estates, property fee, etc. Figure 1(f) shows the spatial distribution of residential communities in Beijing, which is positively correlated with spatial distribution of transaction volume.

2.2. Geographical Data

Then we construct large-scale geographical datasets, including point of interest (POI) data (Li et al., 2020) and transport station data (Liu et al., 2020b). There are 1,315,353 POIs and 44,513 transport stations in Beijing, 1,116,989 POIs and 30,564 transport stations in Chengdu.

2.3. Check-in Data

Each check-in record corresponds to a GPS request from a mobile user, which is collected through Baidu location SDK (Zhang et al., 2020; Zhu et al., 2020). There are 4,078,723,999 and 5,130,847,589 users’ check-ins in Beijing and Chengdu respectively, to support the real estate appraisal task.

2.4. User Trip Data

The User Trip Data are collected from Baidu Maps, which records the mobility pattern of a city. The user trip data includes the origin (geographical location) and destination of a trip, user’s travel mode (e.g., drive, taxi, bus, cycle, walk), and the type (e.g., enterprise, shopping places) of trip destination. Overall, there are 73,307,426 and 66,960,348 trip records in Beijing and Chengdu, respectively.

2.5. User Profile Data

The User Profile Data contain user profile attributes (e.g., gender, age, income level, education level), which is collected from multiple Baidu applications including Baidu Search, Baidu App and Baidu Maps. There are 2,042,718 and 1,426,860 distinct user records in Beijing and Chengdu, respectively. Each record contains a user’s demographic attributes including hometown, gender, age, and social attributes such as the industry, income level, educational level, consumption level, and whether the user is a car owner. All user profile records are anonymous and cannot be associated with sensitive personal information such as names and phone numbers.

3. Preliminaries

We first introduce some important definitions and formally define the real estate appraisal problem.

Definition 1.

Subject Property. The subject property is the real estate that will be appraised.

Definition 2.

Real Estate Transaction Event. Consider a set of real estate transactions $S$ , a real estate transaction event $s_{t}=\langle l_{t},T_{t},x_{t},y_{t}\rangle\in S$ is defined as the $t$ -th chronological real estate transaction in the dataset. Specifically, $l_{t}$ is the location of $s_{t}$ , $T_{t}$ is transaction date, $x_{t}$ is the feature input associated with the real estate in $s_{t}$ , and $y_{t}$ is the transaction unit price of $s_{t}$ .

Problem 1.

Real Estate Appraisal. Given the subject property $s_{t+1}$ , our task is to estimate the transaction unit price of $s_{t+1}$ .

4. Framework

Then we introduce our framework in detail, including the processes of feature construction, the event-level and community-level representation learning, and multi-task learning based valuation.

4.1. Overview

Figure 2 shows the framework overview of MugRep, which consists of five major components: Data Warehouse, Feature Construction, Graph Construction, Representation Learning, and Multi-Task Learning. The Data Warehouse stores Real Estate Marketing Data and four additional multi-source urban data. The Feature Construction module generates seven groups of features based on the corresponding datasets in Data Warehouse. Besides, the Graph Construction module builds evolving transaction event graph based on the place and period proximity of transaction events, and constructs hierarchical heterogeneous community graph based on intra-community real estate transactions and inter-community similarities. In particular, the community similarities are quantified by additional diversified urban feature groups. After that, the Representation Learning module is introduced to derive the event-level and community-level representation of subject property, of which the integration is processed in a Multi-Task Learning module, where each task corresponds to an urban district. The corresponding task block produces the final result of Real Estate Appraisal.

4.2. Feature Construction

In this subsection, we present the process of constructing various features based on the aforementioned multi-source urban datasets. The details of used features are listed in Table 3 of Appendix A.

4.2.1. Real Estate Profile Features.

The real estate’s profiles are the most fundamental factors for the real estate appraisal. We extract real estate profile features from real estate transaction data, including estate’s number of rooms, area, decoration, orientation, structure, free of tax, transaction ownership, etc. The features of estate also include some profiles of the building where it is located, such as floor number, building type, elevator household ratio.

4.2.2. Residential Community Profile Features.

The residential community where the estate is located is another crucial factor that determines the value of estate. The residential community profile features include its developer, completion year, number of buildings and estates, property fee and district. All the above features are extracted from residential community data. In addition, we identify each community by a unique identifier.

4.2.3. Temporal Features.

The temporal features include the valuation date of estate and the price distribution of historical transactions of the same residential community. Historical estate transaction prices in the same community are important reference factors for subject property appraisal, because these estates usually have many similar attributes (e.g., community attributes, geographical attributes), therefore, have similar price distribution. We quantify the distribution of historical prices by some statistic features (e.g., mean, variance, maximum, minimum) for the unit prices of estate transactions that have been closed in previous quarter of the valuation date in the same residential community.

4.2.4. Geographical Features.

The geographical features are of much importance when appraising a real estate as well. The estate in an area that has complete facilities, which provide more convenience and enjoyment for living, usually has a higher price. Except counting the number of all POIs and transport stations close to the estate to reflect the completeness of surrounding facilities, we further consider several kinds of important geographical factors, including transportation, education, medical treatment, shopping, living, entertainment, and unpleasantness. We count the number of facilities and places that correspond to above factors nearby the estate, and calculate the distance from the estate to the nearest ones. The spatial distribution of facilities number feature is shown in Figure 3(a), where we can observe a positive correlation between this feature and real estate unit price. More details of geographical features can be found in Table 3.

4.2.5. Population Visit Features.

The geographical visitation volume can reflect the popularity and prosperity of an area, which has a significant impact on real estate prices. We first aggregate the check-in records of each user by every 10 minutes as one visit of a location. Then we construct population visit features by accumulating the visiting frequency of population nearby the estate in work hours (10:00-18:00), break hours (18:00-23:00), and all day on workdays and weekends, respectively. By comparing Figure 1(d) and Figure 3(b), we observe the areas with high visiting frequency of population are usually of high unit prices of real estates.

4.2.6. Mobility Features.

Human mobility is also an important factor to estimate the real estate prices of a residential community. For example, if residents of a community frequently visit high-consumption or high-income places, then we can infer higher real estate prices for the residential community. Thus, we construct abundant mobility features based on User Trip Data to reflect mobility patterns of community residents, including the human volume of inflow and outflow of community, the distribution of travel modes (e.g., driver, taxi, bus, cycle, walk) and the distribution of travel destination types (e.g., enterprise, administration, shopping places, entertainment venues) of community residents on workdays and weekends, respectively.

4.2.7. Resident Population Profile Features.

The resident population profile also has strong correlation with real estate prices of the residential community. For example, the residential communities where high-income and high-consumption residents live indicates the brownstone districts, and are supposed to have high real estate prices. To this end, we construct valuable resident population profile features of the community based on User Profile Data to comprehensively profile residents’ demographic attributes and social attributes. These features include the resident population number, the distributions of residents’ hometown, life stage, industry, income level, education level, consumption level, etc. The details of these features are listed in Table 3. We depict the spatial distribution of community residents’ income in Figure 3(c), which further confirms that the communities with high-income residents usually correspond to high real estate prices.

4.3. Event-level Representation Learning

The price of real estate transactions in proximal places and periods have strong dependencies. For example, for a place with a planned subway station, the real estate prices of surrounding areas usually increase synchronously. The transaction price dependencies can also be validated by Figure 1(d), i.e., the nearby real estate prices tend to be similar. However, these transactions are dispersedly distributed in spatial and temporal domains, which induces spatiotemporal asynchronism. Such asynchronously spatiotemporal dependencies among real estate transactions also distinguish our task from existing works on predicting regional future house prices (Tan et al., 2017; Ge et al., 2019), where the input are more regular time series data. It also prevents us to adopt existing spatiotemporal prediction approaches (Li et al., 2018) for our task.

To tackle above problem, we first formulate each real estate transaction as a transaction event, which is defined in Section 3. Then, we model the continuously emerging transaction events as an evolving graph $G_{e}=(V_{e},E_{e},A_{e})$ , where $V_{e}=S$ is a set of real estate transaction events, $E_{e}$ is a set of edges indicating connectivity among transaction events, and $A_{e}$ denotes the proximity matrix of $G_{e}$ . Specifically, we define connectivity constraint $e_{(t+1)t^{\prime}}\in E_{e}$ as

(1)

e_{(t+1)t^{\prime}}=\left\{\begin{aligned} &1,\quad dist(s_{t+1},s_{t^{\prime}})\leq\epsilon_{d},\ 0<(T_{t+1}-T_{t^{\prime}})\leq\epsilon_{\tau}\\ &0,\quad otherwise\end{aligned},\right.

where $dist(\cdot)$ is the geographical distance between the locations of events, $\epsilon_{d}$ and $\epsilon_{\tau}$ are physical distance threshold and time interval threshold, respectively. To balance the connected events from different communities, we restrict a node at most to connect last $N_{e}$ events from each community. With real estate transaction events occurring over time, the graph $G_{e}$ evolves accordingly.

Inspired by recent success of GNN (Kipf and Welling, 2017; Velickovic et al., 2018) on processing non-Euclidean graph structures and its inductive ability to effectively generate representation for previous unseen node (Hamilton et al., 2017), we adopt graph attention networks (Velickovic et al., 2018), an effective variant of GNN, to capture event-level dependencies among real estate transactions on the evolving graph.

Specifically, to quantify the sophisticated influence of historical transaction events $s_{t^{\prime}}$ to subject property $s_{t+1}$ , we introduce the attention mechanism (Zhang et al., 2021) to automatically learn the coefficient between transaction events,

(2)

\beta_{(t+1)t^{\prime}}=\mathbf{v}_{e}^{\top}\text{tanh}\left(\mathbf{W}_{e}[x_{t+1}\oplus x_{t^{\prime}}\oplus y_{t^{\prime}}]\right),

where $\mathbf{v}_{e}$ and $\mathbf{W}_{e}$ are learnable parameters, $\oplus$ denotes the concatenation operation. The proximity weight between event $s_{t+1}$ and $s_{t^{\prime}}$ is further defined as

(3)

\alpha_{(t+1)t^{\prime}}=\frac{exp(\beta_{(t+1)t^{\prime}})}{\sum_{k\in\mathcal{N}_{t+1}}exp(\beta_{(t+1)k})},

where $\mathcal{N}_{t+1}$ is a set of adjacent events of $s_{t+1}$ in $G_{e}$ .

Once $\alpha_{(t+1)t^{\prime}}\in A_{e}$ is obtained, we derive the representation that integrates nearby previous transaction events by aggregating and transforming the adjacent events of subject property, defined as

(4)

h_{e,t+1}^{l}=\text{ReLU}\left(\mathbf{W}_{he}^{l}\left(\sum_{{t^{\prime}}\in\mathcal{N}_{t+1}}\alpha_{(t+1)t^{\prime}}{h}_{e,t^{\prime}}^{l-1}+I(l>1){h}_{e,t+1}^{l-1}\right)\right),

where $l$ indicates the $l$ -th graph convolution layer in evolving graph, $\mathbf{W}_{he}^{l}$ are learnable parameters for $l$ -th graph convolution, $h_{e,t+1}^{0}=x_{t+1}$ , and $I(l>1)$ is an indicator function that equals one if $l>1$ and zero otherwise. By stacking $L_{e}$ convolution layers, we can aggregate $L_{e}$ -hop adjacent events to mine spatio-temporally extensive knowledge from historical real estate transaction data for more accurate subject property valuation.

4.4. Community-level Representation Learning

As aforementioned, the real estate value is also highly correlated with the residential community it belongs to (Fu et al., 2015). Therefore, an expressive representation of community can be useful in real estate appraisal. In this work, we devise a hierarchical heterogeneous community graph convolution module, including the dynamic intra-community graph convolution block and the heterogeneous inter-community graph convolution block.

4.4.1. Dynamic Intra-Community Graph Convolution

The representation of a residential community $c^{i}$ should be updated dynamically once there is a new transaction event, defined as $s_{t}^{i}$ , happening within it. The transaction events happened in each community $c^{i}$ can make up an individual impact graph: $G^{i}=(V^{i},E^{i},A^{i})$ , where $V^{i}$ consists of the community $c^{i}$ and transaction events happening in $c^{i}$ , $E^{i}$ is a set of edges constraining what transaction events have impacts on the representation of $c^{i}$ , and $A^{i}$ denotes the impact matrix of $G^{i}$ , indicating the impacts of previous transaction events to the community representation. The connectivity $e_{t^{\prime}}^{i}\in E^{i}$ between community $c^{i}$ and transaction event $s_{t^{\prime}}^{i}$ is defined as

(5)

e_{t^{\prime}}^{i}=\left\{\begin{aligned} &1,\quad 0\leq(T_{t}-T_{t^{\prime}})\leq max(\epsilon_{\tau},D^{i}_{t,N_{c}})\\ &0,\quad otherwise\end{aligned},\right.

where $D^{i}_{t,N_{c}}$ denotes the number of days from the last $N_{c}$ -th transaction event to the latest transaction event $s_{t}^{i}$ in $c^{i}$ . Note the transaction event nodes set and the edges set connecting between community and transaction events change dynamically as the new transaction events occur.

Then the attention mechanism is further used to quantify the impact of each previous transaction within $c^{i}$ ,

(6)

\beta_{t^{\prime}}^{i}=\mathbf{v}_{u}^{\top}\text{tanh}\left(\mathbf{W}_{u}[x_{t^{\prime}}^{i}\oplus y_{t^{\prime}}^{i}]\right),

where $\mathbf{v}_{u}$ and $\mathbf{W}_{u}$ are learnable parameters. Similar to Eq. (3), we can finally derive the impact weight $\alpha_{t^{\prime}}^{i}$ .

Once $\alpha_{t^{\prime}}^{i}\in A^{i}$ is obtained, the representation of each community $c^{i}$ is updated by performing the graph convolution operation

(7)

h_{u}^{i}=\text{ReLU}\left(\mathbf{W}_{hu}\left(\sum_{{t^{\prime}}\in\mathcal{N}_{i}}\alpha_{t^{\prime}}^{i}{x}_{t^{\prime}}^{i}\right)\right),

where $\mathbf{W}_{hu}$ are learnable parameters.

4.4.2. Heterogeneous Inter-Community Graph Convolution

The diversified correlations between residential communities are induced by their various similarities. For example, the residential communities located in similar functional areas with similar surrounding geographical facilities distribution usually tend to be positively correlated in real estate value. Thus, we define four similarity metrics between residential communities based on four types of features, i.e., geographical features, population visit features, mobility features, and resident population profile features.

Then, we construct the heterogeneous inter-community graph to model the diversified correlations between residential communities, which is defined as $G_{c}=(V_{c},E_{c},A_{c})$ , where $V_{c}$ is a set of residential communities. We construct four types of edges $E_{c}=\{\mathcal{E}_{g},\mathcal{E}_{v},\mathcal{E}_{m},\mathcal{E}_{p}\}$ based on four kinds of similarities between residential communities. Next, we will take geographical edges set $\mathcal{E}_{g}$ as a representative for detailed explanation.

We define the geographical features of community $c^{i}$ as $f^{i}_{g}$ . Then, we can compute the euclidean distance between $f^{i}_{g}$ and $f^{j}_{g}$

(8)

dist_{g}(c^{i},c^{j})=\lVert f^{i}_{g}-f^{j}_{g}\rVert.

Smaller euclidean distance of features indicates higher similarity between communities. Thus, the geographical edge $e_{ij}^{g}\in\mathcal{E}_{g}$ is defined as

(9)

e_{ij}^{g}=\left\{\begin{aligned} &1,\quad dist_{g}(c^{i},c^{j})\leq\epsilon_{g}\\ &0,\quad otherwise\end{aligned},\right.

where $\epsilon_{g}$ is the distance threshold. Other types of edge sets can be derived in the same way but with respective thresholds.

We define $s_{t+1}^{i}$ as the subject property belonging to community $c^{i}$ . With the latest representation of each community and the heterogeneous edges set $E_{c}$ , the coefficient between communities $c^{i}$ and $c^{j}$ is computed by

(10)

\beta_{ij}=\mathbf{v}_{c}^{\top}\text{tanh}\left(\mathbf{W}_{c}[x_{t+1}^{i}\oplus h_{u}^{j}\oplus p_{ij}]\right),

where $\mathbf{v}_{c}$ and $\mathbf{W}_{c}$ are learnable parameters, $p_{ij}$ is a one-hot vector to denote the type of edge. Once coefficients are derived, the proximity weight $\alpha_{ij}\in A_{c}$ can be obtained similar to Eq. (3). Then, we derive the overall representation of residential communities by the graph convolution operation

(11)

h_{c}^{i,l}=\text{ReLU}\left(\mathbf{W}_{hc}^{l}\left(\sum_{{j}\in\mathcal{N}_{i}}\alpha_{ij}{h}_{c}^{j,l-1}+I(l>1){h}_{c}^{i,l-1}\right)\right),

where $l$ indicates the $l$ -th graph convolution layer in inter-community graph, $\mathbf{W}_{hc}^{l}$ are learnable parameters for $l$ -th graph convolution, ${h}_{c}^{i,0}=h_{u}^{i}$ . By stacking $L_{c}$ convolution layers, $h_{c}^{i,L_{c}}$ can integrate $L_{c}$ -hop diversified correlations between communities.

Finally, we obtain the overall representation of subject property $s_{t+1}^{i}$ through concatenation and multi-layer perceptron operations

(12)

h_{t+1}^{o}=MLP([x_{t+1}^{i}\oplus h_{e,t+1}^{L_{e}}\oplus h_{c}^{i,L_{c}}]).

4.5. Multi-Task Learning Based Valuation

In general, a city is partitioned into several urban districts for administration. Each urban district can have distinctive urban functionalities and leads to diversified real estate price distributions, as illustrated in Figure 4. Inspire by (Zhu et al., 2016), we partition the tasks via urban districts, and each urban district corresponds to a learning task. These tasks share most of parameters of the model except have a unique fully-connected output layer to generate appraisal of distinctive distribution. Therefore, the real estate located in each urban district is valuated by

(13)

\hat{y}_{t+1}=FC_{m}(h_{t+1}^{o}),

where $FC_{m}$ denotes the unique fully-connected layer of $m$ -th task associated with $m$ -th urban district.

Overall, our model aims to minimize the Mean Square Error (MSE) loss between the estimated unit price and the ground truth transaction unit price of real estate,

(14)

L=\frac{1}{|S|}\sum_{S_{m}\subset S}\sum_{s_{t+1}\in S_{m}}\left(\hat{y}_{t+1}-y_{t+1}\right)^{2},

where $S_{m}$ denotes the real estate transactions set happening in $m$ -th urban district, $|S|$ denotes the cardinality of $S$ .

5. Experiments

5.1. Experimental setup

We evaluate the performance of MugRep on both Beijing and Chengdu datasets. We chronologically order each dataset based on the transaction date. For Beijing, we take the data of which the transaction date ranges from January 2018 to June 2019, as the training set, the following one month as validation set, and the rest of data for testing. For Chengdu, the training set is set from January 2018 to June 2020, the other setting are the same as Beijing. Our experiments mainly focus on (1) the overall performance comparison, (2) the ablation studies of model and features, (3) the feature importance analysis, (4) and the effectiveness check for spatially different communities. Furthermore, please refer to Appendix C for the prototype system of real estate appraisal.

5.1.1. Implementation details.

We take $\epsilon_{d}=500$ m, $\epsilon_{\tau}=90$ , $N_{e}=5$ and $L_{e}=2$ for event-level representation learning. We choose $N_{c}=5$ , set the distance thresholds (e.g., $\epsilon_{g}$ ) of several types of edge as the 0.001 quantile of all pair-wise euclidean distance values, and select $L_{c}=1$ for community-level representation learning. The dimensions of $h_{e,t+1}^{L_{e}}$ , $h_{u}^{i}$ and $h_{c}^{i,L_{c}}$ are fixed to 32. The hidden dimension of $MLP$ is fixed to 64. We employ Adam optimizer, set learning rate as 0.01 to train our model, and early stop if the loss does not decrease on validation data set over 30 epochs.

5.1.2. Evaluation metrics.

We adopt Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), and Root Mean Square Error (RMSE), three widely used metrics (Wang et al., 2020) for evaluation. Note that the estimated unit price and ground truth transaction unit price are in 10,000 CNY.

5.1.3. Baselines.

We compare MugRep with one statistical baseline (HA), three classic machine learning based baselines (LR (Pedregosa et al., 2011), SVR (Pedregosa et al., 2011) and GBRT (Ke et al., 2017)), and two artificial neural networks (ANN) based baselines (DNN, PDVM (Bin et al., 2019)). The input features of all learning based methods are the same. The details of these baselines are introduced in Appendix B.

Table 2. Overall performance evaluated by MAE, MAPE, RMSE on Beijing and Chengdu datasets.

Dataset	Algorithm	MAE	MAPE	RMSE
Beijing	HA	0.6313	11.33%	1.1008
	LR	0.4776	8.94%	0.7041
	SVR	0.4427	8.10%	0.6840
	GBRT	0.3640	6.70%	0.5515
	DNN	0.3550	6.35%	0.5505
	PDVM	0.3469	6.17%	0.5373
	MugRep	0.3244	5.76%	0.5097
Chengdu	HA	0.1456	10.65%	0.2487
	LR	0.1515	9.73%	0.2150
	SVR	0.1339	8.57%	0.1888
	GBRT	0.1133	7.27%	0.1708
	DNN	0.1090	7.26%	0.1651
	PDVM	0.1051	6.95%	0.1583
	MugRep	0.0916	6.20%	0.1404

5.2. Overall Performance

Table 2 reports overall results of our methods and all compared baselines on two datasets with respect to three metrics. Overall, our model achieves the best performance among all the baselines. Moreover, we observe all ANN based algorithms (DNN, PDVM, MugRep) outperform the statistical algorithm (HA) and machine learning based algorithms (LR, SVR, GBRT), which consistently verifies the advantages of applying ANN to real estate appraisal for its extraordinary non-linear processing ability. Particularly, MugRep achieves $(6.5\%,6.6\%,5.1\%)$ and $(12.8\%,10.8\%,11.3\%)$ improvements beyond the state-of-the-art baseline PDVM for (MAE, MAPE, RMSE) on Beijing and Chengdu, respectively. The results demonstrate the effectiveness of MugRep.

5.3. Ablation Study

In this section, we conduct ablation studies on MugRep, including model ablation and feature ablation, to further verify the effectiveness of each component and feature group. The experiments are finished for three metrics on both Beijing and Chengdu datasets.

5.3.1. Model Ablation.

We evaluate the performance of MugRep and it’s three variants, which are (1) noEvt removes the event-level representation learning module; (2) noCom removes the community-level representation learning module; (3) noMT removes the multi-task learning module. The ablation results are reported in Figure 5. As can be seen, removing any of the components leads to remarkable performance degradation. Among these components, we find the event-level and community-level representation modules are more important, especially the event-level representation module. This is because the nearby real estates have strong similarities and dependencies. The close historical real estate transactions can be a very valuable reference for subject property valuation. All the results demonstrate the effectiveness of MugRep and its each component.

5.3.2. Feature Ablation.

To examine the performance impact of feature groups that constructed based on four additional multi-source urban datasets, we evaluate MugRep with complete features and its five variants: (1) Complete is MugRep with complete features; (2) Basic excludes all features from four additional multi-source urban datasets and the community-level representation learning module; (3) noGeo excludes geographical features and geographical edge set $\mathcal{E}_{g}$ in the heterogeneous inter-community graph; (4) noVis excludes population visit features and corresponding edge set $\mathcal{E}_{v}$ ; (5) noMob excludes mobility features and corresponding edge set $\mathcal{E}_{m}$ ; (6) noPop excludes resident population profile features and corresponding edge set $\mathcal{E}_{p}$ . The experimental results are reported in Figure 6. There is a consistent performance degradation by excluding any of additional urban feature groups. If we exclude all these additional urban features (i.e., basic), MugRep will have significant $(10.8\%,9.4\%,11.4\%)$ and $(9.1\%,7.9\%,8.0\%)$ performance degradation for (MAE, MAPE, RMSE) on Beijing and Chengdu, which demonstrate the effectiveness of these urban features and community-level representation learning module. Besides, we observe noMob and noPop lead to notable performance degradation compared to MugRep with complete features. The observation verifies that considering the characteristics and correlations of community residents are very useful for real estate appraisal.

5.4. Feature Importance Analysis

To further evaluate the effect of our constructed features, we illustrate the top-30 most important features in Figure 7. The features are ranked by logarithmic information gain (Ke et al., 2017). As can be seen, the distribution of historical price are the most important features. The subsequent one is the district of residential community, which indicates the large difference between districts. Furthermore, we observe half of the top-30 features are from the four multi-source urban datasets, which demonstrates the effectiveness of these urban features. Among these urban features, the geographical features (Living, Entertainment, Shopping), mobility features (Travel destination) and resident population profile features (Income level, Consumption level, Industry) are ranked very high (in top-15). For the reasons that these geographical features are closely related to living quality, these mobility features are greatly relevant to travel preferences of community residents, and these resident population profile features are strongly associated with the wealth of community residents. The living quality, travel preferences, and wealth are three very important factors to reflect the real estate prices.

5.5. Effectiveness in Different Community

To evaluate the performance of MugRep on spatially diverse communities, we compute the separate MAPE for each residential community. Figure 8(a) and Figure 8(b) show the spatial distribution of MAPE and inverse transaction volume on Beijing. As can be seen, some communities with high MAPE (bright color) always have high inverse transaction volume (i.e., less transaction volume) nearby. This makes sense for two reasons: first, the historical price features of same community are important for subject property valuation, less transaction volume in the community means inaccurate or missing historical price features; second, the evolving transaction event graph convolution module of MugRep is highly correlated with nearby previous transactions, the absence of nearby transactions decreases the effectiveness of event-level representation. This result indicates further optimization can be applied to these residential communities with scarce transactions to improve the overall performance.

6. Related work

Real Estate Appraisal. Traditional real estate appraisal methodologies can be mainly categorized into three classes, i.e., sales comparison based approaches (Mccluskey and Borst, 1997), cost based approaches (Guo et al., 2014), and income based approaches (Baum et al., 2017). The sales comparison approach estimates the market value of real estate depending on some forms of comparison. The cost approach is based on the theory that the summation of land value and depreciated value of any improvements can be as the estimation for the real estate. And the income approach estimates the real estate market value based on its income. Hedonic price model (Rosen, 1974; Cheshire and Sheppard, 1995) is also commonly used in real estate appraisal. It assumes that the real estate can be viewed as an aggregation of individual attributes, which implicitly reflect the real estate market value. However, it does not consider attributes interactions and is incompetent in non-linear data (Limsombunchai, 2004). Besides, automated valuation methods (AVMs) have arisen researchers’ interests for they can automatically estimate the market value of an real estate based on its available attributes. Many AVMs such as linear regression (Şipoş et al., 2008; Ahn et al., 2012), support vector regression (Lin and Chen, 2011), boosted regression trees (Graczyk et al., 2010; Park and Bae, 2015) and artificial neural networks methods (Peterson and Flanagan, 2009; Peter et al., 2020; Poursaeed et al., 2018; You et al., 2017; Law et al., 2019) are widely applied into the real estate appraisal. Moreover, some works (Fu et al., 2014b, a; Fu et al., 2015) investigate how to rank real estates via various viewpoints, such as individual, peer and zone dependency, online user reviews and offline moving behaviors, and diverse mixed land use. Furthermore, there are few works that try to capture the peer-dependency among nearby estates. Fu et al. (2014b) use the generative likelihood of each edge to model peer-dependency, which does not adequately integrate the attributes knowledge of nearby estates. Works (Bin et al., 2019; You et al., 2017) sample fixed number of similar estates by selecting k-nearest similar estates or the random walk algorithm, and then feed these samples to recurrent neural networks. The sampling process could lead to the loss of valuable information. Overall, these prior studies all leave out community residents characteristics, and are incapable to fully model the spatiotemporal dependencies among real estate transactions. Besides, none of them attempt to capture the diversified correlations between residential communities.

Real Estate Forecasting. This work is also related to real estate forecasting tasks. For example, Tan et al. (2017) proposes a time-aware latent hierarchical model and Ge et al. (2019) proposes an integrated framework that improving the DenseNet to predict future house prices of regions. Zhu et al. (2016) proposes a multi-task linear regression model for real estate’s days-on-market prediction. Li et al. (2019) estimates the housing demand based on express delivery data.

Graph Neural Network. Graph neural networks (GNN) is designed to extend the well-known convolution neural network to non-Euclidean graph structures. GNN is usually used to obtain the expressive representation of each node by aggregating and transforming the representations of its neighbors in the graph (Kipf and Welling, 2017; Hamilton et al., 2017; Velickovic et al., 2018). Many previous studies have demonstrated the effectiveness of GNN in the graph-based representation learning tasks (Wu et al., 2020). Because of its effectiveness, GNN has been successfully applied to various fields, such as physics (Santoro et al., 2017), chemistry (Gilmer et al., 2017), biology (Fout et al., 2017), recommendation systems (Ying et al., 2018; Wang et al., 2019; Xin et al., 2021), and smart city systems (Zhang et al., 2020; Liu et al., 2020a; Han et al., 2021). However, to the best of our knowledge, this is the first work applying GNN to real estate appraisal.

7. Conclusion

In this paper, we presented MugRep, a multi-task hierarchical graph representation learning framework for real estate appraisal. We first constructed abundant valuable features with respect to the fundamental attributes and community residents characteristics to comprehensively profile the real estate value. Then we designed an evolving transaction event graph convolution module to model the asynchronously spatiotemporal dependencies among real estate transactions, and devised a hierarchical heterogeneous community graph convolution module to capture diversified correlations between residential communities. After that, an urban district partitioned multi-task learning module was introduced to perform the real estate appraisal of distinctive distribution. Extensive experiments on two real-world datasets demonstrated that MugRep achieves the best performance compared with six baselines.

Acknowledgements.

This research is supported in part by grants from the National Natural Science Foundation of China (Grant No.91746301, 71531001).

References

(1)
Ahn et al. (2012) Jae Joon Ahn, Hyun Woo Byun, Kyong Joo Oh, and Tae Yoon Kim. 2012. Using ridge regression with genetic algorithm to enhance real estate appraisal forecasting. Expert Systems with Applications 39, 9 (2012), 8369–8379.
Baum et al. (2017) Andrew Baum, David Mackmin, and Nick Nunnington. 2017. The income approach to property valuation. Routledge.
Bin et al. (2019) Junchi Bin, Bryan Gardiner, Eric Li, and Zheng Liu. 2019. Peer-Dependence Valuation Model for Real Estate Appraisal. Data-Enabled Discovery and Applications 3, 1 (2019), 2.
Cheshire and Sheppard (1995) Paul Cheshire and Stephen Sheppard. 1995. On the price of land and the value of amenities. Economica (1995), 247–267.
Fout et al. (2017) Alex Fout, Jonathon Byrd, Basir Shariat, and Asa Ben-Hur. 2017. Protein interface prediction using graph convolutional networks. In Advances in Neural Information Processing Systems. 6533–6542.
Fu et al. (2014a) Yanjie Fu, Yong Ge, Yu Zheng, Zijun Yao, Yanchi Liu, Hui Xiong, and Jing Yuan. 2014a. Sparse real estate ranking with online user reviews and offline moving behaviors. In IEEE International Conference on Data Mining. 120–129.
Fu et al. (2015) Yanjie Fu, Guannan Liu, Spiros Papadimitriou, Hui Xiong, Yong Ge, Hengshu Zhu, and Chen Zhu. 2015. Real estate ranking via mixed land-use latent models. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 299–308.
Fu et al. (2014b) Yanjie Fu, Hui Xiong, Yong Ge, Zijun Yao, Yu Zheng, and Zhi-Hua Zhou. 2014b. Exploiting geographic dependencies for real estate appraisal: A mutual perspective of ranking and clustering. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1047–1056.
Ge et al. (2019) Chuancai Ge, Yang Wang, Xike Xie, Hengchang Liu, and Zhengyang Zhou. 2019. An integrated model for urban subregion house price forecasting: A multi-source data perspective. In IEEE International Conference on Data Mining. 1054–1059.
Gilmer et al. (2017) Justin Gilmer, Samuel S Schoenholz, Patrick F Riley, Oriol Vinyals, and George E Dahl. 2017. Neural message passing for quantum chemistry. In International Conference on Machine Learning. 1263–1272.
Graczyk et al. (2010) Magdalena Graczyk, Tadeusz Lasota, Bogdan Trawiński, and Krzysztof Trawiński. 2010. Comparison of bagging, boosting and stacking ensembles applied to real estate appraisal. In Asian Conference on Intelligent Information and Database Systems. 340–350.
Guo et al. (2014) Jingjuan Guo, Shoubo Xu, and Zhuming Bi. 2014. An integrated cost-based approach for real estate appraisals. Information Technology and Management 15, 2 (2014), 131–139.
Hamilton et al. (2017) Will Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. In Advances in Neural Information Processing Systems. 1024–1034.
Han et al. (2021) Jindong Han, Hao Liu, Hengshu Zhu, Hui Xiong, and Dejing Dou. 2021. Joint Air Quality and Weather Prediction Based on Multi-Adversarial Spatiotemporal Networks. In Proceedings of the 35th AAAI Conference on Artificial Intelligence.
Ke et al. (2017) Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye, and Tie-Yan Liu. 2017. Lightgbm: A highly efficient gradient boosting decision tree. In Advances in Neural Information Processing Systems. 3146–3154.
Kipf and Welling (2017) Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In International Conference on Learning Representations, ICLR.
Law et al. (2019) Stephen Law, Brooks Paige, and Chris Russell. 2019. Take a look around: using street view and satellite images to estimate house prices. ACM Transactions on Intelligent Systems and Technology (TIST) 10, 5 (2019), 1–19.
Li et al. (2019) Qingyang Li, Zhiwen Yu, Bin Guo, Huang Xu, and Xinjiang Lu. 2019. Housing Demand Estimation Based on Express Delivery Data. ACM Transactions on Knowledge Discovery from Data (TKDD) 13, 4 (2019), 1–25.
Li et al. (2020) Shuangli Li, Jingbo Zhou, Tong Xu, Hao Liu, Xinjiang Lu, and Hui Xiong. 2020. Competitive Analysis for Points of Interest. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1265–1274.
Li et al. (2018) Yaguang Li, Rose Yu, Cyrus Shahabi, and Yan Liu. 2018. Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting. In International Conference on Learning Representations, ICLR.
Limsombunchai (2004) Visit Limsombunchai. 2004. House price prediction: hedonic price model vs. artificial neural network. In New Zealand Agricultural and Resource Economics Society Conference. 25–26.
Lin and Chen (2011) Hongyu Lin and Kuentai Chen. 2011. Predicting price of Taiwan real estates by neural networks and support vector regression. In Proceedings of the 15th WSEAS International Conference on Systems. 220–225.
Liu et al. (2020a) Hao Liu, Jindong Han, Yanjie Fu, Jingbo Zhou, Xinjiang Lu, and Hui Xiong. 2020a. Multi-modal transportation recommendation with unified route representation learning. Proceedings of the VLDB Endowment 14, 3 (2020), 342–350.
Liu et al. (2020b) Hao Liu, Yongxin Tong, Jindong Han, Panpan Zhang, Xinjiang Lu, and Hui Xiong. 2020b. Incorporating Multi-Source Urban Data for Personalized and Context-Aware Multi-Modal Transportation Recommendation. IEEE Transactions on Knowledge and Data Engineering (2020).
Mccluskey and Borst (1997) W. Mccluskey and R. Borst. 1997. An evaluation of MRA, comparable sales analysis, and artificial neural networks (ANNs) for the mass appraisal of residential properties in Northern Ireland. Assessment 4 (1997), 47–55.
Pagourtzi et al. (2003) Elli Pagourtzi, Vassilis Assimakopoulos, Thomas Hatzichristos, and Nick French. 2003. Real estate appraisal: a review of valuation methods. Journal of Property Investment & Finance (2003).
Park and Bae (2015) Byeonghwa Park and Jae Kwon Bae. 2015. Using machine learning algorithms for housing price prediction: The case of Fairfax County, Virginia housing data. Expert Systems with Applications 42, 6 (2015), 2928–2934.
Pedregosa et al. (2011) Fabian Pedregosa, Gaël Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel, Peter Prettenhofer, Ron Weiss, Vincent Dubourg, et al. 2011. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 12 (2011), 2825–2830.
Peter et al. (2020) Nkolika J Peter, Hilary I Okagbue, Emmanuela CM Obasi, and Adedotun O Akinola. 2020. Review on the Application of Artificial Neural Networks in Real Estate Valuation. International Journal 9, 3 (2020).
Peterson and Flanagan (2009) Steven Peterson and Albert Flanagan. 2009. Neural network hedonic pricing models in mass real estate appraisal. Journal of Real Estate Research 31, 2 (2009), 147–164.
Poursaeed et al. (2018) Omid Poursaeed, Tomáš Matera, and Serge Belongie. 2018. Vision-based real estate price estimation. Machine Vision and Applications 29, 4 (2018), 667–676.
Rosen (1974) Sherwin Rosen. 1974. Hedonic prices and implicit markets: product differentiation in pure competition. Journal of Political Economy 82, 1 (1974), 34–55.
Santoro et al. (2017) Adam Santoro, David Raposo, David GT Barrett, Mateusz Malinowski, Razvan Pascanu, Peter Battaglia, and Timothy Lillicrap. 2017. A simple neural network module for relational reasoning. In Advances in Neural Information Processing Systems. 4974–4983.
Şipoş et al. (2008) Ciprian Şipoş, Eng Adrian Crivii, and MBA FRICS. 2008. A linear regression model for real estate appraisal. In WAVO Valuation Congress Valuation in Diversified and Emerging Economies. 17–18.
Tan et al. (2017) Fei Tan, Chaoran Cheng, and Zhi Wei. 2017. Time-aware latent hierarchical model for predicting house prices. In IEEE International Conference on Data Mining. 1111–1116.
Velickovic et al. (2018) Petar Velickovic, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Lio, and Yoshua Bengio. 2018. Graph attention networks. In International Conference on Learning Representations, ICLR.
Wang et al. (2019) Hongwei Wang, Fuzheng Zhang, Mengdi Zhang, Jure Leskovec, Miao Zhao, Wenjie Li, and Zhongyuan Wang. 2019. Knowledge-aware graph neural networks with label smoothness regularization for recommender systems. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 968–977.
Wang et al. (2020) Jun Wang, Hefu Zhang, Qi Liu, Zhen Pan, and Hanqing Tao. 2020. Crowdfunding Dynamics Tracking: A Reinforcement Learning Approach. In Proceedings of the 34th AAAI Conference on Artificial Intelligence. 6210–6218.
Wu et al. (2020) Zonghan Wu, Shirui Pan, Fengwen Chen, Guodong Long, Chengqi Zhang, and S Yu Philip. 2020. A comprehensive survey on graph neural networks. IEEE Transactions on Neural Networks and Learning Systems (2020).
Xin et al. (2021) Haoran Xin, Xinjiang Lu, Tong Xu, Hao Liu, Jingjing Gu, Dejing Dou, and Hui Xiong. 2021. Out-of-Town Recommendation with Travel Intention Modeling. In Proceedings of the 35th AAAI Conference on Artificial Intelligence.
Ying et al. (2018) Rex Ying, Ruining He, Kaifeng Chen, Pong Eksombatchai, William L Hamilton, and Jure Leskovec. 2018. Graph convolutional neural networks for web-scale recommender systems. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 974–983.
You et al. (2017) Quanzeng You, Ran Pang, Liangliang Cao, and Jiebo Luo. 2017. Image-based appraisal of real estate properties. IEEE Transactions on Multimedia 19, 12 (2017), 2751–2759.
Zhang et al. (2020) Weijia Zhang, Hao Liu, Yanchi Liu, Jingbo Zhou, Tong Xu, and Hui Xiong. 2020. Semi-Supervised City-Wide Parking Availability Prediction via Hierarchical Recurrent Graph Neural Network. IEEE Transactions on Knowledge and Data Engineering (2020).
Zhang et al. (2021) Weijia Zhang, Hao Liu, Fan Wang, Tong Xu, Haoran Xin, Dejing Dou, and Hui Xiong. 2021. Intelligent Electric Vehicle Charging Recommendation Based on Multi-Agent Reinforcement Learning. In The World Wide Web Conference.
Zhu et al. (2020) Hengshu Zhu, Ying Sun, Wenjia Zhao, Fuzhen Zhuang, Baoshan Wang, and Hui Xiong. 2020. Rapid Learning of Earthquake Felt Area and Intensity Distribution with Real-time Search Engine Queries. Scientific Reports 10, 1 (2020), 1–9.
Zhu et al. (2016) Hengshu Zhu, Hui Xiong, Fangshuang Tang, Qi Liu, Yong Ge, Enhong Chen, and Yanjie Fu. 2016. Days on market: Measuring liquidity in real estate markets. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 393–402.

Table 3. Detailed feature list.

Feature Type

Feature

Description

Real Estate Profile

Number of rooms

Number of rooms (e.g., bedroom, living-room, kitchen, bathroom) in this estate.

Area

Number of square meters of the estate.

Decoration

Type of decoration (e.g., simply-decorated, well-decorated, not decorated).

Orientation

Orientation of the estate (e.g., north, south, west, east).

Estate structure

Structure of estate (e.g., flat layer, jump layer, duplex).

Heating method

Method of estate’s heating (e.g., central heating, self-heating, without heating).

Floor type

Type of the floor of the estate (e.g., high, medium, low, basement).

Free of tax

Whether the estate is free of sales tax.

Transaction ownership

Transaction ownership of the estate (i.e., commercial housing, affordable housing, purchased public housing).

Floor number

Number of floors of the building where the estate is located.

Building type

Type of the building (e.g., tower building, slab-type building, mixed-type building).

Elevator household ratio

Ratio of the number of elevators to households of the building.

Residential Community Profile

Developer

Developer of the community.

Completion year

Completion year of the community.

Number of buildings

The total number of buildings in the community.

Number of estates

The total number of estates in the community.

Property fee

Property fee of the community.

District

District of the community (e.g., Xicheng, Haidian, Chaoyang).

Community index

Index of community.

Temporal

Valuation date

Date of making valuation for the estate.

Historical price

Statistics (e.g., mean, variance, maximum, minimum) for the unit prices of transactions closed in previous

quarter in the community.

Geographical

Transportation

Number of the transport facilities (e.g., subway and bus stations) nearby and the distances of the nearest ones.

Education

Number of the educational facilities (e.g., kindergarten, primary and middle schools, college) nearby

and the distances of the nearest ones.

Medical

Number of the medical facilities (e.g., hospital, clinic, pharmacy) nearby and the distances of the nearest ones.

Shopping

Number of the shopping places (e.g., mall, supermarkte, convenience store) nearby and the distances

of the nearest ones.

Living

Number of the living places (e.g., restaurant, barbershop, bank) nearby and the distances of the nearest ones.

Entertainment

Number of the entertainment venues (e.g., cinema, gym, park) nearby and the distances of the nearest ones.

Unpleasantness

Number of the unpleasant facilities (e.g., factory, cemetery) nearby and the distances of the nearest ones.

Number of facilities

Number of all the POIs and transport stations nearby.

Population Visit

Work hours visits

Visiting frequency of population nearby in work hours (10:00-18:00) on workdays and weekends.

Break hours visits

Visiting frequency of population nearby in break hours (18:00-23:00) on workdays and weekends.

All day visits

Visiting frequency of population nearby in all day on workdays and weekends.

Mobility

Inflow volume

Human volume of the inflow of community on workdays and weekends.

Outflow volume

Human volume of the outflow of community on workdays and weekends.

Travel mode

Distribution of the residents’ travel modes (e.g., drive, taxi, bus, cycle, walk) from the community on

workdays and weekends.

Travel destination

Distribution of the types of residents’ travel destinations (e.g., enterprise, administration, shopping places,

entertainment venues) from the community on workdays and weekends.

Resident Population Profile

Resident population

Number of resident population in the community.

Hometown

Distribution of the hometowns (e.g., Beijing, Shandong) of resident population in the community.

Gender

Distribution of the gender of resident population in the community.

Age

Distribution of the ages (e.g., teenager, youth, the middle-aged, the old) of resident population in the community.

Life stage

Distribution of the life stages (e.g., student, working, parent, retire) of resident population in the community.

Industry

Distribution of the industries (e.g., education, catering, IT, finance) of resident population in the community.

Car owner

Distribution of owning cars of resident population in the community.

Income level

Distribution of the income levels (e.g., low, medium, high, very high) of resident population in the community.

Education level

Distribution of the education levels (e.g., undergraduate, college, senior) of resident population in the community.

Consumption level

Distribution of the consumption level (e.g., low, medium, high) of resident population in the community.

Consumption wish

Distribution of the consumption wishes (e.g., daily supplies, education, healthcare, travel, finance, technology)

of resident population in the community.

Appendix A Details of used features

Table 3 is the feature list used in MugRep, including Real Estate Profile features, Residential Community Profile features, Temporal features, Geographical features, Population Visit features, Mobility features, and Resident Population Profile features.

Appendix B Details of baselines

We compare our MugRep with the following six baselines. We carefully tune major hyper-parameters of each baseline based on their recommended settings. GBRT, DNN, and PDVM employ the same early stop training strategy as MugRep.

•

HA uses the average previous 90 days’ price of transactions closed in the same residential community as estimated value.
•

LR (Pedregosa et al., 2011) makes appraisal via the well-known linear regression model.
•

SVR (Pedregosa et al., 2011) makes appraisal via the support vector regression model. We use the Radial Basis Function (RBF) kernel.
•

GBRT makes appraisal via gradient boosted regression tree model. We use the version in LightGBM (Ke et al., 2017), set learning rate to 0.1, set maximal tree depth to 10, and maximal leaves number to 31.
•

DNN is a type of ANN method contains two fully-connected 64 dimensions hidden layers with ReLU activation functions, and employ Adam for optimization. The learning rate is set to 0.005.
•

PDVM (Bin et al., 2019) is a state-of-the-art ANN method for real estate appraisal. It models estates peer-dependency by using the k-nearest similar estate sampling to sample fixed number of real estates, and feed them to a bidirectional LSTM to generate final real estate appraisal. We slightly modify PDVM to sample historical real estate transactions to fit our dataset. We employ one layer bidirectional LSTM, and its input sequence length is 7. The hidden dimension is set to 64, learning rate is 0.005.

Appendix C Prototype system

We have implemented a prototype system for supporting users to make real estate appraisal decisions. We use angularJS (JavaScript MVW framework), bootstrap (front-end framework for web development), and Django (a web framework in Python) along with MySQL to build our prototype system, of which Figure 9 shows a screenshot. Specifically, once the user enters a residential community name to search, and select the community from a list of returned candidate items, the system will show its position on the map and other detailed information, including developer, property fee, completion year, etc. Then the user is expected to input real estate profile attributes, such as estate’s number of rooms, area, decoration, orientation, and click the ”Valuate” button to generate estimated price of the subject property.