Stability of China’s Stock Market: Measure and Forecast by Ricci Curvature on Network

Xinyu Wang School of Science, China Agricultural University, Beijing, 100091, China Liang Zhao [ Corresponding author, School of Mathematical Sciences,
Key Laboratory of Mathematics and Complex Systems of MOE,
Beijing Normal University, Beijing 100875, China Ning Zhang School of Finance, Chinese Fintech Research Center,
Central University of Finance and Economics, Beijing, 102206, China Liu Feng Haibo Lin

Abstract

The systemic stability of a stock market is one of the core issues in the financial field. The market can be regarded as a complex network whose nodes are stocks connected by edges that signify their correlation strength. Since the market is a strongly nonlinear system, it is difficult to measure the macroscopic stability and depict market fluctuations in time. In this paper, we use a geometric measure derived from discrete Ricci curvature to capture the higher-order nonlinear architecture of financial networks. In order to confirm the effectiveness of our method, we use it to analyze the CSI 300 constituents of China’s stock market from 2005–2020 and the systemic stability of the market is quantified through the network’s Ricci type curvatures. Furthermore, we use a hybrid model to analyze the curvature time series and predict the future trends of the market accurately. As far as we know, this is the first paper to apply Ricci curvature to forecast the systemic stability of domestic stock market, and our results show that Ricci curvature has good explanatory power for the market stability and can be a good indicator to judge the future risk and volatility of the domestic market.

keywords:

stability, Ricci curvature, network, stock market

^†^†journal:

\ast\ast\ast

label2] [email protected]

1 Introduction

Through more than thirty years of development, China’s capital market has grown continuously. With improvements of the trading mechanism, the market stability has been gradually enhanced and the market plays a more and more important role in optimizing the social financing structure and promoting the allocation of resources. On the other hand, China’s financial market is in its infancy, and abnormal market fluctuations still occur occasionally. For example, from 2007 to 2008, the Shanghai Composite Index fell from $6124$ , the highest point, to $1664$ , a drop of $70\%$ . During the market crash in 2015, the market experienced significant abnormal fluctuations which lasted for half a year. As the key factors of derivative pricing and financial risk management, it is of great significance to study how to measure and forecast the market stability reasonably and accurately. This kind of ability to analyze and predict the market is conducive to the objective and quantifiable evaluation of China’s financial market, to the analysis of the market stability factors and the formulation of targeted policies, so as to realize the early warning and prevention of financial risks and the maintenance of financial stability.

The stock market is a nonlinear and non-stationary system with strong volatility, tight coupling and asymmetry. Individual stocks in the market interact each other and the abnormal fluctuations of individuals may quickly enlarge to the whole market. To better understand the highly correlated market, as well as to achieve monitoring and adjustment of it, economists advocate the use of many new tools and interdisciplinary approaches, such as trigger points, feedback, contagion and complexity theory[1, 2, 3, 4, 5, 6]. In particular, to describe the stability macroscopically, we should not consider each individual separately, but should regard the market as a whole system, which coincides with the nature of complex networks[7, 8]. Empirical cross-correlation among stock prices has been extensively studied and explored more than two decades[9, 10, 11, 12, 13, 14]. The correlation between stock returns allows us to construct a variety of correlation-based networks, such as minimum spanning trees (MST)[10, 16, 17, 15] or threshold networks[18], where nodes represent stocks and edges represent correlation strength (or converted to a distance metric). In recent years, correlation-based networks become one of the common tools for modeling and analyzing complex financial systems[14, 15, 19, 20, 21].

Since there are interactions that occur among groups of more nodes besides pairwise interactions, to reveal the higher-order nonlinear relationship in a network [22, 23, 24, 25], curvature, which is a key concept in geometry proposed by Gauss and Riemann[26], can be an appropriate and powerful tool, and it has been increasingly used as network metrics in recent years[24, 25, 28, 27]. In 2015, Sandhu et al.[29] applied the graph curvature to cancer networks for the first time. Sandhu et al.[28] also studied the evolution of Ollivier-Ricci curvature in the financial threshold network and showed that Ollivier-Ricci curvature can be used to determine the stability of USA S $\&$ P-500 over the period 1998-2013. A recent study by Samal et al.[30] confirms that discrete Ricci curvature can be an excellent indicator of stability and volatility for financial markets of USA and Japan. For the financial market in China, relevant studies have confirmed that it has significant small-world effect and scale-free feature[31, 32, 33], which provides us a theoretical basis for the combination of network geometry and domestic financial market. In summary, the description of the stability of the domestic stock market through geometric measurement is the first motivation of the research work in this paper.

In addition to measure the stability, prediction of trends of the market is also an exciting research area and this is another main purpose of this paper. We will use a hybrid machine learning model combing deep neural network and wavelet decomposition to achieve this goal. We remark that, because the financial curvature time series are complex, non-stationary and very noisy, the classic time series models, such as ARIMA, GARCH, et al., are not suitable for this task.

Since deep learning models can successfully extract features of real-world data, combining deep learning with financial market forecasting is regarded as a charming strategy[34]. Among them, recurrent neural network (RNN)[35, 36] is a kind of recursive neural network that is input from sequence data, recursive in the direction of the evolution of sequence, and chained by all nodes. To overcome gradient disappearance and gradient explosion of RNN, a specific kind of RNN named Long Short–Term Memory (LSTM)[37, 38], which takes into account the long-term dependence of time series, is gradually used in time series forecasting. Kumar and Ningombam[39] evaluated the effectiveness of LSTM for making predictions about stock prices of APPL(Apple Inc./NASDAQ). Liu[40] applied LSTM to the large interval volatility forecasting of S $\&$ P 500 and AAPL, and finally concluded that LSTM can achieve a better forecasting result than GARCH(1,1). Huang, et al.[41] decomposed financial data into long-term and short-term trends by variational mode decomposition and then utilized LSTM to predict the future trends of the sequences.

Wavelet decomposition (WD) is an approach that describes the relationship between the time series in time and frequency domains simultaneously. Through wavelet decomposition, the noise feature of time series can be fixed. Therefore, it is natural to combine wavelet decomposition and forecasting models to improve the prediction accuracy of time series. In the research of impact of COVID-19 on the global economy, Štifanić, et al.[42] integrated the stationary wavelet transform and bidirectional long short-term memory neural network to forecast Crude Oil and stock prices and achieved satisfactory results. Peng, et al.[43] applied a LSTM-based model into energy consumption forecasting, which also combined wavelet decomposition and LSTM, and achieved better prediction accuracy compared with the basic LSTM model.

In the present paper, according to the above work and our two main purposes, we first construct a threshold network based on the daily returns of the constituents of CSI 300 index over 16 years. A main objective of this study is to confirm that discrete Ricci curvature can be applied to networks of China’s stock market and can accurately describe its systemic stability. We find that Ricci curvature provides a good response to the systemic characteristic of the financial market in China and we can use this tool to to identify important events (good or bad) in the market. As another main contribution, we develop a hybrid forecasting model which provides a good response to the future trends of the market.

2 Preliminaries

2.1 Graph and Minimum Spanning Tree

In mathematics, we usually call a network a graph, which is composed of a finite set of nodes and a set of edges between nodes, denoted as $G(V,E)$ , where $G$ is denoted as a graph, $V$ is the set of nodes in $G$ , and $E$ is the set of edges in $G$ . Table 1 list some of the concepts related to graph.

Professional terminology	Definition
Directed Edge	The edge has directions
Undirected Edge	The edge has no direction
Directed graph	All edges of the graph are directed edges
Undirected graph	All edges in the graph are undirected
Directed complete graph	A directed graph with edges between any two nodes
Undirected complete graph	An undirected graph with edges between any two nodes
Weight	Edge–related numbers

Table 1: Basic concepts of graphs

For brevity, we only discuss undirected graphs. Two nodes of a graph are said to be connected if there is a path between them. If any two nodes in the graph are connected, the graph is called a connected graph. The spanning tree of a connected graph with $n$ vertices is a connected subgraph that contains all $n$ vertices, but has only $n-1$ edges. If an edge is added to a spanning tree, it necessarily forms a ring, and if an edge is reduced, it is no longer a connected graph.

Minimum Spanning Tree(MST): In a given undirected graph $G=(V,E)$ , $e_{uv}$ represents the edge connecting nodes $u$ and $v$ , and $\omega_{uv}$ represents the weight of this edge. If there exists $T$ which is a spanning tree of $G$ and $\omega(T)$ is minimal, $T$ is called a minimal spanning tree of $G$ . We usually use Prim’s algorithm[44] to implement the construction of minimum spanning trees of a graph.

2.2 Ricci-type Curvatures for Network Analysis

As an important geometric quantity, the classical Ricci curvature quantifies the deviation for the tangent direction and requires a smooth manifold as well as a tensor and higher order derivatives[26]. This requirement is not applicable to discrete graphs or networks, so it is necessary to discretize it to apply in networks. In this work, we apply four different types of discrete Ricci curvatures to the threshold network of China’s stock market. Their definitions and applications can be found in many relevant literatures. For completeness, we briefly describe their definitions here.

Ollivier-Ricci Curvature: This is a widely used discretization[25, 27, 28] of the classical Ricci curvature raised by Olliver[45, 46]. In recent years it has also been applied to financial networks[29, 30]. In a space with positive curvature, the average distance between balls is less than the center distance, while in a negative curved space, the opposite conclusion is reached. Ollivier-Ricci(OR) curvature extends the above observations from balls (volumes) to measures (probabilities), and the OR curvature of the edge $e$ connecting nodes $u$ and $v$ is defined as

\displaystyle O(e)=1-\frac{W_{1}(m_{u},m_{v})}{d(u,v)}.

(2.1)

In (2.1), $m_{u}$ and $m_{v}$ represent measures concentrated at nodes $u$ and $v$ , $d(u,v)$ is the distance between $u$ and $v$ , and $W_{1}$ is the Wasserstein distance[47] between the discrete probability measures $m_{u}$ and $m_{v}$ . The Wasserstein distance is given by

\displaystyle W_{1}(m_{u},m_{v})=\underset{\mu_{u,v}\in\Pi(m_{u},m_{v})}{\inf}\sum_{(u^{\prime},v^{\prime})\in V\times V}d(u^{\prime},v^{\prime})\mu_{u,v}(u^{\prime},v^{\prime}),

where $\Pi(m_{u},m_{v})$ is the set of probability measures $\mu_{u,v}$ that satisfy

\displaystyle\sum_{u^{\prime}\in V}\mu_{u,v}(u^{\prime},v^{\prime})=m_{v}(v^{\prime}),~{}~{}~{}\sum_{v^{\prime}\in V}\mu_{u,v}(u^{\prime},v^{\prime})=m_{u}(u^{\prime})

In addition, the probability distribution $m_{u}$ for $u\in V$ must be specified, which is chosen to be uniform over the neighboring nodes of $u$ [48].

Forman-Ricci Curvature: Forman-Ricci(FR) Curvature is based on the relationship between the Riemannian Laplace operator and the Ricci curvature[49]. It has been shown that FR curvature and edge betweenness centrality are highly correlated[25, 50]. In the undirected network, the FR curvature of edge $e$ connecting nodes $u$ and $v$ is defined as[24]

\displaystyle F(e)=\omega_{e}\left(\frac{\omega_{u}}{\omega_{e}}+\frac{\omega_{v}}{\omega_{e}}-\sum_{e_{u}\sim e,e_{v}\sim e}\left[\frac{\omega_{u}}{\sqrt{\omega_{e}\omega_{e_{u}}}}+\frac{\omega_{v}}{\sqrt{\omega_{e}\omega_{e_{v}}}}\right]\right),

(2.2)

where $\omega_{e}$ , $\omega_{u}$ and $\omega_{v}$ denote the weights of the edge $e$ , the nodes $u$ and $v$ respectively. In addition, $e_{u}\sim e$ and $e_{v}\sim e$ denote the set of edges connecting $u$ and $v$ , respectively, but excluding the edge $e$ .

Menger-Ricci Curvature: Menger’s approach[51] is based on viewing the graph as a metric space, and the path length between two nodes is treated as the distance between two points in the metric space. Suppose $T$ is a triangle in the metric space with sides $a$ , $b$ and $c$ , then Menger curvature of $T$ is given by

\displaystyle M(T)=\frac{1}{R(T)}=\frac{\sqrt{p(p-a)(p-b)(p-c)}}{a\cdot b\cdot c},

where $p=(a+b+c)/2$ and $R(T)$ is the radius of the circumscribed circle of the triangle $T$ . Then, Menger-Ricci(MR) curvature of an edge $e$ in a network can be defined as[52]

\displaystyle M(e)=\sum_{T_{e}\sim e}M(T_{e}),

(2.3)

where $T_{e}\sim e$ denotes the set of triangles formed by side $e$ .

Haantjes-Ricci Curvature: Haantjes[53] defined the curvature of a curve in a metric space as the ratio of the arc length to the chord length of the curve. For a discrete network, suppose that $\pi=v_{0},v_{1},\cdots v_{n}$ is a simple path between nodes $v_{0}$ and $v_{n}$ , $l(\pi)$ is the length of the path and $d(v_{0},v_{n})$ is the shortest distance between nodes $v_{0}$ and $v_{n}$ . Haantjes-Ricci(HR) curvature of the simple path $\pi$ is

\displaystyle H^{2}(\pi)=\frac{l(\pi)-d(v_{0},v_{n})}{d(v_{0},v_{n})^{3}}.

Then, HR curvature of an edge $e$ can be defined as

\displaystyle H(e)=\sum_{\pi\sim e}H(\pi),

(2.4)

where $\pi\sim e$ denote the paths that connect the nodes anchoring the edge $e$ .

The above four discretizations focus on capturing different geometric properties portrayed by the classical Ricci curvature. OR curvature can well capture the aspect of volume growth of classical Ricci curvature. We use OR curvature in networks to compare the average distance between two nodes. FR curvature depicts the geodesic diffusivity of the classical Ricci curvature and we use FR curvature in networks to show the information spread at the ends of edges. Both MR and HR curvatures can capture the geodesics dispersal rate of the classical Ricci curvature. In this work, we ignore the weights of the edges in the network and calculate the average of edges for these four discrete Ricci curvatures according to equations (2.1-2.4), respectively, and considering the computational complexity, we only use the path between nodes whose length is less than or equal to $4$ in the calculations of MR and HR curvatures.

2.3 Discrete Wavelet

Wavelet analysis is a time-frequency analysis method and can achieve high resolution in both time and frequency domains. Through decomposing the curvature time series of our financial networks into several components based on various frequencies, wavelet analysis is able to filter out the chaotic components, so as to remove the influence of noises and improve the prediction performance effectively.

The wavelet transform is roughly divided into continuous transform and discrete transform and both are based on two specific functions: mother wavelet function and daughter wavelet function. For the continuous case, assuming $\psi\in L^{2}(\mathbb{R})$ and $\widetilde{\psi}(\omega)$ is the Fourier transform of $\psi(t)$ , $\psi(t)$ is called mother wavelet function, if $\widetilde{\psi}(\omega)$ meets:

\displaystyle C_{\psi}=\int\frac{|\widetilde{\psi}(\omega)|^{2}}{|\omega|}d\omega<\infty.

And the definition of daughter wavelet function is as followed:

\displaystyle\psi_{a,b}(t)=\frac{1}{\sqrt{|a|}}\psi\left(\frac{t-b}{a}\right),

(2.5)

where $a$ and $b$ are respectively called expansion factor and translation factor.

Due to the fact that our curvature data is based on the daily returns of stocks, we utilize the discrete wavelet transform to decompose the time series. Assigning $2^{-j}$ and $k2^{-j}$ to $a$ and $b$ in equation(2.5), discrete daughter wavelet function is as followed:

\displaystyle\psi_{2^{-j},k2^{-j}}(t)=2^{j/2}\psi(2^{j}t-k),

(2.6)

where $j,k\in\mathbb{Z}$ . For brevity, we use $\psi_{j,k}(t)$ instead of $\psi_{2^{-j},k2^{-j}}(t)$ from now on. The discrete wavelet transform corresponding $\psi_{j,k}(t)$ is as followed:

\displaystyle DWf(j,k)=\langle f,\psi_{j,k}\rangle=2^{j/2}\int_{-\infty}^{+\infty}f(t)\overline{\psi}(2^{j}t-k)dt,

(2.7)

where $f(t)\in L^{2}(\mathbb{R})$ and $\overline{\psi}$ is the conjugate of $\psi$ .

Our denoising process of wavelet decomposition is divided in to the following three steps:

Step1: determine a wavelet function and the number of decomposition layers, and then decompose the original time series.

Step2: select an appropriate threshold to eliminate the fluctuation exceeding the threshold and retain the specific signals.

Step3: reconstruct the retained signals to form a new signal.

3 Data and Methods

3.1 Data Description

The data of this paper are collected from Eastmoney(www.eastmoney.com), including daily closing prices for $N=111$ stocks, $T=3889$ trading days, from January 4, 2005 to December 31, 2020. All the $N=111$ stocks are constituents of CSI 300 Index. Due to some unavoidable factors such as stock suspensions, some stocks are missing their prices on certain trading days. Considering that the stock prices do not change too much in a short period of time, we fill the gaps with the data of previous trading time.

First for each stock, we construct a daily return time series $r_{k}(t)$ according to the formula as followed:

\displaystyle r_{k}(t)=\ln P_{k}(t)-\ln P_{k}(t-1),

where $k=1,2,\cdots,N$ , $t=2,3,\cdots,T$ and $P_{k}(t)$ is the adjusted closing price of the $k$ th stock at time $t$ . Then, the equal-time Pearson cross-correlation coefficients $c_{ij}$ of the daily return time series of stock $i$ and stock $j$ is defined as

\displaystyle c_{ij}(t)=\frac{Cov(r_{i},r_{j})}{\sigma_{i}\sigma_{j}},

where $Cov(r_{i},r_{j})$ is the covariance of $r_{i}$ and $r_{j}$ in a time interval of length $\tau$ , $i,j=1,\cdots,N$ , $t$ indicates the end date of the interval of $\tau$ trading days. In our empirical research, we use the following two schemes to divide time series in order to better illustrated the reliability of our conclusion by comparison of these two approaches.

(i) A non-overlapping time interval of $\tau$ =22 trading days (one trading month),

(ii)An overlapping time interval of $\tau$ =22 days, with a rolling shift of $\Delta$ =5 trading days (one trading week).

Corresponding to correlation coefficients, we construct the distance measures $d_{ij}$ which are widely used for the construction of financial networks[10, 54, 15].

\displaystyle d_{ij}(t)=\sqrt{2(1-c_{ij}(t))}.

3.2 Threshold Network Construction

Firstly, for a given time interval of $\tau$ trading days ending on trading day $t$ , we get a distance matrix $D_{\tau}(t)$ whose elements are $d_{ij}(t)$ . This distance matrix $D_{\tau}(t)$ can be considered as an edge-weighted complete graph $G_{\tau}(t)$ , whose nodes are stocks and the weight of an edge between stocks $i$ and $j$ is given by $d_{ij}(t)$ . Next, with the help of Prim’s algorithm[44], we create MST $T_{\tau}(t)$ based on the complete graph $G_{\tau}(t)$ , which selects the most relevant connections of the stocks. Finally, to capture more significant information in the market, we add edges in $G_{\tau}(t)$ to connect corresponding nodes $i$ and $j$ in $T_{\tau}(t)$ if $c_{ij}(t)>\theta$ for some threshold $\theta$ . The complete graph constructed by MST and the threshold $\theta$ is called threshold network and is denoted as $S_{\tau}(t)$ .

In this paper, we set the threshold $\theta=0.75$ and use $S_{\tau}(t)$ for calculating different kinds of Ricci curvatures.

3.3 The Hybrid Forecasting Model

Due to the fact that the curvature time series is composed of nonlinear features, various temporal information and noises, it is challenging to achieve an accurate forecasting result. Wavelet decomposition can analyze the series from different scales, which can not only reflect the overall trend, but also extract the effective information of the series in details. On the other hand, as a deep learning model, LSTM is able to learn long-term correlations and mine complicated nonlinear relationships within the curvature series effectively. Based on the above facts, we propose a hybrid WD-LSTM model, combining the strengths of wavelet decomposition and long short-term memory network, to forecast the future trends of the market. The WD-LSTM model involves three phrases: decomposition, forecasting and integration. In the decomposition phrase, we decompose the original curvature series data into four high frequency sequences (detail) and one low frequency sequence (approximation). Next, in the forecasting phrase, LSTM is utilized to forecast each decomposed sequence respectively. Finally, the prediction results of all sub-sequences are aggregated in the integration phrase. The architecture of the WD-LSTM model is shown in Figure 1.

Refer to caption — Figure 1: The architechture of WD-LSTM

LSTM used in the forecasting phrase is a specially designed RNN and suitable for processing and forecasting important events with very long intervals and delays in the time series. The architecture of LSTM at time $t$ is composed of four units: forget gate, input gate, output gate and cell state, which is shown in Figure 2. To clarify the details of LSTM, we use $W$ , $U$ and $b$ with different subscripts to denote the linear coefficients and biases of these units.

The output $f_{t}$ of forget gate at time $t$ represents the probability of forgetting the hidden cell state of the previous layer, which can be calculated by:

\displaystyle f_{t}=(\sigma W_{f}h_{t-1}+U_{f}x_{t}+b_{f}),

where $\sigma$ is the sigmoid activation function, $h_{t-1}$ denotes the state of hidden layer at time $t-1$ , $x_{t}$ denotes the input vector at time $t$ .

The input gate is responsible for processing the current input signal and composed of two parts depending on sigmoid and $\tanh$ activation functions respectively. This gate can be formulated as:

\displaystyle\left\{\begin{aligned} i_{t}&=\sigma(W_{i}h_{t-1}+U_{i}x_{t}+b_{i})\\ a_{t}&=\tanh(W_{a}h_{t-1}+U_{a}x_{t}+b_{a})\\ \end{aligned}\right.

The cell state is updated according to forget gate and input gate which is formulated as:

\displaystyle C_{t}=C_{t-1}\odot f_{t}+a_{t}\odot i_{t},

where $\odot$ denotes the Hadamard product.

The output gate is formulated as:

\displaystyle\begin{aligned} O_{t}&=\sigma(W_{o}h_{t-1}+U_{o}x_{t}+b_{o})\\ \end{aligned}.

With the output state $O_{t}$ and hidden cell state $C_{t}$ at time $t$ , the hidden state of the cell is updated as:

\displaystyle\begin{aligned} h_{t}&=O_{t}\odot\tanh(C_{t})\\ \end{aligned}.

Finally, we set a forecast unit which is a fully connected neural network with outputs be the forecasting values $Y_{t}$ of the time series at time $t+1$ according to the hidden state $h_{t}$ :

\displaystyle Y_{t}=\sigma(Wh_{t}+b).

To complete the building of the whole LSTM model, we set four layers including input layer, LSTM layer, fully connected layer and regression layer, as shown in Figure 1, where the regression layer is used to give the mean square error of the outputs.

4 Empirical Results

4.1 Market Stability

Exploring the explanatory power of Ricci curvature for the stability of China’s stock market is one of the main purposes of this paper. We analyze the logarithmic returns of constituents of CSI 300 index over a 16-year period (2005–2020) by means of building the undirected network $S_{\tau}(t)$ with the threshold $\theta=0.75$ . The MST and threshold network constructed based on the data is shown in Figure 3 and Tabel 2 lists some of the ticker symbols corresponding to numbers of nodes in the figure.

Number	18	27	35	49	58	74	97	108
Ticher Symbol	600109	600183	600346	600570	600703	601607	000786	002008

Table 2: List of some of the ticker symbols

Figure 4 depicts four curvature time series of the threshold network $S_{\tau}(t)$ building with non-overlapping time intervals ( $\tau=22$ trading days) and Figure 5 with a rolling shift of $\Delta=5$ . Obviously, the fluctuation trends of the curvature time series which are obtained by using two different data processing methods are essentially consistent, which confirms the generalization performance of our methods and the reliability of our conclusions.

We list some of the major events in China’s financial market between 2005 and 2020 in Table 3. As key events in the market, during these events, the rule, structure, participants or external environment of the market have changed significantly and the stability should be poorer than the normal periods. To verify the effectiveness of the geometric quantities of networks, we compare these events and the curvature time series, and find out that the fluctuations of the curvature time series can capture these key information of the market well. Some of the events are marked with dotted lines in Figure 4 and 5.

Number	Events	Time/Period
1	Shareholding Reform	May 2005
2	Subprime mortgage crisis	Aug 2007
3	International Financial Crisis	2008-2009
4	Establishment of GEM	30 Oct 2009
5	First CSI 300 futures contracts listed	16 Apr 2010
6	CSRC proposed eight key tasks	14 Jan 2011
7	PBOC cut RMB RRR	30 Nov 2011
8	Suspension of IPO	2013
9	The mix-up event of Everbright Securities	16 Aug 2013
10	Market Crash in China	15 Jun-9 Jul 2015
11	Implementation of the meltdown mechanism	1 Jan 2016
12	Establishment of the STAR Market	5 Nov 2018
13	Launch of Shanghai-London Stock Exchange	17 Jun 2019
14	First listing of the STAR stocks	22 Jul 2019
15	Impact of COVID-19	3 Mar-1 May 2020

Table 3: List of some market events between 2005 and 2020

Combining the results in Figure 4 and 5, and the events in Table 3, we find that the four discrete Ricci curvatures can depict the market stability. During the periods of those key events, the curvature time series fluctuates to different degrees. In particular, when the news is significantly good or bad, the time series shows large fluctuations. We therefore believe that the discrete Ricci curvatures can serve as good indicators of the stability for China’s stock market.

4.2 Forecasting of the Systemic Stability

To accomplish another main purpose, we apply the WD-LSTM model to analyze the curvature time series and forecast the future trend of China’s stock market. The WD-LSTM model contains three phrases: decomposition, forecast and integration. The empirical results through the above three phrases are presented below in details.

Decomposition of Curvature Series: According to (2.6) and (2.7), we first decompose the original curvature series into four high frequency sequences (detail) and one low frequency sequence (approximation). For brevity, we choose FR curvature series ( $\Delta=5$ ) as an example and present its decomposition results in Figure 6.

Forecast of Decomposed Sequences: The second step of the WD-LSTM model is to forecast each component decomposed by the WD module by using the LSTM module. In our experiment, each decomposed sequences is divided into training set and testing set according to the proportion of $80\%$ and $20\%$ . Since $\tau=22$ and $\Delta=5$ , the training time series is from February 2, 2015 to November 2, 2017. The number of LSTM layer is set to be $200$ . While in the process of training the LSTM model, the max iteration and the initial learning rate is set to be $250$ and $0.005$ . Besides, the optimizer of LSTM is chosen to be Adam and the gradient threshold is set to be $1$ . After training by using the back-propagation algorithm, we use the hidden state $h_{t-1}$ to forecast the value at time $t$ , where $t$ is from November 9, 2017 to December 31, 2020.

Figure 7 presents the forecasting result of decomposed sequences of FR curvature series.

Integration of Forecasting Results: The final step of the WD-LSTM model is to integrate the forecasting results of decomposed sequences. After the integration phrase, we can get the final forecasting results of the curvature series. We show the final forecasting results of the four Ricci-type curvature series in Figure 8. We also list the evaluation metrics of the final forecasting results, including mean absolute error (MAE), mean square error (MSE) and $R^{2}$ , in Table 4.

	OR	MR	HR	FR
MAE	0.0156	0.5409	73.9460	2.4413
MSE	0.0004	0.8745	14087.5311	18.1357
$R^{2}$	0.9653	0.9295	0.8701	0.9459

Table 4: The evaluation metrics of the WD-LSTM model

4.3 Model Comparison $\&$ Empirical Summary

To verify the superiority of the WD-LSTM model, in this subsection, we carry out a comparative experiment where a basic LSTM model is utilized to forecast the four Ricci-type curvature series directly. Table 5 presents the evaluation metrics of the single LSTM model’s final forecasting results.

	OR	MR	HR	FR
MAE	0.0950	2.3464	200.4158	9.8831
MSE	0.0146	14.2501	186815.2867	251.1875
$R^{2}$	-0.1271	-0.1495	-0.7230	0.2509

Table 5: The evaluation metrics of the single LSTM

Comparing Table 4 and Table 5, it is obvious that for each evaluation metric, the forecasting performance of the WD-LSTM model is significantly better than that of the basic LSTM model for all the four Ricci-type curvature series. It implies that the wavelet decomposition plays a remarkable role and the hybrid model can handle the strong nonlinearity, complex time characteristics and noise interference of the curvature series better than the single LSTM model.

Furthermore, there must be performance differences between the four Ricci curvatures. Samal, et al.[30] have shown that FR curvature are more sensitive and can detect both crashes and bubbles in USA S $\&$ P-500 and Japanese Nikkei-225 markets more efficiently. For China’s stock market, comparison of $R^{2}$ metrics for the four kinds of curvatures in Table 4 and Table 5 obviously implies that the performance of the hybrid model is better than a single LSTM model. In the hybrid model, all the four curvatures have excellent explanatory power for depict and forecast the stability of China’s stock market. In particular, the $R^{2}$ metric of OR curvature series is closer to $1$ than those of the other three. We can infer that the OR curvature series is more suitable for the domestic market, which is different from the conclusion about the foreign market. This may reflect the different characteristics of domestic and foreign markets. According to the definitions of these two curvatures, FR curvature is mainly aimed at capturing the diffusion characteristics of the geodesic, which is more sensitive to events than other curvatures, and can better capture the details of the market. While OR curvature measures the relative distance between two respective neighborhoods of two vertices that form an edge. Therefore, it is more suitable for the domestic market where macro-control measures are implemented more effectively and the co-movement effect of the stock sectors is more obvious.

5 Conclusion

In this paper, we apply different types of discrete Ricci curvatures of networks to characterize the systemic stability of China’s stock market. We verify the reliability of our methods by monitoring the fluctuations of the constituents of CSI 300 index from 2005 to 2020 in conjunction with Table 3. We find that network curvatures can be used as good indicators for the systemic stability of China’s stock market.

Based on the above, we also make a more in depth application of the geometric measure. A hybrid WD-LSTM model, combing wavelet decomposition with long short-term memory network, is applied to forecast the future trends of the systemic stability for China’s stock market by means of modeling and predicting the curvature series data. Comparing to the single LSTM model, the WD-LSTM model performs significantly better. Moreover, the empirical result shows that OR curvature is most suitable for the domestic market and the proposed hybrid model has excellent forecasting performance.

In summary, we use discrete Ricci curvature as a measure of the stability for China’s financial market and apply an effective hybrid model to forecast the future trends. Our methods and models are very helpful to develop new financial regulatory tools to better identify, forecast, and prevent market risks and contribute to financial stability.

Acknowledgements. This research was supported by New Liberal Arts Research and Reform Practice Project of Ministry of Education (NO. 2021060011) and the Emerging Interdisciplinary Project of CUFE.

References

[1] Sutherland A, Financial market integration and macroeconomic volatility, The Scandinavian Journal of Economic, 1996, 98: 521-539.
[2] Baig T, Goldfajn I, Financial Market Contagion in the Asian Crisis, IMF Staff Papers, 1999, 46: 167-195.
[3] Subrahmanyam A, Titman S, Feedback from stock prices to cash flows, The Journal of Finance, 2002, 56: 2389-2413.
[4] Bouchaud J P, and Potters M, Theory of financial risk and derivative pricing: From statistical physics to risk management, Cambridge University Press, Cambridge, 2003.
[5] Ghoulmie F, Cont R, Nadal J P, Heterogeneity and feedback in an agent-based market model, Journal of Physics: Condensed Matter, 2005, 17: 1259.
[6] Chakraborti A, Challet D, Chatterjee A, et al., Statistical mechanics of competitive resource allocation using agent-based models, Physics Reports, 2015, 552: 1-25.
[7] Mantegna R N, and Stanley H E, An introduction to econophysics: Correlations and complexity in finance, Cambridge University Press, Cambridge, 2007.
[8] Battiston S, Flache A, Garlaschelli D, et al., Complexity theory and financial regulation, Science, 2016, 351: 818-819.
[9] Plerou V, Gopikrishnan P, Rosenow B, et al., Universal and nonuniversal properties of cross correlations in financial time series, Physics Review Letters, 1999, 83: 1471-1474.
[10] Mantegna R N, Hierarchical structure in financial markets, The European Physical Journal B-Condensed Matter and Complex Systems, 1999, 11: 193-197.
[11] Laloux L, Cizeau P, Bouchaud J P, et al., Noise dressing of financial correlation matrices, Physical Review Letters, 1999, 83: 1467-1470.
[12] Gopikrishnan P, Rosenow B, Plerou V, et al., Quantifying and interpreting collective behavior in financial markets, Physical Review E, 2001, 64: 035106.
[13] Kullmann L, Kertész J, Kaski K, Time dependent cross-correlations between different stock returns: A directed network of influence, Physical Review E, 2002, 66: 026125.
[14] Plerou V, Gopikrishnan P, Rosenow B, et al., Random matrix approach to cross correlations in financial data, Physical Review E, 2002, 65: 066126.
[15] Onnela J P, Chakraborti A, Kaski K, et al., Dynamics of market correlations: Taxonomy and portfolio analysis, Physical Review E, 2003, 68: 056110.
[16] Dussert C, Rasigni G, Rasigni M, et al., Minimal spanning tree: A new approach for studying order and disorder, Physical Review B, 1986, 34: 3528.
[17] Miccichè S, Bonanno G, Lillo F, et al., Degree stability of a minimum spanning tree of price return and volatility, Physica A, 2003, 324: 66-73.
[18] Kumar S, Deo N, Correlation and network analysis of global financial indices, Physical Review E, 2012, 86: 026101.
[19] Tumminello M, Aste T, Di Matteo T, et al., A tool for filtering information in complex systems, Proceedings of the National Academy of Sciences, 2005, 102: 10421-10426.
[20] Pharasi H K, Sharma K, Chatterjee R, et al., Identifying long-term precursors of financial market crashes using correlation patterns, New Journal of Physics, 2018, 20: 103041.
[21] Chakraborti A, Sharma K, Pharasi H K, et al., Emerging spectra characterization of catastrophic instabilities in complex systems, New Journal of Physics, 2020, 22: 063043.
[22] Krioukov D, Papadopoulos F, Kitsak M, et al., Hyperbolic geometry of complex networks, Physical Review E, 2010, 82: 036106.
[23] Bianconi G, Interdisciplinary and physics challenges of network theory, Europhysics Letters, 2015, 111: 56001.
[24] Sreejith R P, Mohanraj K, Jost J, et al., Forman curvature for complex networks, Journal of Statistical Mechanics: Theory and Experiment, 2016, 063206.
[25] Samal A, Sreejith R P, Gu J, et al., Comparative analysis of two discretizations of Ricci curvature for complex networks, Scientific Reports, 2018, 8: 8650.
[26] Jost J, Riemannian geometry and geometric analysis, Springer International Publishing, Berlin, 2017.
[27] Ni C, Lin Y, Luo F, et al., Community detection on networks with Ricci flow, Scientific Reports, 2019, 9: 9984.
[28] Sandhu R, Georgiou T, Reznik E, et al., Graph curvature for differentiating cancer networks, Scientific Reports, 2015, 5: 12323.
[29] Sandhu R S, Georgiou T T, Tannenbaum A R, Ricci curvature: An economic indicator for market fragility and systemic risk, Science Advances, 2016, 2: 1501495.
[30] Samal A, Pharasi H K, Ramaia S J, et al., Network geometry and market instability, Royal Society Open Science, 2020, 8: 201734.
[31] Jin Y H, Zhang Q, Shan L F, et al., Characteristics of venture capital network and its correlation with regional economy: Evidence from China, PLoS One, 2015, 10: 0137172.
[32] Zhang W P, Zhuang X T, The stability of Chinese stock network and its mechanism, Physica A, 2019, 515: 748-761.
[33] Wang Y L, Zhang Q P, Yang X G, Evolution of the Chinese guarantee network under financial crisis and stimulus program, Nature Communications, 2020, 11: 2693.
[34] Cavalcante R C, Brasileiro R C, Souza V L F, et al., Computational intelligence and financial markets: A survey and future directions, Expert Systems with Applications, 2016, 55: 194-211.
[35] Werbos P J, Backpropagation through time: What it does and how to do it, Proceedings of the IEEE, 1990, 78: 1550-1560.
[36] Hochreiter S, Untersuchungen zu dynamischen neuronalen netzen, Diploma thesis, T.U. München, 1991.
[37] Hochreiter S, Schmidhuber J, Long short-term memory, Neural Computation, 1997, 9: 1735-1780.
[38] Gers F A, Schmidhuber J, Cummins F, Learning to forget: Continual prediction with LSTM, Neural Computation, 2000, 12: 2451-2471.
[39] Kumar S, Ningombam D, Short-term forecasting of stock prices using long short term memory, International Conference on Information Technology (ICIT), 2018.
[40] Liu Y, Novel volatility forecasting using deep learning long short term memory recurrent neural networks, Expert Systems with Applications, 2019, 132: 99-109.
[41] Huang Y, Gao Y, Gan Y, et al., A new financial data forecasting model using genetic algorithm and long short-term memory network, Neurocomputing, 2021, 425: 207-218.
[42] Štifanić D, Musulin J, Miočević A, et al., Impact of COVID-19 on forecasting stock prices: An integration of stationary wavelet transform and bidirectional long short-term memory, Complexity, 2020, 1846926.
[43] Peng L, Wang L, Xia D, et al., Effective energy consumption forecasting using empirical wavelet transform and long short-term memory, Energy, 2022, 238: 121756.
[44] Prim R C, Shortest connection networks and some generalizations, The Bell System Technical Journal, 1957, 36: 1389-1401.
[45] Ollivier Y, Ricci curvature of metric spaces, Comptes Rendus de l’Académie des Sciences-Series I-Mathematics, 2007, 345: 643-646.
[46] Ollivier Y, Ricci curvature of Markov chains on metric spaces, Journal of Functional Analysis, 2009, 256: 810-864.
[47] Vaserstein L N, Markov processes over denumerable products of spaces, describing large systems of automata, Problemy Peredachi Informatsii, 1969, 5: 64-72.
[48] Lin Y, Lu L, Yau ST, Ricci curvature of graphs, Tohoku Mathematical Journal, 2011, 63: 605-627.
[49] Forman R, Bochner’s method for cell complexes and combinatorial Ricci curvature, Discrete $\&$ Computational Geometry, 2003, 29: 323-374.
[50] Sreejith R P, Jost J, Saucan E, et al., Systematic evaluation of a new combinatorial curvature for complex networks, Chaos, Solitons $\&$ Fractals, 2017, 101: 50-67.
[51] Menger K, Untersuchungen über allgemeine Metrik. Vierte Untersuchung. Zur Metrik der Kurven, Mathematische Annalen, 1930, 103: 466-501.
[52] Saucan E, Samal A, Jost J, A simple differential geometry for complex networks, Network Science, 2020, 9: 106-133.
[53] Haantjes J, Distance geometry: Curvature in abstract metric spaces, Proc. Kon. Ned. Akad. v. Wetenseh., 1947, 50: 302-314.
[54] Mantegna R N, Information and hierarchical structure in financial markets, Computer Physics Communications, 1999, 121-122: 153-156.