This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Stability of China’s Stock Market: Measure and Forecast by Ricci Curvature on Network

Xinyu Wang School of Science, China Agricultural University, Beijing, 100091, China Liang Zhao [ Corresponding author, School of Mathematical Sciences,
Key Laboratory of Mathematics and Complex Systems of MOE,
Beijing Normal University, Beijing 100875, China
Ning Zhang School of Finance, Chinese Fintech Research Center,
Central University of Finance and Economics, Beijing, 102206, China
Liu Feng Haibo Lin
Abstract

The systemic stability of a stock market is one of the core issues in the financial field. The market can be regarded as a complex network whose nodes are stocks connected by edges that signify their correlation strength. Since the market is a strongly nonlinear system, it is difficult to measure the macroscopic stability and depict market fluctuations in time. In this paper, we use a geometric measure derived from discrete Ricci curvature to capture the higher-order nonlinear architecture of financial networks. In order to confirm the effectiveness of our method, we use it to analyze the CSI 300 constituents of China’s stock market from 2005–2020 and the systemic stability of the market is quantified through the network’s Ricci type curvatures. Furthermore, we use a hybrid model to analyze the curvature time series and predict the future trends of the market accurately. As far as we know, this is the first paper to apply Ricci curvature to forecast the systemic stability of domestic stock market, and our results show that Ricci curvature has good explanatory power for the market stability and can be a good indicator to judge the future risk and volatility of the domestic market.

keywords:
stability, Ricci curvature, network, stock market
journal: \ast\ast\ast

1 Introduction

Through more than thirty years of development, China’s capital market has grown continuously. With improvements of the trading mechanism, the market stability has been gradually enhanced and the market plays a more and more important role in optimizing the social financing structure and promoting the allocation of resources. On the other hand, China’s financial market is in its infancy, and abnormal market fluctuations still occur occasionally. For example, from 2007 to 2008, the Shanghai Composite Index fell from 61246124, the highest point, to 16641664, a drop of 70%70\%. During the market crash in 2015, the market experienced significant abnormal fluctuations which lasted for half a year. As the key factors of derivative pricing and financial risk management, it is of great significance to study how to measure and forecast the market stability reasonably and accurately. This kind of ability to analyze and predict the market is conducive to the objective and quantifiable evaluation of China’s financial market, to the analysis of the market stability factors and the formulation of targeted policies, so as to realize the early warning and prevention of financial risks and the maintenance of financial stability.

The stock market is a nonlinear and non-stationary system with strong volatility, tight coupling and asymmetry. Individual stocks in the market interact each other and the abnormal fluctuations of individuals may quickly enlarge to the whole market. To better understand the highly correlated market, as well as to achieve monitoring and adjustment of it, economists advocate the use of many new tools and interdisciplinary approaches, such as trigger points, feedback, contagion and complexity theory[1, 2, 3, 4, 5, 6]. In particular, to describe the stability macroscopically, we should not consider each individual separately, but should regard the market as a whole system, which coincides with the nature of complex networks[7, 8]. Empirical cross-correlation among stock prices has been extensively studied and explored more than two decades[9, 10, 11, 12, 13, 14]. The correlation between stock returns allows us to construct a variety of correlation-based networks, such as minimum spanning trees (MST)[10, 16, 17, 15] or threshold networks[18], where nodes represent stocks and edges represent correlation strength (or converted to a distance metric). In recent years, correlation-based networks become one of the common tools for modeling and analyzing complex financial systems[14, 15, 19, 20, 21].

Since there are interactions that occur among groups of more nodes besides pairwise interactions, to reveal the higher-order nonlinear relationship in a network [22, 23, 24, 25], curvature, which is a key concept in geometry proposed by Gauss and Riemann[26], can be an appropriate and powerful tool, and it has been increasingly used as network metrics in recent years[24, 25, 28, 27]. In 2015, Sandhu et al.[29] applied the graph curvature to cancer networks for the first time. Sandhu et al.[28] also studied the evolution of Ollivier-Ricci curvature in the financial threshold network and showed that Ollivier-Ricci curvature can be used to determine the stability of USA S&\&P-500 over the period 1998-2013. A recent study by Samal et al.[30] confirms that discrete Ricci curvature can be an excellent indicator of stability and volatility for financial markets of USA and Japan. For the financial market in China, relevant studies have confirmed that it has significant small-world effect and scale-free feature[31, 32, 33], which provides us a theoretical basis for the combination of network geometry and domestic financial market. In summary, the description of the stability of the domestic stock market through geometric measurement is the first motivation of the research work in this paper.

In addition to measure the stability, prediction of trends of the market is also an exciting research area and this is another main purpose of this paper. We will use a hybrid machine learning model combing deep neural network and wavelet decomposition to achieve this goal. We remark that, because the financial curvature time series are complex, non-stationary and very noisy, the classic time series models, such as ARIMA, GARCH, et al., are not suitable for this task.

Since deep learning models can successfully extract features of real-world data, combining deep learning with financial market forecasting is regarded as a charming strategy[34]. Among them, recurrent neural network (RNN)[35, 36] is a kind of recursive neural network that is input from sequence data, recursive in the direction of the evolution of sequence, and chained by all nodes. To overcome gradient disappearance and gradient explosion of RNN, a specific kind of RNN named Long Short–Term Memory (LSTM)[37, 38], which takes into account the long-term dependence of time series, is gradually used in time series forecasting. Kumar and Ningombam[39] evaluated the effectiveness of LSTM for making predictions about stock prices of APPL(Apple Inc./NASDAQ). Liu[40] applied LSTM to the large interval volatility forecasting of S&\&P 500 and AAPL, and finally concluded that LSTM can achieve a better forecasting result than GARCH(1,1). Huang, et al.[41] decomposed financial data into long-term and short-term trends by variational mode decomposition and then utilized LSTM to predict the future trends of the sequences.

Wavelet decomposition (WD) is an approach that describes the relationship between the time series in time and frequency domains simultaneously. Through wavelet decomposition, the noise feature of time series can be fixed. Therefore, it is natural to combine wavelet decomposition and forecasting models to improve the prediction accuracy of time series. In the research of impact of COVID-19 on the global economy, Štifanić, et al.[42] integrated the stationary wavelet transform and bidirectional long short-term memory neural network to forecast Crude Oil and stock prices and achieved satisfactory results. Peng, et al.[43] applied a LSTM-based model into energy consumption forecasting, which also combined wavelet decomposition and LSTM, and achieved better prediction accuracy compared with the basic LSTM model.

In the present paper, according to the above work and our two main purposes, we first construct a threshold network based on the daily returns of the constituents of CSI 300 index over 16 years. A main objective of this study is to confirm that discrete Ricci curvature can be applied to networks of China’s stock market and can accurately describe its systemic stability. We find that Ricci curvature provides a good response to the systemic characteristic of the financial market in China and we can use this tool to to identify important events (good or bad) in the market. As another main contribution, we develop a hybrid forecasting model which provides a good response to the future trends of the market.

2 Preliminaries

2.1 Graph and Minimum Spanning Tree

In mathematics, we usually call a network a graph, which is composed of a finite set of nodes and a set of edges between nodes, denoted as G(V,E)G(V,E), where GG is denoted as a graph, VV is the set of nodes in GG, and EE is the set of edges in GG. Table 1 list some of the concepts related to graph.

Professional terminology Definition
Directed Edge The edge has directions
Undirected Edge The edge has no direction
Directed graph All edges of the graph are directed edges
Undirected graph All edges in the graph are undirected
Directed complete graph A directed graph with edges between any two nodes
Undirected complete graph An undirected graph with edges between any two nodes
Weight Edge–related numbers
Table 1: Basic concepts of graphs

For brevity, we only discuss undirected graphs. Two nodes of a graph are said to be connected if there is a path between them. If any two nodes in the graph are connected, the graph is called a connected graph. The spanning tree of a connected graph with nn vertices is a connected subgraph that contains all nn vertices, but has only n1n-1 edges. If an edge is added to a spanning tree, it necessarily forms a ring, and if an edge is reduced, it is no longer a connected graph.

Minimum Spanning Tree(MST): In a given undirected graph G=(V,E)G=(V,E), euve_{uv} represents the edge connecting nodes uu and vv, and ωuv\omega_{uv} represents the weight of this edge. If there exists TT which is a spanning tree of GG and ω(T)\omega(T) is minimal, TT is called a minimal spanning tree of GG. We usually use Prim’s algorithm[44] to implement the construction of minimum spanning trees of a graph.

2.2 Ricci-type Curvatures for Network Analysis

As an important geometric quantity, the classical Ricci curvature quantifies the deviation for the tangent direction and requires a smooth manifold as well as a tensor and higher order derivatives[26]. This requirement is not applicable to discrete graphs or networks, so it is necessary to discretize it to apply in networks. In this work, we apply four different types of discrete Ricci curvatures to the threshold network of China’s stock market. Their definitions and applications can be found in many relevant literatures. For completeness, we briefly describe their definitions here.

Ollivier-Ricci Curvature: This is a widely used discretization[25, 27, 28] of the classical Ricci curvature raised by Olliver[45, 46]. In recent years it has also been applied to financial networks[29, 30]. In a space with positive curvature, the average distance between balls is less than the center distance, while in a negative curved space, the opposite conclusion is reached. Ollivier-Ricci(OR) curvature extends the above observations from balls (volumes) to measures (probabilities), and the OR curvature of the edge ee connecting nodes uu and vv is defined as

O(e)=1W1(mu,mv)d(u,v).\displaystyle O(e)=1-\frac{W_{1}(m_{u},m_{v})}{d(u,v)}. (2.1)

In (2.1), mum_{u} and mvm_{v} represent measures concentrated at nodes uu and vv, d(u,v)d(u,v) is the distance between uu and vv, and W1W_{1} is the Wasserstein distance[47] between the discrete probability measures mum_{u} and mvm_{v}. The Wasserstein distance is given by

W1(mu,mv)=infμu,vΠ(mu,mv)(u,v)V×Vd(u,v)μu,v(u,v),\displaystyle W_{1}(m_{u},m_{v})=\underset{\mu_{u,v}\in\Pi(m_{u},m_{v})}{\inf}\sum_{(u^{\prime},v^{\prime})\in V\times V}d(u^{\prime},v^{\prime})\mu_{u,v}(u^{\prime},v^{\prime}),

where Π(mu,mv)\Pi(m_{u},m_{v}) is the set of probability measures μu,v\mu_{u,v} that satisfy

uVμu,v(u,v)=mv(v),vVμu,v(u,v)=mu(u)\displaystyle\sum_{u^{\prime}\in V}\mu_{u,v}(u^{\prime},v^{\prime})=m_{v}(v^{\prime}),~{}~{}~{}\sum_{v^{\prime}\in V}\mu_{u,v}(u^{\prime},v^{\prime})=m_{u}(u^{\prime})

In addition, the probability distribution mum_{u} for uVu\in V must be specified, which is chosen to be uniform over the neighboring nodes of uu[48].

Forman-Ricci Curvature: Forman-Ricci(FR) Curvature is based on the relationship between the Riemannian Laplace operator and the Ricci curvature[49]. It has been shown that FR curvature and edge betweenness centrality are highly correlated[25, 50]. In the undirected network, the FR curvature of edge ee connecting nodes uu and vv is defined as[24]

F(e)=ωe(ωuωe+ωvωeeue,eve[ωuωeωeu+ωvωeωev]),\displaystyle F(e)=\omega_{e}\left(\frac{\omega_{u}}{\omega_{e}}+\frac{\omega_{v}}{\omega_{e}}-\sum_{e_{u}\sim e,e_{v}\sim e}\left[\frac{\omega_{u}}{\sqrt{\omega_{e}\omega_{e_{u}}}}+\frac{\omega_{v}}{\sqrt{\omega_{e}\omega_{e_{v}}}}\right]\right), (2.2)

where ωe\omega_{e}, ωu\omega_{u} and ωv\omega_{v} denote the weights of the edge ee, the nodes uu and vv respectively. In addition, euee_{u}\sim e and evee_{v}\sim e denote the set of edges connecting uu and vv, respectively, but excluding the edge ee.

Menger-Ricci Curvature: Menger’s approach[51] is based on viewing the graph as a metric space, and the path length between two nodes is treated as the distance between two points in the metric space. Suppose TT is a triangle in the metric space with sides aa, bb and cc, then Menger curvature of TT is given by

M(T)=1R(T)=p(pa)(pb)(pc)abc,\displaystyle M(T)=\frac{1}{R(T)}=\frac{\sqrt{p(p-a)(p-b)(p-c)}}{a\cdot b\cdot c},

where p=(a+b+c)/2p=(a+b+c)/2 and R(T)R(T) is the radius of the circumscribed circle of the triangle TT. Then, Menger-Ricci(MR) curvature of an edge ee in a network can be defined as[52]

M(e)=TeeM(Te),\displaystyle M(e)=\sum_{T_{e}\sim e}M(T_{e}), (2.3)

where TeeT_{e}\sim e denotes the set of triangles formed by side ee.

Haantjes-Ricci Curvature: Haantjes[53] defined the curvature of a curve in a metric space as the ratio of the arc length to the chord length of the curve. For a discrete network, suppose that π=v0,v1,vn\pi=v_{0},v_{1},\cdots v_{n} is a simple path between nodes v0v_{0} and vnv_{n}, l(π)l(\pi) is the length of the path and d(v0,vn)d(v_{0},v_{n}) is the shortest distance between nodes v0v_{0} and vnv_{n}. Haantjes-Ricci(HR) curvature of the simple path π\pi is

H2(π)=l(π)d(v0,vn)d(v0,vn)3.\displaystyle H^{2}(\pi)=\frac{l(\pi)-d(v_{0},v_{n})}{d(v_{0},v_{n})^{3}}.

Then, HR curvature of an edge ee can be defined as

H(e)=πeH(π),\displaystyle H(e)=\sum_{\pi\sim e}H(\pi), (2.4)

where πe\pi\sim e denote the paths that connect the nodes anchoring the edge ee.

The above four discretizations focus on capturing different geometric properties portrayed by the classical Ricci curvature. OR curvature can well capture the aspect of volume growth of classical Ricci curvature. We use OR curvature in networks to compare the average distance between two nodes. FR curvature depicts the geodesic diffusivity of the classical Ricci curvature and we use FR curvature in networks to show the information spread at the ends of edges. Both MR and HR curvatures can capture the geodesics dispersal rate of the classical Ricci curvature. In this work, we ignore the weights of the edges in the network and calculate the average of edges for these four discrete Ricci curvatures according to equations (2.1-2.4), respectively, and considering the computational complexity, we only use the path between nodes whose length is less than or equal to 44 in the calculations of MR and HR curvatures.

2.3 Discrete Wavelet

Wavelet analysis is a time-frequency analysis method and can achieve high resolution in both time and frequency domains. Through decomposing the curvature time series of our financial networks into several components based on various frequencies, wavelet analysis is able to filter out the chaotic components, so as to remove the influence of noises and improve the prediction performance effectively.

The wavelet transform is roughly divided into continuous transform and discrete transform and both are based on two specific functions: mother wavelet function and daughter wavelet function. For the continuous case, assuming ψL2()\psi\in L^{2}(\mathbb{R}) and ψ~(ω)\widetilde{\psi}(\omega) is the Fourier transform of ψ(t)\psi(t), ψ(t)\psi(t) is called mother wavelet function, if ψ~(ω)\widetilde{\psi}(\omega) meets:

Cψ=|ψ~(ω)|2|ω|𝑑ω<.\displaystyle C_{\psi}=\int\frac{|\widetilde{\psi}(\omega)|^{2}}{|\omega|}d\omega<\infty.

And the definition of daughter wavelet function is as followed:

ψa,b(t)=1|a|ψ(tba),\displaystyle\psi_{a,b}(t)=\frac{1}{\sqrt{|a|}}\psi\left(\frac{t-b}{a}\right), (2.5)

where aa and bb are respectively called expansion factor and translation factor.

Due to the fact that our curvature data is based on the daily returns of stocks, we utilize the discrete wavelet transform to decompose the time series. Assigning 2j2^{-j} and k2jk2^{-j} to aa and bb in equation(2.5), discrete daughter wavelet function is as followed:

ψ2j,k2j(t)=2j/2ψ(2jtk),\displaystyle\psi_{2^{-j},k2^{-j}}(t)=2^{j/2}\psi(2^{j}t-k), (2.6)

where j,kj,k\in\mathbb{Z}. For brevity, we use ψj,k(t)\psi_{j,k}(t) instead of ψ2j,k2j(t)\psi_{2^{-j},k2^{-j}}(t) from now on. The discrete wavelet transform corresponding ψj,k(t)\psi_{j,k}(t) is as followed:

DWf(j,k)=f,ψj,k=2j/2+f(t)ψ¯(2jtk)𝑑t,\displaystyle DWf(j,k)=\langle f,\psi_{j,k}\rangle=2^{j/2}\int_{-\infty}^{+\infty}f(t)\overline{\psi}(2^{j}t-k)dt, (2.7)

where f(t)L2()f(t)\in L^{2}(\mathbb{R}) and ψ¯\overline{\psi} is the conjugate of ψ\psi.

Our denoising process of wavelet decomposition is divided in to the following three steps:

Step1: determine a wavelet function and the number of decomposition layers, and then decompose the original time series.

Step2: select an appropriate threshold to eliminate the fluctuation exceeding the threshold and retain the specific signals.

Step3: reconstruct the retained signals to form a new signal.

3 Data and Methods

3.1 Data Description

The data of this paper are collected from Eastmoney(www.eastmoney.com), including daily closing prices for N=111N=111 stocks, T=3889T=3889 trading days, from January 4, 2005 to December 31, 2020. All the N=111N=111 stocks are constituents of CSI 300 Index. Due to some unavoidable factors such as stock suspensions, some stocks are missing their prices on certain trading days. Considering that the stock prices do not change too much in a short period of time, we fill the gaps with the data of previous trading time.

First for each stock, we construct a daily return time series rk(t)r_{k}(t) according to the formula as followed:

rk(t)=lnPk(t)lnPk(t1),\displaystyle r_{k}(t)=\ln P_{k}(t)-\ln P_{k}(t-1),

where k=1,2,,Nk=1,2,\cdots,N, t=2,3,,Tt=2,3,\cdots,T and Pk(t)P_{k}(t) is the adjusted closing price of the kkth stock at time tt. Then, the equal-time Pearson cross-correlation coefficients cijc_{ij} of the daily return time series of stock ii and stock jj is defined as

cij(t)=Cov(ri,rj)σiσj,\displaystyle c_{ij}(t)=\frac{Cov(r_{i},r_{j})}{\sigma_{i}\sigma_{j}},

where Cov(ri,rj)Cov(r_{i},r_{j}) is the covariance of rir_{i} and rjr_{j} in a time interval of length τ\tau, i,j=1,,Ni,j=1,\cdots,N, tt indicates the end date of the interval of τ\tau trading days. In our empirical research, we use the following two schemes to divide time series in order to better illustrated the reliability of our conclusion by comparison of these two approaches.

(i) A non-overlapping time interval of τ\tau=22 trading days (one trading month),

(ii)An overlapping time interval of τ\tau=22 days, with a rolling shift of Δ\Delta=5 trading days (one trading week).

Corresponding to correlation coefficients, we construct the distance measures dijd_{ij} which are widely used for the construction of financial networks[10, 54, 15].

dij(t)=2(1cij(t)).\displaystyle d_{ij}(t)=\sqrt{2(1-c_{ij}(t))}.

3.2 Threshold Network Construction

Firstly, for a given time interval of τ\tau trading days ending on trading day tt, we get a distance matrix Dτ(t)D_{\tau}(t) whose elements are dij(t)d_{ij}(t). This distance matrix Dτ(t)D_{\tau}(t) can be considered as an edge-weighted complete graph Gτ(t)G_{\tau}(t), whose nodes are stocks and the weight of an edge between stocks ii and jj is given by dij(t)d_{ij}(t). Next, with the help of Prim’s algorithm[44], we create MST Tτ(t)T_{\tau}(t) based on the complete graph Gτ(t)G_{\tau}(t), which selects the most relevant connections of the stocks. Finally, to capture more significant information in the market, we add edges in Gτ(t)G_{\tau}(t) to connect corresponding nodes ii and jj in Tτ(t)T_{\tau}(t) if cij(t)>θc_{ij}(t)>\theta for some threshold θ\theta. The complete graph constructed by MST and the threshold θ\theta is called threshold network and is denoted as Sτ(t)S_{\tau}(t).

In this paper, we set the threshold θ=0.75\theta=0.75 and use Sτ(t)S_{\tau}(t) for calculating different kinds of Ricci curvatures.

3.3 The Hybrid Forecasting Model

Due to the fact that the curvature time series is composed of nonlinear features, various temporal information and noises, it is challenging to achieve an accurate forecasting result. Wavelet decomposition can analyze the series from different scales, which can not only reflect the overall trend, but also extract the effective information of the series in details. On the other hand, as a deep learning model, LSTM is able to learn long-term correlations and mine complicated nonlinear relationships within the curvature series effectively. Based on the above facts, we propose a hybrid WD-LSTM model, combining the strengths of wavelet decomposition and long short-term memory network, to forecast the future trends of the market. The WD-LSTM model involves three phrases: decomposition, forecasting and integration. In the decomposition phrase, we decompose the original curvature series data into four high frequency sequences (detail) and one low frequency sequence (approximation). Next, in the forecasting phrase, LSTM is utilized to forecast each decomposed sequence respectively. Finally, the prediction results of all sub-sequences are aggregated in the integration phrase. The architecture of the WD-LSTM model is shown in Figure 1.

Refer to caption
Figure 1: The architechture of WD-LSTM

LSTM used in the forecasting phrase is a specially designed RNN and suitable for processing and forecasting important events with very long intervals and delays in the time series. The architecture of LSTM at time tt is composed of four units: forget gate, input gate, output gate and cell state, which is shown in Figure 2. To clarify the details of LSTM, we use WW, UU and bb with different subscripts to denote the linear coefficients and biases of these units.

The output ftf_{t} of forget gate at time tt represents the probability of forgetting the hidden cell state of the previous layer, which can be calculated by:

ft=(σWfht1+Ufxt+bf),\displaystyle f_{t}=(\sigma W_{f}h_{t-1}+U_{f}x_{t}+b_{f}),

where σ\sigma is the sigmoid activation function, ht1h_{t-1} denotes the state of hidden layer at time t1t-1, xtx_{t} denotes the input vector at time tt.

The input gate is responsible for processing the current input signal and composed of two parts depending on sigmoid and tanh\tanh activation functions respectively. This gate can be formulated as:

{it=σ(Wiht1+Uixt+bi)at=tanh(Waht1+Uaxt+ba)\displaystyle\left\{\begin{aligned} i_{t}&=\sigma(W_{i}h_{t-1}+U_{i}x_{t}+b_{i})\\ a_{t}&=\tanh(W_{a}h_{t-1}+U_{a}x_{t}+b_{a})\\ \end{aligned}\right.

The cell state is updated according to forget gate and input gate which is formulated as:

Ct=Ct1ft+atit,\displaystyle C_{t}=C_{t-1}\odot f_{t}+a_{t}\odot i_{t},

where \odot denotes the Hadamard product.

The output gate is formulated as:

Ot=σ(Woht1+Uoxt+bo).\displaystyle\begin{aligned} O_{t}&=\sigma(W_{o}h_{t-1}+U_{o}x_{t}+b_{o})\\ \end{aligned}.

With the output state OtO_{t} and hidden cell state CtC_{t} at time tt, the hidden state of the cell is updated as:

ht=Ottanh(Ct).\displaystyle\begin{aligned} h_{t}&=O_{t}\odot\tanh(C_{t})\\ \end{aligned}.

Finally, we set a forecast unit which is a fully connected neural network with outputs be the forecasting values YtY_{t} of the time series at time t+1t+1 according to the hidden state hth_{t}:

Yt=σ(Wht+b).\displaystyle Y_{t}=\sigma(Wh_{t}+b).
Refer to caption
Figure 2: The architechture of LSTM

To complete the building of the whole LSTM model, we set four layers including input layer, LSTM layer, fully connected layer and regression layer, as shown in Figure 1, where the regression layer is used to give the mean square error of the outputs.

4 Empirical Results

4.1 Market Stability

Exploring the explanatory power of Ricci curvature for the stability of China’s stock market is one of the main purposes of this paper. We analyze the logarithmic returns of constituents of CSI 300 index over a 16-year period (2005–2020) by means of building the undirected network Sτ(t)S_{\tau}(t) with the threshold θ=0.75\theta=0.75. The MST and threshold network constructed based on the data is shown in Figure 3 and Tabel 2 lists some of the ticker symbols corresponding to numbers of nodes in the figure.

Refer to caption
(a) MST
Refer to caption
(b) Threshold Network
Figure 3: The MST and threshold network
Number 18 27 35 49 58 74 97 108
Ticher Symbol 600109 600183 600346 600570 600703 601607 000786 002008
Table 2: List of some of the ticker symbols

Figure 4 depicts four curvature time series of the threshold network Sτ(t)S_{\tau}(t) building with non-overlapping time intervals (τ=22\tau=22 trading days) and Figure 5 with a rolling shift of Δ=5\Delta=5. Obviously, the fluctuation trends of the curvature time series which are obtained by using two different data processing methods are essentially consistent, which confirms the generalization performance of our methods and the reliability of our conclusions.

We list some of the major events in China’s financial market between 2005 and 2020 in Table 3. As key events in the market, during these events, the rule, structure, participants or external environment of the market have changed significantly and the stability should be poorer than the normal periods. To verify the effectiveness of the geometric quantities of networks, we compare these events and the curvature time series, and find out that the fluctuations of the curvature time series can capture these key information of the market well. Some of the events are marked with dotted lines in Figure 4 and 5.

Number Events Time/Period
1 Shareholding Reform May 2005
2 Subprime mortgage crisis Aug 2007
3 International Financial Crisis 2008-2009
4 Establishment of GEM 30 Oct 2009
5 First CSI 300 futures contracts listed 16 Apr 2010
6 CSRC proposed eight key tasks 14 Jan 2011
7 PBOC cut RMB RRR 30 Nov 2011
8 Suspension of IPO 2013
9 The mix-up event of Everbright Securities 16 Aug 2013
10 Market Crash in China 15 Jun-9 Jul 2015
11 Implementation of the meltdown mechanism 1 Jan 2016
12 Establishment of the STAR Market 5 Nov 2018
13 Launch of Shanghai-London Stock Exchange 17 Jun 2019
14 First listing of the STAR stocks 22 Jul 2019
15 Impact of COVID-19 3 Mar-1 May 2020
Table 3: List of some market events between 2005 and 2020
Refer to caption
Figure 4: Type (i) curvature time series
Refer to caption
Figure 5: Type (ii) curvature time series

Combining the results in Figure 4 and 5, and the events in Table 3, we find that the four discrete Ricci curvatures can depict the market stability. During the periods of those key events, the curvature time series fluctuates to different degrees. In particular, when the news is significantly good or bad, the time series shows large fluctuations. We therefore believe that the discrete Ricci curvatures can serve as good indicators of the stability for China’s stock market.

4.2 Forecasting of the Systemic Stability

To accomplish another main purpose, we apply the WD-LSTM model to analyze the curvature time series and forecast the future trend of China’s stock market. The WD-LSTM model contains three phrases: decomposition, forecast and integration. The empirical results through the above three phrases are presented below in details.

Decomposition of Curvature Series: According to (2.6) and (2.7), we first decompose the original curvature series into four high frequency sequences (detail) and one low frequency sequence (approximation). For brevity, we choose FR curvature series (Δ=5\Delta=5) as an example and present its decomposition results in Figure 6.

Forecast of Decomposed Sequences: The second step of the WD-LSTM model is to forecast each component decomposed by the WD module by using the LSTM module. In our experiment, each decomposed sequences is divided into training set and testing set according to the proportion of 80%80\% and 20%20\%. Since τ=22\tau=22 and Δ=5\Delta=5, the training time series is from February 2, 2015 to November 2, 2017. The number of LSTM layer is set to be 200200. While in the process of training the LSTM model, the max iteration and the initial learning rate is set to be 250250 and 0.0050.005. Besides, the optimizer of LSTM is chosen to be Adam and the gradient threshold is set to be 11. After training by using the back-propagation algorithm, we use the hidden state ht1h_{t-1} to forecast the value at time tt, where tt is from November 9, 2017 to December 31, 2020.

Figure 7 presents the forecasting result of decomposed sequences of FR curvature series.

Refer to caption
Figure 6: The decomposition result of FR curvature series
Refer to caption
Figure 7: The forecasting result of decomposed sequences (FR curvature)

Integration of Forecasting Results: The final step of the WD-LSTM model is to integrate the forecasting results of decomposed sequences. After the integration phrase, we can get the final forecasting results of the curvature series. We show the final forecasting results of the four Ricci-type curvature series in Figure 8. We also list the evaluation metrics of the final forecasting results, including mean absolute error (MAE), mean square error (MSE) and R2R^{2}, in Table 4.

Refer to caption
Figure 8: The final forecasting results of four curvature series
OR MR HR FR
MAE 0.0156 0.5409 73.9460 2.4413
MSE 0.0004 0.8745 14087.5311 18.1357
R2R^{2} 0.9653 0.9295 0.8701 0.9459
Table 4: The evaluation metrics of the WD-LSTM model

4.3 Model Comparison &\& Empirical Summary

To verify the superiority of the WD-LSTM model, in this subsection, we carry out a comparative experiment where a basic LSTM model is utilized to forecast the four Ricci-type curvature series directly. Table 5 presents the evaluation metrics of the single LSTM model’s final forecasting results.

OR MR HR FR
MAE 0.0950 2.3464 200.4158 9.8831
MSE 0.0146 14.2501 186815.2867 251.1875
R2R^{2} -0.1271 -0.1495 -0.7230 0.2509
Table 5: The evaluation metrics of the single LSTM

Comparing Table 4 and Table 5, it is obvious that for each evaluation metric, the forecasting performance of the WD-LSTM model is significantly better than that of the basic LSTM model for all the four Ricci-type curvature series. It implies that the wavelet decomposition plays a remarkable role and the hybrid model can handle the strong nonlinearity, complex time characteristics and noise interference of the curvature series better than the single LSTM model.

Furthermore, there must be performance differences between the four Ricci curvatures. Samal, et al.[30] have shown that FR curvature are more sensitive and can detect both crashes and bubbles in USA S&\&P-500 and Japanese Nikkei-225 markets more efficiently. For China’s stock market, comparison of R2R^{2} metrics for the four kinds of curvatures in Table 4 and Table 5 obviously implies that the performance of the hybrid model is better than a single LSTM model. In the hybrid model, all the four curvatures have excellent explanatory power for depict and forecast the stability of China’s stock market. In particular, the R2R^{2} metric of OR curvature series is closer to 11 than those of the other three. We can infer that the OR curvature series is more suitable for the domestic market, which is different from the conclusion about the foreign market. This may reflect the different characteristics of domestic and foreign markets. According to the definitions of these two curvatures, FR curvature is mainly aimed at capturing the diffusion characteristics of the geodesic, which is more sensitive to events than other curvatures, and can better capture the details of the market. While OR curvature measures the relative distance between two respective neighborhoods of two vertices that form an edge. Therefore, it is more suitable for the domestic market where macro-control measures are implemented more effectively and the co-movement effect of the stock sectors is more obvious.

5 Conclusion

In this paper, we apply different types of discrete Ricci curvatures of networks to characterize the systemic stability of China’s stock market. We verify the reliability of our methods by monitoring the fluctuations of the constituents of CSI 300 index from 2005 to 2020 in conjunction with Table 3. We find that network curvatures can be used as good indicators for the systemic stability of China’s stock market.

Based on the above, we also make a more in depth application of the geometric measure. A hybrid WD-LSTM model, combing wavelet decomposition with long short-term memory network, is applied to forecast the future trends of the systemic stability for China’s stock market by means of modeling and predicting the curvature series data. Comparing to the single LSTM model, the WD-LSTM model performs significantly better. Moreover, the empirical result shows that OR curvature is most suitable for the domestic market and the proposed hybrid model has excellent forecasting performance.

In summary, we use discrete Ricci curvature as a measure of the stability for China’s financial market and apply an effective hybrid model to forecast the future trends. Our methods and models are very helpful to develop new financial regulatory tools to better identify, forecast, and prevent market risks and contribute to financial stability.

Acknowledgements. This research was supported by New Liberal Arts Research and Reform Practice Project of Ministry of Education (NO. 2021060011) and the Emerging Interdisciplinary Project of CUFE.

References

  • [1] Sutherland A, Financial market integration and macroeconomic volatility, The Scandinavian Journal of Economic, 1996, 98: 521-539.
  • [2] Baig T, Goldfajn I, Financial Market Contagion in the Asian Crisis, IMF Staff Papers, 1999, 46: 167-195.
  • [3] Subrahmanyam A, Titman S, Feedback from stock prices to cash flows, The Journal of Finance, 2002, 56: 2389-2413.
  • [4] Bouchaud J P, and Potters M, Theory of financial risk and derivative pricing: From statistical physics to risk management, Cambridge University Press, Cambridge, 2003.
  • [5] Ghoulmie F, Cont R, Nadal J P, Heterogeneity and feedback in an agent-based market model, Journal of Physics: Condensed Matter, 2005, 17: 1259.
  • [6] Chakraborti A, Challet D, Chatterjee A, et al., Statistical mechanics of competitive resource allocation using agent-based models, Physics Reports, 2015, 552: 1-25.
  • [7] Mantegna R N, and Stanley H E, An introduction to econophysics: Correlations and complexity in finance, Cambridge University Press, Cambridge, 2007.
  • [8] Battiston S, Flache A, Garlaschelli D, et al., Complexity theory and financial regulation, Science, 2016, 351: 818-819.
  • [9] Plerou V, Gopikrishnan P, Rosenow B, et al., Universal and nonuniversal properties of cross correlations in financial time series, Physics Review Letters, 1999, 83: 1471-1474.
  • [10] Mantegna R N, Hierarchical structure in financial markets, The European Physical Journal B-Condensed Matter and Complex Systems, 1999, 11: 193-197.
  • [11] Laloux L, Cizeau P, Bouchaud J P, et al., Noise dressing of financial correlation matrices, Physical Review Letters, 1999, 83: 1467-1470.
  • [12] Gopikrishnan P, Rosenow B, Plerou V, et al., Quantifying and interpreting collective behavior in financial markets, Physical Review E, 2001, 64: 035106.
  • [13] Kullmann L, Kertész J, Kaski K, Time dependent cross-correlations between different stock returns: A directed network of influence, Physical Review E, 2002, 66: 026125.
  • [14] Plerou V, Gopikrishnan P, Rosenow B, et al., Random matrix approach to cross correlations in financial data, Physical Review E, 2002, 65: 066126.
  • [15] Onnela J P, Chakraborti A, Kaski K, et al., Dynamics of market correlations: Taxonomy and portfolio analysis, Physical Review E, 2003, 68: 056110.
  • [16] Dussert C, Rasigni G, Rasigni M, et al., Minimal spanning tree: A new approach for studying order and disorder, Physical Review B, 1986, 34: 3528.
  • [17] Miccichè S, Bonanno G, Lillo F, et al., Degree stability of a minimum spanning tree of price return and volatility, Physica A, 2003, 324: 66-73.
  • [18] Kumar S, Deo N, Correlation and network analysis of global financial indices, Physical Review E, 2012, 86: 026101.
  • [19] Tumminello M, Aste T, Di Matteo T, et al., A tool for filtering information in complex systems, Proceedings of the National Academy of Sciences, 2005, 102: 10421-10426.
  • [20] Pharasi H K, Sharma K, Chatterjee R, et al., Identifying long-term precursors of financial market crashes using correlation patterns, New Journal of Physics, 2018, 20: 103041.
  • [21] Chakraborti A, Sharma K, Pharasi H K, et al., Emerging spectra characterization of catastrophic instabilities in complex systems, New Journal of Physics, 2020, 22: 063043.
  • [22] Krioukov D, Papadopoulos F, Kitsak M, et al., Hyperbolic geometry of complex networks, Physical Review E, 2010, 82: 036106.
  • [23] Bianconi G, Interdisciplinary and physics challenges of network theory, Europhysics Letters, 2015, 111: 56001.
  • [24] Sreejith R P, Mohanraj K, Jost J, et al., Forman curvature for complex networks, Journal of Statistical Mechanics: Theory and Experiment, 2016, 063206.
  • [25] Samal A, Sreejith R P, Gu J, et al., Comparative analysis of two discretizations of Ricci curvature for complex networks, Scientific Reports, 2018, 8: 8650.
  • [26] Jost J, Riemannian geometry and geometric analysis, Springer International Publishing, Berlin, 2017.
  • [27] Ni C, Lin Y, Luo F, et al., Community detection on networks with Ricci flow, Scientific Reports, 2019, 9: 9984.
  • [28] Sandhu R, Georgiou T, Reznik E, et al., Graph curvature for differentiating cancer networks, Scientific Reports, 2015, 5: 12323.
  • [29] Sandhu R S, Georgiou T T, Tannenbaum A R, Ricci curvature: An economic indicator for market fragility and systemic risk, Science Advances, 2016, 2: 1501495.
  • [30] Samal A, Pharasi H K, Ramaia S J, et al., Network geometry and market instability, Royal Society Open Science, 2020, 8: 201734.
  • [31] Jin Y H, Zhang Q, Shan L F, et al., Characteristics of venture capital network and its correlation with regional economy: Evidence from China, PLoS One, 2015, 10: 0137172.
  • [32] Zhang W P, Zhuang X T, The stability of Chinese stock network and its mechanism, Physica A, 2019, 515: 748-761.
  • [33] Wang Y L, Zhang Q P, Yang X G, Evolution of the Chinese guarantee network under financial crisis and stimulus program, Nature Communications, 2020, 11: 2693.
  • [34] Cavalcante R C, Brasileiro R C, Souza V L F, et al., Computational intelligence and financial markets: A survey and future directions, Expert Systems with Applications, 2016, 55: 194-211.
  • [35] Werbos P J, Backpropagation through time: What it does and how to do it, Proceedings of the IEEE, 1990, 78: 1550-1560.
  • [36] Hochreiter S, Untersuchungen zu dynamischen neuronalen netzen, Diploma thesis, T.U. München, 1991.
  • [37] Hochreiter S, Schmidhuber J, Long short-term memory, Neural Computation, 1997, 9: 1735-1780.
  • [38] Gers F A, Schmidhuber J, Cummins F, Learning to forget: Continual prediction with LSTM, Neural Computation, 2000, 12: 2451-2471.
  • [39] Kumar S, Ningombam D, Short-term forecasting of stock prices using long short term memory, International Conference on Information Technology (ICIT), 2018.
  • [40] Liu Y, Novel volatility forecasting using deep learning long short term memory recurrent neural networks, Expert Systems with Applications, 2019, 132: 99-109.
  • [41] Huang Y, Gao Y, Gan Y, et al., A new financial data forecasting model using genetic algorithm and long short-term memory network, Neurocomputing, 2021, 425: 207-218.
  • [42] Štifanić D, Musulin J, Miočević A, et al., Impact of COVID-19 on forecasting stock prices: An integration of stationary wavelet transform and bidirectional long short-term memory, Complexity, 2020, 1846926.
  • [43] Peng L, Wang L, Xia D, et al., Effective energy consumption forecasting using empirical wavelet transform and long short-term memory, Energy, 2022, 238: 121756.
  • [44] Prim R C, Shortest connection networks and some generalizations, The Bell System Technical Journal, 1957, 36: 1389-1401.
  • [45] Ollivier Y, Ricci curvature of metric spaces, Comptes Rendus de l’Académie des Sciences-Series I-Mathematics, 2007, 345: 643-646.
  • [46] Ollivier Y, Ricci curvature of Markov chains on metric spaces, Journal of Functional Analysis, 2009, 256: 810-864.
  • [47] Vaserstein L N, Markov processes over denumerable products of spaces, describing large systems of automata, Problemy Peredachi Informatsii, 1969, 5: 64-72.
  • [48] Lin Y, Lu L, Yau ST, Ricci curvature of graphs, Tohoku Mathematical Journal, 2011, 63: 605-627.
  • [49] Forman R, Bochner’s method for cell complexes and combinatorial Ricci curvature, Discrete &\& Computational Geometry, 2003, 29: 323-374.
  • [50] Sreejith R P, Jost J, Saucan E, et al., Systematic evaluation of a new combinatorial curvature for complex networks, Chaos, Solitons &\& Fractals, 2017, 101: 50-67.
  • [51] Menger K, Untersuchungen über allgemeine Metrik. Vierte Untersuchung. Zur Metrik der Kurven, Mathematische Annalen, 1930, 103: 466-501.
  • [52] Saucan E, Samal A, Jost J, A simple differential geometry for complex networks, Network Science, 2020, 9: 106-133.
  • [53] Haantjes J, Distance geometry: Curvature in abstract metric spaces, Proc. Kon. Ned. Akad. v. Wetenseh., 1947, 50: 302-314.
  • [54] Mantegna R N, Information and hierarchical structure in financial markets, Computer Physics Communications, 1999, 121-122: 153-156.