This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Causal inference in survival analysis using longitudinal observational data: Sequential trials and marginal structural models

Ruth H. Keogh1, Jon Michael Gran2, Shaun R. Seaman3,
Gwyneth Davies4, Stijn Vansteelandt1,5
(1Department of Medical Statistics and Centre for Statistical Methodology, London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, UK
2Oslo Centre for Biostatistics and Epidemiology, Department of Biostatistics, Institute of Basic Medical Sciences, University of Oslo, P.O. Box 1122 Blindern, 0317 Oslo, Norway
3MRC Biostatistics Unit, University of Cambridge, Institute of Public Health, Forvie Site, Robinson Way, Cambridge CB2 0SR, UK
4Population, Policy and Practice Research and Teaching Department, Great Ormond Street Institute of Child Health, University College London, London, WC1N 1EH, UK
5Department of Applied Mathematics, Computer Science and Statistics, Ghent University, 9000 Ghent, Belgium
)
Abstract

Longitudinal observational patient data can be used to investigate the causal effects of time-varying treatments on time-to-event outcomes. Several methods have been developed for controlling for the time-dependent confounding that typically occurs. The most commonly used is inverse probability weighted estimation of marginal structural models (MSM-IPTW). An alternative, the sequential trials approach, is increasingly popular, in particular in combination with the target trial emulation framework. This approach involves creating a sequence of ‘trials’ from new time origins, restricting to individuals as yet untreated and meeting other eligibility criteria, and comparing treatment initiators and non-initiators. Individuals are censored when they deviate from their treatment status at the start of each ‘trial’ (initiator/non-initiator) and this is addressed using inverse probability of censoring weights. The analysis is based on data combined across trials. We show that the sequential trials approach can estimate the parameter of a particular MSM, and compare it to a MSM-IPTW with respect to the estimands being identified, the assumptions needed and how data are used differently. We show how both approaches can estimate the same marginal risk differences. The two approaches are compared using a simulation study. The sequential trials approach, which tends to involve less extreme weights than MSM-IPTW, results in greater efficiency for estimating the marginal risk difference at most follow-up times, but this can, in certain scenarios, be reversed at late time points. We apply the methods to longitudinal observational data from the UK Cystic Fibrosis Registry to estimate the effect of dornase alfa on survival.

1 Introduction

Longitudinal observational data on patients can allow for the estimation of treatment effects over long follow-up periods and in diverse populations. A key task when aiming to estimate causal effects from observational data is to account for confounding of the treatment-outcome association. With longitudinal data on treatment and outcome, the treatment-outcome association may be subject to so-called time-dependent confounding. Such confounding is present when there are time-dependent covariates affected by past treatment, also affecting later treatment use and outcome. Over the last two decades a large statistical literature has built up on methods for estimating causal treatment effects in the presence of time-dependent confounding, and their use in practice is becoming more widespread.

In this paper we focus on estimating the effects of a longitudinal treatment regime on a time-to-event outcome, using longitudinal observational data on treatment and covariates obtained at approximately regular visits, alongside the time-to-event information. When there is time-dependent confounding, standard methods for survival analysis, such as Cox regression with adjustment for baseline or time-updated covariates, do not in general enable estimation of causal effects. Several methods have been described for estimating the causal effects of longitudinal treatment regimes on time-to-event outcomes. Marginal structural models (MSMs) estimated using inverse probability of treatment weighting (IPTW) were introduced by Robins et al.1 and extended to survival outcomes by Hernan et al.2 through marginal structural Cox models. A review by Clare et al. in 20193 found this approach to be by far the most commonly used in practice, and we refer to this as the MSM-IPTW approach. An alternative but related approach, which we refer to as the ‘sequential trials’ approach, was described by Hernan et al. in 20084 and then Gran et al. in 20105.

In the sequential trials approach, artificial ‘trials’ are mimicked from a sequence of new time origins which could, for example, be defined by study visits at which new information is recorded for each individual who remains under observation. At each time origin individuals are divided into those who have just initiated the treatment under investigation and those who have not yet initiated the treatment. Within each mimicked trial, individuals are artificially censored at the time at which their treatment status deviates from what it was at the time origin, if such deviation occurs. Inverse probability of censoring weighting is used to account for dependence of this artificial censoring on time-dependent characteristics. Effects of being treated or untreated starting from each time origin can then be estimated using, for example, weighted pooled logistic or Cox regression. There have been several applications of the sequential trials approach. Danaei et al.(2013) 6 applied it to electronic health records data to estimate the effect of statins on occurrence of coronary heart disease. Clark et al. (2015)7 studied whether onset of impaired sleep is followed by changes in health related behavior. Bhupathiraju et al. (2017)8 investigated the effect of hormone therapy use on chronic disease risk in the Nurses’ Health Study. Some papers have also used the sequential trials approach alongside other methods and compared the results empirically. Suttorp et al (2015)9 studied the effect of high dose of a particular agent used in kidney disease patients on all-cause mortality with both a sequential trials approach and MSM-IPTW using data from a longitudinal cohort. Thomas et al. (2020)10 used the sequential trials approach together with sequential stratification and time-dependent propensity score matching when studying the effect of statins on cardiovascular outcomes using the Framingham Offspring cohort.

The sequential trials approach is attractive because it mimics a series of randomized trials in a fairly straightforward and explicit manner. Several recent papers have advocated to emulate a ‘target trial’ in investigations of causal effects using observational data 11, 12, 13, 14, 15. The target trial is the hypothetical randomized controlled trial that one would like to conduct. The use of target trials provides a structured framework that guides the steps of an investigation, with the target trial protocol including specification of eligibility criteria, time origin, and the estimands of interest. There is strong link between the target trial framework of Hernan and Robins (2016)11 and the general roadmap for causal inference proposed by Petersen and van der Laan (2014)16. These approaches help to avoid biases in estimation of treatment effects using observational data, and to clarify assumptions and methodology.

Applications of the sequential trials approach, including in the original papers of Hernan et al.4 and Gran et al.5, have focused on estimation of hazard ratios. However, it has not been discussed in detail what the formal causal interpretation of this hazard ratio is and Gran and Aalen (2019)17 highlighted the need for further work to establish the causal estimand being estimated. This is important both to clarify the interpretation and to inform simulation studies to study the properties of the estimators. Karim et al. (2018)18 compared the sequential trials approach with MSM-IPTW based on Cox models using simulations, but did not account for the fact that the two analyses estimate different quantities, which are therefore not directly comparable 17. The aim of this paper is to formally describe what and how causal estimands can be identified using the sequential trials approach in a time-to-event setting and to contrast these with estimands typically identified using MSM-IPTW. We show that the causal parameter estimated from the sequential trials approach can be described in terms of the parameters of a particular marginal structural model. We also establish more broadly what estimands the two approaches are suitable for identifying, what assumptions are needed and how the data are used differently.

In the causal inference literature on the sequential trials approach and MSM-IPTW for time-to-event outcomes, the focus has mostly been on Cox proportional hazard models and estimation of hazard ratios. However, Aalen additive hazard models for hazard differences are increasingly being used, both in general to avoid the proportional hazards assumption and in causal inference due to their useful property of being collapsible 19. We will therefore outline methods in terms of both proportional and additive hazards models. As hazard ratios, and generally also hazard differences, have been shown not to have a straightforward causal interpretation 20, 21, 22, we demonstrate how marginal survival probabilities can be obtained under both hazard models for both the sequential trials and MSM-IPTW approaches, using a simple standardization procedure. This enables results from hazard models to be converted into more meaningful causal quantities such as risk differences or risk ratios. The two approaches are compared in a simulation study, in which we assess their relative efficiency for estimation of the same quantities, making use of the properties of additive hazards models 23.

The paper is organised as follows. In Section 2 we describe a motivating example, in which the aim is to estimate the effect of long term use of the treatment dornase alfa on the composite outcome of death or transplant for people with cystic fibrosis (CF), using longitudinal data from the UK Cystic Fibrosis Registry. In Section 3 we introduce notation and describe the components of a general target trial in this context. In Section 4 we outline estimation using the MSM-IPTW approach and the sequential trials approach, showing how the latter can also be understood as fitting a particular MSM, and in Section 5 we discuss in detail the similarities and differences between the two approaches. The two approaches are then compared using a simulation study in Section 6, and applied to the motivating example in Section 7. Accompanying R code for performing the simulation is provided at https://github.com/ruthkeogh/sequential_trials. We conclude with a discussion in Section 8. Our focus is on MSM-IPTW and the sequential trials approach. Petersen et al. (2007)24 described history-adjusted MSMs (HA-MSM), which are connected to both approaches, but which have not been used much in practice, perhaps due to initial criticisms of the method 25, 26. In Section 8 we discuss the link between HA-MSM and MSM-IPTW and the sequential trials approach.

2 Motivating example

As a motivating example we will investigate the long term impacts of a treatment used in CF on the composite time-to-event outcome of death or transplant. CF is an inherited, chronic, progressive condition affecting around 10,500 individuals in the UK and over 70,000 worldwide 27, 28, 29. Survival in CF has improved considerably over recent decades and the estimated median survival age for a person born with CF in the UK today is 51.6 for males and 45.7 for females 30, 31, 27.

One of the most common symptoms of CF is a build up of mucus in the lungs, which leads to an increased prevalence of bacterial growth in the airways and a decline in lung function 32. Therefore, it is common for people with CF to routinely use aerosolized mucoactive agents, which help to break down the layer of mucus in the lungs making clearance easier. Recombinant human deoxyribonuclease, commonly known as dornase alfa (DNase), is one such mucolytic treatment which was authorised for use in January 1994. DNase is the most commonly used treatment for the pulmonary consequences of CF in the UK, used by 67.6% of people in 2019 27. It is a nebulised treatment, administered daily on a long term basis. The efficacy of DNase for health outcomes in CF has been studied in several randomized controlled trials 33. Most trials had short term follow-up and focused on the impact of DNase on lung function. Seven trials from the 1990s investigated mortality as a secondary outcome but follow-up was short and findings inconclusive; a meta-analysis based on these studies gave a risk ratio estimate of 1.70 (95% CI 0.70 to 4.14) comparing users versus non-users 33.

The effect of DNase use on lung function and requirement for intravenous antibiotics was investigated using observational data from the UK CF Registry 34, 35. We now use the UK CF Registry to investigate the impact of DNase on the composite outcome of death or transplant through emulation of a target trial. The majority of transplants in people with CF are lung transplants, however we consider any transplant here. The UK CF Registry is a national, secure database sponsored and managed by the Cystic Fibrosis Trust 36. It was established in 1995 and records demographic data and longitudinal health data on nearly all people with CF in the UK, to date capturing data on over 12,000 individuals. Data are collected in a standardized way at designated (approximately) annual visits on over 250 variables in several domains, and have been recorded using a centralised database since 2007. In this study we use data from 2008 to 2018, plus some data on prior years to define eligibility according to our criteria outlined below. At each annual visit it is recorded whether or not an individual had been using DNase in the past year. We identified potential confounders of the association between DNase use and death or transplant based on expert clinical input, and these include both time-fixed and time-dependent variables - see Section 7. We focus on the impact of initiating and continuing use of DNase on the risk of death or transplant up to 11 years of follow-up compared with not using DNase. In Section 7 we describe the target trial protocol for addressing this question and how it is emulated using the observational data.

3 Target trial and notation

We begin by outlining a general target trial protocol in the context of estimating the effect of a longitudinal treatment regime on a time-to-event outcome, using the elements recommended by Hernan and Robins11. This is a protocol for a hypothetical trial that, if possible, could have been performed to estimate the estimand of interest. We will later focus on how to emulate this trial when we have longitudinal observational data on treatment, covariates and the outcome.

Eligibility criteria

Individuals are eligible to enter the trial from the first time that they meet a specified criteria (for example based on age, clinical characteristics and counter-indicators to the treatment in question) during a particular recruitment period. Individuals who have previously used the treatment or treatments under study are not eligible.

Treatment strategies

We consider a binary treatment AA (treatment or control), and let A¯0\underline{A}_{0} denote treatment status from the time they enter the trial (time 0) onwards. Individuals are assigned to the treatment or control group, which they are asked to sustain throughout follow-up. Those who sustain the treatment throughout follow-up have A¯0=1\underline{A}_{0}=1, and those who do not use treatment throughout follow-up have A¯0=0\underline{A}_{0}=0.

Assignment procedures

At the time of entering the trial, individuals are randomized to the treatment group (A¯0=1\underline{A}_{0}=1) or the control group (A¯0=0\underline{A}_{0}=0). In a real trial, double- or single-blind assignment of the treatment would be desirable in contexts where blinding is feasible. However, it is not realistic to emulate blinding when the target trial is emulated using the observational data at hand. Therefore in the target trial both participants and their clinical teams are aware of their treatment group.

Follow-up period

Individuals are followed up from randomization, at t=0t=0, until the time of the event of interest TT, the time horizon of interest τ\tau, or censoring CC, whichever occurs first.

Outcome

The observed outcome for a given individual is T~=min(T,C,τ)\tilde{T}=\mathrm{min}(T,C,\tau).

Causal estimands of interest

Let us define Ta¯0=1T^{\underline{a}_{0}=1} as the counterfactual event time had an individual received the treatment from time zero onwards (‘always treated’) and Ta¯0=0T^{\underline{a}_{0}=0} as the counterfactual event time had an individual not received the treatment from time zero onwards (at least for the follow-up time of interest) (‘never treated’). We can then define our causal estimands of interest as contrasts between functions of the distributions of Ta¯0=1T^{\underline{a}_{0}=1} and Ta¯0=0T^{\underline{a}_{0}=0}. These correspond to per-protocol effects because they involve comparisons between outcomes if treatment level aa (a=0,1a=0,1) were sustained throughout the follow-up period.

Effects of treatments on time-to-event outcomes can be quantified in a number of ways. A common measure is the ratio of hazards between the treatment and control groups, made under the assumption that hazards are proportional over the period of follow-up, hTa¯0=1(t)/hTa¯0=0(t)=eβh_{T^{\underline{a}_{0}=1}}(t)/h_{T^{\underline{a}_{0}=0}}(t)=e^{\beta}, where hTa¯0=a(t)h_{T^{\underline{a}_{0}=a}}(t) denotes the hazard in the hypothetical world in which all individuals are observed in treatment group aa (a=0,1a=0,1). However, as hazard ratios do not have a straightforward causal interpretation, it is additionally advisable to consider causal contrasts between risks 20, 21, 22. In this paper we will focus on the risk difference as the primary estimand of interest:

Pr(Ta¯0=1>τ)Pr(Ta¯0=0>τ).\Pr(T^{\underline{a}_{0}=1}>\tau)-\Pr(T^{\underline{a}_{0}=0}>\tau). (1)

This risk difference contrasts the risk of the event by the time horizon τ\tau, under the conditions of being ‘always treated’ versus ‘never treated’. There may be several time horizons of interest, i.e. different values of τ\tau, and we let τmax\tau_{\mathrm{max}} denote the maximum time horizon for which the risk difference is to be obtained. Alternative quantities with a causal interpretation are the risk ratio Pr(Ta¯0=1>τ)/Pr(Ta¯0=0>τ)\Pr(T^{\underline{a}_{0}=1}>\tau)/\Pr(T^{\underline{a}_{0}=0}>\tau) and contrasts between restricted mean survival times E(min(Ta¯0,τ))E(\mathrm{min}(T^{\underline{a}_{0}},\tau)) 37.

Analysis plan for the target trial

If the target trial in fact could have been implemented, the estimation of the causal risk difference in (1) would make use of analysis methods for randomized controlled trials. In a randomized controlled trial in the absence of non-adherence or informative drop-out, the counterfactual event times are independent of the treatment, i.e. Ta¯0A0T^{\underline{a}_{0}}\perp\!\!\!\perp A_{0} for all a0a_{0}. Under this assumption, the causal risk difference of interest is equal to the observed risk difference:

Pr(Ta¯0=1>τ)Pr(Ta¯0=0>τ)=Pr(T>τ|A0=1)Pr(T>τ|A0=0).\Pr(T^{\underline{a}_{0}=1}>\tau)-\Pr(T^{\underline{a}_{0}=0}>\tau)=\Pr(T>\tau|A_{0}=1)-\Pr(T>\tau|A_{0}=0). (2)

The risks Pr(T>τ|A0=a)\Pr(T>\tau|A_{0}=a) (a=0,1a=0,1) could, for example, be estimated nonparametrically using the Kaplan-Meier estimator. In practice, non-adherence and informative drop-out are often present, and the analysis should take account of this, with IPTW being one possible approach.

4 Emulating the target trial using longitudinal observational data

4.1 Observational data set-up

Let us now assume a setting in which individuals are observed at regular ‘visits’ (data collection times) over a particular calendar period, reflecting the type of data that arise in our motivating example and in similar data sources. Let t=0t=0 denote the start of follow-up, defined as the time at which a patient meets the eligibility criteria. We assume a ‘closed cohort’ such that all individuals in the study cohort were included at the same calendar time. In practice, some historical data are typically required to establish whether an individual meets the eligibility criteria at time t=0t=0, e.g. past treatment status. The cohort is assumed to be followed up to (at least) the maximum time horizon of interest for the risk difference, τmax\tau_{\mathrm{max}}. Information on treatment status and other covariates is observed at visits k=0,1,,Kk=0,1,\ldots,K, and the time of visit kk is denoted tkt_{k}. Without loss of generality, we assume that tk=kt_{k}=k, for k0k\geq 0, and K=τmaxK=\lfloor\tau_{\mathrm{max}}\rfloordenotes the time of most recent visit before time τmax\tau_{\mathrm{max}} (τmax<τmax\lfloor\tau_{\mathrm{max}}\rfloor<\tau_{\mathrm{max}}). At each visit kk we observe a binary treatment status AkA_{k} and a set of time-dependent covariates LkL_{k}. To simplify the notation, each LkL_{k} (k=0,1,k=0,1,\ldots) also includes any time-fixed covariates. A bar over a time-dependent variable indicates the history, that is A¯k={A0,A1,,Ak}\bar{A}_{k}=\{A_{0},A_{1},\ldots,A_{k}\} and L¯k={L0,L1,,Lk}\bar{L}_{k}=\{L_{0},L_{1},\ldots,L_{k}\}. An underline indicates the future, so that A¯k={Ak,Ak+1,}\underline{A}_{k}=\{A_{k},A_{k+1},\ldots\} denotes treatment status from time kk onwards.

Refer to caption
Figure 1: Directed acyclic graph (DAG) illustrating relationships between treatment AA, time-dependent covariates LL, discrete time outcome YY, and unmeasured covariates UU. Baseline covariates ZZ are omitted from the diagram but are assumed to potentially affect all other variables.

The data set-up is illustrated in the directed acyclic graph (DAG) in Figure 1, using a discrete-time setting where Yk=I(tk1T<tk)Y_{k}=I(t_{k-1}\leq T<t_{k}) is an indicator of whether the event occurs between visits k1k-1 and kk. One can imagine extending the DAG by adding a series of small time intervals between each visit, at which events are observed (but not AA or LL, which are assumed constant between visits). As the time intervals become very small we approach the continuous time setting. From the DAG, we can see that LkL_{k} are time-dependent confounders. Time-dependent confounding occurs when there are time-dependent covariates that predict subsequent treatment use, are affected by earlier treatment, and affect the outcome through pathways that are not just through subsequent treatment. The DAG also includes a variable UU, which has direct effects on LkL_{k} and YkY_{k} but not on AkA_{k}. UU is an unmeasured individual frailty and we include it because it is realistic that such individual frailty effects exist in practice. Because UU is not a confounder of the association between AkA_{k} and YkY_{k} (after controlling for the observed confounders), the fact that it is unmeasured does not affect our ability to estimate causal effects of treatments. The DAG could be extended in various ways, for example, we could incorporate long term effects of LL on AA and vice versa by adding arrows from LkL_{k} to Ak+1A_{k+1} and from AkA_{k} to Lk+2L_{k+2}. Long term effects of AA and LL on survival could also be added, for example by adding arrows from LkL_{k} and AkA_{k} to Yk+2Y_{k+2}.

4.2 The emulated trial

Next, we outline how the target trial outlined in Section 3 is emulated using the observational data described above. Each element of the target trial protocol should be considered. The eligibility criteria, treatment strategies, assignment strategies, and outcome in the emulated trial should be the same as far as possible as those in the target trial. An iterative process is appropriate when designing the target trial protocol, to take into consideration knowledge about the observational data. In some settings there may be some differences in eligibility criteria in the target trial and the emulated trial - for example, if the study pertains to individuals diagnosed with a particular infection, in the target trial a laboratory-confirmed diagnosis may be specified, whereas this may not possible in the emulated trial if diagnosis route is not recorded in the available observational data. There may also be differences in some aspects of treatment, for example the target trial may specify the dose of the treatment, whereas dose information may not be available in the observational data and so the emulated trial is pragmatic about dose. Making clear the differences between the target trial and the emulated trial can help us to understand possible sources of bias when we emulate the target trial using observational data.

The causal estimand of interest for the emulated trial is the same as that in the target trial. In our setting, the causal estimand compares marginal risks in a follow-up period τ\tau under two treatment regimes: always treated (a¯0=1\underline{a}_{0}=1) or never treated (a¯0=0\underline{a}_{0}=0). In the observational data, individuals who are untreated initially can subsequently start treatment and vice versa. Treatment initiation and treatment switching may depend on time-dependent patient characteristics that also affect the outcome, as illustrated in the DAG in Figure 1. This must be addressed in the analysis, and the analysis stage is where we find the biggest differences between the emulated trial and target trial. However, we emphasise that the causal estimand in the emulated trial is the same as in the target trial. The time-dependent confounding is addressed in different ways in the MSM-IPTW approach and the sequential trials approach. We now describe how to estimate the causal risk difference (expression (1)) using these two approaches.

4.3 Analysis using MSM-IPTW

In the MSM-IPTW approach, a MSM is specified for the hazard. The counterfactual hazard under the possibly counter-to-fact treatment regime a¯0\underline{a}_{0} is denoted hTa¯0(t)h_{T^{\underline{a}_{0}}}(t). In a slight abuse of standard notation we let t\lfloor t\rfloor denote the time of the most recent visit before time tt (so t\lfloor t\rfloor is always <t<t), and a¯t\bar{a}_{\lfloor t\rfloor} denotes treatment pattern up to the most recent visit prior to tt. The MSM is usually assumed to take the form of a Cox proportional hazards model 38

hTa¯0(t)=h0(t)eg(a¯t;βA),h_{T^{\underline{a}_{0}}}(t)=h_{0}(t)e^{g(\bar{a}_{\lfloor t\rfloor};\beta_{A})}, (3)

where h0(t)h_{0}(t) is the baseline counterfactual hazard, and g(a¯t;βA)g(\bar{a}_{\lfloor t\rfloor};\beta_{A}) is a function of treatment pattern a¯t\bar{a}_{\lfloor t\rfloor} to be specified, and βA\beta_{A} is a vector of log hazard ratios. However, the hazards could take various other forms and we will also consider MSMs based on Aalen’s additive hazard model 39, 40, which has the form

hTa¯0(t)=α0(t)+g(a¯t;αA(t)),h_{T^{\underline{a}_{0}}}(t)=\alpha_{0}(t)+g(\bar{a}_{\lfloor t\rfloor};\alpha_{A}(t)), (4)

where α0(t)\alpha_{0}(t) is the baseline counterfactual hazard and αA(t)\alpha_{A}(t) is a vector of time-dependent coefficients. In both hazard models the g()g(\cdot) function equals zero when a¯t=0\bar{a}_{\lfloor t\rfloor}=0. In a simple version of the MSM, the hazard at time tt is assumed to depend only on the current level of treatment: g(a¯t;βA)=βAatg(\bar{a}_{\lfloor t\rfloor};\beta_{A})=\beta_{A}a_{\lfloor t\rfloor} or g(a¯t;αA(t))=αA(t)atg(\bar{a}_{\lfloor t\rfloor};\alpha_{A}(t))=\alpha_{A}(t)a_{\lfloor t\rfloor}. Other options include that the hazard depends on duration of treatment, g(a¯t;βA)=βAk=0takg(\bar{a}_{\lfloor t\rfloor};\beta_{A})=\beta_{A}\sum_{k=0}^{\lfloor t\rfloor}a_{k} or g(a¯t;αA(t))=αA(t)k=0takg(\bar{a}_{\lfloor t\rfloor};\alpha_{A}(t))=\alpha_{A}(t)\sum_{k=0}^{\lfloor t\rfloor}a_{k}; or on the history of treatment in a more general way, g(a¯t;βA)=k=0tβAkakg(\bar{a}_{\lfloor t\rfloor};\beta_{A})=\sum_{k=0}^{\lfloor t\rfloor}\beta_{Ak}a_{k} or g(a¯t;αA(t))=k=0tαAk(t)akg(\bar{a}_{\lfloor t\rfloor};\alpha_{A}(t))=\sum_{k=0}^{\lfloor t\rfloor}\alpha_{Ak}(t)a_{k}.

The MSM cannot be estimated directly from the observed data because of time-dependent confounding by LkL_{k}. In the MSM-IPTW approach, individuals are assigned time-dependent weights and the MSM is fitted to the observed weighted data. The weight at time tt for a given individual is the inverse of the probability of their observed treatment pattern up to time tt given their time-dependent covariate history, L¯t\bar{L}_{\lfloor t\rfloor}. These weights can be large for some individuals, which induces large uncertainty in estimates from the MSM, and stabilized weights are typically used instead. The use of MSMs estimated using IPTW to estimate causal effects of joint treatments over time involves the four key assumptions of no interference, positivity, consistency, and conditional exchangeability (i.e. no unmeasured confounding) 1, 41, 2. Details on the weights and the assumptions are provided in the Supplementary Material (Sections A1 and A2). Weights are also discussed in detail in the context of the simulation study in Section 6.

The MSM can also be extended to include conditioning on baseline covariates L0L_{0}:

hTa¯0(t|L0)=h0(t)eg(a¯t;βA)+βLL0h_{T^{\underline{a}_{0}}}(t|L_{0})=h_{0}(t)e^{g(\bar{a}_{\lfloor t\rfloor};\beta_{A})+\beta_{L}L_{0}} (5)

or

hTa¯0(t|L0)=α0(t)+g(a¯t;αA(s))+αL(t)L0.h_{T^{\underline{a}_{0}}}(t|L_{0})=\alpha_{0}(t)+g(\bar{a}_{\lfloor t\rfloor};\alpha_{A}(s))+\alpha_{L}(t)L_{0}. (6)

When covariates L0L_{0} are included in the MSM, they can also be included in the model in the numerator of the stabilized weights.

The survival probabilities required for the estimand in (1) can be estimated using the relation between the hazard and survivor functions. For the Cox MSM in (3) we have

Pr(Ta¯0>τ)=exp{eg(a0;β~A)01h0(s)𝑑seg(a¯1;β~A)12h0(s)𝑑seg(a¯t;β~A)ττh0(s)𝑑s},\begin{split}\Pr(T^{\underline{a}_{0}}>\tau)&=\exp\left\{-e^{g(a_{0};\tilde{\beta}_{A})}\int_{0}^{1}h_{0}(s)ds-e^{g(\bar{a}_{1};\tilde{\beta}_{A})}\int_{1}^{2}h_{0}(s)ds\cdots-e^{g(\bar{a}_{\lfloor t\rfloor};\tilde{\beta}_{A})}\int_{\lfloor\tau\rfloor}^{\tau}h_{0}(s)ds\right\},\end{split} (7)

where the integrated (cumulative) baseline hazards can be estimated using an IPTW Breslow’s estimator. Based on the Aalen MSM in (4) we have

Pr(Ta¯0>τ)=exp(0tα0(s)𝑑s01g(a0;αA(s))𝑑s12g(a¯1;αA(s))𝑑sττg(a¯τ;αA(s))𝑑s),\Pr(T^{\underline{a}_{0}}>\tau)=\exp\left(-\int_{0}^{t}\alpha_{0}(s)ds-\int_{0}^{1}g(a_{0};\alpha_{A}(s))ds-\int_{1}^{2}g(\bar{a}_{1};\alpha_{A}(s))ds\cdots-\int_{\lfloor\tau\rfloor}^{\tau}g(\bar{a}_{\lfloor\tau\rfloor};\alpha_{A}(s))ds\right), (8)

where the integrals are obtained as the cumulative coefficients estimated in the Aalen additive hazard model fitting process. The probabilities of interest under the longitudinal treatment regimes of ‘always treated’ and ‘never treated’ (Pr(Ta¯0=a>τ)\Pr(T^{\underline{a}_{0}=a}>\tau), a=0,1a=0,1) are the obtained by setting a¯0=a\underline{a}_{0}=a in (7) or (8). These probabilities are marginal and they refer to the population of nn individuals meeting the target trial eligibility criteria at t=0t=0; we denote this population by 𝒞0\mathcal{C}_{0}. The MSMs including baseline covariates, as in (5) and (6), can be used to obtain the conditional probabilities Pr(Ta¯0=aτ|L0)\Pr(T^{\underline{a}_{0}=a}\geq\tau|L_{0}) for each individual in 𝒞0\mathcal{C}_{0}. Estimates of the marginal survival probabilities Pr(Ta¯0>τ)\Pr(T^{\underline{a}_{0}}>\tau) can then be obtained using empirical standardization:

Pr^(Ta¯0=aτ)=1ni𝒞0Pr^(Tia¯0=aτ|L0,i)\begin{split}\widehat{\Pr}(T^{\underline{a}_{0}=a}\geq\tau)&=\frac{1}{n}\sum_{i\in\mathcal{C}_{0}}\widehat{\Pr}(T^{\underline{a}_{0}=a}_{i}\geq\tau|L_{0,i})\end{split} (9)

where Pr(Tia¯0=aτ|L0,i)\Pr(T^{\underline{a}_{0}=a}_{i}\geq\tau|L_{0,i}) is estimated using the relation Pr(Tia¯0=aτ|L0,i)=exp{0τhTa¯0=a(u|L0,i)𝑑u}\Pr(T^{\underline{a}_{0}=a}_{i}\geq\tau|L_{0,i})=\exp\left\{-\int_{0}^{\tau}h_{T^{\underline{a}_{0}=a}}(u|L_{0,i})du\right\}, and where the integrals are estimated as outlined above for (7) or (8). In practice, the empirical standardization can be done by creating two copies of each individual in 𝒞0\mathcal{C}_{0} and setting A0=A1==Aτ=1A_{0}=A_{1}=\ldots=A_{\lfloor\tau\rfloor}=1 for one copy and A0=A1==Aτ=0A_{0}=A_{1}=\ldots=A_{\lfloor\tau\rfloor}=0 for the other. The covariate values L0,iL_{0,i} are the same for both copies. The estimated conditional survival probabilities are then obtained for each individual under both treatment regimes (i.e. for both copies), and then the average calculated across individuals under both treatment regimes.

If it is of interest to estimate the causal risk difference in (1) for several time horizons τ\tau, it is recommended to fit the MSM using all event times up to τmax\tau_{\mathrm{max}} and with administrative censoring at τmax\tau_{\mathrm{max}} to obtain risk difference estimates for all horizons ττmax\tau\leq\tau_{\mathrm{max}}, rather than fitting separate MSMs for different time horizons.

Confidence intervals for the estimated causal risk difference can be obtained by bootstrapping. The weights models and the MSM are fitted in each bootstrap sample, so that the bootstrap confidence intervals capture the uncertainty in the estimation of the weights as well as in the MSM.

4.4 Analysis using the sequential trials approach

In the MSM-IPTW approach each individual contributes to the emulated target trial from visit k=0k=0, at which time they meet the eligibility criteria. However, individuals may in fact meet the eligibility criteria for the target trial at more than one visit during the study period. In the sequential trials approach an individual can contribute to an emulated trial starting from any visit k=0,1,2,k=0,1,2,\ldots at which they meet the eligibility criteria. The emulated trial eligibility criteria are applied at each visit k=0,1,k=0,1,\ldots to form a sequence of emulated trials. Recall that individuals who have previously used the treatment are not eligible and so all individuals eligible for the kkth trial have A¯k1=0\bar{A}_{k-1}=0. Those eligible for the kkth trial therefore include ‘initiators’ who start treatment at visit kk (Ak=1A_{k}=1) and ‘non-initiators’ who remain untreated at visit kk (Ak=0A_{k}=0). Non-initiators in trial kk can appear as initiators in a later trial (k+1,k+1,\ldots). Individuals can therefore appear as initiators in only one trial, but as non-initiators in several trials.

In this paper we focus on a sequential trials approach in which the length of possible follow-up decreases for trials starting at later visits. An alternative would be to use a sequence of trials that all have equal length of follow-up. To explain this further, consider an example in which the observational cohort has visits at times k=0,1,2,3,4k=0,1,2,3,4 and follow-up to time 55 but our maximum time horizon of interest for the causal risk difference (1) is τmax=3\tau_{\mathrm{max}}=3. We could have 5 trials starting at times k=0,1,2,3,4k=0,1,2,3,4, with a decreasing length of follow-up from the start of each trial, such that the trials starting at times k=0,1,2k=0,1,2 have follow-up of length 3 or longer (and administrative censoring would be applied after 3 time units of follow-up), and the trials starting at times k=3,4k=3,4 only provide 2 and 1 time units of follow-up respectively. In the alternative we would only make use of trials starting at times k=0,1,2k=0,1,2, with administrative censoring applied after 3 time units of follow-up.

The sequential trials approach focuses on comparing survival only under the two treatment regimes: ‘always treated’, in which eligible individuals initiate treatment and then always continue treatment, and ‘never treated’, in which eligible individuals do not initiate treatment at the time of meeting eligibility criteria and remain untreated at all later times. Previous descriptions of the sequential trials approach have described the analysis models used but have not been explicit about the causal interpretation of the model parameters. Here we begin by defining MSMs for populations meeting the emulated trial eligibility criteria at visits k=0,1,2,,Kk=0,1,2,\ldots,K and explain how these relate to the estimand of interest in (1). We then outline how the MSMs can be estimated using the observational data under certain assumptions, using the methods described by Hernan et al.4 and Gran et al.5.

Consider the trial starting at visit kk. Let TA¯k=aT^{\underline{A}_{k}=a} denote the counterfactual event time under a treatment regime in which individuals follow their observed treatments up to and including visit k1k-1 and are then assigned to treatment aa (a=0,1a=0,1) from visit kk onwards (if they survive to visit kk). Let hk,TA¯k=a(tk|A¯k1=0,L¯k)h_{k,T^{\underline{A}_{k}=a}}(t-k|\bar{A}_{k-1}=0,\bar{L}_{k}) denote the hazard at time tkt-k after visit kk under this treatment regime, conditional on the baseline characteristics LkL_{k} at the start of trial kk. The time scale is time since the start of the trial. The conditioning on A¯k1=0\bar{A}_{k-1}=0 indicates the restriction to individuals who have not initiated treatment prior to the start of the kkth trial, which is part of the eligibility criteria. MSMs for this hazard using a Cox model and Aalen’s additive hazard model are

hk,TA¯k=a(tk|A¯k1=0,Lk)=h0k(tk)exp{af(tk;βAk)+βLkLk}h_{k,T^{\underline{A}_{k}=a}}(t-k|\bar{A}_{k-1}=0,L_{k})=h_{0k}(t-k)\exp\left\{af(t-k;\beta_{Ak})+\beta_{Lk}L_{k}\right\} (10)

and

hk,TA¯k=a(tk|A¯k1=0,Lk)=α0k(tk)+αAk(tk)a+αLk(tk)Lk.h_{k,T^{\underline{A}_{k}=a}}(t-k|\bar{A}_{k-1}=0,L_{k})=\alpha_{0k}(t-k)+\alpha_{Ak}(t-k)a+\alpha_{Lk}(t-k)L_{k}. (11)

Recall that LkL_{k} includes any time fixed covariates. The conditioning on LkL_{k} could be excluded, but we include it because conditioning on baseline variables is a feature of the previously-described sequential trials analysis methods, which we discuss below. In the Cox MSM (10) the hazards in the treated and untreated could be assumed proportional, f(tk;βAk)=βAkf(t-k;\beta_{Ak})=\beta_{Ak}, or we could allow the hazard to depend on duration of treatment, e.g. af(tk;βAk)=aβAk,0+aβAk,1(tk)af(t-k;\beta_{Ak})=a\beta_{Ak,0}+a\beta_{Ak,1}(t-k). The additive hazards MSM (11) naturally incorporates a flexible time-dependent effect of treatment on the hazard. Both models could be extended to include interactions between AkA_{k} and LkL_{k}. The above MSMs are conditional on LkL_{k} rather than L¯k\bar{L}_{k}, which is an assumption. They could be written as conditional on L¯k\bar{L}_{k}, but it is likely to be appropriate that the hazard for each trial kk depends on the same amount of history of LkL_{k}. For example, if we wish to allow the hazard for each trial kk to depend on LkL_{k} and Lk1L_{k-1} then the first trial would need to start at time k=1k=1 instead of k=0k=0, unless data on the LL were available prior to time 0.

Conditional survival probabilities (conditional on LkL_{k} and TkT\geq k) for the counterfactual event times are related to the hazard according to the formula

Pr(TA¯k=a>k+τ|Tk,A¯k1=0,Lk)=exp{kk+τhk,TA¯k=a(uk|A¯k1=0,Lk)𝑑u},ττmaxk.\begin{split}\Pr(T^{\underline{A}_{k}=a}>k+\tau|&T\geq k,\bar{A}_{k-1}=0,L_{k})\\ &=\exp\left\{-\int_{k}^{k+\tau}h_{k,T^{\underline{A}_{k}=a}}(u-k|\bar{A}_{k-1}=0,L_{k})du\right\},\quad\tau\leq\tau_{\mathrm{max}}-k.\end{split} (12)

Below we discuss how marginal survival probabilities, and therefore the causal estimand of interest (marginal risk difference), can be obtained. First, however, we outline how the MSMs in (10) or (11), and hence the conditional survival probabilities in (12), can be estimated using the observed data. The MSMs in (10) and (11) cannot be estimated directly from the observational data because not all individuals meeting the eligibility criteria at visit kk are then ‘always treated’ (A¯k=1\underline{A}_{k}=1) or ‘never treated’ (A¯k=0\underline{A}_{k}=0). In the sequential trials approach as described by Hernan et al.4 and Gran et al.5 individuals are artificially censored at the visit at which they switch from their treatment group at the start of a given trial. In the kkth trial individuals are censored at the earliest visit m>km>k such that AmAkA_{m}\neq A_{k}. This results in a modified data set in which all individuals in the trial starting at time kk have either A¯k=0\underline{A}_{k}=0 or A¯k=1\underline{A}_{k}=1 up to the earliest of their event time, their censoring time, or their artificial censoring time due to treatment switch. The treatment switching is expected to depend on time-dependent covariates that are also associated with the outcome, meaning that the artificial censoring is informative. It is addressed using weights, which we refer to as inverse probability of artificial-censoring weights (IPACW). The IPACW at times 0<s<10<s<1 after the start of trial kk are equal to 1. The IPACW at times ls<l+1l\leq s<l+1 (l1l\geq 1) after the start of trial kk is the inverse of the probability that the individual’s treatment status at time ll remained the same as their treatment status at the start of the trial, AkA_{k}, conditional on their observed covariates up to time ll. The MSMs in (10) and (11) are then fitted using weighted regression using the time-dependent IPACW, with aa in the hazard model replaced by the observed treatment status AkA_{k}. The conditioning on baseline covariates LkL_{k} in each trial controls for confounding of the association between treatment initiation at time kk and the hazard. Estimating the MSMs in this way is valid under the same assumptions of positivity, consistency and conditional exchangeability, as are required for the MSM-IPTW analysis. Further details on the IPACW are provided in the Supplementary Materials (Section A1). Hernan et al.4 used logistic regression to estimate the IPACW, whereas Gran et al.5 used Aalen’s additive hazard model. The IPACW could be fitted separately in each trial or combined across trials. After estimating the MSMs using this approach, the expression in (12) can be used to obtain estimates of conditional survival probabilities. The trial starting at k=0k=0 provides estimates of Pr(TA¯0=a>τ|L0)\Pr(T^{\underline{A}_{0}=a}>\tau|L_{0}) (ττmax\tau\leq\tau_{\mathrm{max}}), and the trial starting at k=1k=1 provides estimates of Pr(TA¯1=a>τ+1|A0=0,L1,T1)\Pr(T^{\underline{A}_{1}=a}>\tau+1|A_{0}=0,L_{1},T\geq 1) (ττmax1\tau\leq\tau_{\mathrm{max}}-1), (a=0,1a=0,1), and so on for k=2,,Kk=2,\ldots,K. These are conditional on covariates measured at the start of the trial, LkL_{k}.

After estimating the conditional survival probabilities in (12) estimates of marginal survival probabilities Pr(TA¯k=a>τ+k|A¯k1=0,Tk)\Pr(T^{\underline{A}_{k}=a}>\tau+k|\bar{A}_{k-1}=0,T\geq k) can be obtained via empirical standardization over the distribution of LkL_{k} at the start of each trial, as described for the MSM-IPTW approach in Section 4.3. This enables estimation of marginal risk differences:

Pr(Ta¯0=1>k+τ|A¯k1=0,Tk)Pr(Ta¯0=0>k+t|A¯k1=0,Tk),ττmaxk.\Pr(T^{\underline{a}_{0}=1}>k+\tau|\bar{A}_{k-1}=0,T\geq k)-\Pr(T^{\underline{a}_{0}=0}>k+t|\bar{A}_{k-1}=0,T\geq k),\qquad\tau\leq\tau_{\mathrm{max}}-k. (13)

The (true) marginal risk differences for the same time horizon τ\tau after the start of the trial will differ in general for trials starting at different times k=0,1,2,k=0,1,2,\ldots (for trials for which ττmaxk\tau\leq\tau_{\mathrm{max}}-k). Differences in the true value of the estimand for different trials arise when coefficients of the hazard model ((10) or (11)) depend on kk, but also because the marginal risk differences for different kk refer to populations with different distributions of baseline characteristics, LkL_{k}. Marginal risk differences that have the same true value across trials starting at different times k=0,1,k=0,1,\ldots can be estimated, using assumptions about the coefficients of the hazard models and standardisation to a common distribution of baseline covariates LkL_{k}.

Some or all parameters of the MSMs in (10) and (11) could be assumed common across trials (i.e. for all kk). The common parameters can then be estimated by fitting the hazard models using the observed data combined across trials. It may be reasonable in many settings to assume that the impact of the treatment on the hazard at a given time post-initiation does not depend on the visit kk at which treatment was initiated, i.e. βAk=βA\beta_{Ak}=\beta_{A} and αAk(tk)=αA(tk)\alpha_{Ak}(t-k)=\alpha_{A}(t-k) for all kk. Similarly the coefficients for LkL_{k} could be assumed constant across trials. Often the underlying time scale tt will be in a sense arbitrary. For example, in some settings it will be calendar year, and in others the time since joining a cohort. Provided any elements of time (e.g. age, calendar year) that affect the hazard are included in the set of covariates LkL_{k} then the baseline hazard may also be assumed common across trials, i.e. h0k(tk)=h0(tk)h_{0k}(t-k)=h_{0}(t-k) or α0k(tk)=α0(tk)\alpha_{0k}(t-k)=\alpha_{0}(t-k).

Suppose that all coefficients of the MSM in (10) and (11) are assumed to be the same across trials (i.e. for all kk). The MSM would be fitted using the data combined across trials using a weighted regression with time-dependent IPACW, and with adjustment for baseline covariates in each trial, LkL_{k}. When estimating the marginal risk differences by empirical standardization it should be specified clearly which population the marginal probabilities refer to. We suggest that the population of interest for the marginal risk difference is not the population of individuals used in the combined analysis, since this population is not well defined and many individuals appear in it more than once. A better alternative is to estimate the risk difference for the population 𝒞0\mathcal{C}_{0} of individuals meeting the target trial eligibility criteria at t=0t=0 (visit k=0k=0). This would make the marginal risk difference comparable with that estimated from the MSM-IPTW analysis. We note that the causal estimand in expression (1) is implicitly conditional on no prior treatment, which is part of the target trial eligibility criteria. Another possibility would be to estimate the risk difference for the population of individuals meeting the target trial eligibility criteria at the last visit time KK, as this population might be expected to be the most similar to the patient population now in terms of their distribution of characteristics.

Suppose, instead, that we wish to allow the baseline hazard the differ across trials, but that the coefficients for treatment (aa) and baseline covariates (LkL_{k}) are assumed the same across trials. The MSMs in(10) or (11) could be fitted using data combined across trials, but with a stratified baseline hazard. In this case, care should be taken over which baseline hazard is used to obtain the marginal risk difference estimates. For example, the baseline hazard corresponding to the trial starting at k=0k=0 may be used to obtain the marginal risk difference estimates for all time horizons ττmax\tau\leq\tau_{\mathrm{max}}. This would make the marginal risk difference comparable with that estimated from the MSM-IPTW analysis. The baseline hazard from the trial starting at the penultimate visit K1K-1 can only be used to estimate marginal risk difference estimates for time horizons ττmax(K1)\tau\leq\tau_{\mathrm{max}}-(K-1), and using baseline hazards corresponding to trials starting at visit k1k\geq 1 to estimate marginal risk differences would result in marginal risk differences that do not correspond to those estimated from the MSM-IPTW analysis.

In their descriptions of the sequential trials approach Gran et al.5 used the Cox proportional hazards model, and Hernan et al.4 used pooled logistic regression, which is equivalent as the times between visits gets smaller 43. Røysland et al.44 used an additive hazards model, though their aim was to estimate direct and indirect effects, which differs from our aims in this paper. Gran et al.5 assumed the hazard ratios for treatment in the Cox model to be common across the trials, but allowed a different baseline hazard in each trial. Hernan et al.4 allowed the odds ratio for treatment to differ across trials and also performed a test for heterogeneity, though it is unclear whether they allowed separate intercept parameters (baseline hazards) across trials. If the treatment effect is assumed the same across trials, it is recommended to assess this assumption using a formal test.

As in the MSM-IPTW approach, confidence intervals for the estimated causal risk difference can be obtained by bootstrapping. The formation of the sequential trials, estimation of the weights, fitting of the conditional MSMs, and obtaining the risk difference are all repeated in each bootstrap sample.

5 Comparing MSM-IPTW and the sequential trials approach

5.1 Estimating causal effects: an illustrative example using a causal tree diagram

In this section we show how the MSM-IPTW and sequential trials approaches can estimate the same causal estimand using an illustrative example and non-parametric estimates. Our aim is to provide insight into how the two approaches use the data differently to estimate the same quantity.

Consider a situation as depicted in the DAG in Figure 1, but with treatment and covariate information only collected at up to two visits (L0,A0L_{0},A_{0} measured at time 0, and L1,A1L_{1},A_{1} measured at time 1) and survival status observed at times 1 (Y1=I(0T<1)Y_{1}=I(0\leq T<1)) and 2 (Y2=I(1T<2)Y_{2}=I(1\leq T<2)). We focus on a single binary time-dependent confounder LL and omit the unobserved variable UU. Figure 2 shows a causal tree graph (see for example Richardson and Rotnitzky45), representing the different possible combinations of the variables L0,A0,Y1L_{0},A_{0},Y_{1} up to time 1, and then the different combinations of L1,A1,Y2L_{1},A_{1},Y_{2} among individuals who survive to time 1 (Y1=0Y_{1}=0). A total of nn individuals are observed at time 0. The numbers in brackets on the branches of the tree are the number of individuals who followed a given pathway up to that branch. The notation is as follows: nl0n_{l_{0}} denotes the number with L0=l0L_{0}=l_{0}; nl0,a0n_{l_{0},a_{0}} the number with L0=l0,A0=a0L_{0}=l_{0},A_{0}=a_{0}; nl0a0y1n^{y_{1}}_{l_{0}a_{0}} the number with L0=l0,A0=a0,Y1=y1L_{0}=l_{0},A_{0}=a_{0},Y_{1}=y_{1}; nl0a0,l10n^{0}_{l_{0}a_{0},l_{1}} the number with L0=l0,A0=a0,Y1=0,L1=l1L_{0}=l_{0},A_{0}=a_{0},Y_{1}=0,L_{1}=l_{1}; nl0a0,l1a10n^{0}_{l_{0}a_{0},l_{1}a_{1}} the number with L0=l0,A0=a0,Y1=0,L1=l1,A1=a1L_{0}=l_{0},A_{0}=a_{0},Y_{1}=0,L_{1}=l_{1},A_{1}=a_{1}; and nl0a0,l1a10y2n^{0y_{2}}_{l_{0}a_{0},l_{1}a_{1}} the number with L0=l0,A0=a0,Y1=0,L1=l1,Y2=y2L_{0}=l_{0},A_{0}=a_{0},Y_{1}=0,L_{1}=l_{1},Y_{2}=y_{2}.

Our interest is in comparing survival probabilities under two treatment regimes: always treated (A0=A1=1A_{0}=A_{1}=1) and never treated (A0=A1=0A_{0}=A_{1}=0). The branches representing these two treatment regimes are shown with thick lines in the causal tree diagram. The MSM analysis makes use of the causal tree diagram as depicted in Figure 2. Under the sequential trials approach, we create a trial starting at time 0 and a trial starting at time 1. In the trial starting at time 0 individuals who survive to time 1 are censored at time 1 if A1A0A_{1}\neq A_{0}, i.e. not all branches after time 1 are used. The trial starting at time 1 is restricted to individuals with A0=0A_{0}=0 and Y1=0Y_{1}=0. The sections of the causal tree diagram for these two trials are shown in Figure 3.

Consider the probability of survival to time 1 with treatment aa, Pr(Y1A0=a=0)\Pr(Y_{1}^{A_{0}=a}=0). The MSM-IPTW approach estimates this using the results that (under the conditions of consistency, positivity and conditional exchangeability)

Pr(Y1A0=a=0)=E{I(A0=a)(1Y1)Pr(A0=a|L0)}.\Pr(Y_{1}^{A_{0}=a}=0)=E\left\{\frac{I(A_{0}=a)(1-Y_{1})}{\Pr(A_{0}=a|L_{0})}\right\}. (14)

Based on the tree diagram, a non-parametric estimate of this is

Pr^(Y1A0=a=0)=1n{n0a0(n0an0)1+n1a0(n1an1)1},\widehat{\Pr}(Y_{1}^{A_{0}=a}=0)=\frac{1}{n}\left\{n_{0a}^{0}\left(\frac{n_{0a}}{n_{0}}\right)^{-1}+n_{1a}^{0}\left(\frac{n_{1a}}{n_{1}}\right)^{-1}\right\}, (15)

where the two contributions come from individuals with L0=0L_{0}=0 and L0=1L_{0}=1. On the other hand, the sequential trial starting at time 0 estimates Pr(Y1A0=a=0|L0)\Pr(Y_{1}^{A_{0}=a}=0|L_{0}). By consistency and conditional exchangeability we have Pr(Y1A0=a=0|L0)=Pr(Y1=0|A0=a,L0)\Pr(Y_{1}^{A_{0}=a}=0|L_{0})=\Pr(Y_{1}=0|A_{0}=a,L_{0}). Based on the tree diagram, a non-parametric estimate of this using the trial starting at time 0 is Pr(Y1=0|A0=a,L0=l0)=nl0a0nl0a,l0=0,1\Pr(Y_{1}=0|A_{0}=a,L_{0}=l_{0})=\frac{n_{l_{0}a}^{0}}{n_{l_{0}a}},\quad l_{0}=0,1. Using the result that Pr(Y1A0=a=0)=l0Pr(Y1=0|A0=a,L0=l0)Pr(L0=l0)\Pr(Y_{1}^{A_{0}=a}=0)=\sum_{l_{0}}\Pr(Y_{1}=0|A_{0}=a,L_{0}=l_{0})\Pr(L_{0}=l_{0}) (assuming consistency and conditional exchangeability) it follows that a non-parametric estimate of the marginal probability Pr(Y1A0=a=0)\Pr(Y_{1}^{A_{0}=a}=0) based on the trial starting at time 0 is

Pr^(Y1A0=a=0)=n0a0n0an0n+n1a0n1an1n,\widehat{\Pr}(Y_{1}^{A_{0}=a}=0)=\frac{n_{0a}^{0}}{n_{0a}}\frac{n_{0}}{n}+\frac{n_{1a}^{0}}{n_{1a}}\frac{n_{1}}{n}, (16)

which is the same as the estimate using MSM-IPTW (i.e. equation (15)). Therefore the MSM-IPTW and sequential trials approaches give identical estimates of Pr(Y1A0=a=0)\Pr(Y_{1}^{A_{0}=a}=0). The equivalence between inverse probability weighted estimates and standardized estimates obtained using the g-formula in the non-parametric setting is well established (see for example 2, 46). In the sequential trials approach, the trial starting at time 1 can be used to estimate the probability of survival to time 2 after initiating the treatment strategy (a=0,1a=0,1) conditional on survival to time 1 and on A1=0A_{1}=0. The non-parametric estimate is

Pr^(Y2A1=a=0|Y1=0,A1=0)=(n00,0a0,0+n10,0a0,0n00,0a0+n10,0a0)(n00,00+n10,00n000+n100)+(n00,1a0,0+n10,1a0,0n00,1a0+n10,1a0)(n00,10+n10,10n000+n100).\begin{split}\widehat{\Pr}(Y_{2}^{A_{1}=a}=0|Y_{1}=0,A_{1}=0)=&\left(\frac{n_{00,0a}^{0,0}+n_{10,0a}^{0,0}}{n_{00,0a}^{0}+n_{10,0a}^{0}}\right)\left(\frac{n_{00,0}^{0}+n_{10,0}^{0}}{n_{00}^{0}+n_{10}^{0}}\right)\\ &+\left(\frac{n_{00,1a}^{0,0}+n_{10,1a}^{0,0}}{n_{00,1a}^{0}+n_{10,1a}^{0}}\right)\left(\frac{n_{00,1}^{0}+n_{10,1}^{0}}{n_{00}^{0}+n_{10}^{0}}\right).\end{split} (17)

The marginal probability estimate Pr^(Y1A0=a=0)\widehat{\Pr}(Y_{1}^{A_{0}=a}=0) refers to a population in which Pr(L0=1)=n1/n\Pr(L_{0}=1)=n_{1}/n, whereas the marginal probability estimate Pr^(Y2A1=a=0|Y1=0,A1=0)\widehat{\Pr}(Y_{2}^{A_{1}=a}=0|Y_{1}=0,A_{1}=0) refers to a population with a different distribution of L1L_{1}. The estimate from the trial starting at time 1 could alternatively be standardized to the distribution of L0L_{0} at time 0:

Pr^Std(Y2A1=a=0|Y1=0,A1=0)=(n00,0a0,0+n10,0a0,0n00,0a0+n10,0a0)(n0n)+(n00,1a0,0+n10,1a0,0n00,1a0+n10,1a0)(n1n).\widehat{\Pr}^{\mathrm{Std}}(Y_{2}^{A_{1}=a}=0|Y_{1}=0,A_{1}=0)=\left(\frac{n_{00,0a}^{0,0}+n_{10,0a}^{0,0}}{n_{00,0a}^{0}+n_{10,0a}^{0}}\right)\left(\frac{n_{0}}{n}\right)+\left(\frac{n_{00,1a}^{0,0}+n_{10,1a}^{0,0}}{n_{00,1a}^{0}+n_{10,1a}^{0}}\right)\left(\frac{n_{1}}{n}\right). (18)

Under the assumption that PrStd(Y2A1=a=0|Y1=0,A1=0)=Pr(Y1A0=a=0)\Pr^{\mathrm{Std}}(Y_{2}^{A_{1}=a}=0|Y_{1}=0,A_{1}=0)=\Pr(Y_{1}^{A_{0}=a}=0), a combined estimate of Pr(Y1A0=a=0)\Pr(Y_{1}^{A_{0}=a}=0) could be obtained from the two trials, for example using an inverse-variance-weighted combination of the estimates from the two trials.

We can also show that non-parametric estimates of Pr(Y2A¯1=a=0)\Pr(Y_{2}^{\bar{A}_{1}=a}=0) from the two methods are the same. The MSM-IPTW approach uses

Pr(Y2A¯1=a=0)=E{I(A0=a,A1=a)(1Y1)(1Y2)Pr(A0=a|L0)Pr(A1=a|A0=a,L0,L1)}.\Pr(Y_{2}^{\bar{A}_{1}=a}=0)=E\left\{\frac{I(A_{0}=a,A_{1}=a)(1-Y_{1})(1-Y_{2})}{\Pr(A_{0}=a|L_{0})\Pr(A_{1}=a|A_{0}=a,L_{0},L_{1})}\right\}. (19)

Based on the tree diagram, a non-parametric estimate of this is

Pr^(Y2A¯1=a=0)=1n{n0a,0a0,0(n0an0)1(n0a,0a0n0a,00)1+n0a,1a0,0(n0an0)1(n0a,1a0n0a,10)1+n1a,0a0,0(n1an1)1(n1a,0a0n1a,00)1+n1a,1a0,0(n1an1)1(n1a,1a0n1a,10)1},\begin{split}\widehat{\Pr}(Y_{2}^{\bar{A}_{1}=a}=0)=&\frac{1}{n}\left\{n_{0a,0a}^{0,0}\left(\frac{n_{0a}}{n_{0}}\right)^{-1}\left(\frac{n_{0a,0a}^{0}}{n_{0a,0}^{0}}\right)^{-1}+n_{0a,1a}^{0,0}\left(\frac{n_{0a}}{n_{0}}\right)^{-1}\left(\frac{n_{0a,1a}^{0}}{n_{0a,1}^{0}}\right)^{-1}\right.\\ &+\left.n_{1a,0a}^{0,0}\left(\frac{n_{1a}}{n_{1}}\right)^{-1}\left(\frac{n_{1a,0a}^{0}}{n_{1a,0}^{0}}\right)^{-1}+n_{1a,1a}^{0,0}\left(\frac{n_{1a}}{n_{1}}\right)^{-1}\left(\frac{n_{1a,1a}^{0}}{n_{1a,1}^{0}}\right)^{-1}\right\},\end{split} (20)

where the four contributions come from individuals with the four possible combinations of (L0,L1)(L_{0},L_{1}).

The sequential trials approach estimates Pr(Y2A¯1=a=0|L0)\Pr(Y_{2}^{\bar{A}_{1}=a}=0|L_{0}), which can be written

Pr(Y2A¯1=a=0|L0)=Pr(Y2A¯1=a=0|Y1A0=a=0,L0)Pr(Y1A0=a=0|L0).\Pr(Y_{2}^{\bar{A}_{1}=a}=0|L_{0})=\Pr(Y_{2}^{\bar{A}_{1}=a}=0|Y_{1}^{A_{0}=a}=0,L_{0})\Pr(Y_{1}^{A_{0}=a}=0|L_{0}). (21)

The term Pr(Y1A0=a=0|L0)\Pr(Y_{1}^{A_{0}=a}=0|L_{0}) was considered above. The first term, Pr(Y2A¯1=a=0|Y1A0=a=0,L0)\Pr(Y_{2}^{\bar{A}_{1}=a}=0|Y_{1}^{A_{0}=a}=0,L_{0}), is estimated by IPACW, because individuals with A1A0A_{1}\neq A_{0} are censored at time 1, and the remaining individuals with A1=A0A_{1}=A_{0} weighted by Pr(A1=a|A0=a,L0,L1)1\Pr(A_{1}=a|A_{0}=a,L_{0},L_{1})^{-1}. Under the assumptions of consistency, positivity and conditional exchangeability we can write

Pr(Y2A¯1=a=0|Y1A0=a=0,L0)=E{I(A1=a)(1Y2)Pr(A1=a|A0=a,L0,L1)|Y1A0=a=0,A0=a,L0}.\Pr(Y_{2}^{\bar{A}_{1}=a}=0|Y_{1}^{A_{0}=a}=0,L_{0})=E\left\{\frac{I(A_{1}=a)(1-Y_{2})}{\Pr(A_{1}=a|A_{0}=a,L_{0},L_{1})}|Y_{1}^{A_{0}=a}=0,A_{0}=a,L_{0}\right\}. (22)

Based on the tree diagram, a non-parametric estimate of this is

Pr^(Y2A¯1=a=0|Y1A0=a=0,L0=l0)=Pr^(Y2A¯1=a=0|Y1=0,A0=a,L0=l0)=1nl0a0{nl0a,0a0,0(nl0a,0a0nl0a,00)1+nl0a,1a0,0(nl0a,1a0nl0a,10)1}.\begin{split}\widehat{\Pr}(Y_{2}^{\bar{A}_{1}=a}=0|Y_{1}^{A_{0}=a}=0,L_{0}=l_{0})&=\widehat{\Pr}(Y_{2}^{\bar{A}_{1}=a}=0|Y_{1}=0,A_{0}=a,L_{0}=l_{0})\\ &=\frac{1}{n_{l_{0}a}^{0}}\left\{n_{l_{0}a,0a}^{0,0}\left(\frac{n_{l_{0}a,0a}^{0}}{n_{l_{0}a,0}^{0}}\right)^{-1}+n_{l_{0}a,1a}^{0,0}\left(\frac{n_{l_{0}a,1a}^{0}}{n_{l_{0}a,1}^{0}}\right)^{-1}\right\}.\end{split} (23)

Using this result along with the estimate Pr^(Y1A0=a=0|L0=l0)=Pr^(Y1=0|A0=a,L0=l0)=nl0a/nl0\widehat{\Pr}(Y_{1}^{A_{0}=a}=0|L_{0}=l_{0})=\widehat{\Pr}(Y_{1}=0|A_{0}=a,L_{0}=l_{0})=n_{l_{0}a}/n_{l_{0}}, it can be shown that the sequential trials estimate of Pr(Y2A¯1=a=0)=l0=0,1Pr(Y2A¯1=a=0|L0=l0)Pr(L0=l0)\Pr(Y_{2}^{\bar{A}_{1}=a}=0)=\sum_{l_{0}=0,1}\Pr(Y_{2}^{\bar{A}_{1}=a}=0|L_{0}=l_{0})\Pr(L_{0}=l_{0}) is the same as the MSM-IPTW estimate in (20). By comparing (20) and (23) we can see clearly the connection between the weights used in the MSM-IPTW approach and those used in the sequential trials approach (IPACW).

5.2 Specification and estimation of MSMs

In practice, model-based estimates are typically required instead of non-parametric estimates, as we usually have multiple time-dependent confounders to consider, including continuous variables. In this section we compare in more detail the form of the MSMs used in the two approaches, and how they are estimated using inverse probability weights.

A key difference in the MSMs used in the two approaches is that in the MSM-IPTW the MSM for the hazard at time tt includes the history of treatment up to time tt, A¯t\bar{A}_{\lfloor t\rfloor}, whereas the MSM used in the sequential trials approach involves only the treatment status assigned at the start of the trial. The MSM-IPTW approach therefore requires that the relation between treatment history A¯t\bar{A}_{\lfloor t\rfloor} and the hazard is correctly specified, whereas in the sequential trials approach there are limited options for the form of the MSM because only two treatment patterns are considered, because after the artificial censoring all individuals used in the analysis are either always treated or never treated. In MSM-IPTW, when baseline covariates are excluded from the MSM, the MSM is a fully marginal model - the possibility of interaction between treatment and baseline covariates is not ruled out but does not have to be modelled. If the MSM used in the MSM-IPTW approach includes baseline covariates L0L_{0} then any interactions that exist between treatment and L0L_{0} must be included in the MSM and correctly specified. Similarly, in the sequential trials approach, if there are interactions between treatment and the baseline covariates at the start of each trial, LkL_{k}, then these must be included in the MSM and correctly specified. MSMs that condition on baseline covariates are therefore more susceptible to misspecification. The sequential trials approach could use weighting in the first time period instead of adjustment for baseline covariates, though that is not how the method has been described or used to date. Conditioning on baseline covariates in the sequential trials approach enables use of empirical standardization to obtain risk difference estimates for a population with any distribution of the baseline covariates. When combined with assumptions about the form of the MSM for the hazard in each trial, this enables us to estimate the same causal estimand in the sequential trials approach as in the MSM-IPTW approach.

If the sequential trials analysis makes the assumption that the effect of treatment on the hazard is the same in all trials, i.e. does not depend on the time of treatment initiation, then the sequential trials approach has information about the effect of treatment initiation from several time origins. If the assumptions are valid, this re-use of individuals from several time origins could lead to gains in efficiency in the marginal risk difference estimate at some time horizons τ\tau relative to the MSM-IPTW approach. On the other hand, in the MSM-IPTW analysis individuals who switch their treatment status during follow-up continue to contribute information to the analysis, whereas in the sequential trials analysis individuals cease to contribute information when they deviate from their treatment group at the start of a given trial. This means that in the sequential trials analysis the number of individuals with longer term follow-up will be smaller than in the MSM-IPTW analysis.

To fit the MSMs, both approaches require time-updated weights, and hence the data should be formatted with multiple rows per individual (one for each visit for MSM-IPTW, and one for each visit within each trial for the sequential trials approach). In the MSM-IPTW approach (without conditioning on baseline covariates), the MSM is estimated by weighting individuals in the observed data using time-updated weights. In the sequential trials approach the MSM is instead estimated through adjustment in trial kk for baseline covariates LkL_{k} and artificial censoring of individuals when they deviate from their initial treatment group at time kk. The artificial censoring is addressed using IPACW. The weight up to time 1 after the start of a given trial is always equal to 1. In the MSM-IPTW approach with conditioning on baseline variables L0L_{0} the IPTW weight at a given time is the same as the IPACW weight used in the sequential trials approach for the trial starting at time 0 for individuals following the ‘always treated’ or ‘never treated’ regimes. In the MSM-IPTW approach without conditioning on baseline variables the IPTW weights are proportional to the IPACW weights used in the sequential trials approach. Because the MSM-IPTW approach includes individuals following any treatment regime (not just ‘always treated’ or ‘never treated’), we may expect to see more extreme weights with increasing follow-up, compared with the weights used in the sequential trials approach.

Because the two approaches use the data differently and because the MSM-IPTW approach could make use of more extreme weights, it is of interest to investigate the relative efficiency of the two approaches for estimating the causal estimand. We note that it is only appropriate to consider the relative efficiency of the two approaches when they target the same causal estimand, which, as discussed in section 4.4, requires assumptions that some parameters of the MSM used in the sequential trials approach are common across trials.

5.3 Compatibility of MSMs used in the two approaches

In Sections 4.3 and 4.4 we considered MSMs based on Cox models or on additive hazard models. In this section we show that one can use an additive hazard model for the MSM for the sequential trials analysis (expression (11)) (with common parameters assumed across trials) and an additive hazard model for the MSM used in the MSM-IPTW approach (expression (4), without conditioning on L0L_{0}), and that both MSMs can be correctly specified. However, if we use a Cox model for the MSM for the sequential trials analysis (expression (10)) and a Cox model for the MSM used in the MSM-IPTW approach (expression (3), then both MSMs cannot simultaneously be correctly specified in general. This is because the parameters of additive hazard models are collapsible, whereas hazard ratios in the Cox model are non-collapsible. Martinussen and Vansteelandt (2013)19 explained the implications of collapsibility for the use of Aalen additive hazards models and Cox models in the context of estimating the causal effect of a point treatment on survival. Keogh et al. (2021)23 extended to a longitudinal setting similar to that considered in this paper. Here we outline some key points in the context of the simple setting depicted in Figures 2 and 3.

For the MSM-IPTW analysis in this section we again focus on an MSM that is not conditional on L0L_{0}, as in (3),(4). The MSMs used in a MSM-IPTW analysis and in the sequential trials analysis may be considered to be consistent with each other if there exists an underlying fully conditional hazard model that implies both the MSM used in MSM-IPTW and the MSM used in the sequential trials approach, which is conditional on baseline covariates at the start of each trial. As above we consider estimating Pr(Y1A0=a=0)\Pr(Y_{1}^{A_{0}=a}=0) (or Pr(TA¯0=a>1)\Pr(T^{\underline{A}_{0}=a}>1)) and Pr(Y2A¯1=a=0)\Pr(Y_{2}^{\bar{A}_{1}=a}=0) (or Pr(TA¯1=a>2)\Pr(T^{\underline{A}_{1}=a}>2)). First consider a fully conditional additive hazard model of the form

h(t|A¯t,L¯t)=α0(t)+αA(t)At+αL(t)Lt.h(t|\bar{A}_{\lfloor t\rfloor},\bar{L}_{\lfloor t\rfloor})=\alpha_{0}(t)+\alpha_{A}(t)A_{\lfloor t\rfloor}+\alpha_{L}(t)L_{\lfloor t\rfloor}. (24)

Under this model the conditional survival probability at time 1 is

Pr(Y1=0|A0,L0)=exp(01(α0(u)+αA(u)A0+αL(u)L0)𝑑u)\Pr(Y_{1}=0|A_{0},L_{0})=\exp\left(-\int_{0}^{1}(\alpha_{0}(u)+\alpha_{A}(u)A_{0}+\alpha_{L}(u)L_{0})du\right) (25)

and the marginal probability of survival to time 1 is

Pr(Y1A0=a=0)=e01(α0(u)+αA(u)A0)𝑑uPr(L0=0)+e01(α0(u)+αA(u)A0+αL(u))𝑑uPr(L0=1)=e01(α0(u)+αA(u)A0)𝑑u,\begin{split}\Pr(Y_{1}^{A_{0}=a}=0)&=e^{-\int_{0}^{1}(\alpha_{0}(u)+\alpha_{A}(u)A_{0})du}\Pr(L_{0}=0)+e^{-\int_{0}^{1}(\alpha_{0}(u)+\alpha_{A}(u)A_{0}+\alpha_{L}(u))du}\Pr(L_{0}=1)\\ &=e^{-\int_{0}^{1}(\alpha_{0}^{*}(u)+\alpha_{A}(u)A_{0})du},\end{split} (26)

where 01α0(u)=01α0(u)𝑑u+log{Pr(L0=0)+e01αL(u)𝑑uPr(L0=1)}\int_{0}^{1}\alpha_{0}^{*}(u)=\int_{0}^{1}\alpha_{0}(u)du+\log\{\Pr(L_{0}=0)+e^{-\int_{0}^{1}\alpha_{L}(u)du}\Pr(L_{0}=1)\}. This expression is in the form of the survival probability from a marginal additive hazard model of the form

hTA0=a(t)=α0(t)+αA(t)a.h_{T^{{}^{A_{0}=a}}}(t)=\alpha_{0}^{*}(t)+\alpha_{A}(t)a. (27)

It follows that we could use an additive hazard model for Pr(Y1A0=a=0|L0)\Pr(Y_{1}^{A_{0}=a}=0|L_{0}) in the sequential trials analysis and an additive hazard model for Pr(Y1A0=a=0)\Pr(Y_{1}^{A_{0}=a}=0) in the MSM-IPTW analysis, and that both models can be correctly specified.

Next consider the probabilities Pr(Y2A¯1=a=0)=Pr(Y2A¯1=a=0|Y1A0=a=0)Pr(Y1A0=a=0)\Pr(Y_{2}^{\bar{A}_{1}=a}=0)=\Pr(Y_{2}^{\bar{A}_{1}=a}=0|Y_{1}^{A_{0}=a}=0)\Pr(Y_{1}^{A_{0}=a}=0) and Pr(Y2A¯1=a=0|L0)=Pr(Y2A¯1=a=0|Y1A0=a=0,L0)Pr(Y1A0=a=0|L0)\Pr(Y_{2}^{\bar{A}_{1}=a}=0|L_{0})=\Pr(Y_{2}^{\bar{A}_{1}=a}=0|Y_{1}^{A_{0}=a}=0,L_{0})\Pr(Y_{1}^{A_{0}=a}=0|L_{0}). Under the fully conditional additive hazard model in (24) it can be shown that (see Supplementary Materials Section A3)

Pr(Y2A¯1=a=0|Y1A0=a=0,L0)=exp{12(α0(u)+α(u)a)du+Δ(a,L0)}\Pr(Y_{2}^{\bar{A}_{1}=a}=0|Y_{1}^{A_{0}=a}=0,L_{0})=\exp\left\{-\int_{1}^{2}(\alpha_{0}(u)+\alpha_{(}u)a)du+\Delta(a,L_{0})\right\} (28)

and

Pr(Y2A¯1=a=0|Y1A0=a=0)=exp{12(α0(u)+α(u)a)du+Δ(a)},\Pr(Y_{2}^{\bar{A}_{1}=a}=0|Y_{1}^{A_{0}=a}=0)=\exp\left\{-\int_{1}^{2}(\alpha_{0}(u)+\alpha_{(}u)a)du+\Delta^{*}(a)\right\}, (29)

where

Δ(a,L0)=log{Pr(L1=0|Y1=0,A0=a,L0)+e12αL(u)𝑑uPr(L1=1|Y1=0,A0=a,L0)}\Delta(a,L_{0})=\log\left\{\Pr(L_{1}=0|Y_{1}=0,A_{0}=a,L_{0})+e^{-\int_{1}^{2}\alpha_{L}(u)du}\Pr(L_{1}=1|Y_{1}=0,A_{0}=a,L_{0})\right\}

and

Δ(a)=log{eΔ(a,0)Pr(L0=0)+eΔ(a,0)01αL(u)𝑑uPr(L1=1)Pr(L0=0)+e01αL(u)𝑑uPr(L1=1)}.\Delta^{*}(a)=\log\left\{\frac{e^{\Delta(a,0)}\Pr(L_{0}=0)+e^{\Delta(a,0)-\int_{0}^{1}\alpha_{L}(u)du}\Pr(L_{1}=1)}{\Pr(L_{0}=0)+e^{-\int_{0}^{1}\alpha_{L}(u)du}\Pr(L_{1}=1)}\right\}.

It follows that there exists an underlying conditional hazard model that gives rise to an additive model for the sequential trials analysis and for the MSM-IPTW analysis.

Secondly consider a fully conditional Cox proportional hazards model of the form

h(t|A¯t,L¯t)=h0(t)eβAAt+βLLt.h(t|\bar{A}_{\lfloor t\rfloor},\bar{L}_{\lfloor t\rfloor})=h_{0}(t)e^{\beta_{A}A_{\lfloor t\rfloor}+\beta_{L}L_{\lfloor t\rfloor}}. (30)

Under this model the conditional survival probability at time 1 is Pr(Y1=0|A0,L0)=eH0(1)eβAA0+βLL0\Pr(Y_{1}=0|A_{0},L_{0})=e^{-H_{0}(1)e^{\beta_{A}A_{0}+\beta_{L}L_{0}}} where H0(1)=01h0(u)𝑑uH_{0}(1)=\int_{0}^{1}h_{0}(u)du. Using this, the marginal probability of interest can be written

Pr(Y1A0=a=0)=exp(H0(1)eβAA0)Pr(L0=0)+exp(H0(1)eβAA0+βL)Pr(L0=1).\Pr(Y_{1}^{A_{0}=a}=0)=\exp\left(-H_{0}(1)e^{\beta_{A}A_{0}}\right)\Pr(L_{0}=0)+\exp\left(-H_{0}(1)e^{\beta_{A}A_{0}+\beta_{L}}\right)\Pr(L_{0}=1). (31)

This expression for Pr(Y1A0=a=0)\Pr(Y_{1}^{A_{0}=a}=0) is not the form of the survival probability from a marginal Cox proportional hazards model. Therefore, if a Cox model was assumed for hTa¯0(t|L0)h_{T^{\underline{a}_{0}}}(t|L_{0}) in the sequential trials analysis, then this does not imply a Cox model for hTa¯0(t)h_{T^{\underline{a}_{0}}}(t) in the MSM-IPTW analysis, and vice-versa. If Cox models are used for a sequential trials analysis and a MSM-IPTW analysis then the two analyses make different modelling assumptions that cannot both be true. In practice, however, a Cox model could be a good working model for both approaches.

The result in this section has implications for comparing estimates obtained from the MSM-IPTW approach and the sequential trials approach, and we make use of this in the simulation study below by using additive hazards models.

Figure 2: Causal tree diagram illustrating a study with a binary treatment AkA_{k} and binary covariate LkL_{k}, both measured at two time points (t=0,1t=0,1). YtY_{t} is a discrete time survival outcome. Thick lines indicate branches for groups of individuals who were untreated or treated at both time points.
Refer to caption
Figure 3: Illustration of the sequential trials approach using causal tree diagram from Figure 2. (a) In trial starting at time t=0t=0 individuals with Y1=0Y_{1}=0 are censored at time 1 if A1A0A_{1}\neq A_{0}. (b) The trial starting at time t=1t=1 is restricted to individuals with A0=0A_{0}=0 (and Y1=0Y_{1}=0).
Refer to caption

6 A simulation study for the comparison of MSM-IPTW and the sequential trials approach

6.1 Simulation plan

To evaluate and compare the performance of the two methods in question we will now consider a simulation study based on the data generating simulation algorithm for longitudinal and time-to-event data described by Keogh et al. 23. We follow the general recommendations of Morris et al.(2019)47 on the conduct of simulation studies for evaluating statistical methods. R code for replicating the simulation is provided at https://github.com/ruthkeogh/sequential_trials.

Aim

Our aim is to compare the MSM-IPTW and sequential trials approaches for estimating the effect of a time-varying treatment on survival, subject to time-dependent confounding, from longitudinal observational data. Our focus is on a setting in which both approaches target the same causal estimand (see below) and hence it is relevant to assess their relative efficiency. Both methods are expected to produce approximately unbiased estimates when the modelling assumptions are met. We hypothesise that the sequential trials analysis could be more efficient than the MSM-IPTW analysis in certain settings, depending on the data generating mechanism and time horizon τ\tau. MSM-IPTW is inefficient when there are extreme weights. The efficiency of the sequential trials approach is expected to depend on the proportion of individuals always in the treatment group or always in the control group, since when many individuals switch over time this would result in few people contributing as follow-up time increases, and the potential for large weights.

Data generating mechanisms

Data are generated according to the DAG in Figure 1. Individuals are observed at up to 5 visits, with administrative censoring at time 5. We consider three simulation scenarios, starting with a ‘standard’ scenario (scenario 1) and then investigating the performance of the methods when certain aspects of the data generating mechanism are varied. The simulation scenarios are outlined in detail in Table 1. In all scenarios, once an individual starts treatment they remain on treatment, i.e., for any kk, there are no individuals that go from Ak=1A_{k}=1 to Ak+1=0A_{k+1}=0. Also in all scenarios there is a single time-dependent confounder LL, the model for LkL_{k} conditional on Ak1A_{k-1}, Lk1L_{k-1},kk and UU is the same, and event times are generated according to the same conditional additive hazards model

h(t|A¯t,L¯t,U)=α0+αAAt+αLLt+αUU.h(t|\bar{A}_{\lfloor t\rfloor},\bar{L}_{\lfloor t\rfloor},U)=\alpha_{0}+\alpha_{A}A_{\lfloor t\rfloor}+\alpha_{L}L_{\lfloor t\rfloor}+\alpha_{U}U. (32)

In scenarios 1 and 2 the log odds ratio for LkL_{k} in the logistic model for the probability of Ak=1A_{k}=1 is 0.5, giving ‘moderate’ dependence of treatment initiation on LkL_{k}. In scenario 3, the log odds ratio for LkL_{k} in the logistic model for the probability of Ak=1A_{k}=1 is increased to 3, meaning that there is strong dependence of treatment initiation on LkL_{k}.

In scenario 1 the intercept in the logistic model for the probability of Ak=1A_{k}=1 is such that 25% of individuals have A0=1A_{0}=1. In scenario 2 the intercept in the logistic model for the probability of Ak=1A_{k}=1 is reduced so that the proportion of individuals initiating treatment at a given visit is lower, with approximately 5% having A0=1A_{0}=1.

In scenarios 1, 2 and 3, respectively, approximately 57%, 62% and 56% of individuals have the event of interest during the 5 years of follow-up. We do not include any censoring apart from administrative censoring, though extensions to include this are straightforward.

The reason for using an additive hazards model for generating event times is that it enables us to specify a model that is correct under both the MSM-IPTW and the sequential trials analysis, as discussed in Section 5.3, thus enabling a fair comparison between the two approaches.

Estimand

The estimand of interest is the marginal risk difference in (1). We consider time horizons on a continuous scale up to time τ=5\tau=5. The population of interest for the marginal risk difference is the set of nn individuals observed at time 0.

Methods

The data are analysed using the MSM-IPTW approach and the sequential trials approach. In the MSM-IPTW approach we considered MSMs with and without conditioning on L0L_{0}. When the conditional additive hazard model is as in (32), it can be shown 23 that the correct form for the MSM used in the MSM-IPTW analysis using the MSM that is not conditional on L0L_{0} is

hTa¯0(t)=α0(t)+j=0tα~Aj(t)atjh_{T^{\underline{a}_{0}}}(t)=\alpha_{0}(t)+\sum_{j=0}^{\lfloor t\rfloor}\tilde{\alpha}_{Aj}(t)a_{\lfloor t\rfloor-j} (33)

and the correct form for the MSM conditional on L0L_{0} is

hTa¯0(t|L0)=α0(t)+j=0tα~Aj(t)atj+αL(t)L0.h_{T^{\underline{a}_{0}}}(t|L_{0})=\alpha_{0}(t)+\sum_{j=0}^{\lfloor t\rfloor}\tilde{\alpha}_{Aj}(t)a_{\lfloor t\rfloor-j}+\alpha_{L}(t)L_{0}. (34)

The correct form for the MSM used in the sequential trials analysis, under our data generating mechanism, is

hTa¯k=a(tk|Lk)=α0(tk)+αA(tk)a+αL(tk)Lk,k=0,,4.h_{T^{\underline{a}_{k}=a}}(t-k|L_{k})=\alpha_{0}(t-k)+\alpha_{A}(t-k)a+\alpha_{L}(t-k)L_{k},\qquad k=0,\ldots,4. (35)

The coefficients in the fully conditional hazard model in (32) do not depend on time, and this results in the coefficients in the MSM for the sequential trials analysis being a function of time since the start of the trial, but being the same across trials k=0,,4k=0,\ldots,4.

We use stabilized weights for all analyses. In the MSM-IPTW approach using the MSM in (33) (without conditioning on L0L_{0}) the stabilized IPTW at time tt are

k=0tPr(Ak|A¯k1)Pr(Ak|L¯k,A¯k1)\prod_{k=0}^{\lfloor t\rfloor}\frac{\Pr(A_{k}|\bar{A}_{k-1})}{\Pr(A_{k}|\bar{L}_{k},\bar{A}_{k-1})} (36)

and in the MSM-IPTW approach using the MSM in (34) (with conditioning on L0L_{0}) the stabilized IPTW at time tt are

k=0tPr(Ak|A¯k1,L0)Pr(Ak|L¯k,A¯k1),\prod_{k=0}^{\lfloor t\rfloor}\frac{\Pr(A_{k}|\bar{A}_{k-1},L_{0})}{\Pr(A_{k}|\bar{L}_{k},\bar{A}_{k-1})}, (37)

In the sequential trials approach the stabilized IPACW at time tt (for t1t\geq 1) after the start of the trial for the trial starting at visit kk are (for t1t\geq 1)

{1for Ak=1j=1tPr(Ak+j=0|Lk,A¯k+j1=0)Pr(Ak+j=0|L¯j,A¯k+j1=0)for Ak=0\left\{\begin{array}[]{cc}1&\mbox{for }A_{k}=1\\ \prod_{j=1}^{\lfloor t\rfloor}\frac{\Pr(A_{k+j}=0|L_{k},\bar{A}_{k+j-1}=0)}{\Pr(A_{k+j}=0|\bar{L}_{j},\bar{A}_{k+j-1}=0)}&\mbox{for }A_{k}=0\end{array}\right. (38)

The IPACW are equal to 1 up to time 1 after the start of each trial, i.e. up to time k+1k+1 for the trial starting at visit kk.

The weights take into account that individuals do not stop treatment once they start. In MSM-IPTW we have Pr(Ak=1|A¯k1=1)=1\Pr(A_{k}=1|\bar{A}_{k-1}=1)=1, Pr(Ak=1|L¯k,A¯k2,Ak1=1)=1\Pr(A_{k}=1|\bar{L}_{k},\bar{A}_{k-2},A_{k-1}=1)=1 and Pr(Ak=1|A¯k1=1,L0)=1\Pr(A_{k}=1|\bar{A}_{k-1}=1,L_{0})=1. All other probabilities used in the weights were estimated using logistic regression models fitted using all visits combined (k=0,,4k=0,\ldots,4). The probabilities Pr(Ak|A¯k2,Ak1=0)\Pr(A_{k}|\bar{A}_{k-2},A_{k-1}=0) were estimated using a logistic regression model for AkA_{k} with a separate intercept for each kk, in the subset of individuals with Ak1=0A_{k-1}=0. The probabilities Pr(Ak|A¯k2,Ak1=0,L0)\Pr(A_{k}|\bar{A}_{k-2},A_{k-1}=0,L_{0}) were estimated using a logistic regression model for AkA_{k} with L0L_{0} as a covariate, and allowing both the intercept and the coefficients for L0L_{0} to differ for each kk, fitted using the subset of individuals with Ak1=0A_{k-1}=0. The probabilities Pr(Ak|L¯k,A¯k2,Ak1=0)\Pr(A_{k}|\bar{L}_{k},\bar{A}_{k-2},A_{k-1}=0) were estimated using a logistic regression model for AkA_{k} with LkL_{k} as a covariate in the subset of individuals with Ak1=0A_{k-1}=0, which is the correct model under our data generating mechanism. Similar models were used to estimate the weights for the sequential trials analysis, but with L0L_{0} replaced by LkL_{k} in trial kk for the model used to estimate the probabilities in the numerator of the weights in (38). The models were fitted across all trials combined, which was valid under our data generating mechanism.

Performance measures

We assess the performance of the methods in terms of bias and efficiency of the risk difference estimates. To obtain the bias we need to know the true values. True values for the survival probabilities and the risk difference were obtained by generating data as though from a large randomized controlled trial, as explained in 23. For this, we first generated L0L_{0} for 1 million individuals, according to the model outlined in Table 1. Two data sets are then created, one in which the 1 million individuals are set to have Ak=1A_{k}=1 (t=0,,4t=0,\ldots,4) (‘always treated’) and another in which they are all set to have At=0A_{t}=0 (k=0,,4k=0,\ldots,4) (‘never treated’). In each data set the LkL_{k} (k=1,,4k=1,\ldots,4) are then generated sequentially using the model for LkL_{k} in Table 1, with LkL_{k} being generated conditional on Lk1L_{k-1}, UU and AkA_{k}. Event times were generated in each data set according to the conditional additive hazard model in (32). The true survival probabilities, and corresponding risk differences, under each treatment strategy (‘always treated’ and ‘never treated’) were then obtained using Kaplan-Meier estimates.

Plots are used to present survival probability estimates and bias and efficiency results for the risk differences at time horizons τ\tau in the range from 0 to 5. We also present results for the survival probabilities and risk differences at time horizons τ=1,2,3,4,5\tau=1,2,3,4,5 in tables. Estimates are accompanied by Monte Carlo errors.

Table 1: Simulation scenarios: data generating mechanisms
Data generating mechanism
General data UN(0,0.1)U\sim N(0,0.1)
generating mechanism L0N(U,1)L_{0}\sim N(U,1), LkN(δ0+δLLk1+δAAk1+δTk+U,1)L_{k}\sim N(\delta_{0}+\delta_{L}L_{k-1}+\delta_{A}A_{k-1}+\delta_{T}k+U,1) (k1k\geq 1)
logitPr(A0=1|L0)=γ0+γLL0\mathrm{logit}\Pr(A_{0}=1|L_{0})=\gamma_{0}+\gamma_{L}L_{0}
logitPr(Ak=1|A¯k1,L¯k)=γ0+γAAk1+γLLk\mathrm{logit}\Pr(A_{k}=1|\bar{A}_{k-1},\bar{L}_{k})=\gamma_{0}+\gamma_{A}A_{k-1}+\gamma_{L}L_{k}, (k1k\geq 1)
h(t|A¯t,L¯t)=α0+αAAt+αLLt+αUUh(t|\bar{A}_{\lfloor t\rfloor},\bar{L}_{\lfloor t\rfloor})=\alpha_{0}+\alpha_{A}A_{\lfloor t\rfloor}+\alpha_{L}L_{\lfloor t\rfloor}+\alpha_{U}U
Scenario 1 δ0=0,δL=0.8,δA=1,δT=0.1\delta_{0}=0,\delta_{L}=0.8,\delta_{A}=-1,\delta_{T}=0.1
γ0=1,γL=0.5\gamma_{0}=-1,\gamma_{L}=0.5
α0=0.2,αA=0.04,αL=0.015,αU=0.015\alpha_{0}=0.2,\alpha_{A}=-0.04,\alpha_{L}=0.015,\alpha_{U}=0.015
Scenario 2 δ0,δL,δA,δT\delta_{0},\delta_{L},\delta_{A},\delta_{T}: as in scenario 1
γ0=3,γL=0.5\gamma_{0}=-3,\gamma_{L}=0.5
α0,αA,αL,αU\alpha_{0},\alpha_{A},\alpha_{L},\alpha_{U}: as in scenario 1
Scenario 3 δ0,δL,δA,δT\delta_{0},\delta_{L},\delta_{A},\delta_{T}: as in scenario 1
γ0=1,γL=3\gamma_{0}=-1,\gamma_{L}=3
α0,αA,αL,αU\alpha_{0},\alpha_{A},\alpha_{L},\alpha_{U}: as in scenario 1

6.2 Results

For the MSM-IPTW approach we focus on the results obtained using the MSM conditional on L0L_{0}. Corresponding results for the MSM that is not conditional on L0L_{0} are shown in the Supplementary Materials (Supplementary Figures A1 and A2). Figure 4 shows the estimated survivor curves under the ‘always treated’ and ‘never treated’ regimes under the three simulation scenarios, alongside the true curves. Figure A3 summarises the corresponding results for bias in risk difference estimates. The results are summarised numerically at time points 1,2,3,4,51,2,3,4,5 in Table 2. The relative efficiency of the sequential trials approach compared with the MSM-IPTW approach is illustrated in Figure A4. Figure 7 shows plots of the largest weight used in the MSM-IPTW and sequential trials analyses in each of the 1000 simulated data sets, by time-period (because the weights are time-dependent).

In scenario 1 both methods give unbiased estimates of the risk difference at all time points. The sequential trials approach is more efficient up to time 4, after which the MSM-IPTW approach is more efficient. The efficiency gain from the sequential trials approach is greatest at earlier time points and diminishes as time progresses. Figure 7 shows that the sequential trials approach tends to use less extreme weights(IPACW) than the MSM-IPTW approach at the earlier time points, but that by time 4 after potential treatment initiation the IPTW do not appear to be more extreme than than the IPACW. The relative efficiency results are also a function of how the data are used and of modelling assumptions made. Under the assumption that treatment effects on the hazard are the same across trials (which is true under our data generating mechanism), the sequential trials approach uses information about the treatment effect on the hazard in time period 0<t<10<t<1 from 5 trials, time period 1t<21\leq t<2 from 4 trials and so on. Only the first trial (starting at k=0k=0) provides information about the time period 4t<54\leq t<5. The MSM-IPTW analysis draws information on the impact of treatment in time period 4t<54\leq t<5 from individuals who initiated treatment at times 0,1,2,3,40,1,2,3,4, Supplementary Table A1 shows the number of individuals observed at times t=0,1,2,3,4t=0,1,2,3,4 under the two approaches. The number of individuals observed at time 44 is considerably greater under the MSM-IPTW approach compared with the sequential trials approach.

In scenario 2, the sequential trials approach gives unbiased estimates of the risk difference at all time points. The MSM-IPTW approach gives unbiased estimates up to around time 3, with the risk difference being slightly biased upwards after time 3. Figure 4 shows that the survivor curve in the always treated group is slightly biased upwards, which we also see in Table 2. The bias is small in magnitude but statistically significant. Figure 7 comparing the IPACW to the IPCW shos that the largest weights tend to be considerably higher in the MSM-IPTW analysis compared with the sequential trials analysis. The difference between the IPACW and IPTW is much greater in scenario 2 compared with scenario 1, with the IPACW tending to be much less extreme than the IPTW at all time points. In scenario 2 the proportion of individuals initiating treatment at a given visit is low. Supplementary Table A1 shows that in scenario 2 the absolute numbers of individuals in the ‘always treated group’ is low across all visits. The bias for MSM-IPTW in scenario 2 is therefore attributed to the combination of large weights and finite sample bias due to small numbers of treated individuals. The efficiency plot in Figure A4 shows that the sequential trials analysis is more efficient than MSM-IPTW at all time points. From 4 and Table 2, we see that the variation in the estimates of the survival probabilities is much greater in MSM-IPTW for the ‘always treated’ regime. For the ‘never treated’ regime the variation is smaller using the sequential trials approach up to between times 2 and 3, after which the variation in the MSM-IPTW estimates is lower.

In scenario 3, both methods give bias in the risk difference after time 1 (Figure A3). Figure 4 shows that the survival probabilities for the ‘always treated’ regime are estimated without bias, but the survival probabilities for the ‘never treated’ regime have some upwards bias at later time points. The magnitude of the bias is small and similar under both methods. Figure A4 shows that the sequential trials analysis is more efficient up to around time 2.5, and the MSM-IPTW analysis is more efficient thereafter. In this scenario the time-varying covariate LkL_{k} is strongly predictive of treatment AkA_{k} (log odds ratio 3). Looking at the distribution of the largest weights in Figure 7 we see that after time 1 there are some very large weights under both methods. The largest weights tended to be higher in the MSM-IPTW approach. The number of individuals observed at each time point is greater under scenario 3 than scenario 2. In this case therefore, the bias may be attributed to near violations of the positivity assumption.

Results are similar from the MSM-IPTW analysis based on the MSM without conditioning on L0L_{0}. We also applied the analyses using truncated weights in which both IPTW and IPACW were truncated at the 95th percentile, which resulted in similar findings - see Supplementary Materials (Supplementary Figure A3 and Figure A4).

7 Application in the UK CF Registry

7.1 The target trial

We apply the methods to the motivating application introduced in Section 2. Table 3 summarises the target trial, which we emulate using longitudinal data on individuals observed in the UK CF Registry between 2008 and 2018, in addition to some data from up to three visits prior to 2008, which was used to ascertain eligibility (see below).

Some initial data set-up steps were taken as follows. The outcome is the composite of death or transplant, so visits recorded after transplant were omitted. Data are intended to be collected at annual visits. However, there are some visits considerably closer and wider apart. We omitted visits that were less than 6 months after the previous visit and where the previous and subsequent visit were less than 18 months apart. For individuals who do not have the event of interest (death or transplant) the censoring date was the earlier of 31 December 2018 and the date of the last visit plus 18 months. Where there was a period of more than 18 months after a given visit without the event or another visit being recorded, the individual was censored at the date of that visit plus 18 months, and any subsequent later visits excluded. The data allow for a maximum of 11 years of follow-up.

The eligibility criteria for the target trial are people with CF aged 12 or older and who have not used DNase for at least 3 years. For the emulated trial we identify individuals meeting these criteria according to UK CF Registry data recorded between 2008 and 2018. The emulated trial has some additional eligibility criteria that relate to the way in which the data are collected. An individual is defined as being eligible at a given visit if they have at least 3 preceding visits, with the patient recorded as not using DNase at each of those visits, and at least one visit in the prior 18 months. We used this (approximately) 3-year period of non-treatment use to minimise any bias arising from missing treatment data for an individual given year, or a missing annual review. Individuals who may have used DNase in the more distant past were included to increase sample size and because it was considered reasonable that the effect of re-starting DNase in this group would be similar to that for individuals who had never used DNase in the past. To be eligible at a given visit, individuals were also required to have observed measurements of forced expiratory volume in 1 second as percentage predicted (FEV1%) and body mass index (BMI) z-score at the visit and the previous visit. The other elements of the protocol are the same for the emulated trial as for the target trial. The treatment strategies are (i) start using DNase and continue using it throughout follow-up, and (ii) do not start using DNase during follow-up. The aim is to estimate risk differences between the two treatment strategies up to 11 years of follow-up.

There was a small amount of missing data in some of the time-dependent covariates. We used the last-observation-carried-forward to address this. For the genotype variable ‘missing’ was treated as a separate category.

7.2 Analysis

The MSM-IPTW analysis and the sequential trials analysis were both applied as outlined in Section 4 to estimate survival curves up to 11 years post-randomization under the two treatment regimes of ‘always treat’ or ‘never treat’ with DNase (Table 3), and corresponding risk differences. For the MSM-IPTW analysis an individual’s baseline visit (k=0k=0) was defined as the first visit in the period 2008-2018 at which an individual meets the emulated trial eligibility criteria. For the sequential trials analysis individuals were assessed for eligibility at each visit from the baseline visit onwards, resulting in 11 possible trials.

DNase use is recorded at each annual review visit and refers to whether the patient has been prescribed DNase over the past year. We identified a number of potential confounders, which include both demographic and clinical patient characteristics (see Table 4). These include two time-fixed covariates; sex and genotype class (a marker of the severity of the CF-causing mutation - low, high, not assigned, missing) 48, and several time-dependent covariates; age (in years), lung function measured using FEV1%, BMI z-score, chronic infection with Pseudomonas aeruginosa or use of nebulized antibiotics, infection with Staphylococcus aureus or Methicillin-resistant Staphylococcus aureus (MRSA), infection with Burkholderia cepacia, infection with Non-tuberculous mycobacteria (NTM), CF-related diabetes, pancreatic insufficiency, number of days on intravenous (IV) antibiotics at home or in hospital (categorized as 0, 1-14, 15-28, 29-42, 43+), other hospitalization (yes or no), use of other mucoactive treatments (hypertonic saline, mannitol or acetylcysteine), use of oxygen therapy, use of CFTR modulators (ivacaftor, lumacaftor/ivacaftor, or tezacaftor/ivacaftor). Of the time-dependent variables, FEV1% and BMI are measured on the day of the visit, whereas all of the other variables refer to information in the time since the previous visit (i.e. in the past year).

For each approach we obtained results using a Cox proportional hazards model and an Aalen’s additive hazard model (with time-varying coefficients) for the MSMs. In the MSMs used for the MSM-IPTW analysis the MSM included current treatment and treatment in the three prior years, and is conditional on the time-fixed covariates and baseline measures of the time-varying covariates, except visit year. The MSM used in the sequential trials analysis included the same set of covariates as used in the MSM-IPTW analysis, i.e. the time-fixed covariates and time-varying covariates as measured at the visit at the start of each trial. The continuous covariates, age, FEV1%,and BMI z-score, were modelled using restricted cubic splines (with 3 knots) in the MSMs. For the sequential trials analysis, we performed tests of whether the coefficient(s) for treatment status differed by trial. There was no evidence of this and so we used a combined analysis across trials. In the analyses using the Cox proportional hazards model we assessed the proportional hazards assumption for the treatment variable(s). In the sequential trials analysis there was no evidence against the proportional hazards assumption for the treatment variable, whereas in the MSM-IPTW analysis there was evidence against the proportional hazards assumption in a joint test for the four treatment variables in the model.

Logistic regression models were used to estimate stabilized weights for use in each analysis method. In the MSM-IPTW approach the model used for the denominator of the weights included all time-fixed covariates and the current values of the time-dependent covariates. The model used for the numerator of the weights included the time-fixed covariates and baseline measures of the time-varying covariates. In these models, DNase status recorded at visit kk is assumed to depend on FEV1% and BMI z-score recorded at visit k1k-1, and on other time-dependent variables as recorded at visit kk.

In the simulation we assumed that once an individual initiates treatment they always continue treatment. However, in this application some individuals also stop treatment. In the sequential trials analysis we therefore estimated separate IPACW for artificial censoring due to switching from Ak=0A_{k}=0 to Ak+1=1A_{k+1}=1 and for switching from Ak=1A_{k}=1 to Ak1=0A_{k-1}=0. The models for the weights included the same variables as included in the weights for the MSM-IPTW analysis. The continuous variables (Age, FEV1%, BMI z-score, visit year) were modelled using restricted cubic splines (with 3 knots) in the models for the weights.

For both approaches we also estimated stabilized weights for censoring due to loss-to-follow-up. These included the same variables as used in the treatment-related weights, with the difference that the censoring weights models for the probability of being censored between visits kk and k+1k+1 included measures of FEV1% and BMI z-score made at visit kk, rather than the measures from the previous visit. This is because the censoring indicator at visit kk refers to censoring before visit k+1k+1, whereas the treatment indicator at visit kk refers to treatment status since the previous visit.

There was missing data in some of the time-dependent covariates (\sim10% missingness in covariates at the baseline visit). We used the last-observation-carried-forward to address this. For the genotype variable ‘missing’ was treated as a separate category.

We wished to estimate the same marginal survival probabilities, and hence risk differences, using each analysis method. Both methods include conditioning on covariates measured at baseline. We are therefore able to standardize to any appropriate population. For this analysis we standardize to the population of individuals meeting the eligibility criteria in 2018. Visit year was included in the weights models but not in the set of variables included in the MSM. This is because there can only be little information about long term follow-up for individuals in the later years, which results in poor convergence in some models if visit year is included in the MSMs. There are therefore differences between the population to which the MSM-IPTW analyses refer and that to which the sequential trials analyses refer, in terms of the marginal distribution of baseline years from 2008-2017. We consider this to be a minor difference that is unlikely to result in clinically important differences between the results from the two approaches.

Bootstrapping was used to obtain 95% confidence intervals for the estimated survival curves under the two treatment regimes of ‘always treat’ or ‘never treat’ with DNase and the corresponding risk differences, using 1000 bootstrap samples and the percentile method.

7.3 Data overview

The eligibility criteria were met by 3855 individuals at at least one visit. Among these individuals there were 338 events (266 deaths and 72 transplants). Of the 3517 individuals who did not have the event, 1780 (51%) were administratively censored (defined as having the expected number of visits given their first year of meeting eligibility criteria, assuming one visit each year) and the remainder were censored due to loss-to-follow-up (which included individuals censored after a gap of more than 18 months without a visit). The characteristics of the individuals at the first visit at which the eligibility criteria were met are summarised in Table 4.

Table 5 summarises the number of individuals contributing to the MSM-IPTW and sequential trials analyses and their treatment status. The eligibility criteria were first met in 2008 (the first year we considered) by 1729 individuals (45%). In the sequential trials approach many individuals contribute to trials starting at multiple time points such that 18439 rows contribute information at the start of trials (Table 5(b)). At visit k=10k=10, there remain 643 individuals in the MSM-IPTW analysis but only 241 rows in the sequential trials analysis, due to the artificial censoring in the sequential trials approach. At the first visit at which eligibility criteria were met, 612 initiated DNase treatment (16%). Of those who initiated DNase at this first eligible visit, 69% used DNase at all visits, and of those who did not initiate DNase at the first eligible visit 53% were non-users at all visits. In the MSM-IPTW analysis the proportion of treated individuals is similar across visits, ranging from 15-17%. In the sequential trials analysis the proportion of treated individuals at a given visit is equivalent to the proportion ’always treated’ up to that visit (due to the artificial censoring), and this percentage increases over time from 15% at visit k=0k=0 to 22% at visit k=10k=10.

7.4 Results

Figure 8 shows the estimated survivor curves from the MSM-IPTW and sequential trials analyses, using Cox proportional hazards models and additive hazards models for the MSMs. The corresponding estimated risk differences are shown in Figure 9, where a risk difference greater than 1 indicates better survival in the ‘always treated’.

None of the results provide evidence that DNase use affects the outcome, either in the short or long term. In the MSM-IPTW analysis using a Cox proportional hazards model, the survival curve for the ‘always treated’ regime is consistently below that for the ‘never treated’ regime, with the differences increasing over time. In the sequential trials analysis using the Cox model the survival curves in the two treatment groups are very similar over time. Using the additive hazards model the survival curve for the ‘always treated’ regime is above that for the ‘never treated’ regime for the first few years, more so in the MSM-IPTW results, before the curves cross. As we would expect, the 95% CIs from using the Cox model are narrower than those from the additive hazards model. The 95% CIs are generally narrower under the sequential trials approach, except when using the additive hazards models in the later part of follow-up. Supplementary Figure A5 shows a plot of the distribution of the weights used under the two analysis approaches. This shows that a large number of rows are assigned a weight of 1 in the sequential trials analysis. This is because in the sequential trials data the first row corresponding to each trial has a weight of 1. There are no extreme weights in the MSM-IPTW analysis, but the distribution is less highly concentrated around 1.

8 Discussion

In this paper we have focused on estimating the effects of longitudinal treatment regimes on survival, using observational data subject to time-varying confounding of the treatment-outcome association. We have compared the MSM-IPTW approach with the sequential trials approach. A key contribution has been to specify causal estimands that can be identified using the sequential trials approach, and we showed how the previously described implementations can be shown to estimate underlying MSMs for counterfactual outcomes under certain assumptions. The methods were considered in the context of emulating a target trial and we outlined a general protocol for a target trial for longitudinal treatment regimes and time-to-event outcomes. We compared the MSM-IPTW and sequential trials approaches in terms of how the data are used, the form of the MSMs, how time-dependent confounding is addressed through the analysis, and when they identify the same parameters.

For time-to-event outcomes, estimands based on contrasts between risks rather than contrasts between hazards have been shown to have better causal interpretations. We outlined how the outputs from the sequential trials analysis and the MSM-IPTW analysis can be used to obtain marginal absolute risk estimates using empirical standardization, and hence to obtain marginal risk differences or ratios. Care should be taken over defining the population to which marginal risk estimates refer. Applications of the sequential trials approach in the literature have tended to focus on presentation of hazard ratio estimates. As well as giving hazard ratio estimates, Hernan et al. (2008)4 presented estimated survivor curves under the treated and untreated regimes. However, it was not clear how the survival curves were obtained or what population they referred to. Cole et al. (2017)49 and Garcia-Albinez et al (2017)13 also obtained survival curves using the sequential trials approach but similarly the methods were not described in detail. In this paper, we have provided clear guidance on how to obtain estimated survival curves from the MSM-IPTW and sequential trials approaches.

A key difference between the MSM-IPTW approach and the sequential trials approach is that the MSM-IPTW approach enables estimation of risks under any longitudinal treatment regime, whereas the sequential trials approach focuses only on ‘always treated’ and ‘never treated’ regimes. The sequential trials approach could be extended to consider different longitudinal treatment regimes, with individuals being censored when they deviate from a specified regime. This is an area for future investigation. We focused in this paper on a setting in which the trials used in the sequential trials approach have different lengths of follow-up, with trials starting at later times having shorter follow-up. It would also be possible to focus only on trials with the same fixed length of follow-up. This depends on the total length of follow-up available in the data, and on the time horizons of interest for the risk differences being estimated. For example, in the simulation we consider a situation with 5 visit times. Had we been interested in a risk difference for a time horizon of τ=2\tau=2, we could have used sequential trials of equal length starting at times 0,1,2, and 3. For the MSM analysis, the question would arise as to whether to focus on an analysis starting from time 0, or starting from time 3, say. Benefits of using sequential trials of equal length of follow-up could be that it is possible that each trial can estimate a separate baseline hazard covering the full length of follow-up needed for the risk difference.

Van der Laan et al. (2005)50 and Petersen and van der Laan (2007)24 introduced history-adjusted MSMs (HA-MSM), which are an extension of MSMs. In a HA-MSM, an MSM is used from a series of new time points (visits) in a study, with the MSM being conditional on covariates and treatment history up to that time, and assuming a common MSM across time points. Estimation of the MSMs is using IPTW. As well as being an extension of the MSM-IPTW approach, there is a clear connection between HA-MSMs and the sequential trials approach. Hernan and Robins (2016)11 referenced HA-MSMs in their paper describing the target trial framework when noting that eligibility criteria could be applied at multiple time points. A distinction between HA-MSMs and the sequential trials approach is that the sequential trials approach restricts to individuals who have not yet started the treatment at each new time origin, whereas the HA-MSM approach conditions on past treatment in the MSM. Like in MSMs, the HA-MSM approach considers all longitudinal treatment regimes, in contrast to the sequential trials approach, which focuses only on two treatment regimes. An example of the use of HA-MSMs in the context of a survival outcome was given by Bembom et al.(2009)51. However, the HA-MSM approach does not yet appear to have been much used in practice, perhaps due to initial criticisms of the method 25, 26. The sequential trials approach is also related to the sequential stratification method described by Schaubel et al. (2006)52, in which individuals initiating treatment at a given time point are matched to untreated individuals in a sequential manner over time. The analysis results in an estimate of the treatment effect on the treated. An area for future work is to further discuss the different sequential methods for causal inference for survival outcomes, in particularly to compare the estimands that they focus on and their assumptions.

In this paper we compared the MSM-IPTW and sequential trials approaches using a simulation study, focusing on bias and efficiency. We found that the sequential trials approach can be more efficient than the MSM-IPTW approach for estimating risk differences, but that this is not true at all time points: for risks at time points corresponding to longer follow-up the MSM-IPTW approach can be more efficient. We linked the efficiency gains from the sequential trials approach to the use of less extreme weights (IPACW) compared with the IPTW. It must also be recognised that efficiency gains from the sequential trials approach at some time points arises in part due to assumptions about consistency of the treatment effect across different trials (i.e. from different time origins). Throughout the paper we discussed the methods in terms of Cox proportional hazard models or Aalen additive hazard models for the MSM. Previous work in this context has focused on Cox models for the MSM. We have shown that the additive hazard model can have some advantages in terms of relaxing assumptions - for example, it naturally allows the hazard to depend on treatment duration in the sequential trials approach. Use of the the additive hazard model was also advantageous in the simulation study as it ensured that the analysis models used in both the MSM-IPTW approach and in the sequential trials approach could both be correctly specified, enabling a fair comparison of the methods. There have been few simulation comparisons of causal methods for time-to-event outcomes due to the challenges of making fair comparisons. As noted earlier, Karim et al.(2018)18 compared the MSM-IPTW and sequential trials approaches but focused on estimation of hazard ratios and compared efficiency of marginal and conditional hazard ratios, which are not the same quantity. In further work it would be of interest to use a similar simulation approach to compare other methods, including the other sequential methods noted above.

The methods were applied to investigate the effect of the treatment DNase for people with CF on the composite outcome of death or transplant using longitudinal observational data from the UK CF Registry. We did not find any evidence of a causal treatment effect using either the MSM-IPTW approach or the sequential trials approach. In recent work, longitudinal data from the UK CF Registry was used to estimate the longer term causal effect of DNase on the non-time-to-event outcomes of lung function and number of days of intravenous antibiotic medication using MSM-IPTW, g-computation and g-estimation 34, 35. This work found evidence of a benefit of DNase for people with low lung function, but not for those whose lung function was higher. Seaman et al. (2020)53 illustrated the structural nested cumulative survival time approach with an investigation of the effect of DNase use on survival using UK CF Registry data, and also did not find evidence of a treatment effect. Sawicki et al (2012)54 used longitudinal data from the US Cystic Fibrosis Foundation patient Registry to investigate the effect of DNase treatment on the risk of death in the next year using pooled logistic regression analyses, finding evidence of a reduced risk of death with DNase use. Limitations of our analysis include that Dnase is used by a large proportion of the population from a young age, and we focused on individuals initiating DNase aged 12 and older, as there are few deaths in younger individuals. It is possible that patients initiating DNase at older ages have particular characteristics that are also associated with survival, and which we did not fully capture in the set of adjustment variables used in our analysis. The number of events in our data was relatively low, which at least in part explains the wide uncertainty on our estimates. Our analysis included a relatively large number of adjustment variables compared to the number of events, which included several continuous variables, and we did not include any interaction terms in the weights models or the MSMs. In further work it would be of interest to investigate data-adaptive methods for flexible adjustment for the time-dependent confounders. Another important consideration in interpreting our results is that the UK CF Registry records DNase prescription, but adherence to the treatment is unknown and may be low and dependent on individual characteristics. In further work it would be of interest to use Registry data to investigate the impact of removing DNase treatment, particularly in light of new disease-modifying treatments that have been introduced for CF in recent years.

In summary, the MSM-IPTW and sequential trials approaches can both be used to estimate causal effects of time-varying treatments using observational data, controlling for time-dependent confounding. The sequential trials approach tends to use less extreme weights, which can result in substantial gains in efficiency relative to the MSM-IPTW approach. Both approaches rely on modelling assumptions. The MSM used in MSM-IPTW is prone to misspecification as it considers all treatment patterns, whereas the sequential trials approach focuses only on two regimes. However, the sequential trials approach is prone to model misspecification when a combined analysis across trials is used, and this risk of misspecification is arguably greater than that in the MSM-IPTW approach. The sequential trials approach is appealing in that it makes efficient use of longitudinal data that are often available in observational databases, and that it explicitly mimics a hypothetical trial. There are several areas for further development of this approach that could extend the types questions that it could be used to address.

Funding

RHK is funded by a UK Research & Innovation Future Leaders Fellowship (MR/S017968/1), JMG by the Research Council of Norway (Grant No. 273674), SRS by MRC programme grant MC_UU_00002/10, and GD by a UK Research & Innovation Future Leaders Fellowship (MR/T041285/1).

Data availability

This work used anonymized data from the UK Cystic Fibrosis Registry, which has Research Ethics Approval (REC ref: 07/Q0104/2). The use of the data was approved by the Registry Research Committee. Data are available following application to the Registry Research Committee
(https://www.cysticfibrosis.org.uk/the-work-we-do/uk-cf-registry/apply-for-data-from-the-uk-cf-registry). The authors thank people with CF and their families for consenting to their data being held in the UK CF Registry, and NHS teams in CF centres and clinics for the input of data into the Registry.

Figure 4: Simulation results: mean estimated survival curves obtained using the sequential trials analysis and the MSM-IPTW analysis. The faded lines show the estimates from each of the 1000 simulated data sets and the thick lines are the point-wise averages.
Refer to caption
Figure 5: Simulation results: bias in estimation of the risk difference using the sequential trials analysis and the MSM-IPTW analysis. The black line shows the bias at each time point and the grey area shows the Monte-Carlo 95% CI at each time point.
Refer to caption
Figure 6: Simulation results: relative efficiency of the sequential trials analysis compared with the MSM-IPTW analysis, defined as the ratio of the empirical variances of the risk difference estimates at each time.
Refer to caption
Figure 7: Simulation results: Plots of the largest weight (by time period) in each of the 1000 simulated data sets under the sequential trials analysis (IPACW) compared with the MSM-IPTW analysis. The weights are time-dependent and change at event visit k=0,,4k=0,\ldots,4. We obtained the largest weight in each time period, where in the sequential trials the time refers to time since the start of the trial. In the sequential trials analysis, the IPACW are equal to 1 up to time 1.
Refer to caption
Table 2: Simulation results: Estimated survival probabilities (’Est’) under the always treated (S1) and never treated (S0) regimes and risk difference(RD) (mean over 1000 simulations) at time τ\tau, with corresponding empirical standard deviation (over 1000 simulations). Corresponding bias in the estimates (×104\times 10^{4}), accompanied by the MC error (×100\times 100).
Time τ\tau
1 2 3 4 5
MSM-IPTW: scenario 1
S1 Est (SD) 0.854 (0.021) 0.738 (0.027) 0.644 (0.030) 0.566 (0.031) 0.498 (0.031)
Bias (MC error) 0.170 (0.066) 0.166 (0.084) 0.229 (0.094) 0.223 (0.097) 0.189 (0.099)
S0 Est (SD) 0.819 (0.015) 0.671 (0.019) 0.547 (0.024) 0.445 (0.029) 0.361 (0.033)
Bias (MC error) 0.002 (0.047) 0.114 (0.061) 0.100 (0.076) 0.137 (0.091) 0.098 (0.105)
RD Est (SD) 0.034 (0.026) 0.067 (0.033) 0.097 (0.040) 0.121 (0.044) 0.138 (0.047)
Bias (MC error) 0.168 (0.082) 0.053 (0.106) 0.129 (0.126) 0.086 (0.139) 0.092 (0.148)
Sequential trials: scenario 1
S1 Est (SD) 0.853 (0.015) 0.736 (0.019) 0.641 (0.022) 0.563 (0.024) 0.496 (0.027)
Bias (MC error) 0.068 (0.047) 0.010 (0.059) -0.068 (0.068) -0.086 (0.076) -0.037 (0.085)
S0 Est (SD) 0.819 (0.010) 0.669 (0.017) 0.544 (0.025) 0.442 (0.033) 0.359 (0.044)
Bias (MC error) -0.043 (0.031) -0.085 (0.054) -0.202 (0.079) -0.191 (0.105) -0.076 (0.140)
RD Est (SD) 0.034 (0.018) 0.067 (0.027) 0.097 (0.036) 0.121 (0.044) 0.137 (0.055)
Bias (MC error) 0.111 (0.058) 0.094 (0.085) 0.134 (0.112) 0.105 (0.140) 0.039 (0.173)
MSM-IPTW: scenario 2
S1 Est (SD) 0.855 (0.049) 0.739 (0.063) 0.648 (0.071) 0.573 (0.072) 0.506 (0.074)
Bias (MC error) 0.333 (0.154) 0.343 (0.199) 0.569 (0.224) 0.934 (0.227) 0.933 (0.233)
S0 Est (SD) 0.819 (0.012) 0.671 (0.015) 0.547 (0.016) 0.445 (0.018) 0.361 (0.018)
Bias (MC error) 0.018 (0.039) 0.123 (0.048) 0.133 (0.052) 0.112 (0.055) 0.077 (0.056)
RD Est (SD) 0.036 (0.050) 0.069 (0.065) 0.100 (0.073) 0.128 (0.075) 0.145 (0.077)
Bias (MC error) 0.315 (0.159) 0.219 (0.206) 0.436 (0.231) 0.823 (0.237) 0.856 (0.243)
Sequential trials: scenario 2
S1 Est (SD) 0.852 (0.026) 0.735 (0.036) 0.640 (0.042) 0.564 (0.048) 0.498 (0.056)
Bias (MC error) -0.032 (0.083) -0.125 (0.112) -0.162 (0.132) 0.038 (0.150) 0.195 (0.178)
S0 Est (SD) 0.819 (0.007) 0.669 (0.012) 0.544 (0.017) 0.442 (0.020) 0.359 (0.024)
Bias (MC error) -0.030 (0.023) -0.088 (0.039) -0.184 (0.052) -0.189 (0.063) -0.128 (0.075)
RD Est (SD) 0.033 (0.027) 0.066 (0.038) 0.096 (0.046) 0.122 (0.053) 0.140 (0.062)
Bias (MC error) -0.001 (0.086) -0.036 (0.122) 0.022 (0.145) 0.227 (0.168) 0.324 (0.197)
MSM-IPTW: scenario 3
S1 Est (SD) 0.852 (0.023) 0.736 (0.034) 0.642 (0.042) 0.564 (0.046) 0.497 (0.049)
Bias (MC error) 0.013 (0.074) 0.044 (0.108) -0.007 (0.134) 0.005 (0.146) 0.001 (0.154)
S0 Est (SD) 0.820 (0.018) 0.674 (0.040) 0.558 (0.056) 0.460 (0.067) 0.382 (0.072)
Bias (MC error) 0.100 (0.056) 0.478 (0.125) 1.165 (0.178) 1.631 (0.213) 2.181 (0.229)
RD Est (SD) 0.032 (0.033) 0.062 (0.065) 0.084 (0.086) 0.104 (0.097) 0.115 (0.103)
Bias (MC error) -0.087 (0.106) -0.434 (0.204) -1.172 (0.271) -1.626 (0.307) -2.180 (0.327)
Sequential trials: scenario 3
S1 Est (SD) 0.853 (0.017) 0.736 (0.027) 0.642 (0.044) 0.564 (0.060) 0.497 (0.060)
Bias (MC error) 0.046 (0.053) 0.005 (0.087) 0.021 (0.138) 0.079 (0.189) 0.077 (0.191)
S0 Est (SD) 0.819 (0.013) 0.672 (0.036) 0.554 (0.063) 0.458 (0.080) 0.383 (0.091)
Bias (MC error) -0.005 (0.041) 0.215 (0.115) 0.775 (0.198) 1.427 (0.254) 2.344 (0.287)
RD Est (SD) 0.033 (0.024) 0.064 (0.058) 0.088 (0.098) 0.106 (0.126) 0.114 (0.133)
Bias (MC error) 0.051 (0.076) -0.211 (0.182) -0.754 (0.309) -1.347 (0.399) -2.267 (0.420)
Table 3: Summary of the protocol of a target trial for studying the effect of DNase use on survival in people with cystic fibrosis (CF).
Protocol component
Eligibility criteria People with CF between 2008 and 2017 and aged 12 or older and who have not used DNase for at least 3 years.
Treatment strategies (i) Do not start using DNase during follow-up. (ii) Start using DNase and continue to use it throughout follow-up.
Assignment procedures Patients randomly assigned to either treatment strategy. Patients and their health care teams are aware of the patient’s treatment status.
Follow-up period Starts at randomization and ends at death or transplant, loss-to follow-up, or the end of 2018, whichever occurs first.
Outcome Death or transplant.
Causal contrasts of interest Per-protocol treatment effect. Survival curves up to 11 years under the two treatment strategies, and corresponding risk differences.
Table 4: Summary of individual characteristics at the first visit at which eligibility criteria were met.
Variable Mean (SD) Median (IQR)
Age (years) 24.32 (11.56) 21.6 (14.39,30.54)
FEV1% 76.12 (21.93) 78.8 (62.2,92.38)
BMI z-score 0.15 (1.22) 0.14 (-0.6,0.96)
N %
Sex
Male 2120 (55%)
Female 1735 (45%)
Genotype class
High 2586 (67%)
Low 596 (15%)
Not assigned 594 (15%)
Missing 79 (2%)
Presence of airway infections
Staphylococcus aureus or Methicillin-resistant Staphylococcus aureus (MRSA) 1676 (43%)
Pseudomonas Aeruginosa (or use of nebulized antibiotics) 2453 (64%)
Burkholderia cepacia 113 (3%)
Non-tuberculous mycobacteria (NTM) 162 (4%)
Presence of complications
CF-related diabetes 707 (18%)
Pancreatic insufficiency 3028 (79%)
Other treatments
Use of other mucoactive treatments (hypertonic saline, mannitol or acetylcysteine) 554 (14%)
Use of CFTR modulators (ivacaftor, lumacaftor/ivacaftor, or tezacaftor/ivacaftor) 54 (1%)
Use of oxygen therapy 153 (4%)
Hospitalisation (not for IV antibiotics) 145 (4%)
Days using IV antibiotics
0 2315 (60%)
1-14 640 (17%)
15-28 329 (9%)
29-42 227 (6%)
>>42 344 (9%)
Table 5: Summary of number of individuals contributing to the MSM-IPTW and sequential trials analyses and numbers untreated and treated: (a) by year, (b) by visit number.

(a) By year

MSM-IPTW Sequential trials
Year Number observed Untreated Treated Number observed Untreated Treated
2008 1729 1452 (84%) 277 (16%) 1729 1452 (84%) 277 (16%)
2009 537 440 (82%) 97 (18%) 1908 1599 (84%) 309 (16%)
2010 352 285 (81%) 67 (19%) 2004 1637 (82%) 367 (18%)
2011 359 310 (86%) 49 (14%) 2039 1731 (85%) 308 (15%)
2012 264 234 (89%) 30 (11%) 2040 1728 (85%) 312 (15%)
2013 179 156 (87%) 23 (13%) 1916 1652 (86%) 264 (14%)
2014 152 127 (84%) 25 (16%) 1866 1601 (86%) 265 (14%)
2015 123 100 (81%) 23 (19%) 1779 1519 (85%) 260 (15%)
2016 90 79 (88%) 11 (12%) 1629 1426 (88%) 203 (12%)
2017 70 60 (86%) 10 (14%) 1529 1325 (87%) 204 (13%)

(b) By visit number

MSM-IPTW Sequential trials
Visit Number observed Untreated Treated Number observed Untreated Treated
0 3855 3243 (84%) 612 (16%) 18439 15670 (85%) 2769 (15%)
1 3466 2901 (84%) 565 (16%) 16889 14321 (85%) 2568 (15%)
2 3121 2609 (84%) 512 (16%) 12302 10347 (84%) 1955 (16%)
3 2821 2350 (83%) 471 (17%) 8958 7408 (83%) 1550 (17%)
4 2500 2101 (84%) 399 (16%) 6308 5196 (82%) 1112 (18%)
5 2205 1851 (84%) 354 (16%) 4380 3564 (81%) 816 (19%)
6 1870 1565 (84%) 305 (16%) 2929 2335 (80%) 594 (20%)
7 1570 1308 (83%) 262 (17%) 1899 1490 (78%) 409 (22%)
8 1246 1036 (83%) 210 (17%) 1154 882 (76%) 272 (24%)
9 972 824 (85%) 148 (15%) 609 472 (78%) 137 (22%)
10 643 548 (85%) 95 (15%) 241 187 (78%) 54 (22%)
Figure 8: Estimated survival curves in the ‘always’ treated and ‘never treated’ with DNase from the MSM-IPTW and sequential trials analyses. The shaded areas show the 95% confidence intervals.
Refer to caption
Figure 9: Estimated risk difference (‘always’ treated vs ‘never treated’ with DNase) from the MSM-IPTW and sequential trials analyses. The shaded area shows the 95% confidence intervals.
Refer to caption

References

  • 1 Robins J, Hernán M, Brumback B. Marginal structural models and causal inference in epidemiology.. Epidemiology 2000; 11: 550–560.
  • 2 Hernán M, Brumback B, Robins J. Marginal Structural Models to Estimate the Causal Effect of Zidovudine on the Survival of HIV-Positive Men. Epidemiology 2000; 11: 561–569.
  • 3 Clare P, Dobbins T, Mattick R. Causal models adjusting for time-varying confounding – a systematic review of the literature. International Journal of Epidemiology 2019; 48: 254-265.
  • 4 Hernán M, Alonso A, Logan R, et al. Observational studies analyzed like randomized experiments: an application to postmenopausal hormone therapy and coronary heart disease.. Epidemiology 2008; 19: 766–779.
  • 5 Gran J, Røysland K, Wolbers M, et al. A sequential Cox approach for estimating the causal effect of treatment in the presence of time-dependent confounding applied to data from the Swiss HIV Cohort Study. Statistics in Medicine 2010; 29: 2757-2768.
  • 6 Danaei G, Rodríguez L, Cantero O, Logan R, Hernán M. Observational data for comparative effectiveness research: An emulation of randomised trials of statins and primary prevention of coronary heart disease. Statistical Methods in Medical Research 2013; 22: 70–96.
  • 7 Clark AJ, Salo P, Lange T, et al. Onset of impaired sleep as a predictor of change in health-related behaviours; analysing observational data as a series of non-randomized pseudo-trials. International journal of epidemiology 2015; 44(3): 1027–1037.
  • 8 Bhupathiraju S, Grodstein F, Rosner B, Stampfer M, Hu F, Willett W. Hormone Therapy Use and Risk of Chronic Disease in the Nurses Health Study: A Comparative Analysis With the Women’s Health Initiative. American Journal of Epidemiology 2017; 186: 696–708.
  • 9 Suttorp MM, Hoekstra T, Mittelman M, et al. Treatment with high dose of erythropoiesis-stimulating agents and mortality: Analysis with a sequential Cox approach and a marginal structural model.. Pharmacoepidemiology and Drug Safety 2015; 24: 1068-1075.
  • 10 Thomas LE, Yang S, Wojdyla D, Schaubel DE. Matching with time-dependent treatments: A review and look forward. Statistics in Medicine 2020.
  • 11 Hernán M, Robins J. Using Big Data to Emulate a Target Trial When a Randomized Trial Is Not Available. American Journal of Epidemiology 2016; 183: 758–764.
  • 12 Hernán M, Sauer B, Hernandez-Diaz S, Platt R, Shrier I. Specifying a target trial prevents immortal time bias and other self-inflicted injuries in observational analyses. Journal of Clinical Epidemiology 2016; 79: 70–75.
  • 13 Garcia-Albeniz X, Hsu J, Hernán M. The value of explicitly emulating a target trial when using real world evidence: an application to colorectal cancer screening. European Journal of Epidemiology 2017; 32: 495-500.
  • 14 Hernán M. How to estimate the effect of treatment duration on survival outcomes using observational data.. British Medical Journal 2018; 360: k182.
  • 15 Didelez V. Commentary: Should the analysis of observational data always be preceded by specifying a target experimental trial?. International journal of epidemiology 2016; 45(6): 2049–2051.
  • 16 Petersen ML, van der Laan MJ. Causal models and learning from data: integrating causal modeling and statistical estimation. Epidemiology (Cambridge, Mass.) 2014; 25(3): 418.
  • 17 Gran J, Aalen O. Letter to the Editor: Comparison of statistical approaches dealing with time-dependent confounding in drug effectiveness studies (SMMR, Vol. 27, Issue 6, 2018). Statistical Methods in Medical Research 2019; 28: 321-322.
  • 18 Karim M, Petkau J, Gustafson P, et al. Comparison of statistical approaches dealing with time-dependent confounding in drug effectiveness studies. Statistical Methods in Medical Research 2018; 27: 1709–1722.
  • 19 Martinussen T, Vansteelandt S. On collapsibility and confounding bias in Cox and Aalen regression models. Lifetime data analysis 2013; 19(3): 279–296.
  • 20 Hernán M. The Hazards of Hazard Ratios. Epidemiology 2010; 21: 13–15.
  • 21 Aalen OO, Cook RJ, Røysland K. Does Cox analysis of a randomized survival study yield a causal treatment effect?. Lifetime data analysis 2015; 21(4): 579–593.
  • 22 Martinussen T, Vansteelandt S, Andersen P. Subtleties in the interpretation of hazard contrasts. Lifetime Data Analysis 2020; 36: 833–855.
  • 23 Keogh R, Seaman S, Gran J, Vansteelandt S. Simulating longitudinal data from marginal structural models using the additive hazard model. Biometrical Journal 2021.
  • 24 Petersen M, Deeks S, Martin J, van der Laan M. History-adjusted Marginal Structural Models for Estimating Time-varying Effect Modification. American Journal of Epidemiology 2007; 166: 985–993.
  • 25 Robins J, Hernán MA, Rotnitzky A. Invited Commentary: Effect Modification by Time-varying Covariates. American Journal of Epidemiology 2007; 166: 994-1002.
  • 26 Petersen M, van der Laan M. Petersen et al. Respond to ”Effect modification by time-varying covariates”. American Journal of Epidemiology 2007; 166: 1003-1004.
  • 27 Cystic Fibrosis Trust . UK Cystic Fibrosis Registry Annual Data Report 2019. 2020.
  • 28 Rowe S, Miller S, Sorscher E. Cystic Fibrosis. New England Journal of Medicine 2005; 352: 1992–2001.
  • 29 MacNeill SJ. Hodson and Geddes’ Cystic Fibrosisch. Epidemiology of Cystic Fibrosis: 18–40; CRC Press . 2016.
  • 30 Dodge J, Lewis P, Stanton M, Wilsher J. Cystic fibrosis mortality and survival in the UK: 1947-2003.. European Respiratory Journal 2007; 29: 522–526.
  • 31 Keogh R, Szczesniak R, Taylor-Robinson D, Bilton D. Up-to-date and projected estimates of survival for people with cystic fibrosis using baseline characteristics: A longitudinal study using UK patient registry data.. Journal of Cystic Fibrosis 2018; 17: 218–227.
  • 32 Pressler T. Review of recombinant human deoxyribonuclease (rhDNase) in the management of patients with cystic fibrosis. Biologics 2008; 2: 611–617.
  • 33 Yang C, Montgomery M. Dornase alfa for cystic fibrosis. Cochrane Database of Systematic Reviews 2018(9).
  • 34 Newsome S, Keogh R, Daniel R. Estimating long-term treatment effects in observational data: A comparison of the performance of different methods under real-world uncertainty. Statistics in Medicine 2018; 37: 2367–2390.
  • 35 Newsome S, Daniel R, Carr S, Bilton D, Keogh R. Investigating the effects of long-term dornase alfa use on lung function using registry data. Journal of Cystic Fibrosis 2019; 110–117.
  • 36 Taylor-Robinson D, Archangelidi O, Carr S, et al. Data Resource Profile: The UK Cystic Fibrosis Registry.. International Journal of Epidemiology 2017; 47: 9–10e.
  • 37 Royston P, Parmar M. Restricted mean survival time: an alternative to the hazard ratio for the design and analysis of randomized trials with a time-to-event outcome. BMC Medical Research Methodology 2013; 152: 152.
  • 38 Cox D. Regression Models and Life-Tables. Journal of the Royal Statistical Society (Series B) 1972; 34: 187–220.
  • 39 Aalen O. A linear regression model for the analysis of life times. Statistics in Medicine 1989; 8: 907–925.
  • 40 Aalen O, Borgan Ø, Gjessing H. Survival and Event History Analysis: A Process Point of View. New York: Springer . 2008.
  • 41 VanderWeele T. Concerning the consistency assumption in causal inference. Epidemiology 2009; 20: 880-883.
  • 42 Daniel R, Cousens S, De Stavola B, Kenward M, Sterne J. Methods for dealing with time-dependent confounding. Statistics in Medicine 2013; 32: 1584–1618.
  • 43 D’Agostino R, Lee ML, Belanger A, Cupples L, Anderson K, Kannel W. Relation of pooled logistic regression to time-dependent Cox regression analysis: The Framingham Heart Study. Statistics in Medicine 1990; 9: 1501–1515.
  • 44 Røysland K, Gran J, Ledergerber B, Wyl vV, Young J, Aalen O. Analyzing direct and indirect effects of treatment using dynamic path analysis applied to data from the Swiss HIV Cohort. Statistics in Medicine 2011; 30: 2947–2958.
  • 45 Richardson T, Rotnitzky A. Causal Etiology of the Research of James M. Robins. Statisticl Science 2014; 29: 459-484.
  • 46 Hernán M, Robins J. Causal Inference: What If. Boca Raton: Chapman and Hall/CRC . 2010.
  • 47 Morris T, White I, Crowther M. Using simulation studies to evaluate statistical methods. Statistics in Medicine 2019; 38: 2074–2102.
  • 48 McKone E, Goss C, Aitken M. CFTR Genotype as a Predictor of Prognosis in Cystic Fibrosis. Chest 2006; 130: 1141–1147.
  • 49 Cole S, Edwards J, Hall H, et al. Incident AIDS or Death After Initiation of Human Immunodeficiency Virus Treatment Regimens Including Raltegravir or Efavirenz Among Adults in the United States. Clinical Infectious Diseases 2017; 64: 1591–1596.
  • 50 van der Laan M, Petersen M, Joffe M. History-adjusted marginal structural models and statically-optimal dynamic treatment regimens.. International Journal of Biostatistics 2005; 1: article 4.
  • 51 Bembom O, van der Laan M, Haight T, Tager I. Leisure-time Physical Activity and All-cause Mortality in an Elderly Cohort. Epidemiology 2009; 20: 424–430.
  • 52 Schaubel D, Wolfe R, Port F. A sequential stratification method for estimating the effect of a time-dependent experimental treatment in observational studies.. Biometrics 2006; 62: 910–917.
  • 53 Seaman S, Dukes R, Vansteelandt S. Adjusting for time‐varying confounders in survival analysis using structural nested cumulative survival time models. Biometrics 2019.
  • 54 Sawicki G, Signorovitch J, Zhang J, Latremouille-Viau M, Wu E, Shi L. Reduced mortality in cystic fibrosis patients treated with tobramycin inhalation solution. Pediatric Pulmonology 2012; 47: 44–52.
  • 55 Cole S, Hernán M. Constructing inverse probability weights for marginal structural models. American Journal of Epidemiology 2008; 168: 656–664.

Causal inference in survival analysis using longitudinal observational data: Sequential trials and marginal structural models

Supplementary Material

A1 Inverse probability weights

A1.1 MSM-IPTW

To estimate MSMs using IPTW, the weight at time tt for individual ii is the inverse of their probability of their observed treatment pattern up time time tt given their time-dependent covariate history 2, 1:

Wi(t)=k=0t1Pr(Ak=Ak,i|L¯k,i,A¯k1,i,Tk)W_{i}(t)=\prod_{k=0}^{\lfloor t\rfloor}\frac{1}{\Pr(A_{k}=A_{k,i}|\bar{L}_{k,i},\bar{A}_{k-1,i},T\geq k)} (A1)

Some individuals can have very large weights, which can results in the parameters of the MSM being estimated very imprecisely, and therefore stabilized weights are typically used. The stabilized weight for individual ii is:

SWi(t)=k=0tPr(Ak=Ak,i|A¯k1,i,Tk)Pr(Ak=Ak,i|L¯k,i,A¯k1,i,Tk)SW_{i}(t)=\prod_{k=0}^{\lfloor t\rfloor}\frac{\Pr(A_{k}=A_{k,i}|\bar{A}_{k-1,i},T\geq k)}{\Pr(A_{k}=A_{k,i}|\bar{L}_{k,i},\bar{A}_{k-1,i},T\geq k)} (A2)

The model in the numerator of the stabilized weights can also include covariates measured at time 0, L¯0\bar{L}_{0}:

SWi(t)=k=0tPr(Ak=Ak,i|A¯k1,i,L¯0,i,Tk)Pr(Ak=Ak,i|L¯k,i,A¯k1,i,Tk),SW_{i}(t)=\prod_{k=0}^{\lfloor t\rfloor}\frac{\Pr(A_{k}=A_{k,i}|\bar{A}_{k-1,i},\bar{L}_{0,i},T\geq k)}{\Pr(A_{k}=A_{k,i}|\bar{L}_{k,i},\bar{A}_{k-1,i},T\geq k)}, (A3)

in which case the MSM must also condition on these variables, as noted in the main text.

The models used in the weights can be estimated using logistic regression, and they may be fitted using all visits combined or separately by visit.

A1.2 Sequential trials

For individuals in the trial starting at visit kk but who do not initiate treatment at that time (A¯k=0\bar{A}_{k}=0) the weight at time tkt-k from the start of the trial is

Wi,k0(tk)=j=k+1k+t1Pr(Aj=0|L¯j,i,A¯(k,j1),i=0,Tk)W^{0}_{i,k}(t-k)=\prod_{j=k+1}^{k+\lfloor t\rfloor}\frac{1}{\Pr(A_{j}=0|\bar{L}_{j,i},\bar{A}_{(k,j-1),i}=0,T\geq k)} (A4)

where A¯(k,j1)={Ak,Ak+1,,Aj1}\bar{A}_{(k,j-1)}=\{A_{k},A_{k+1},\ldots,A_{j-1}\} denotes the treatment status from visit kk to visit j1j-1. For those who initiate treatment at the start of trial kk (A¯k1=0,Ak=1\bar{A}_{k-1}=0,A_{k}=1) the weight is

Wi,k1(tk)=j=k+1k+t1Pr(Aj=1|L¯j,i,A¯k1,i=0,A¯(k,j1),i=1,Tk)W^{1}_{i,k}(t-k)=\prod_{j=k+1}^{k+\lfloor t\rfloor}\frac{1}{\Pr(A_{j}=1|\bar{L}_{j,i},\bar{A}_{k-1,i}=0,\bar{A}_{(k,j-1),i}=1,T\geq k)} (A5)

In the trial starting at visit kk, the weights are equal to 1 for all individuals between time kk and k+1k+1, i.e. up to follow-up time 1 in trial kk.

The corresponding stabilized weights are

SWi,k0(tk)=j=k+1k+tPr(Aj=0|L¯k,i,A¯(k,j1),i=0,Tk)Pr(Aj=0|L¯j,i,A¯(k,j1),i=0,Tk)SW^{0}_{i,k}(t-k)=\prod_{j=k+1}^{k+\lfloor t\rfloor}\frac{\Pr(A_{j}=0|\bar{L}_{k,i},\bar{A}_{(k,j-1),i}=0,T\geq k)}{\Pr(A_{j}=0|\bar{L}_{j,i},\bar{A}_{(k,j-1),i}=0,T\geq k)} (A6)

and

SWi,k1(tk)=j=k+1k+tPr(Aj=1|L¯k,i,A¯k1,i=0,A¯(k,j1),i=1,Tk)Pr(Aj=1|L¯j,i,A¯k1,i=0,A¯(k,j1),i=1,Tk)SW^{1}_{i,k}(t-k)=\prod_{j=k+1}^{k+\lfloor t\rfloor}\frac{\Pr(A_{j}=1|\bar{L}_{k,i},\bar{A}_{k-1,i}=0,\bar{A}_{(k,j-1),i}=1,T\geq k)}{\Pr(A_{j}=1|\bar{L}_{j,i},\bar{A}_{k-1,i}=0,\bar{A}_{(k,j-1),i}=1,T\geq k)} (A7)

where the probability in the numerator is conditional on the covariates L¯k\bar{L}_{k} observed at the start of trial kk.

The models used in the weights can be estimated using logistic regression, and they may be fitted using all visits combined or separately by visit, and across all trials combined or separately by trial.

A2 Assumptions

The no interference assumption is that the counterfactual event time for a given individual, Ta¯0T^{\underline{a}_{0}}, does not depend on the treatment received by any other individuals. The positivity assumption is that each individual has a strictly non-zero probability of receiving each given pattern of treatments over time. Consistency means that an individual’s observed outcome is equal to the counterfactual outcome when the assigned treatment pattern is set to that which was actually received, Ti=TiA¯0,iT_{i}=T_{i}^{\underline{A}_{0,i}}. The conditional exchangeability assumption can be expressed formally as TA¯k1,a¯kAk|A¯k1,L¯k,TkT^{\bar{A}_{k-1},\underline{a}_{k}}\perp\!\!\!\perp A_{k}|\bar{A}_{k-1},\bar{L}_{k},T\geq k for all feasible a¯k\underline{a}_{k}, where TA¯k1,a¯kT^{\bar{A}_{k-1},\underline{a}_{k}} denotes the counterfactual event time had an individual followed their observed treatment pattern up to time k1k-1, A¯k1\bar{A}_{k-1}, and had their treatments been set to a¯k\underline{a}_{k} from time kk onwards, given survival to time kk. The conditional exchangeability assumption means that among individuals who remain at risk of the event at time kk, the treatment AkA_{k} received at time kk may depend on past treatment and covariates A¯k1\bar{A}_{k-1} and L¯k\bar{L}_{k}, but that, conditional on these, it does not depend on the remaining lifetime that would apply if all future treatments were set to any particular values a¯k\underline{a}_{k}.

A3 Compatibility of MSMs used in the two approaches

To obtain expressions (28) and (29) in the main text we use the result that the probability Pr(Y2A¯1=a=0|Y1A0=a=0)\Pr(Y_{2}^{\bar{A}_{1}=a}=0|Y_{1}^{A_{0}=a}=0) can be written

Pr(Y2A¯1=a=0|Y1A0=a=0)=l0Pr(Y2A¯1=a=0|Y1A0=a=0,L0=l0)Pr(Y1=0|A0=a,L0=l0)Pr(L0=l0)l0Pr(Y1=0|A0=a,L0=l0)Pr(L0=l0)\begin{split}\Pr(Y_{2}^{\bar{A}_{1}=a}=0|&Y_{1}^{A_{0}=a}=0)=\sum_{l_{0}}\Pr(Y_{2}^{\bar{A}_{1}=a}=0|Y_{1}^{A_{0}=a}=0,L_{0}=l_{0})\frac{\Pr(Y_{1}=0|A_{0}=a,L_{0}=l_{0})\Pr(L_{0}=l_{0})}{\sum_{l_{0}^{\prime}}\Pr(Y_{1}=0|A_{0}=a,L_{0}=l_{0}^{\prime})\Pr(L_{0}=l_{0}^{\prime})}\end{split} (A8)

and the first term in the sum can then be written

Pr(Y2A¯1=a=0|Y1A0=a=0,L0)=l1Pr(Y2=0|Y1=0,A¯1=a,L0,L1=l1)Pr(L1=l1|Y1=0,A0=a,L0).\begin{split}\Pr(Y_{2}^{\bar{A}_{1}=a}=0|&Y_{1}^{A_{0}=a}=0,L_{0})=\sum_{l_{1}}\Pr(Y_{2}=0|Y_{1}=0,\bar{A}_{1}=a,L_{0},L_{1}=l_{1})\Pr(L_{1}=l_{1}|Y_{1}=0,A_{0}=a,L_{0}).\end{split} (A9)
Table A1: Simulation results. Summary of number of individuals observed at each time, number always treated and never treated, and corresponding percentages. Numbers and percentages are the mean across the 1000 simulations.
MSM-IPTW Sequential trials
Time kk Time kk
0 1 2 3 4 0 1 2 3 4
Scenario 1
Observed at time kk N 1000 829 694 588 502 2264 1414 873 509 238
% 100 83 69 59 50 100 62 39 22 10
Always treated from time 0 to kk N 279 237 204 177 155 643 514 397 282 155
% 28 29 29 30 31 28 36 46 55 65
Never treated from time 0 to kk N 721 424 249 144 82 1621 900 475 227 82
% 72 51 36 25 16 72 64 54 45 35
Scenario 2
Observed at time kk N 1000 821 675 556 458 3180 2178 1403 806 349
% 100 82 68 56 46 100 68 44 25 11
Always treated from time 0 to kk N 53 45 39 34 29 197 142 98 61 29
% 5 5 6 6 6 6 7 7 8 8
Never treated from time 0 to kk N 947 731 560 425 319 2983 2036 1305 745 319
% 95 89 83 76 70 94 93 93 92 92
Scenario 3
Observed at time kk N 1000 832 701 596 510 2083 1333 865 543 287
% 100 83 70 60 51 100 64 42 26 14
Always treated from time 0 to kk N 387 326 278 241 210 700 563 446 337 210
% 39 39 40 40 41 34 42 52 62 73
Never treated from time 0 to kk N 613 352 213 129 77 1383 770 419 206 77
% 61 42 30 22 15 66 58 48 38 27
Figure A1: Simulation results: bias in estimation of the risk difference using the sequential trials analysis and the MSM-IPTW analysis. The black line shows the bias at each time point and the grey area shows the Monte-Carlo 95% CI at each time point. The MSM-IPTW results are from the MSM not conditional on L0L_{0}.
Refer to caption
Figure A2: Simulation results: relative efficiency of the sequential trials analysis compared with the MSM-IPTW analysis, defined as the ratio of the empirical variances of the risk difference estimates at each time. The MSM-IPTW results are from the MSM not conditional on L0L_{0}.
Refer to caption
Figure A3: Simulation results using truncated weights: bias in estimation of the risk difference using the sequential trials analysis and the MSM-IPTW analysis. The black line shows the bias at each time point and the grey area shows the Monte-Carlo 95% CI at each time point. The MSM-IPTW results are from the MSM conditional on L0L_{0}.

. Refer to caption

Figure A4: Simulation results using truncated weights: relative efficiency of the sequential trials analysis compared with the MSM-IPTW analysis, defined as the ratio of the empirical variances of the risk difference estimates at each time. The MSM-IPTW results are from the MSM conditional on L0L_{0}.
Refer to caption
Refer to caption
Figure A5: Plots showing the distribution of the weights used in the MSM-IPTW and sequential trials analyses of the application using UK CF Registry data.

References

  • Cole and Hernan 2008 Cole, S. and Hernan, M. (2008). Constructing inverse probability weights for marginal structural models. American Journal of Epidemiology 168, 656–664.
  • Daniel et al. 2013 Daniel, R., Cousens, S., De Stavola, B., Kenward, M., and Sterne, J. (2013). Methods for dealing with time-dependent confounding. Statistics in Medicine 32, 1584–1618.