Protocol for an Observational Study on the Effects of Paternal Alcohol Use Disorder on Children’s Later Life Outcomes
Abstract.
The harmful effects of growing up with a parent with an alcohol use disorder have been closely examined in children and adolescents, and are reported to include mental and physical health problems, interpersonal difficulties, and a worsened risk of future substance use disorders. However, few studies have investigated how these impacts evolve into later life adulthood, leaving the ensuing long-term effects of interest. In this article, we provide the protocol for our observational study of the long-term consequences of growing up with a father who had an alcohol use disorder. We will use data from the Wisconsin Longitudinal Study to examine impacts on long-term economic success, interpersonal relationships, physical, and mental health. To reinforce our findings, we will conduct this investigation on two discrete subpopulations of individuals in our study, allowing us to analyze the replicability of our conclusions. We introduce a novel statistical design, called data turnover, to carry out this analysis. Data turnover allows a single group of statisticians and domain experts to work together to assess the strength of evidence gathered across multiple data splits, while incorporating both qualitative and quantitative findings from data exploration. We delineate our analysis plan using this new method and conclude with a brief discussion of some additional considerations for our study.
1. Background and Motivation
Alcohol use disorder (AUD), also known colloquially as alcohol dependence or alcoholism, has long been reported to run in families and numerous genetic factors have been identified as contributing to a greater risk of developing the disorder (Edenberg and Foroud [2013]). Along with this predisposition to future alcohol use disorders, growing up with a parent with AUD can adversely affect child and adolescent development, as well as their mental and physical health. Problematic drinking by parents can contribute to inconsistency or unpredictability in parenting behaviors and may also induce higher levels of marital conflict, a destabilizing influence on families associated with higher levels of child and adolescent alcohol use and aggression. It can also lead parents to model ineffective coping strategies and other problem behaviors (Windle [1996]). Moreover, children and adolescents of parents with AUD often have a less healthy lifestyle and more mental health problems compared to their peers (Serec et al. [2012]) and are more frequently affected by depression, anxiety, suicidal ideation, substance abuse, and interpersonal difficulties (Park and Schepp [2015]).
Unfortunately, these damaging effects are not uncommon and approximately 10.5% of children in the United States under 18 years of age live with a parent who has an alcohol use disorder (Lipari and Van Horn [2017]). Yet despite its prevalence, most studies on the impacts of growing up with a parent with AUD focus on the drawbacks experienced during childhood and adolescence (see Rossow et al. [2016] for a review), with fewer studying effects among young adults (e.g., Christoffersen and Soothill [2003], Klostermann et al. [2011]), and most research on adult outcomes limited to alcohol abuse or psychiatric problems (e.g., Anda et al. [2002]). Longitudinal studies such as Beardslee et al. [1986] present a unique opportunity to follow these children and examine how these issues evolve throughout their lives. We are interested in how various aspects of the later lives of these persons, particularly related to their health and wellness, are influenced by having grown up with a father who experienced alcohol use disorder. Previous literature has suggested that the effects of growing up with a father with AUD can manifest differently than those who had grown up with an affected mother or two affected parents (e.g., Steinhausen et al. [1984], Stout and Mintz [1996]), so we focus our investigation on the long-term impacts of growing up with a father who had an alcohol use disorder. Our findings will supplement and extend previous results on this topic, such as those recorded by Rydelius [1981] and Thor et al. [2022].
We will examine the impacts of growing up with a father with AUD on long-term economic success, interpersonal relationships, physical, and mental health using data from the Wisconsin Longitudinal Study (WLS), a long-term study of a random sample of over ten thousand individuals who graduated from Wisconsin high schools in 1957 (see, for instance, Sewell et al. [2003], Herd et al. [2014]). These data contain detailed information on the participants that were collected at various times over the decades since their graduation, making it uniquely suited to facilitate the analysis of later life outcomes as graduates reach their 50s, 60s, and 70s. In the 2011 wave of data collection, WLS respondents were asked, “When you were growing up, that is during your first 18 years, did you live with anyone who was a problem drinker or alcoholic?” Among the over one thousand graduates who answered affirmatively, more than 750 cited growing up with an affected father. This large sample size will help facilitate powerful data analysis and may lead to new discoveries that extend the current literature. The WLS also collected detailed measurements of variables that may confound the relationship between growing up with an affected parent and later life outcomes, such as those related to personal demographics and family background. We will use these variables to create matched sets of treated and control graduates, allowing us to compare the outcomes of similar individuals and lessen the burden of confounding biases.
We introduce a novel statistical design, called data turnover, to conduct this analysis. Data turnover allows for a single group of statisticians and domain experts to work together to assess the strength of evidence gathered across multiple data splits, while incorporating both qualitative and quantitative findings from data exploration. Exploratory data analysis allows us look at our data before making any (potentially dubious) assumptions, while enabling the recognition of any obvious errors, detection of outliers or anomalies, and shaping of new hypotheses. It also permits us to identify a variable that does not measure what we initially thought and help suggest a new one, or facilitate data-informed hypothesis selection (Bekerman et al. [2024]). Along with exploration, replication is a valuable tool for statisticians conducting observational studies. Replication of scientific results lends credence to these findings and serves to maintain and build trust in scientific knowledge as a whole. Previous works have facilitated analyses that allow for exploration but no replication (e.g., Cox [1975], Heller et al. [2009]), replication but no exploration (e.g., Zhao et al. [2018], Karmakar et al. [2019]), or both under the stringent requirement of two independent teams of investigators (Roy et al. [2022]). Data turnover allows us to explore the data and assess the replicability of our findings with only one group of researchers.
Evaluating the replicability of our findings will be made possible by analyzing the data separately on two discrete subpopulations of WLS graduates – those whose father graduated high school and those whose father did not. In observational studies like ours where we do not control assignment of individuals to receive treatment, obtaining similar results across multiple groups with different treatment assignment mechanisms strengthens the evidence that the treatment is in fact the cause of its ostensible effects (Rosenbaum [2001, 2015b]). Paul Rosenbaum (2015a) would liken our approach to accumulating evidence in a crossword puzzle: while each piece of evidence may be tentative when considered individually, assessing them jointly can lead to more convincing conclusions. Notably, the reasons for a more educated father drinking heavily may differ from those of a less educated father. For instance, those with less education might be more likely to live in neighborhoods with many alcohol retailers or more relaxed community norms with respect to excessive drinking ([Crum et al., 1993]). Meanwhile, an individual who graduated high school and continued his studies might have been exposed to excessive drinking at university and developed an unhealthy relationship with alcohol. Despite increased prevention efforts, binge drinking remains frequent among college students and one study found that roughly 40% of college students have binged in the previous two weeks (Wechsler et al. [2002], Clapp et al. [2003]). Employing data turnover, we can pre-specify how one group will be used to plan the analysis on the other group, and use exploratory data analysis on the other group to determine how to conduct inference on the original subpopulation. We refer to those outcomes for which the effect is significant on both groups as “replicable” findings (Bogomolov and Heller [2023]). In Section 5, we provide a detailed description on how we use this non-random cross-screening approach to identify replicable outcomes. We also discuss testing of the global null hypothesis; that is, identifying those outcomes which are affected on at least one of the two subpopulations.
The remainder of the document is organized as follows. We introduce the WLS dataset and describe our data processing in Section 2. In Section 3, we outline our matching methodology. We report our primary outcomes of interest in Section 4. Finally, we conclude with a description of our data analysis procedure in Section 5.
2. Wisconsin Longitudinal Study (WLS) Dataset
The WLS is an extensive, long-term study of 10,317 individuals who graduated from Wisconsin high schools in 1957. Detailed information pertaining to family background, childhood experiences, education, health, interpersonal relationships, economic status, and much more, were collected from the graduates or their parents across multiple waves from 1957 to 2020. These data are broadly representative of white American men and women, typically of German descent, who have graduated high school. The WLS sample does not well-represent certain large strata of American society, such as those of minority status and those without at least a high school education.
To investigate the myriad long-term impacts of growing up with a father with AUD, we examine outcomes collected during the 1992/1993 wave of data collection, where most graduates were in their early to mid 50s. We restrict our sample to those who completed both the phone interview and mail questionnaire to help mitigate issues with missingness due to non-response or censoring by death. We also omit those individuals who revealed growing up with more than one individual with AUD, those who reported an individual who was not their father, and subjects who did not report their father’s education level. Since we also have access to substantial information from siblings of the graduates, we impute any missing family background or treatment status data using answers provided by their siblings, if these data are available. Among the 2817 remaining graduates whose father did not complete a high school education, 2382 did not grow up with a problem drinker in the household, while 435 grew up with a father with AUD. On the other hand, there are 1846 remaining graduates whose father graduated high school, among which 1623 did not grow up with an affected father and 223 grew up with an affected father.
3. Matching
In order to lessen the burden of potential confounding biases, we create matched sets of treated and control graduates based on relevant baseline covariates to identify alike individuals and compare their outcomes. We employ matching separately for participants whose fathers did not graduate high school and those whose fathers completed their high school education. We list these variables on which we match in Appendix A.
It would be preferable to match treated and control graduates with exactly the same baseline covariates, yet doing so is difficult or impossible to implement when considering several covariates. Consequently, we instead look to create matched sets such that the mean value of each covariate among the matched treated subjects is close to that among the matched control subjects. We refer to this as covariate balance, which we assess using the absolute raw mean difference between matched treated and control units for binary covariates, and absolute standardized mean difference for continuous variables.
To deal with missingness in our baseline covariates, we assume that, conditional on the observed covariate values and pattern of missingness, the probability of treatment does not depend on the missing covariate values (Rosenbaum and Rubin [1984]). We can then append indicator variables for the pattern of missing data and impute any missing values for each covariate using the mean of the non-missing values. In each subpopulation of graduates, we remove subjects who lack overlap based on the pooled within group standard deviation of the estimated logit propensity score (Hansen [2009]). We then match each treated graduate to three controls via propensity score matching with a caliper, while exactly matching on sex, to achieve adequate covariate balance. Including multiple control units in each matched set allows us to improve the statistical efficiency of our estimators (Rosenbaum [2010], Chapter 8).
Among those graduates whose father did not complete a high school education, we obtain 434 matched sets (effective sample size 651), while we get 223 matched sets (effective sample size 334.5) of individuals whose father graduated high school. Effective sample sizes are calculated as the sum of the harmonic means of the number of units in treatment and control for each matched group (Hansen [2011]). Our matching successfully balances the individual covariates between treated and control groups, which we display graphically using Love plots in Appendix B.
4. Pre-Specified Outcomes of Interest
We collect representative outcomes related to health, well-being, interpersonal relationships, substance use, and economic success to examine the diverse long-term effects of growing up with a father with AUD. We list the corresponding variable name(s) in the WLS dataset, gathered during the 1992/1993 wave of data collection, in parentheses.
Physical Health and Wellness
-
•
Personal health rating. (z_mx001rer)
Mental Health and Wellness
-
•
Depression via modified CES-D. (z_mu001rec)
Interpersonal Relationships
-
•
Times meeting family + friends last month. (z_mz023rer, z_mz024rer)
Substance Use
-
•
Your alcohol use ever affect work or home? (z_ru032re, z_ru033re)
Economic Status
-
•
Total income in the last 12 months. (rp015ree)
5. Design of Data Analysis
We seek to identify which of the aforementioned outcomes were affected by growing up with a father affected by AUD. We will deem those outcomes for which the effect of the treatment is found to be significantly negative or positive in both subgroups - those raised by a more educated father and those with a less educated father - as replicable findings. We will also establish which of the outcomes are affected by the treatment on at least one of the two subpopulations, a weaker yet still informative standard of evidence akin to meta-analysis.
Our inference will be based on matched sets of graduates in the WLS sample, as described previously in Section 3. We introduce the following notation: for each outcome , we observe matched differences among those whose father graduated high school, and matched differences among those who did not. We use subscript since each outcome may have a different number of missing values, so the number of matched pairs may vary across outcomes (see Table 1 in Appendix C). For each outcome , we consider matched sets of graduates with more educated fathers, where each matched set has four subjects, one of which is treated with , while the others are controls with , such that . Sets have been matched for observed covariates such that for all . Let and denote the treated and control potential outcomes for the th subject in the th matched set when assigned to treatment and control, respectively, for . The observed value of outcome for this individual is ; thus, the individual treatment effect is not calculable (Neyman [1923], Rubin [1974]). Let be the vector of treatment assignments and be the vector of observed responses for outcome . These definitions are taken the same for individuals whose father did not graduate high school, for whom we replace superscript by superscript .
For each outcome , Fisher’s sharp null hypothesis (Fisher [1935]) for each subpopulation is
We assume Fisher’s sharp null and employ randomization inference to compute one-sided -values. For each outcome , we assess replicability by testing the composite null hypothesis given by the union of the sharp null hypotheses for both subpopulations of graduates, for which a rejection will allow us to conclude the existence of a treatment effect in each subgroup. We also test the global null by examining the intersection of the sharp null hypotheses for both subpopulations, for which a rejection will allow us to conclude the existence of a treatment effect in at least one subgroup. We propose a novel strategy to control the type I error rate of our analysis. Let be the number of subgroups where Fisher’s sharp null hypothesis regarding outcome is false. Our procedure outputs for each outcome a lower bound on the number of subgroups where Fisher’s sharp null hypothesis regarding this outcome is non-null. A lower bound is a trivial lower bound, a lower bound corresponds to the claim that outcome is non-null in at least one of the parts, i.e., a claim that the global null is false for this outcome. A lower bound corresponds to the claim that outcome is non-null in both parts, i.e., a replicability claim. Our data turnover method will guarantee control of the family-wise error rate (FWER) for these lower bounds, i.e., the probability that for at least one outcome its lower bound exceeds the true number of subgroups where Fisher’s null for this outcome is false, at no more than (see Benjamini and Heller [2008] and Benjamini et al. [2009] for similar error measures). Our new method may increase detection power compared to automated cross-screening (Zhao et al. [2018]) by allowing for a data-informed design plan and facilitating qualitative and quantitative insights from exploration, as well as the potential formation of novel hypotheses.
5.1. Data Turnover
-
Step 1:
Use the data for those graduates whose father did not graduate high school, hereafter referred to as subpopulation 1, to select the direction of the alternative regarding each outcome for those whose father did graduate high school, hereafter referred to as subpopulation 2, and select the hypotheses which will be tested for this latter subgroup. Finally, test the selected hypotheses for subpopulation 2.
-
-
Selection of directions of alternative for subpopulation 2: For each non-binary outcome , test on subpopulation 1 using a weighted version of -statistics, with weights of as proposed by Rosenbaum [2014], and return the right-sided and left-sided -values, and , respectively. For binary outcome , follow the same procedure, but instead conduct McNemar’s test to generate the -values. Select the right-sided alternative for outcome for subpopulation 2 if , and select the left-sided alternative otherwise. Define if right-sided alternative was selected and otherwise.
-
-
Selection of hypotheses for subpopulation 2: For each outcome , let be the minimum of the two one-sided -values for subpopulation 1. Select the hypotheses to be tested for subpopulation 2 based on each as follows: the hypothesis regarding outcome is selected if . Let be the set of indices of selected hypotheses, i.e., If there are no outcomes for which , take to be the outcome which has the minimum -value across all outcomes.
-
-
Testing the hypotheses for subpopulation 2: For each selected hypothesis with index , compute the one-sided -value where and are the right-sided and left-sided -values for outcome computed using the data for subpopulation 2, and is the indicator. Apply Holm’s procedure (Holm [1979]) on at level , where is the target FWER level for lower bounds. Let be the set of indices of outcomes such that for each , the null hypothesis for outcome was rejected on subpopulation 2.
-
-
-
Step 2:
In this step, we are able to explore the data in subpopulation 2 and choose in any manner how to conduct one-sided hypothesis tests for subpopulation 1. Test these hypotheses for subpopulation 1 and return the subset of rejected hypotheses, where is the number of any novel (i.e., not pre-specified) hypotheses tested for subpopulation 1, along with the indicator for the direction of the alternative for each where for right-sided alternative and for left-sided alternative.
-
Step 3:
For each , set where is the lower bound for the number of subpopulations for which the null hypothesis regarding outcome is false. Declare outcome as replicable if and only if and . Reject the global null for outcome or a novel hypothesis with index if and only if .
One intriguing question is, when working with split samples of unequal size, would it be more advantageous to pre-specify our analysis plan on the smaller or larger sample? Preliminary simulation results suggest the latter strategy, so that is how we describe our analysis plan above. We expect that incorporating data exploration in one subpopulation (see Step 2) will enhance the overall design of our study by facilitating data-informed decisions into our design and accommodating the possible development of new hypotheses. The flexibility afforded by our approach may allow for even further benefits we have not outlined nor yet anticipated. For replicability analysis, we are restricted to the pre-defined hypotheses in Section 4, but this constraint is removed for the analysis of the global null hypotheses and when conducting inference on each subgroup separately. This flexibility allows us to consider more complex null hypotheses, for instance, examining whether there is any effect modification among the outcomes. Moreover, the WLS dataset is richer than the list of outcomes outlined in Section 4, so an unrestricted look at the data may facilitate the development of additional novel hypotheses on related outcomes. For example, if self-rated physical health and wellness was affected, we might be interested in assessing which maladies in particular may have been driven by growing up with a father with AUD.
Moreover, we are also interested in reporting effect sizes for the rejected null hypotheses with well-defined parameters. Along with forming marginal confidence intervals for the selected parameters, we also suggest constructing on each subgroup level false coverage rate (FCR; Benjamini and Yekutieli [2005]) confidence intervals for these parameters, thereby providing an overall -level FCR guarantee on all confidence intervals constructed. On each subpopulation, we will determine the parameters for which a confidence interval will be constructed if the hypotheses regarding these parameters are rejected. If for a certain subpopulation where hypotheses were tested, parameters were selected at level , we will construct the confidence intervals for these parameters at level Consequently, the expected fraction of non-covering confidence intervals (i.e., the FCR) for each subgroup will be bounded above by .
References
- Anda et al. [2002] Robert F Anda, Charles L Whitfield, Vincent J Felitti, Daniel Chapman, Valerie J Edwards, Shanta R Dube, and David F Williamson. Adverse childhood experiences, alcoholic parents, and later risk of alcoholism and depression. Psychiatric Services, 53(8):1001–1009, 2002.
- Beardslee et al. [1986] William R Beardslee, Linda Son, and George E Vaillant. Exposure to parental alcoholism during childhood and outcome in adulthood: A prospective longitudinal study. The British Journal of Psychiatry, 149(5):584–591, 1986.
- Bekerman et al. [2024] William Bekerman, Abhinandan Dalal, Carlo del Ninno, and Dylan S Small. Planning for gold: Sample splitting for valid powerful design of observational studies. arXiv preprint arXiv:2406.00866, 2024.
- Benjamini and Heller [2008] Yoav Benjamini and Ruth Heller. Screening for partial conjunction hypotheses. Biometrics, 64(4):1215–1222, 2008.
- Benjamini and Yekutieli [2005] Yoav Benjamini and Daniel Yekutieli. False discovery rate–adjusted multiple confidence intervals for selected parameters. Journal of the American Statistical Association, 100(469):71–81, 2005.
- Benjamini et al. [2009] Yoav Benjamini, Ruth Heller, and Daniel Yekutieli. Selective inference in complex research. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 367(1906):4255–4271, 2009.
- Bogomolov and Heller [2023] Marina Bogomolov and Ruth Heller. Replicability across multiple studies. Statistical Science, 38(4):602–620, 2023.
- Christoffersen and Soothill [2003] Mogens Nygaard Christoffersen and Keith Soothill. The long-term consequences of parental alcohol abuse: A cohort study of children in Denmark. Journal of Substance Abuse Treatment, 25(2):107–116, 2003.
- Clapp et al. [2003] John D Clapp, James E Lange, Cristel Russell, Audrey Shillington, and Robert B Voas. A failed norms social marketing campaign. Journal of Studies on Alcohol, 64(3):409–414, 2003.
- Cox [1975] David R Cox. A note on data-splitting for the evaluation of significance levels. Biometrika, 62(2):441–444, 1975.
- Crum et al. [1993] Rosa M Crum, John E Helzer, and James C Anthony. Level of education and alcohol abuse and dependence in adulthood: A further inquiry. American Journal of Public Health, 83(6):830–837, 1993.
- Edenberg and Foroud [2013] Howard J Edenberg and Tatiana Foroud. Genetics and alcoholism. Nature Reviews Gastroenterology & Hepatology, 10(8):487–494, 2013.
- Fisher [1935] Ronald A Fisher. The Design of Experiments. Oliver and Boyd, Edinburgh, 1935.
- Hansen [2009] Ben B Hansen. Propensity score matching to recover latent experiments: Diagnostics and asymptotics. Technical report, University of Michigan, 2009.
- Hansen [2011] Ben B Hansen. Propensity score matching to extract latent experiments from nonexperimental data: A case study. In Looking back: Proceedings of a conference in honor of Paul W. Holland, pages 149–181. Springer, 2011.
- Heller et al. [2009] Ruth Heller, Paul R Rosenbaum, and Dylan S Small. Split samples and design sensitivity in observational studies. Journal of the American Statistical Association, 104(487):1090–1101, 2009.
- Herd et al. [2014] Pamela Herd, Deborah Carr, and Carol Roan. Cohort profile: Wisconsin longitudinal study (WLS). International Journal of Epidemiology, 43(1):34–41, 2014.
- Holm [1979] Sture Holm. A simple sequentially rejective multiple test procedure. Scandinavian Journal of Statistics, pages 65–70, 1979.
- Karmakar et al. [2019] Bikram Karmakar, Benjamin French, and Dylan S Small. Integrating the evidence from evidence factors in observational studies. Biometrika, 106(2):353–367, 2019.
- Klostermann et al. [2011] Keith Klostermann, Rui Chen, Michelle L Kelley, Valarie M Schroeder, Abby L Braitman, and Theresa Mignone. Coping behavior and depressive symptoms in adult children of alcoholics. Substance Use & Misuse, 46(9):1162–1168, 2011.
- Lipari and Van Horn [2017] Rachel N Lipari and Struther L Van Horn. Children living with parents who have a substance use disorder. Substance Abuse and Mental Health Services Administration, 2017.
- Neyman [1923] Jerzy Neyman. On the application of probability theory to agricultural experiments. Essay on principles. Annals of Agricultural Sciences, pages 1–51, 1923.
- Park and Schepp [2015] Sihyun Park and Karen G Schepp. A systematic review of research on children of alcoholics: Their inherent resilience and vulnerability. Journal of Child and Family Studies, 24:1222–1231, 2015.
- Rosenbaum [2001] Paul R Rosenbaum. Replicating effects and biases. The American Statistician, 55(3):223–227, 2001.
- Rosenbaum [2010] Paul R Rosenbaum. Design of Observational Studies. Springer, 2010.
- Rosenbaum [2014] Paul R Rosenbaum. Weighted M-statistics with superior design sensitivity in matched observational studies with multiple controls. Journal of the American Statistical Association, 109(507):1145–1158, 2014.
- Rosenbaum [2015a] Paul R Rosenbaum. Cochran’s causal crossword. Observational Studies, 1(1):205–211, 2015a.
- Rosenbaum [2015b] Paul R Rosenbaum. How to see more in observational studies: Some new quasi-experimental devices. Annual Review of Statistics and Its Application, 2(1):21–48, 2015b.
- Rosenbaum and Rubin [1984] Paul R Rosenbaum and Donald B Rubin. Reducing bias in observational studies using subclassification on the propensity score. Journal of the American Statistical Association, 79(387):516–524, 1984.
- Rossow et al. [2016] Ingeborg Rossow, Lambert Felix, Patrick Keating, and Jim McCambridge. Parental drinking and adverse outcomes in children: A scoping review of cohort studies. Drug and Alcohol Review, 35(4):397–405, 2016.
- Roy et al. [2022] Samrat Roy, Marina Bogomolov, Ruth Heller, Amy M Claridge, Tishra Beeson, and Dylan S Small. Protocol for an observational study on the effects of giving births from unintended pregnancies on later life physical and mental health. arXiv preprint arXiv:2210.05169, 2022.
- Rubin [1974] Donald B Rubin. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology, 66(5):688, 1974.
- Rydelius [1981] Per-Anders Rydelius. Children of alcoholic fathers: Their social adjustment and their health status over 20 years. Almqvist & Wiksell Periodical Company, 1981.
- Serec et al. [2012] Maša Serec, Igor Švab, Marko Kolšek, Vesna Švab, Diana Moesgen, and Michael Klein. Health-related lifestyle, physical and mental health in children of alcoholic parents. Drug and Alcohol Review, 31(7):861–870, 2012.
- Sewell et al. [2003] William H Sewell, Robert M Hauser, Kristen W Springer, and Taissa S Hauser. As we age: A review of the Wisconsin longitudinal study, 1957–2001. Research in Social Stratification and Mobility, 20:3–111, 2003.
- Steinhausen et al. [1984] Hans-Christoph Steinhausen, Dietmar Göbel, and Veronica Nestler. Psychopathology in the offspring of alcoholic parents. Journal of the American Academy of Child Psychiatry, 23(4):465–471, 1984.
- Stout and Mintz [1996] M Lisa Stout and Laurie B Mintz. Differences among nonclinical college women with alcoholic mothers, alcoholic fathers, and nonalcoholic parents. Journal of Counseling Psychology, 43(4):466, 1996.
- Thor et al. [2022] Siri Thor, Tomas Hemmingsson, Anna-Karin Danielsson, and Jonas Landberg. Fathers’ alcohol consumption and risk of substance-related disorders in offspring. Drug and Alcohol Dependence, 233:109354, 2022.
- Wechsler et al. [2002] Henry Wechsler, Jae Eun Lee, Meichun Kuo, Mark Seibring, Toben F Nelson, and Hang Lee. Trends in college binge drinking during a period of increased prevention efforts: Findings from 4 Harvard School of Public Health College Alcohol Study surveys: 1993–2001. Journal of American College Health, 50(5):203–217, 2002.
- Windle [1996] Michael Windle. Effect of parental drinking on adolescents. Alcohol Health and Research World, 20(3):181, 1996.
- Zhao et al. [2018] Qingyuan Zhao, Dylan S Small, and Paul R Rosenbaum. Cross-screening in observational studies that test many hypotheses. Journal of the American Statistical Association, 113(523):1070–1084, 2018.
Appendix A Baseline Covariates
We gather variables related to personal demographics and family background to create matched sets of treated and control graduates. We list the corresponding variable name in the WLS dataset in parentheses, adding a ∗ if the covariate was constructed (i.e., modified) from the listed variable. These covariates were gathered over two waves delineated below:
– Collected in 1957:
-
•
Coarsened population of town in which graduate attended high school. (rlur57∗)
-
•
Year of birth. (z_brdxdy)
-
•
Sex of respondent. (z_sexrsp)
– Collected in 1975:
-
•
Father’s years of education in 1957. (bmfaedu)
-
•
Father’s Duncan SEI score for 1957 job. (bmfoc3u)
-
•
Mother’s years of education in 1957. (bmmaedu)
-
•
Natural log of parental income in 1957. (bmpin1∗)
-
•
Coarsened nationality of father. (natfth∗)
-
•
Coarsened religious preference of family in 1957. (relfml∗)
-
•
Total number of siblings. (sibstt)
Appendix B Covariate Balance


Appendix C Description of Matched Sets
Outcome | NHS (Subpop. 1) | HS (Subpop. 2) |
Number of Sets | Number of Sets | |
Personal Health Rating | 432 | 222 |
Depression Score | 421 | 216 |
Times Meeting Family & Friends | 429 | 215 |
Alcohol Affect Work or Home | 161 | 86 |
Income | 431 | 223 |
-
•
Note. NHS = graduates whose father did not graduate high school.
HS = graduates whose father completed high school education.