Skip Navigation

This Article
Right arrow Extract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (19)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Cummings, P.
Right arrow Articles by Greenland, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Cummings, P.
Right arrow Articles by Greenland, S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Epidemiol Rev 2003;25:43-50
Copyright © 2003 by the Johns Hopkins Bloomberg School of Public Health


METHODOLOGICAL ISSUES

Matched Cohort Methods for Injury Research

Peter Cummings1, Barbara McKnight2 and Sander Greenland3

1 Harborview Injury Prevention and Research Center and the Department of Epidemiology, School of Public Health and Community Medicine, University of Washington, Seattle, WA.
2 Department of Biostatistics, School of Public Health and Community Medicine, University of Washington, Seattle, WA.
3 Departments of Epidemiology and Statistics, University of California, Los Angeles, CA.

Received for publication July 1, 2002; accepted for publication January 30, 2003.


    INTRODUCTION
 TOP
 INTRODUCTION
 USEFUL FEATURES OF MATCHED...
 ANALYTICAL METHODS
 A POTENTIAL LIMITATION
 A POSSIBLE STRENGTH
 CONCLUSIONS
 ACKNOWLEDGMENTS
 REFERENCES
 
This article reviews the design and analysis of matched cohort studies of injuries where exposed study subjects are matched to others not exposed. We focus on the situation in which data are available for the matched groups with at least one member who had the study outcome, but data are absent or incomplete for matched groups that have no members with the outcome.

When matching is done in a case-control study, those with the outcome are matched to those without the outcome on certain confounder measures; this distorts the exposure status of the controls to be like that of the cases in regard to the matching variables (and perhaps other variables as well) (1). As a consequence, the selected controls may not represent the exposure experience of the entire population from which the cases were derived. Therefore, matching is a source of selection bias in a case-control study. The bias it produces can be removed in the analysis by accounting for the matching since, conditional on the values of the matching variables, controls will be representative of the source population.

The consequence of one-to-one matching in a cohort study is different. A variable can be a confounder only if, in the study cohort, it is associated with but not affected by the exposure and is independently predictive of the outcome. If each exposed study subject is perfectly matched to an unexposed subject on the value of some variable, and if there is no subject loss or missing data, there will be no association of exposure with the matching variable in the data, and confounding by the matching variable will be eliminated (2). Confounding by the matching variable could still occur, however, if an imbalance arose between the exposed and unexposed study subjects; this might happen, for example, if follow-up were less complete for one group compared with the other, or if some records were omitted from the analysis because of missing data.

Despite the potential of matching to prevent confounding in a cohort study and the potential of matching to sometimes increase study efficiency (2), it appears that this design is rarely used. For many cohort studies, matching exposed persons to one or several unexposed persons would be laborious. Furthermore, it might be wasteful in that matches might be unavailable for some potential cohort members. Today, cohort studies usually avoid matching, and the data are analyzed using regression methods that make it relatively easy to adjust for potential confounding factors that might otherwise be used for matching.


    USEFUL FEATURES OF MATCHED COHORT STUDIES
 TOP
 INTRODUCTION
 USEFUL FEATURES OF MATCHED...
 ANALYTICAL METHODS
 A POTENTIAL LIMITATION
 A POSSIBLE STRENGTH
 CONCLUSIONS
 ACKNOWLEDGMENTS
 REFERENCES
 
Matched methods (in cohort studies or clinical trials) are attractive if the matching can be done within a person. To study whether a padded hip protector can decrease the risk of a hip fracture, we could randomize the two hips of each study subject; one hip would be allocated to wear a hip protector and the other hip would serve as a control. If a study subject fractured one hip, the other hip would be considered to be censored at that time. This design would avoid imbalance due to dropouts or missing data. It would control well for both the propensity of a person to fall and the likelihood of fracture if a fall occurred. This design would be more efficient than the cluster randomized designs used in some clinical trials of hip protectors (3, 4). It might also be useful for studies that attempt to evaluate the relative benefit offered by two types of hip protector or by two types of wrist protector for in-line skaters. The design assumes, however, that the devices being studied do not influence the side to which a study subject might fall. Factors that are constant within a person, such as age, sex, or whether the person is right or left handed, could be examined as modifiers of any effect of the studied device.

A second feature of matched cohort studies makes this design especially attractive for the study of certain injuries: Risk ratios, relative rates, or hazard ratios may be estimated using information from only those matched sets with at least one study subject who had the outcome. For example, Walker et al. (57) studied the association between vasectomy and subsequent myocardial infarction. Using computerized data from a health maintenance organization, they matched men who had a vasectomy with other men on year of birth and length of follow-up in the health plan. Of 4,830 matched pairs, 36 pairs had one man with a myocardial infarction. The authors analyzed these data using a Cox proportional hazards model stratified on the pairs (8). Because this technique uses information from only the matched pairs with at least one outcome event, chart review for potential confounding variables was limited to the records of 36 men who had a heart attack and their 36 matched cohort members; chart review for the other 9,588 men was not necessary.

Matched cohort methods have been applied to studies of traffic crashes because data on all vehicles with a death are sometimes available. In the United States, the National Highway Traffic Safety Administration maintains the Fatality Analysis Reporting System; these data, collected since 1975, contain information about all crashes on public roads in which someone died within 30 days (9, 10). Because over 99 percent of all crash-related deaths of pedestrians or vehicle occupants occur within 30 days of the crash and (for legal reasons) vital status is diligently ascertained, follow-up regarding death is nearly complete. The available data contain over 100 variables with information about the crash and all involved vehicles and individuals in the crash event. There were 41,611 deaths and 56,668 vehicles in the 1999 data, compared with the estimated 11,232,000 vehicles involved in all crashes in that year in the United States (9).

Occupants of the same car or riders on the same motorcycle can be thought of as matched in regard to crash- and vehicle-related variables, such as place, time, type of crash (e.g., hit by another vehicle, struck a tree), nearest hospital, ambulance response time, vehicle speed, vehicle model, and so on. Some variables, such as speed, may be difficult to measure or estimate; speed is missing in about half of crash records for a vehicle with a death, during years 1975 through 1998. Yet these variables are still matched for occupants of a vehicle. Other variables, such as vehicle make and model, may have so many values that categorizing them and controlling for them in an analysis may be difficult. By matching on vehicle, we can estimate the risk ratio associated with an exposure that does vary within a vehicle, such as seat belt use or occupant age, and the potential confounding influence of vehicle and crash characteristics that do not vary within a vehicle can be eliminated, even if some of these characteristics cannot be measured.


    ANALYTICAL METHODS
 TOP
 INTRODUCTION
 USEFUL FEATURES OF MATCHED...
 ANALYTICAL METHODS
 A POTENTIAL LIMITATION
 A POSSIBLE STRENGTH
 CONCLUSIONS
 ACKNOWLEDGMENTS
 REFERENCES
 
The crude risk ratio

Imagine a hypothetical matched cohort study of 10,000 cars that crashed. Let us assume that each car had two persons in the front seat, and we are interested in estimating the risk ratio for death in the driver seat position compared with that in the passenger seat. Further, assume that these were severe crashes and that the average risk (incidence proportion) of death for passengers was 0.10, while that for drivers was 0.08; this would result, on average, in the data in table 1, which produce the average risk for drivers as (A + B)/(A + B + C + D) = 800/10,000 = 0.08, the average risk for passengers as (A + C)/(A + B + C + D) = 1,000/10,000 = 0.10, and the risk ratio of death for drivers compared with passengers as ((A + B)/(A + B + C + D))/((A + C)/(A + B + C + D)) = (A + B)/(A + C) = 800/1,000 = 0.80. This estimate is just the count of dead drivers divided by the count of dead passengers; nothing special is needed to account for the matching. Information from cells A, B, and C in table 1 enters into this calculation; these are the vehicles, or matched pairs, in which one or both occupants died. However, information from cell D, in which both persons lived, is unnecessary, because the count of exposed persons (drivers) and the count of unexposed persons (passengers) are equal by virtue of the matched design. With information from only vehicles with a death, 1,720 vehicles in this example, we can estimate the crude risk ratio that would have been obtained if we had information on all 10,000 vehicles. Although we call this the crude risk ratio, the matched design ensures that it is not confounded by vehicle or crash characteristics. Whether or not we use all 10,000 vehicles, our information about the risk ratio comes from the accidents where at least one occupant died.


View this table:
[in this window]
[in a new window]
 
TABLE 1. Hypothetical data for a study of fatality in vehicles according to whether the front seat position was that of the driver or passenger*
 
If all crashes had identical characteristics, the crude risk ratio estimate would apply to the individual crashes. In reality, important characteristics such as severity will vary. If the risk ratio varies with severity or other characteristics related to risk, the crude risk ratio (being only a ratio of average risks) may not apply to any individual crash (11), and the data may not provide enough information to detect this variation. This drawback would arise even if we had data regarding all crashes, not just those in which one or more occupants died.

Advantages. The crude risk ratio is simple to calculate and controls for confounding by the matching variables, provided there is no loss to follow-up or missing data.

Disadvantages. The crude risk ratio fails to control for confounding by variables not used in matching, and it averages over crash characteristics, which may be misleading if the risk ratio varies with those characteristics.

Mantel-Haenszel method

Mantel-Haenszel methods for stratified summaries of odds ratios have been extended to summarize risk ratios using matched-pair cohort data (1, 1216). The estimator of the risk ratio from the matched pairs reduces to the crude risk ratio described above. Using the notation from table 1, this estimator is (A + B)/(A + C). The natural log of this estimator has a variance estimator of (B + C)/((A + B)(A + C)) (1, 17). This estimator requires no information from the pairs in which no one had the outcome, and in simulated data it provides the desired confidence interval coverage (18). This method has been applied, chiefly for pedagogic purposes, to traffic crash data (1, 18, 19).

Advantages. The log risk ratio and its variance are simple to calculate, as only three counts are needed. We are not aware of commercial software that implements this method, but a program that executes the method in Stata statistical software is available at the following website: http://depts.washington.edu/hiprc/prevmat.htm (20).

Disadvantages. This method has only limited ability to control for potential confounders or to examine risk-ratio variation. If we wished to estimate the risk ratio of death for occupants wearing a seat belt compared with those not wearing a seat belt, we might be concerned that occupant sex could confound this association. We could limit our comparison to those pairs composed of only men and only women, but this would waste potentially informative data by discarding pairs consisting of one male and one female. Furthermore, front seat occupants necessarily differ with regard to seat position. If the risk of death associated with being in the driver seat were different from that in the passenger seat, this might confound the association between seat belts and death; using the Mantel-Haenszel method, it is impossible to control for sources of confounding, such as seat position, that are always discordant within a matched pair. Finally, this method cannot examine the possibility that risk ratios differ by seat position.

In our experience, there has been so little confounding by seat position, age, and sex that Mantel-Haenszel estimates of risk ratios associated with seat belt use and motorcycle helmet use are fairly close to fully adjusted estimates (18, 19). However, because analytical methods that can account for confounding and effect modification due to characteristics that vary within a matched pair are available, we see little reason to use the Mantel-Haenszel method.

Double-pair method

In 1986, Evans (21) published a statistical procedure that he called the double-pair method. The method can be thought of as an adaptation of the Mantel-Haenszel method. To estimate seat belt effects, for example, we first select those vehicles in which both driver and passenger were not belted and one or both died (table 2); we use these records to compute the Mantel-Haenszel estimate of the risk ratio of death associated with seat position only. Next, we select vehicles with a belted driver and an unbelted passenger, at least one of whom died; we use these records to compute the Mantel-Haenszel estimate of the risk ratio of death associated with both belt use and seat position. To estimate the risk ratio associated with seat belts only, we divide the second of these risk ratios by the first. It is this division of one risk ratio by another that distinguishes this method from the Mantel-Haenszel method.


View this table:
[in this window]
[in a new window]
 
TABLE 2. Data for a study of driver fatality in vehicles according to whether the driver was wearing a seat belt*
 
To use all the data, belted and unbelted drivers are also compared with belted passengers; the risk ratio estimate for belts only from this comparison is then summarized with the estimate from the comparison with unbelted passengers, using weights based upon the inverse of the variance for each estimate. Similar calculations are then made to estimate seat belt effects for passengers.

Evans proposed a variance estimator for the log of the ratio of two risk ratios. His formula and that in another recent paper (22) assume that the marginal counts (A + B and A + C in table 2) are independent, which cannot be correct because these counts share the pairs with two deaths (A in table 2). We suggest two alternatives: one derived from the variance estimator for the Mantel-Haenszel method (1, 17) and one derived from an estimator based on the delta method (18). For the latter, one uses an asymptotic variance for the log of the ratio of two risk ratios,

log[(A + B)/(A + C)]/[(E + F)/(E + G)],

given by

{[(A x (A + B + C) + (B x C)) x (F + G)] + [(E x (E + F + G) + (F x G)) x (B + C)]}/[(A + B) x (A + C) x (E + F) x (E + G)].

For the data in table 2, the risk ratio estimate associated with seat belt use was 0.36 (95 percent confidence interval: 0.34, 0.39). In simulated data, these two suggested estimators produced virtually identical results, and both had desirable coverage properties (18).

The double-pair method has been applied several times to traffic crash data (18, 19, 2338).

Advantages. Risk ratios and variances are simple to calculate with this method, and separate estimates for driver and passenger seat positions are produced. A program that implements the method in Stata statistical software is available at http://depts.washington.edu/hiprc (20). However, a variance estimator for the log of the average risk ratio for all front seat occupants (drivers and passengers) has not been published; we have used bootstrap methods (18).

Disadvantages. The double-pair method has limited ability to control for potential confounding by occupant variables such as age or sex. The method cannot produce estimates unless there are some pairs concordant on the exposure; that is, both are belted or not belted. Furthermore, bias may arise if a variable related to the vehicle (such as make or weight) or the crash (such as speed or rollover) modifies the association of a within-vehicle varying confounder, such as seat position, with the risk of death (18). Despite these problems, estimates of seat belt effects and motorcycle helmet effects from this method are fairly close to more fully adjusted estimates (18, 19, 39). Given the limitations of the double-pair method, we prefer the methods described below.

Estimating equations

Davis (40) and Liang (41) derived regression generalizations of the Mantel-Haenszel matched pair odds ratio based on the theory of estimating equations. Greenland (39) extended these methods to modeling the paired Mantel-Haenszel risk ratio and illustrated the approach in an analysis of the association of motorcycle helmet use with death. This approach has been extended to accommodate more than two persons in each matched set; one study estimated the risk ratio of death for persons occupying a truck bed compared with those in the truck cab (42).

Advantages. This method can accommodate both confounding and effect modification (interaction terms). It can be programmed in software that can iteratively solve an equation. A program written for GAUSS software (43) is available at http://darwin.epib.cwru.edu/~witte/gauss.

Disadvantages. Because the method is not implemented by specific commands in a commercial software package, it may not be convenient for some users. The results appear to be nearly identical to those from other risk ratio methods that may be more convenient and are described next.

Conditional Poisson regression

Poisson regression can be used to estimate risk ratios (4446). If we condition additionally on the number of deaths in each matched set, the Poisson likelihood becomes a product multinomial likelihood, and we can estimate risk ratios for occupants within the same vehicle; for a single binary covariate, this becomes a product binomial likelihood. In deference to the terminology used by others who have published this conditional likelihood expression, we call this conditional Poisson regression (4653). (This is not the same likelihood, however, as a "conditional Poisson" likelihood published in one recent text (54).)

Poisson regression is often used to estimate incidence rate ratios, rather than ratios of mean event counts, by incorporating data on the person-time at risk into the model as an offset; that is, the log of person-time is entered with its coefficient fixed at 1 (55). In the case of pairs in traffic crashes, the potential follow-up time is identical within pairs (30 days), so risk ratios over the 30 days after a crash can be estimated by setting person-time to 1 month for all subjects, or by ignoring person-time altogether. This regression method has been applied to case-crossover studies of possible adverse outcomes after vaccination (56, 57) and a study of racial discrimination in housing rentals (58), and it has been used in a few studies of traffic crashes (19, 5961).

Advantages. As with most regression methods, it is easy to accommodate confounding and effect modification in a flexible manner. The method can be extended to include more than two persons in each matched set. This method has been implemented in commercial software, for example, in the Stata procedure, xtpois (20).

Disadvantages. In a simulation study, the confidence intervals provided by this method were too wide; 95 percent confidence intervals covered 98 percent of the simulated results (18). Because the number of pairs that are typically examined in this kind of analysis is in the thousands, this may not be a serious drawback; confidence intervals tend to be quite narrow.

Cox proportional hazards regression

Proportional hazards models can be stratified on vehicle, generating estimates based on within-vehicle comparisons; if the time to death or censoring is set to some arbitrary constant value and if the Breslow or Efron methods are used to account for tied survival times (8, 62, 63), the results will be the same as those from conditional Poisson regression, as the likelihoods for these methods are identical when the data come only from matched pairs. This method has been applied for the purposes of illustration in one paper about traffic crashes (18). (If the exact marginal (64) or exact partial (65) likelihoods are used to account for tied survival times, the results will be the odds ratios; the likelihoods for these methods are the same as those for conditional logistic regression, which we discuss later.)

One could use these models to estimate hazard ratios, rather than risk ratios, by incorporating data on the actual survival time; time to death is included in US data for fatal crashes, although it is often missing. For a chronic disease outcome, such as time to death after a diagnosis of cancer, it makes sense to use actual survival time, which is often many years. Death is inevitable, but we are interested in postponing death. However, if death occurs after an injury, it is often nearly instantaneous and any delay in death is usually measured in a few days. Postponing death a few days may not be desirable if the additional survival time is spent in an intensive care unit on a ventilator. Therefore, ignoring time to death and estimating the risk ratio for death within a 30-day interval may be a good choice for studies of traffic crashes.

In some studies, however, we might wish to account for time to an injury event. Imagine that we wished to study how the hazard of a fall varied with shoe characteristics. Each study subject who wore shoes with a leather sole might be paired with someone who wore shoes with a rubber sole. Pairs could be matched on sex, year of age, home floor surface (carpeted or bare), and number of falls in the past 6 months. Cohort members would be followed until they fell or until they were censored by the end of the study or by dropping out of the study; a hazard ratio could be estimated using time to fall or censoring, stratified on the pairs.

Advantages. This method is available in many statistical software packages and is preferable when time to an event, rather than just whether an event occurred, is of interest.

Disadvantages. As with the conditional Poisson method, which provides the same estimates for matched-pair data, the risk ratio confidence intervals from this method appear to be too wide in simulations (18).

Conditional logistic regression

Conditional logistic regression is often applied to case-control studies in which cases and controls are matched on variables that may be confounders (1, 66, 67). This method has been used to analyze data for a few studies of traffic crashes in which persons were either matched within the same vehicle or were in different vehicles that were in the same crash (6871). The odds ratio from conditional logistic regression will closely approximate the risk ratio if, in the study population from which the matched pairs arose, the proportion of persons who experienced the outcome was small (1, 72, 73) and, in addition, the baseline risk varies little across subgroups (11). Death is indeed rare in crashes: In 1999, there were an estimated 11,232,000 drivers in US crashes and 25,210 (0.2 percent) drivers died (9). However, crashes are not homogenous in regard to the risk of death; many deaths occur in severe crashes where the risk of death is great. As a consequence, it is common that two persons die in the same vehicle and, in this situation, the odds ratio will be further from 1.0 than the risk ratio (11, 1719, 39).

Conditional logistic regression uses only the "discordant-pair" information in cells B and C of table 1, in contrast to the other methods we have reviewed, which also use the information in cell A, the pairs with two dead occupants. Because the proportion of all pairs in cells A, B, and C that fall within cell A typically ranges from 15 to 20 percent in actual crash studies (19, 61, 71), the odds ratio may poorly approximate the risk ratio and hence more poorly estimate the effects of factors on caseload (39).

Suppose that we wished to study an exposure that actually reduced mortality by an average of 50 percent in all crashes. One million vehicles crashed, and in each vehicle one occupant was exposed and the other was not. The average mortality was 1 percent among the unexposed and 0.05 percent among the exposed. Only the vehicles with at least one dead person are needed to produce odds ratios or risk ratios; table 3 illustrates that, as the proportion of pairs that are both dead increases, the odds ratio diverges from the risk ratio. In a matched cohort study of crash data, the risk ratio of death for those who wore a seat belt compared with those unbelted was estimated as 0.39 (95 percent confidence interval: 0.37, 0.41) using conditional Poisson regression; the odds ratio of death was 0.28 using conditional logistic regression (18, 61).


View this table:
[in this window]
[in a new window]
 
TABLE 3. Hypothetical data for a matched-pair analysis of an exposure that reduces the risk of death by 50%*
 
Advantages. There are none compared with other regression methods.

Disadvantages. Conditional logistic regression estimates the odds ratio, which is often further from 1.0 than the risk ratio and which more poorly represents the impact on caseload (39). The pairs in which both persons died are ignored.


    A POTENTIAL LIMITATION
 TOP
 INTRODUCTION
 USEFUL FEATURES OF MATCHED...
 ANALYTICAL METHODS
 A POTENTIAL LIMITATION
 A POSSIBLE STRENGTH
 CONCLUSIONS
 ACKNOWLEDGMENTS
 REFERENCES
 
The matched cohort methods that we have described can produce estimates of risk ratios within all occupant pairs that crash. However, these estimates may not apply to the average solo driver who crashes, if those drivers differ from paired drivers with respect to characteristics that modify the risk ratio associated with exposure. For example, one study estimated that the risk ratio for using seat belts in rollover crashes was 0.23, while the estimate was 0.47 in other crashes (18). This difference may be attributed to the ability of a belt to prevent ejection in a crash, a common feature of fatal rollover injuries. If rollovers were more common for solo drivers, compared with drivers who crashed with a passenger, the average risk ratio associated with using a seat belt for a solo driver would be somewhat greater than that for drivers in a matched-pair study. This difficulty can be addressed if the matched-pair analysis produces separate risk ratio estimates for those features of solo drivers or their crashes that differ substantially from those of matched drivers.


    A POSSIBLE STRENGTH
 TOP
 INTRODUCTION
 USEFUL FEATURES OF MATCHED...
 ANALYTICAL METHODS
 A POTENTIAL LIMITATION
 A POSSIBLE STRENGTH
 CONCLUSIONS
 ACKNOWLEDGMENTS
 REFERENCES
 
Misclassified data can distort any measure of association. Limiting a crash analysis to matched pairs with a death may use data that are less subject to misclassification than data for all crashes. Fatal crashes are investigated with more effort than "fender-benders." Many states have specially trained police officers who collect information regarding a crash that involves a death. Data in the Fatality Analysis Reporting System are considerably more complete than data in the Generalized Estimates System, a systematic sample of police reports for all crashes in the United States (74, 75).

There is evidence that police data about seat belt use on routine crash forms may often be inaccurate (76). Some researchers have expressed concern that, if some crash survivors told police that they were belted when they were not, any benefit of seat belts would be exaggerated (37). Some survivors might be motivated to claim falsely that they were belted in order to avoid a fine for not using their seat belt. However, a recent matched-pair analysis of crashes in which someone died reported excellent agreement between police reports of belt use in fatal crashes and ascertainment of belt use by trained investigators for the National Accident Sampling System Crashworthiness Data System (60, 77). The trained investigators made their determination of seat belt use from actual belt inspection for over 90 percent of the occupants; risk ratio estimates for belt use were essentially the same when data based on inspection were used and when data based upon police reports were used.


    CONCLUSIONS
 TOP
 INTRODUCTION
 USEFUL FEATURES OF MATCHED...
 ANALYTICAL METHODS
 A POTENTIAL LIMITATION
 A POSSIBLE STRENGTH
 CONCLUSIONS
 ACKNOWLEDGMENTS
 REFERENCES
 
Conceptualizing data regarding fatal crashes as a matched cohort study offers several advantages. First, risk ratio estimates for all crashes can be produced by using data from only those occupant groups in which one or more died. These data have been available in the United States since 1975. The expense of collecting these data is far less than the expense of collecting data for all crashes.

Second, creating estimates from occupants of the same vehicle provides protection against confounding by vehicle- and crash-related factors. Even if data were available for all crashes, the difficulties of controlling for differences in crash forces between vehicles are considerable; a within-vehicle analysis may be preferable.

Third, an analysis of vehicles with a death may use data that are more complete and more accurate than routine data for all crashes, thereby improving the accuracy of effect estimates.


    ACKNOWLEDGMENTS
 TOP
 INTRODUCTION
 USEFUL FEATURES OF MATCHED...
 ANALYTICAL METHODS
 A POTENTIAL LIMITATION
 A POSSIBLE STRENGTH
 CONCLUSIONS
 ACKNOWLEDGMENTS
 REFERENCES
 
This work was supported by grants R49/CCR002570 and R49/CCR019477-01 from the Centers for Disease Control and Prevention, Atlanta, Georgia.

The authors thank students at the University of Washington for the idea of studying hip protectors by randomizing the two hips of a single person.


    FOOTNOTES
 
Correspondence to Dr. Peter Cummings, 1524 Bear Creek Drive, Bishop, CA 93514 (email: peterc{at}u.washington.edu).


    REFERENCES
 TOP
 INTRODUCTION
 USEFUL FEATURES OF MATCHED...
 ANALYTICAL METHODS
 A POTENTIAL LIMITATION
 A POSSIBLE STRENGTH
 CONCLUSIONS
 ACKNOWLEDGMENTS
 REFERENCES
 

  1. Rothman KJ, Greenland S. Modern epidemiology. 2nd ed. Philadelphia, PA: Lippincott-Raven Publishers, 1998:147–61, 283–5.
  2. Greenland S, Morgenstern H. Matching and efficiency in cohort studies. Am J Epidemiol 1990;131:151–9.[Abstract/Free Full Text]
  3. Kannus P, Parkkari J, Niemi S, et al. Prevention of hip fracture in elderly people with use of a hip protector. N Engl J Med 2000;343:1506–13.[Abstract/Free Full Text]
  4. Parker MJ, Gillespie LD, Gillespie WJ. Hip protectors for preventing hip fractures in the elderly. Oxford, England: The Cochrane Database of Systematic Reviews, 2003. (http://www.cochranelibrary.com/CLIB/CLIBINET.EXE?A=10&S=0&Q=1045078652&U=&B=0&E=0&R=0&F=&H=&D=0&L=0&N=0&M=0&C=0&T=hip+protector).
  5. Walker AM, Jick H, Hunter JR, et al. Vasectomy and non-fatal myocardial infarction. Lancet 1981;1:13–15.[CrossRef][ISI][Medline]
  6. Walker AM. Efficient assessment of confounder effects in matched follow-up studies. Appl Stat 1982;31:293–7.[CrossRef]
  7. Walker AM, Jick H, Hunter JR, et al. Vasectomy and nonfatal myocardial infarction: continued observation indicates no elevation of risk. J Urol 1983;130:936–7.[ISI][Medline]
  8. Hosmer DW Jr, Lemeshow S. Applied survival analysis: regression modeling of time to event data. New York, NY: John Wiley & Sons, Inc, 1999:106–8.
  9. Traffic safety facts 1999: a compilation of motor vehicle crash data from the Fatality Analysis Reporting System and the General Estimates System. Washington, DC: National Highway Traffic Safety Administration, 2000.
  10. Tessmer JM. FARS analytic reference guide: 1975 to 1996. Washington, DC: National Highway Traffic Safety Administration, 1997.
  11. Greenland S. Interpretation and choice of effect measures in epidemiologic analyses. Am J Epidemiol 1987;125:761–8.[Free Full Text]
  12. Mantel N, Haenszel W. Statistical aspects of the analysis of data from retrospective studies. J Natl Cancer Inst 1959;22:719–48.[ISI][Medline]
  13. Nurminen N. Asymptotic efficiency of general noniterative estimation of common relative risk. Biometrika 1981;68:525–30.[Abstract/Free Full Text]
  14. Tarone RE. On summary estimators of relative risk. J Chronic Dis 1981;34:463–8.[CrossRef][ISI][Medline]
  15. Kleinbaum DG, Kupper LL, Morgenstern H. Epidemiologic research: principles and quantitative methods. New York, NY: Van Nostrand Reinhold Company, 1982:345.
  16. Rothman KJ. Modern epidemiology. Boston, MA: Little, Brown and Company, 1986:276.
  17. Greenland S, Robins JM. Estimation of a common effect parameter from sparse follow-up data. Biometrics 1985;41:55–68.[CrossRef][ISI][Medline]
  18. Cummings P, McKnight B, Weiss NS. Matched-pair cohort methods in traffic crash research. Accid Anal Prev 2003;35:131–41.[CrossRef][ISI][Medline]
  19. Norvell DC, Cummings P. Association of helmet use with death in motorcycle crashes: a matched-pair cohort study. Am J Epidemiol 2002;156:483–7.[Abstract/Free Full Text]
  20. Stata statistical software: release 7.0. College Station, TX: Stata Corporation, 2000.
  21. Evans L. Double pair comparison—a new method to determine how occupant characteristics affect fatality risk in traffic crashes. Accid Anal Prev 1986;18:217–27.[CrossRef][ISI][Medline]
  22. Hertz ES. A note on the variance of paired comparisons estimates. Accid Anal Prev 2002;34:401–4.[CrossRef][ISI][Medline]
  23. Evans L. The effectiveness of safety belts in preventing fatalities. Accid Anal Prev 1986;18:229–41.[CrossRef][ISI][Medline]
  24. Kahane CJ. An evaluation of child passenger safety: the effectiveness and benefits of safety belts. Washington, DC: National Highway Traffic Safety Administration, 1986. (DOT HS 806-890).
  25. Sikora J. Relative risk of death for ejected occupants in fatal traffic accidents. Washington, DC: National Highway Traffic Safety Administration, 1986. (DOT HS 807-096).
  26. Evans L. Fatality risk reduction from safety belt use. J Trauma 1987;27:746–9.[ISI][Medline]
  27. Evans L, Frick MC. Helmet effectiveness in preventing motorcycle driver and passenger fatalities. Accid Anal Prev 1988;20:447–58.[CrossRef][ISI][Medline]
  28. Evans L. Risk of fatality from physical trauma versus sex and age. J Trauma 1988;28:368–78.[ISI][Medline]
  29. Evans L. Rear seat restraint system effectiveness in preventing fatalities. Accid Anal Prev 1988;20:129–36.[CrossRef][ISI][Medline]
  30. Evans L, Frick MC. Seating position in cars and fatality risk. Am J Public Health 1988;78:1456–8.[Abstract/Free Full Text]
  31. Partyka S. Lives saved by child restraints from 1982 through 1987. Washington, DC: National Highway Traffic Safety Administration, 1988. (DOT HS 807-371).
  32. Esterlitz JR. Relative risk of death from ejection by crash type and crash mode. Accid Anal Prev 1989;21:459–68.[CrossRef][ISI][Medline]
  33. Evans L, Frick MC. Relative fatality risk in different seating positions versus car model year. Accid Anal Prev 1989;21:581–7.[CrossRef][ISI][Medline]
  34. Evans L. Restraint effectiveness, occupant ejection from cars, and fatality reductions. Accid Anal Prev 1990;22:167–75.[CrossRef][ISI][Medline]
  35. Kahane CJ. Fatality reduction by air bags: analysis of accident data through early 1996. Washington, DC: National Highway Traffic Safety Administration, 1996. (DOT HS 808-470).
  36. Graham JD, Goldie SJ, Segui-Gomez M, et al. Reducing risks to children in vehicles with passenger air bags. Pediatrics 1998;102:e3.[Abstract/Free Full Text]
  37. Kahane CJ. Fatality reduction by safety belts for front-seat occupants of cars and light trucks: updated and expanded estimates based on 1986–99 FARS data. Washington, DC: National Highway Traffic Safety Administration, 2000. (DOT HS 809-199).
  38. Evans L. Female compared with male fatality risk from similar physical impacts. J Trauma 2001;50:281–8.[ISI][Medline]
  39. Greenland S. Modelling risk ratios from matched cohort data: an estimating equation approach. Appl Stat 1994;43:223–32.[CrossRef]
  40. Davis LJ. Generalization of the Mantel-Haenszel estimator to nonconstant odds ratios. Biometrics 1985;41:487–95.[CrossRef]
  41. Liang KY. Extended Mantel-Haenszel estimating procedure for multivariate logistic regression models. Biometrics 1987;43:289–99.[CrossRef][ISI][Medline]
  42. Anderson CL, Agran PF, Winn DG, et al. Fatalities to occupants of cargo areas of pickup trucks. Accid Anal Prev 2000;32:533–40.[CrossRef][ISI][Medline]
  43. GAUSS. Maple Valley, WA: Aptech Systems, Inc, 2002.
  44. Gourieroux C, Monfort A, Tognon C. Pseudo-maximum likelihood methods: theory. Econometrica 1984;52:681–700.[CrossRef]
  45. Lloyd CJ. Statistical analysis of categorical data. New York, NY: John Wiley & Sons, Inc, 1999:84–7.
  46. Wooldridge JM. Econometric analysis of cross section and panel data. Cambridge, MA: The MIT Press, 2002:646–9, 674–6.
  47. Hausman JA, Hall BH, Griliches Z. Econometric models for count data with an application to the patents-R and D relationship. Econometrica 1984;52:909–38.[CrossRef][ISI]
  48. Farrington CP. Relative incidence estimation from case series for vaccine safety evaluation. Biometrics 1995;51:228–35.[CrossRef][ISI][Medline]
  49. Cameron AC, Trivedi PK. Regression analysis of count data. New York, NY: Cambridge University Press, 1998:282–3.
  50. Stata reference manual: release 7. College Station, TX: Stata Press, 2001:404–15.
  51. Hardin J, Hilbe J. Generalized linear models and extensions. College Station, TX: Stata Press, 2001:196–7.
  52. Diggle PJ, Heagerty P, Liang K-Y, et al. Analysis of longitudinal data. New York, NY: Oxford University Press, 2002:184–6.
  53. Hardin JW, Hilbe JM. Generalized estimating equations. Boca Raton, FL: Chapman & Hall/CRC, 2003:36–9.
  54. Lachin JM. Biostatistical methods: the assessment of relative risks. New York, NY: John Wiley & Sons, Inc, 2000:186–7, 344–5.
  55. Breslow NE, Day NE. Statistical methods in cancer research. Vol II. The design and analysis of cohort studies. Lyon, France: International Agency for Research on Cancer, 1987:178–229. (IARC scientific publication no. 82).
  56. Farrington CP, Nash J, Miller E. Case series analysis of adverse reactions to vaccines: a comparative evaluation. Am J Epidemiol 1996;143:1165–73.[Abstract/Free Full Text]
  57. Kramarz P, DeStefano F, Gargiullo PM, et al. Does influenza vaccination exacerbate asthma? Analysis of a large cohort of children with asthma. Vaccine Safety Datalink Team. Arch Fam Med 2000;9:617–23.[Abstract/Free Full Text]
  58. Page M. Racial and ethnic discrimination in urban housing markets: evidence from a recent audit survey. J Urban Economics 1995;38:183–206.[CrossRef]
  59. Cummings P, McKnight B, Rivara FP, et al. Association of driver air bags with driver fatality: a matched cohort study. BMJ 2002;324:1119–22.[Abstract/Free Full Text]
  60. Cummings P. Association of seat belt use with death: a comparison of estimates based on data from police and estimates based on data from trained crash investigators. Inj Prev 2002;8:338–41.[Abstract/Free Full Text]
  61. Cummings P, Wells JD, Rivara FP. Estimating seat belt effectiveness using matched-pair cohort methods. Accid Anal Prev 2003;35:143–9.[CrossRef][ISI][Medline]
  62. Kalbfleisch JD, Prentice RL. The statistical analysis of failure time data. New York, NY: John Wiley & Sons, Inc, 1980:76–8.
  63. Kleinbaum DG. Survival analysis: a self-learning text. New York, NY: Springer-Verlag, 1996.
  64. Kalbfleisch JD, Prentice RL. Marginal likelihoods based on Cox’s regression and life model. Biometrika 1973;60:267–78.[Abstract/Free Full Text]
  65. Cox DR. Partial likelihood. Biometrika 1975;62:269–76.[Abstract/Free Full Text]
  66. Breslow NE, Day NE. Statistical methods in cancer research. Vol I. The analysis of case-control studies. Lyon, France: International Agency for Research on Cancer, 1980:248–79. (IARC scientific publication no. 32).
  67. Hosmer DW, Lemeshow S. Applied logistic regression. New York, NY: John Wiley & Sons, Inc, 2000:223–43.
  68. Perneger T, Smith GS. The driver’s role in fatal two-car crashes: a paired "case-control" study. Am J Epidemiol 1991;134:1138–45.[Abstract/Free Full Text]
  69. Barry S, Ginpil S, O’Neill TJ. The effectiveness of air bags. Accid Anal Prev 1999;31:781–7.[CrossRef][ISI][Medline]
  70. Corneli HM, Cook LJ, Dean JM. Adults and children in severe motor vehicle crashes: a matched-pairs study. Ann Emerg Med 2000;36:340–5.[CrossRef][ISI][Medline]
  71. Crandall CS, Olson LM, Sklar DP. Mortality reduction with air bag and seat belt use in head-on passenger car collisions. Am J Epidemiol 2001;153:219–24.[Abstract/Free Full Text]
  72. MacMahon B, Trichopoulos D. Epidemiology: principles and methods. Boston, MA: Little, Brown, 1996:169.
  73. Zhang J, Yu KF. What’s the relative risk? A method of correcting the odds ratio in cohort studies of common outcomes. JAMA 1998;280:1690–1.[Abstract/Free Full Text]
  74. Cummings P, Koepsell TD, Rivara FP, et al. Air bags and passenger fatality according to passenger age and restraint use. Epidemiology 2002;13:525–32. (Erratum in Epidemiology 2002;13:749).[CrossRef]
  75. Shelton TST. National Accident Sampling System. General Estimates System technical note, 1988 to 1990. Washington, DC: National Highway Traffic Safety Administration, 1991. (DOT HS 807-796).
  76. Greenberg L. Police Accident Report (PAR) Quality Assessment Project. Washington, DC: National Highway Traffic Safety Administration, 1996. (DOT HS 808-487).
  77. National Automotive Sampling System. 1997 Crashworthiness Data System. Data collection, coding, and editing manual. Washington, DC: National Highway Traffic Safety Administration, 1997.

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Arch Pediatr Adolesc MedHome page
P. Cummings
Propensity Scores
Arch Pediatr Adolesc Med, August 1, 2008; 162(8): 734 - 737.
[Full Text] [PDF]


Home page
Am J EpidemiolHome page
E. R. Braver, J. A. Kufera, M. T. Alexander, M. Scerbo, K. Volpini, and J. P. Lloyd
Using Head-on Collisions to Compare Risk of Driver Death by Frontal Air Bag Generation: A Matched-Pair Cohort Study
Am. J. Epidemiol., March 1, 2008; 167(5): 546 - 552.
[Abstract] [Full Text] [PDF]


Home page
JAMAHome page
P. Cummings
Hip Protectors and Hip Fracture
JAMA, November 14, 2007; 298(18): 2139 - 2139.
[Full Text] [PDF]


Home page
Inj. Prev.Home page
M. Zhu, P. Cummings, H. Chu, and L. J Cook
Association of rear seat safety belt use with death in a traffic crash: a matched cohort study
Inj. Prev., June 1, 2007; 13(3): 183 - 185.
[Abstract] [Full Text] [PDF]


Home page
Clin TrialsHome page
G. Y. Zou
One relative risk versus two odds ratios: implications for meta-analyses involving paired and unpaired binary data
Clinical Trials, February 1, 2007; 4(1): 25 - 31.
[Abstract] [PDF]


Home page
JAMAHome page
P. Cummings
Effects of differences between peer reviewers suggested by authors and by editors.
JAMA, September 13, 2006; 296(10): 1231 - 1231.
[Full Text] [PDF]


Home page
JAMAHome page
S. Schroter, L. Tite, A. Hutchings, and N. Black
Effects of Differences Between Peer Reviewers Suggested by Authors and by Editors--Reply
JAMA, September 13, 2006; 296(10): 1231 - 1232.
[Full Text] [PDF]


Home page
Am J EpidemiolHome page
C. M. Olson, P. Cummings, and F. P. Rivara
Association of First- and Second-Generation Air Bags with Front Occupant Death in Car Crashes: A Matched Cohort Study
Am. J. Epidemiol., July 15, 2006; 164(2): 161 - 169.
[Abstract] [Full Text] [PDF]


Home page
Inj. Prev.Home page
K M Smith and P Cummings
Passenger seating position and the risk of passenger death in traffic crashes: a matched cohort study.
Inj. Prev., April 1, 2006; 12(2): 83 - 86.
[Abstract] [Full Text] [PDF]


Home page
Diabetes CareHome page
M. C. Gulliford, J. Charlton, and R. Latinovic
Increased Utilization of Primary Care 5 Years Before Diagnosis of Type 2 Diabetes: A matched cohort study
Diabetes Care, January 1, 2005; 28(1): 47 - 52.
[Abstract] [Full Text] [PDF]


Home page
Am J EpidemiolHome page
S. Greenland
Model-based Estimation of Relative Risks and Other Epidemiologic Measures in Studies of Common Outcomes and in Case-Control Studies
Am. J. Epidemiol., August 15, 2004; 160(4): 301 - 305.
[Abstract] [Full Text] [PDF]


Home page
Am J EpidemiolHome page
RE: "MATCHED COHORT METHODS FOR INJURY RESEARCH"
Am. J. Epidemiol., August 1, 2004; 160(3): 300 - 300.
[Full Text] [PDF]


Home page
JAMAHome page
P. Cummings and F. P. Rivara
Car Occupant Death According to the Restraint Use of Other Occupants: A Matched Cohort Study
JAMA, January 21, 2004; 291(3): 343 - 349.
[Abstract] [Full Text] [PDF]


Home page
Am J EpidemiolHome page
P. Cummings
RE: "ESTIMATING THE RELATIVE RISK IN COHORT STUDIES AND CLINICAL TRIALS OF COMMON OUTCOMES"
Am. J. Epidemiol., January 15, 2004; 159(2): 213 - 213.
[Full Text] [PDF]


Home page
Epidemiol RevHome page
S. B. Thacker and E. J. MacKenzie
Preface: The Ro