Roberta De Angelis, MS1, Andrea Tavilla, MS1, Arduino Verdecchia, ScD1, Steve Scoppa, BS2, Mark Hachey, MS2, Eric J. Feuer, PhD3, and Angela B. Mariotto, PhD3


1National Center of Epidemiology, Italian National Institute of Health, Rome, Italy; 2Information Management Services, Inc., Silver Spring, Maryland; 3Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, Maryland


Received: April 22, 2008; Revised: October 23, 2008; Accepted: October 27, 2008. Published online: February 26, 2009 ©C 2009 American Cancer Society. DOI: 10.1002/cncr.24217, www.interscience.wiley.com.


Geographic Variability and Time Trends, 2005-2015.


KEYWORDS: breast neoplasms, prevalence, registries, forecasting, regional health planning.


Corresponding author: Roberta De Angelis, MS, Instituto Superiore di Sanita, Centro Nazionale di Epidemiologia, Viale Regina Elena 299, Rome 00161, Italy; Fax: (011) 0039-06-49904285; E-mail: roberta.deangelis@iss.it.


Summary

BACKGROUND: Breast cancer continues to place a significant burden on the healthcare system. Regional prevalence measures are instrumental in the development of cancer control policies. Very few populationbased cancer registries are able to provided local, long-term incidence and follow-up information that permits the direct calculation of prevalence. Model-based prevalence estimates are an alternative when this information is lacking or incomplete. The current work represents a comprehensive collection of female breast cancer prevalence from 2005 to 2015 in the United States and the District of Columbia (DC).

METHODS: Breast cancer prevalence estimates were derived from state-specific cancer mortality and survival data using a statistical package called the Mortality-Incidence Analysis Model or MIAMOD. Cancer survival models were derived from the Surveillance, Epidemiology, and End Results Program data and were adjusted to represent state-specific survival. Comparisons with reported incidence for 39 states and DC had validated estimates.

RESULTS: By the year 2010, 2.9 million breast cancer survivors are predicted in the US, equaling 1.85% of the female population. Large variability in prevalent percentages was reported between states, ranging from 1.4% to 2.4% in 2010. Geographic variability was reduced when calculating age-standardized prevalence proportions or cancer survivors by disease duration, including 0 to 2 years and 2 to 5 years. The residual variability in age-adjusted prevalence was explained primarily by the statespecific, age-adjusted breast cancer incidence rates. State-specific breast cancer survivors are expected to increase from 16% to 51% in the decennium from 2005 to 2015 and by 31% at the national level.

CONCLUSIONS: To the authors' knowledge, the current study is the first to provide systematic estimations of breast cancer prevalence in all US states through 2015. The estimated levels and time trends were consistent with the available population-based data on breast cancer incidence, prevalence, and population aging. Cancer 2009;115:1954-66. © 2009 American Cancer Society.




Currently, there are an estimated 2,477,847 breast cancer survivors in the United States, and 65% of these patients have survived for ≥5 years since their initial diagnosis.[1] Breast cancer survivors encompass women who receive initial cancer treatments to women who receive post-treatment, routine follow-up. Increasing prevalence is a result of advances in breast cancer research, which continues to focus on developing effective breast cancer screening techniques, minimizing the toxic effects of treatment, and decreasing cancer recurrence. In addition, the aging babyboom generation and longer life expectancy experienced in the United States also will contribute to the increasing number of breast cancer survivors. Traditionally, US public health programs are developed and disseminated at the state or more local level. This research aids state authorities in making more informed decisions regarding public health programming and allocation of health resources using breast cancer prevalence estimations.


To calculate complete prevalence, a long history of cancer incidence and follow-up data are necessary. To date, only a few US states have collected this type of data, therefore a statistical method was applied to the consistent data available for all states. The Mortality-Incidence Analysis Model (MIAMOD) method uses state-specific mortality (breast cancer and all causes) data and modeled state-specific survival as inputs to derive incidence and complete prevalence estimates and projections. Estimates were validated by comparisons with reported incidence data for 39 states and the District of Columbia (DC) provided by cancer registration programs. For the remaining states that were not covered by cancer registration, MIAMOD incidence estimates were compared with estimates derived from a different methodology: an ecologic regression model of incidence on sociodemographic variables.[2] In this article, we investigated the variability in breast cancer prevalence by state that is correlated with different demographic structure and breast cancer incidence patterns at the state level.



Materials and methods


Mortality and Population Data

Single age and year state-specific female mortality data for breast cancer and all causes of death from the National Center of Health Statistics and respective populations from the US Census Bureau are available for calendar years from 1969 to 2005 from the Surveillance, Epidemiology, and End Results (SEER)*Stat databases.[3,4] The state population projections from 2006 to 2015 were obtained from the US Census Bureau.[5]


State-specific Breast Cancer Survival Model: The Surveillance, Epidemiology, and End Results Baseline Model

Data from 1975 to 2004 from the 9 initial SEER Program registries (SEER-9)[4] were used to calculate female breast cancer relative survival rates by 3-year period of diagnosis (1975-1977, 1978-1980,..., 2002-2004) and by age at diagnosis (ages 15-44 years, 45-54 years, 55-64 years, 6574 years, 75-99 years, and ≥85 years).[4] Because cancer survival information was needed before 1975 to include all past diagnosis years, we fit the data to a parametric Weibull cure model,[6,7] which is described in detail in a technical report.[8]


State-specific Relative Risk of Breast Cancer Death

We adjusted the SEER baseline survival to represent statespecific survival by applying state-specific relative risks that reflected a greater or smaller risk of breast cancer death in a specific state relative to the SEER-9 areas. The method, which was proposed by Mariotto et al,[9] consists of regressing 5-year breast cancer survival on sociodemographic variables for all counties in the SEER-9 areas. Once the regression model is estimated, state-specific 5-year survival is calculated by extrapolating the model to sociodemographic variables by county at the state level. Data sources on this specific application are described in a technical report.[8]


Estimation of Incidence and Prevalence by State: The Mortality-Incidence Analysis Model Method

State-specific breast cancer prevalence and incidence were estimated from state-specific mortality and population data and from the survival model described above using the MIAMOD method.[10,11] The method is based on equations relating mortality and prevalence for a given cancer to incidence and survival probabilities. The model assumes that survival is known, as estimated above, and that cancer incidence follows an age, period, and cohort (APC) model on the logistic scale. The APC incidence parameters are estimated from a Poisson generalized linear regression model on breast cancer deaths. Once incidence is estimated, prevalence can be calculated from incidence and survival. Further details on the method are given in a technical report.[8]


Prevalence Projections, 2006 to 2015

MIAMOD prevalence projections from 2006 to 2015 were based on assumptions of future trends of survival, incidence, population, and other causes mortality. It was assumed that survival was constant with rates equal to those estimated for the last period of data, 2004. Incidence was projected using the previously estimated age and cohort incidence model. This model describes slow changes in incidence, mostly the effect of risk factors, but no period changes. The population projections were based on the general assumption that recent state-specific trends in fertility, mortality, domestic migration, and international migration will continue.[5] Other causes mortality also was assumed to be constant, as observed in the last years of observed data.


Population denominators were not available for annual ages after age 84 years. Prevalence for ages ≥85 years was estimated by applying prevalence proportions at ages 80 to 84 years to the populations of women aged ≥85 years. The age-adjusted rates were based on the US 2000 standard population.


Validation of Mortality-Incidence Analysis Model Estimates

MIAMOD estimates of breast cancer incidence cases were compared with reported cases for which data were available. For 9 states participating in the SEER Program, data were available through 2005 with different starting years. An incidence database was obtained through an agreement with the North American Association of Central Cancer Registries (NAACCR). US cancer registries that report data to the NAACCR participate in the SEER Program, or the Centers for Disease Control and Prevention National Program of Cancer Registries (NPCR), or both and receive support from the state, province, or territory where they are located. The NPCR states that participated in this study were those that met NAACCR registry certification standards for providing complete, accurate, and timely data for at least 3 consecutive years during 1995 to 2003 and agreed to release incidence data for this project (30 states and DC).[12] The start and end years of available data varied, and first breast cancers were calculated from the respective database. Table 1 displays the data source and range of years compared for each state. For the remaining 11 states, no reported cases were available, so we compared MIAMOD estimates with independent incidence estimates from ecologic regression analysis at the county level.[2]


Because the MIAMOD estimates represent individual counts rather than tumor counts, we compared the incidence of first breast cancers reported in the respective databases. The determination of first cancers depends on the length of the registration period. For example, a woman who was diagnosed with 2 breast cancers in 1992 and 1998 would be recorded as a case in 1992 if she resided in 1 of the SEER states and in 1998 if she lived in 1 ofthe NPCR regions.


For each state, we calculated the mean absolute percentage difference (MAPD) between the MIAMOD estimated (ey) and reported (oy) number of breast cancer cases diagnosed up to age 84 years over years y=y1,...,ym:


For the 5 SEER states (Connecticut, New Mexico, Hawaii, Iowa, and Utah) 30-year limited duration prevalent cases from MIAMOD were compared with the corresponding reported values[4] (Table 1).


Table 1. Validation of Mortality-Incidence Analysis Model Estimates Against Reported Data From the Surveillance, Epidemiology, and End Results, National Program of Cancer Registries, and Cancer Incidence in North America Databases and From Ecologic Regression Incidence Estimations: Ages 0 to 84 Years*,†




Results


For 48 of the 51 states, the estimated number of new breast cases was consistent with reported data within 10% of the MAPD value (Table 1). Larger discrepancies were observed in Utah (10.8%), Arkansas (13.2%), and Wyoming (18.9%); note that the latter had a quite small population and that Arkansas was compared with different estimates,[2] subject to its own sources of errors. Estimated and reported 30-year prevalent cases in 2005 were reasonably close (absolute percentage difference, 1%-8%) in the SEER states, except in Utah, where breast cancer survivors were fairly overestimated (absolute percentage difference, 21.4%).


We estimate that there were 2.4 million breast cancer survivors in the United States in 2005 (Table 2), a number very similar to the published 2005 prevalence estimate of 2,477,847 survivors.[1] Crude prevalence proportions varied by state, ranging from 1.15% (Alaska) to 2.03% (Florida), because of differences among states in breast cancer incidence, survival, and population age structure. The geographic variability of prevalence also was high among younger women and older women, ranging from 0.55% to 0.94% in women aged <65 years and between 4.7% and 8.3% for patients aged ≥65 years. On average, 17% and 21% of the total 2005 US breast cancer prevalent cases are women diagnosed in the previous 2 years and between 2 and 5 years, respectively. These percentages are quite stable between US states (range, 15%19% and 19%-23%, respectively). The proportion of short-term survivors is higher in the younger age group (22% within 2 years and 25% between 2 and 5 years) than in the older age group (14% and 18%, respectively). The higher proportion of long-term survivors in the elderly reflects the overall favorable prognosis of breast cancer.


Table 2. State-Specific and Total United States Estimates of Crude Breast Cancer Prevalence in 2005 by Year Since Diagnosis and Age at Prevalence*




Figure 1 displays the age-adjusted female breast cancer prevalent percentage (Fig. 1A) and its percentage increase from 2005 to 2015 (Fig. 1B) for all US states. The southern states, except Florida, have a lower ageadjusted prevalence compared with the northern states. Conversely, it is predicted that southern states will have a higher increase in age-adjusted prevalence.




Figure 1. Female breast cancer age-adjusted prevalent percentage in 2005 (A) and percentage increase of age-adjusted prevalent percentage from 2005 to 2015 (B) by US state.


Figure 2 displays micromaps of age-adjusted breast cancer prevalence, age-adjusted incidence, crude prevalence proportions, and percentage of the female population aged >65 years by US state in 2010. The maps are ordered by age-adjusted prevalent percentage. After adjusting by age, between-state variability in crude prevalence (1.4%-2.4%) is reduced (1.3%-1.8%). Age-adjusted prevalence clearly is correlated more with incidence (0.83 correlation) than with the elderly population (0.25 correlation), whereas crude prevalence is correlated both with incidence (0.73) and the with population aged ≥65 years (0.80 correlation).



Figure 2. Geographic comparison of the breast cancer burden in 2010 by US state: Shown are the age-adjusted prevalent percentage (%), the age-adjusted incidence rates per 100,000 population, the crude prevalent percentage (%), and the percentage proportion of the population aged >65 years (%). The micromaps are ranked by age-adjusted prevalent percentage.


Breast cancer survivors are expected to continue growing from 2005 to 2015 in all US states (Table 3), producing an overall increase of almost 1 million survivors in just a decennium (from 2,403,000 to 3,421,000). The percentage of survivors (among the population) will increase from 1.6% in 2005 to 2.1% in 2015. Although the number of survivors diagnosed within 5 years is expected to increase (by 28% in absolute terms), it is expected to represent a lower proportion of the overall survivors (from 37% in 2005 to 33% in 2015), indicating that prevalence is increasing because ofhigher numbers of long-term survivors.


Table 3. Projections 2005 to 2015 of State-specific and Total United States Breast Cancer Prevalence (Absolute Numbers of Survivors and Crude Percentage Proportions of the Population) and the Prevalence of Those Diagnosed in the Previous 5 Years*




A geographic comparison of breast cancer prevalence dynamics from 2005 to 2015 is represented in Figure 3. The corresponding dynamics of incidence and of the elderly female population also are displayed to interpret the geographic variability. Age-adjusted cancer prevalence (Fig. 3, column 1) will increase by a percentage varying from 8% to 32%, whereas the crude prevalence (Fig. 3, column 3) will increase from 16% to 51%, depending on the state. The state ranking of age-adjusted prevalence increase is highly correlated with the ranking of estimated incidence increases (from —3% to 18%) (Fig. 3, column 2). Note that the states with prevalence growth <15% (the first 20 states shown in Fig. 3, column 1) are those in which the age-adjusted incidence is expected to reduce or stabilize (percentage increases <5%) (Fig. 3, column 2). Some of the differences in ranking between the percentage increase in age-adjusted and crude prevalence can be explained by aging (Fig. 3, column 4). For example, in Alaska, for the top value in population aging, the percentage increase of prevalence moves from 49 to 26 when it is adjusted by age.



Figure 3. Geographic comparison of projected breast cancer dynamics in the decennium from 2005 to 2015: Shown are the percentage increases from 2005 to 2015 in the age-adjusted prevalent percentage (%), the age-adjusted incidence rates (%), the crude prevalent percentage (%), and the population aged >65 years (%). The micromaps are ranked by the percentage increase in the age-adjusted prevalence proportion.



Discussion


To our knowledge, this is the first time that the number of women living with breast cancer in the United States has been estimated in a systematic way for all US states. These estimates represent the total breast cancer prevalence, including all women with a past diagnosis ofbreast cancer. This study includes estimates by age and years since diagnosis as well as cancer prevalence projections up to 2015.


The number of women living with breast cancer in 2005 ranged from 3691 in Alaska to 261,883 in California. Crude prevalence varied between 1.15% in Alaska and 2.03% in Florida. These numbers are influenced by the state population size and the proportion of elderly population. The age-adjusted prevalent percentage is useful for removing discrepancies caused by differences in population age structure and for producing interesting geographic patterns. The southern states, with the exception of Florida, have lower prevalence compared with the northern states. However, it is predicted that they will have a higher increase in prevalence in the next 10 years compared with the northern states. Studies on risk factors have revealed a positive correlation between patterns of mammography use and income[13,14] that may explain part of the south-north disparity. A variable proportion of breast cancer survivors (range, 35%-42%) had been diagnosed within the previous 5 years. This proportion is lower for the population aged ≥65 years, in which longterm survivors represent the large majority.


The prevalence estimates and projections presented here were based on a statistical method (MIAMOD)[10] that uses state-specific information on cancer mortality and survival. This method was used because it permitted a uniform estimation of breast cancer prevalence across all the US states using available data. When a long series of historic incidence is available, other methods[15-17] can be used. Except for few states that participate in the SEER Program, the application of these methods is not feasible. However, for most of the US states, a short and varying series of incidence data from either SEER or NPCR is available. The comparison of MIAMOD incidence estimates against the reported incidence series produced similar numbers for most of the states. The exceptions were Utah and Wyoming. Comparable results, except for Arkansas, also were observed for the remaining 11 states compared with independent incidence estimates. Generally, MIAMOD incidence underestimated the reported data. This is consistent with intrinsic limitations ofthe reported incidence series in determining the first cancers, depending on the length ofthe registration period.


The 2005 US breast cancer prevalence estimate, which was calculated by summing state estimates, was very similar to a published estimate of 2,477,847.[1] The finding that 2 distinct methods yielded similar estimates is reassuring.


In this work, we defined an individual as a survivor from the moment of diagnosis. Also, the registries data were based on cancer diagnosis. Although this definition is becoming the most prevalent, statistics based on it may not capture all individuals who are affected by cancer, such as family members and caregivers who are impacted by the survivorship experience.[18]


Several limitations are noteworthy. First, the MIAMOD method relies on accurate estimates of mortality and survival. No particular quality problem is reported for breast cancer mortality data by state. However, survival data are not available for the majority of the states and are estimated. The baseline survival model, which reflects SEER survival, has demonstrated a good fit to survival data (results not shown) over a 30-year period of diagnosis and for all age groups. State-specific survival was estimated using relative risks that reflected a higher or smaller risk of breast cancer death compared with SEER. The estimates were obtained through an ecologic regression analysis of breast cancer survival on sociodemographic variables at the county level. Although cross-validation[9] analysis indicated that the survival estimates generally provided a good fit, the ecologic associations inherent in these models may not fit well in every state. Also, it is possible that, for some states, we were not able to capture other differences that may exist between SEER and a specific state[19], however the lack of survival data makes this models a viable option.


Second, prevalence estimates were based on an APC incidence model estimate. Although the APC model is flexible enough to fit a variety of incidence dynamics, it is back-calculated from mortality and survival dynamics. Survival improvements, which are particularly relevant for breast cancer, are captured well in the baseline survival model. The long mortality time series used in this study allowed for the estimation of robust incidence age and cohort effects. However, because of the latency between incidence and mortality, the back-calculation method is not able to capture the recent changes in incidence, like changes associated with the reduced use of hormonereplacement therapy[20] in recent years. Thus, this reduction was not represented in the prevalence projection from 2006 to 2015. For the projections, we assumed no period effect. Nevertheless, a reduction and stabilization of incidence was estimated in 16 states.


Third and finally, the projections were based on several assumptions: flat survival rates after 2004, dynamic population projection, constant other causes mortality, and incidence varying according to the estimated age and cohort effects. This assumption attempts to capture age and risk factor effects on breast cancer incidence, eliminating period effects from the projections, such as screening, because temporal trends are more difficult to forecast.


According to these assumptions, the number of breast cancer survivors is expected to grow in all states at various increasing rates (from 20% to 50% in the decennium 2005-2015). It is estimated that age-adjusted incidence will drop slightly or flatten from 2005 to 2015 in about 20 US States (those with percentage increases <5%) (Fig. 3, column 2), and this partially reduces prevalence growth. These states are located mostly on the eastern and western coast and in northern areas: those with highest breast cancer incidence and prevalence rates and with higher use of mammography screening.[13] We predicted a higher increase in southern states. This may be associated with an increase in the use of screening mammography (because levels were low in the past) and with high obesity and smoking rates,[13] which are known risk factors for breast cancer. Breast cancer survivors in 2015 are expected to be older than in 2005 because of population aging and to have a longer disease history, eg, a higher proportion of patients diagnosed >5 years previously (Table 3).


These estimates provide an order of magnitude assessment of the breast cancer burden and can be used by policy makers and health authorities to inform decisions about the allocation of funds, identifying priorities, and planning more targeted cancer control strategies. Given the estimated projections, breast cancer burden will continue to represent a major demand on public health services. Continuing surveillance for breast cancer is needed 5 years postdiagnosis, and prevalence data by disease duration are important for planning research on the quality of life of cancer survivors, because breast cancer is becoming even more a chronic disease.[21] Estimates like those provided in this report support the recommendation by the Institute of Medicine that the National Institutes of Health should strengthen its use of data that estimate the burden of disease in setting its priorities.[22]



Conflict of Interest Disclosures


Supported by grants 263-MQ-514300 (to A.T.) and 263-MQ-320101 (to A.T.) from the National Institutes of Health.




References


Ries LAG, Melbert D, Krapcho M, et al. SEER Cancer Statistics Review, 1975-2005. Bethesda, Md: National Cancer Institute; 2007. Available at: http://seer.cancer.gov/csr/ 1975-2005. Accessed February 12, 2009. 

Pickle LW, Hao Y, Jemal A, et al. A new method of estimating United States and state-level cancer incidence counts for the current calendar year. CA Cancer J Clin. 2007;57:30-42. 

Surveillance Research Program, National Cancer Institute. SEER*Stat Software, Version 6.4.4. Bethesda, Md: National Cancer Institute; 2008. Available at: www.seer. cancer.gov/seerstat. Accessed February 12, 2009. 

Surveilknce,Epidemiology, and End Results (SEER) Program. SEER 9 Registries, 1975-2005. Bethesda, Md: National Cancer Institute; 2008 (released April 2008, based on the November 2007 submission). Available at: www.seer.cancer.gov. Accessed February 12, 2009. 

USCensus Bureau PD. Interim Projections Consistent With Census 2000 (released March 2004). Available at: http:// www.census.gov/population/www/projections/usinterimproj/. Accessed February 12, 2009. 

Verdecchia A, De Angelis R, Capocaccia R, et al. The cure for colon cancer: results from the EUROCARE study. Int J 

Cancer. 1998;77:322-329. 

De Angelis R, Capocaccia R, Hakulinen T, Soderman B, Verdecchia A. Mixture models for cancer survival analysis: application to population-based data with covariates. Stat Med. 1999;18:441-454. 

Mariotto A, De Angelis R. The Method to Estimate Breast Cancer Prevalence at State Level: 2005-2015. Statistical Research and Applications Branch, NCI, Technical Report 2008-01. Available at: http://srab.cancer.gov/reports/. Accessed February 12, 2009. 

Mariotto A, Capocaccia R, Verdecchia A, et al. Projecting SEER cancer survival rates to the US: an ecological regression approach. Cancer Causes Control. 2002;11:101-111. 

Verdecchia A, Capocaccia R, Eqidi V, Golini A. A method for the estimation of chronic disease morbidity and trends from mortality data. Stat Med. 1989;8:201-216. 

De Angelis G, De Angelis R, Frova L, Verdecchia A. MIAMOD: a computer package to estimate chronic disease morbidity using mortality and survival data. Comput Programs Biomed. 1994;44:99-107. 

North American Association of Central Cancer Registries (NAACCR). NAACCR Incidence-CINA Analytic File, 1995-2003. Springfield, Ill: NAACCR; 2008. 

Pickle LW, Su Y. Within-state geographic patterns of health insurance coverage and health risk factors in the United States. Am J Prev Med. 2002;22:75-83. 

Wells BL. Stage at diagnosis in breast cancer: race and socioeconomic factors. Am JPublicHealth. 1992;82:1383-1385. 

Capocaccia R, De Angelis R. Estimating the completeness of prevalence based on cancer registry data. Stat Med. 1997;16:425-440. 

Merrill RM, Capocaccia R, Feuer EJ, Mariotto A. Cancer prevalence estimates based on tumour registry data in the Surveillance, Epidemiology, and End Results (SEER) Program. Int J Epidemiol. 2000;29:197-207. 

Verdecchia A, De Angelis G, Capocaccia R. Estimation and projections of cancer prevalence from cancer registry data. Stat Med. 2002;21:3511-3526. 

Hewitt ME, Greenfield S, Stovall EE, eds. From Cancer Patient to Cancer Survivor: Lost in Transition. Committee on Cancer Survivorship: Improving Care and Quality of Life. Institute of Medicine and National Research Council of the National Academies. Washington, DC: National Academies Press; 2005. 

Coleman MP, Quaresma M, Berrino F, et al; CONCORD Working Group. Cancer survival in 5 continents: a worldwide population-based study (CONCORD). Lancet Oncol. 20089:730-756. 

Ravdin PM, Cronin KA, Howlader N, et al. The decrease in breast-cancer incidence in 2003 in the United States. N Engl J Med. 2007;356:1670-1674. 

Holland JC, Reznik I. Pathways for psychosocial care of cancer survivors. Cancer. 2005;104(11 suppl):2624-2637. 

Committee on the NIH Research Priority-Setting Process. Scientific Opportunities and Public Needs: Improving Priority Setting and Public Input at the National Institutes of Health, Institute of Medicine. Washington, DC: National Academy Press; 1998.