Introduction

Age at menarche has been declining over the last two centuries in most parts of the world, and among the girls in the United States, there is evidence that it continues to do so [1]. In particular, black girls reach menarche earlier than their white counterparts and the ongoing rate of menarche decline appears to be faster among black girls [13]. Consistent with the life course theory, early life exposures may be important in programming the risk of cancer later in life [4, 5]. Because early pubertal development and menarcheal activation are considered a critical window of vulnerability to breast carcinogenesis [6], secular trends toward early puberty are a major public health concern. These developments underscore the importance of examining this potentially modifiable risk factor for breast cancer in early life [7].

Among girls in the United States, recent epidemiologic evidence suggests that low socioeconomic status (SES) may be related to early puberty [810], as determined by entry into the stage 2 of breast development based on the staging system developed by Tanner and Whitehouse [11]. A comprehensive review published in 2003, however, reported inconsistent associations between socioeconomic circumstances and the onset of puberty: developed versus developing countries showed particularly marked variations [12]. Evidence also varies according to how SES was measured, in particular whether utilizing parental education or household income. For example, in one multiethnic cohort study, lower maternal education predicted early menarche but household income was unrelated [13]. More recently, studies have linked early puberty to both lower parental educational attainment [810] and lower household income [10, 14]. Methodological issues that may account for such inconsistencies in studies to date include cohort effects and variations in the sensitivity of SES indicators to different racial/ethnic groups.

Menarcheal activation is modulated by the hypothalamic–pituitary–gonadal system, particularly endogenous estradiol and lower sex hormone binding globulin [15, 16]. In addition, menstrual onset occurs downstream of a cascade of endocrine changes including increased secretion of gonadotropin-releasing hormone (GnRH), growth hormone (GH), and insulin [17, 18]. Because many predictors of age at menarche often covary with socioeconomic settings (e.g. anthropometric status, physical activity, family functioning), explanatory pathways remain poorly understood [19]. Efforts to identify the explanatory mechanisms are further complicated by the multidimensional nature of SES, since its relationship with outcome varies according to how it is measured [20]. What the aforementioned review [12] alludes to, nevertheless, is that some aspects of SES likely play a role in black–white differentials in the onset of menarche in the United States. Notably, such black–white differentials are not observed in the developing world: black African girls do not have a significantly increased risk of early puberty compared to their white African counterparts [21, 22].

In a prospective cohort study of black and white girls, who were followed with annual exams from the age of 9 or 10 through adolescence, we examined the association of two commonly used socioeconomic indicators—household income and parental education—with early menarche. A specific goal was to determine empirically whether the SES factor selected (household income or parental education) could affect the findings regarding racial/ethnic differences in the timing of menarche. Further, we sought to determine whether the relationship between SES and the onset of menarche can be explained by anthropometric factors [percentage fat by skinfolds, body fat distribution as indicated by the ratio between triceps skinfolds and the sum of subscapular and suprailiac skinfolds, body mass index (BMI), upper thigh circumference], caloric intake, physical activity, and aspects of the family environment.

Methods

Study design and population

The National Growth and Health Study is a cohort study of 2,379 girls (1,213 black and 1,166 white), aged 9 or 10 years at entry. The cohort was followed annually until ages 18–19 years. The present analysis focuses on baseline assessments of predictors of age at menarche. The study design, protocol, and baseline characteristics have been previously reported in detail elsewhere [23].

Participants were recruited by three clinical centers: University of California at Berkeley, University of Cincinnati/Cincinnati Children’s Hospital Medical Center, and Westat, Inc./Group Health Association in Rockville, MD. Berkeley recruited participants from public and parochial schools in the Richmond Unified School District. That area was chosen based on census tract data showing approximately equal percentages of black and white children with the least degree of income and occupational disparity between the races. Cincinnati recruited participants from public and parochial schools in greater Cincinnati that were selected to be racially and socioeconomically representative of Hamilton County (which includes inner city, urban residential, and suburban areas). The cohort selected by Westat was randomly drawn from a membership listing of potentially eligible families who had girls aged 9 or 10 and who were enrolled in the prepaid medical program of Group Health Association, a large Washington, DC, area health maintenance organization.

The rationale for recruiting participants from three different geographical areas was to allow for a wide range of socioeconomic backgrounds and to reduce the chance of biased results due to local habits or local ancestry. To reduce the influence of cultural diversity, eligibility was restricted to girls living in racially concordant households and excluded Hispanics of either race or other ethnic groups. If a guardian was not the natural parent, investigators attempted to ascertain the race of the natural parent.

The Institutional Review Board approval for the present analyses was obtained from the University of California, San Francisco. The public release data set used in this study was obtained directly from the NHLBI.

Study variables

All study variables, with the exception of girls’ age at menarche, were collected at baseline (Y1 of the study).

Definition of endpoints

Age at menarche was assessed by a female research assistant during the annual examinations. Girls were asked whether their menstrual periods had started, and if so, to specify the date. Also, in subsequent annual exams, participants were asked retrospectively to recall age at menarche (expand). For the purposes of this analysis, age at menarche was estimated from the menarche onset date participants provided after the first affirmative response to the question regarding the onset of menses. Age at menarche was calculated from the date of birth to the date of the first menstrual period. Consistent with established cutoffs [4], early menarche was defined as <12 years, while average or late menarche was defined as 12 years or older. Reliability of self-reported age at menarche is well established [24].

Definition of socioeconomic status

Measures of SES obtained during the baseline interviews were used in the present analyses. Parents or guardians reported their highest educational level and household income. Efforts were made to interview both caregivers, but when this was not possible, the available parent (or legal guardian) was asked to report on their partner’s education and income. If parents were educationally discordant, ‘maximum parental education’ was derived from the educational level of the parent with higher educational qualifications using a trichotomous scale (‘high school or less,’ ‘some college,’ and ‘college or more’). To assess annual household income, a four-point scale was utilized (‘less than $9,999,’ ‘$10,000–29,999,’ ‘$30,000–39,999,’ and ‘≥$40,000’).

Covariates

The following baseline covariates that were previously associated with age at menarche [19] were considered in the present analyses.

Anthropometric factors. Height, weight, and body fat measurements were taken from girls and parents by examiners who were centrally trained and certified as part of the study. Weight was measured using a Health-o-meter electronic scale (Model 482). Two weight measurements were obtained, and a third measurement was taken if the first two differed by more than 0.3 kg. Using a custom portable stadiometer (Creative Health Products, Plymouth, Mich.), each girl’s height was measured when she was in socks, with her heels together, toes apart at a 45° angle, and head in the Frankfort horizontal plane. Two height measures were obtained with a third measure taken if the first two were more than 0.5 cm apart. Girth measurements were obtained at the upper thigh and at the upper arm, using a fiberglass tape. Although an imprecise measure of obesity, BMI was used in this study to estimate the prevalence of obesity and was calculated using the Quetelet index—weight/height2 (i.e., kg/m2). BMI is moderately correlated with subscapular skinfolds and with percentage of body fat based on underwater weighting. Thus, NGHS used another independent measure of body composition suitable for population studies: skinfolds at three sites. This measure was obtained at the triceps (halfway between the acromion process and the olecranon process), suprailiac (about 20 mm above the iliac crest in the axillary line), and subscapular sites (about 20 mm below the tip of the scapula, at an angle of 45° to the lateral side of the body) using Holtain calipers (Pfister Import-Export, Inc., Carlstadt, NJ). Skinfolds and girths were measured twice and repeated a third time if the first two measurements were discrepant by more than 1.0 mm and 0.5 cm, respectively. The average of the two or the closest two of three measurements was used in data analysis. Girth and skinfold measurements and the estimation of percentage body fat were performed according to Cameron’s method [25].

Caloric intake and physical activity. Girls completed a three-day food diary and a three-day physical activity diary (AD) at baseline. These measures were described in detail elsewhere [26, 27]. Both diaries included one weekend day and two weekdays. The completed food and activity diaries were reviewed by centrally trained and certified nutritionists at the Nutrition Coordinating Center at the University of Minnesota for the first two years and thereafter at the Dietary Data Entry Center at Cincinnati Children’s Hospital, using the Nutrient Data System developed by the University of Minnesota. The NGHS dietary data collection method has been previously validated [26]. The estimation of daily activity levels was based on the sum of the recorded activities, which were scored with approximate metabolic equivalent (MET) levels and duration of the activity [27]. Coding and scoring of physical activity information were centrally coordinated by the Physical Activity Data Unit at the University of Pittsburgh.

Family discord. Assessment of psychosocial and environmental characteristics of families was conducted using a subset of seven items from the Family Environment Scale (FES, [28]), which measured girls’ perceptions of their actual family environment (‘In my family we feel closer to each other than to people outside the family,’ ‘In my family we like to spend free time with each other,’ ‘In my family we feel very close to each other,’ ‘When our family gets together for activities, everybody comes,’ ‘We can easily think of things to do together as a family,’ ‘In my family we ask other family members about our decisions,’ ‘In my family togetherness is very important.’). This scale has been used in diverse populations [28]. Questions were assessed on a four-point scale and then reverse scored, with higher levels representing more family discord. Items were then summed to develop a summary score for the subscale.

Family structure. Participants were asked to report on the number of parents and siblings in their household.

Maternal age at menarche (in years) was obtained by recall from mothers and modeled as a continuous variable.

Statistical analysis

Descriptive statistics, including subject characteristics by race, were evaluated using contingency table analyses, the χ2 test for trend in proportions and t-test for comparison of groups on the continuous data. Modeling progressed from univariate to multivariate, guided by a priori considerations [29]. Multicollinearity among the explanatory variables was examined. Logistic regression analyses were performed to estimate odds ratios (ORs) and 95% confidence intervals (CIs) for the association between the predictors (SES, anthropometric factors, caloric intake, physical activity, maternal age at menarche, family structure, and family discord) and the timing of menarche onset. Household income and parental education were modeled as both polytomous and ordered categorical variables, and the ORs (95% CI) for levels of income and education (1, 2, 3+) were estimated in relation to the group with the lowest income and education. Interactions between race and SES were evaluated. Race-specific models were estimated to evaluate the extent to which results differed by this variable. The α error was set at 0.05 and all reported p-values are two-sided. Analyses were performed using Intercooled STATA 9.0 for Windows (Stata Corporation, College Station, TX).

Results

Study characteristics

The present analyses included 1,091 black and 986 white girls, who provided information on age at menarche. Of the small number of girls who were menarcheal at baseline, 0.01% were white (n = 10) and 3% were black (n = 30). As shown in Fig. 1, age at menarche was normally distributed in both groups and ranged from 8 to 17 years. Further, proportionately more black girls were menarcheal before 12 years of age compared to their white counterparts (46%, n = 468 vs. 26%, n = 240, respectively, p < 0.0001, Fig. 1).

Fig. 1
figure 1

Distribution of age at menarche by race. Vertical bars show mean ages at menarche (white girls, mean = 12.7 years; black girls, mean = 12.1 years)

The most complex multivariate model including household income, parental education, anthropometric factors (percentage fat by skinfolds, body fat distribution as indicated by the ratio between triceps skinfolds and the sum of subscapular and suprailiac skinfolds, BMI, upper thigh circumference), caloric intake, physical activity, maternal age at menarche, family discord, and family structure as predictors of age at menarche was based on 657 (65%) blacks and 669 (72%) whites.

The distribution of baseline characteristics by race is presented in Table 1. Black girls were significantly taller and heavier (all p < 0.0001). Proportionately, more black girls than their white counterparts had lower parental educational attainment: 30.1% (n = 307) vs. 20.1% (n = 189), respectively (p < 0.0001). In addition, proportionately more blacks than whites were poor, as defined by household income below $9,999: 26.7% (n = 272) vs. 7.6% (n = 71), p < 0.0001. It is notable that household income and parental education were more strongly correlated among white girls than among their black counterparts (Spearman’s rank correlation coefficients were 0.51 vs. 0.43, respectively). Black girls were also significantly more likely to be less physically active (p < 0.0001), to have higher caloric intake (p < 0.0001), and to live in more adverse family environments (as indicated by the FES subscale; p = 0.003).

Table 1 Study variables by race and univariate logistic regression odds ratios (ORs) and (95% CI) with early menarche (<12 years)

Univariate predictors of early menarche

Table 1 shows univariate logistic regression ORs and 95% CIs between study variables and early menarche. Parental education was not associated with early menarche (OR = 0.81, 95% CI 0.64–1.02 among white girls and OR = 0.91, 95% CI 0.72–1.15 among black girls). In contrast, there was a statistically significant association of lower household income with early menarche among white girls (OR = 0.74, 95% CI 0.61–0.91). The reverse was true among black girls: there was a significant relationship between higher household income and early menarche (OR = 1.20, 95% CI 1.03–1.41). We next examined anthropometric factors and found an association between higher BMI and early menarche among both white girls (OR = 1.15, 95% CI 1.09–1.22) and black girls (OR = 1.13, 95% CI 1.08–1.18). Other significant anthropometric predictors of early menarche were percentage fat by skinfolds (OR = 1.05, 95% CI 1.03–1.07 among white girls versus OR = 1.03, 95% CI 1.02–1.04 among black girls), upper thigh circumference (OR = 1.10, 95% CI 1.08–1.13 among white girls versus OR = 1.07, 95% CI 1.05–1.09 among black girls), and body fat distribution as indicated by the ratio between triceps skinfolds and the sum of subscapular and suprailiac skinfolds (OR = 0.17, 95% CI 0.09–0.35 among white girls versus OR = 0.20, 95% CI 0.10–0.42 among black girls). In addition, higher maternal age at menarche (as indicated by a continuous variable) was protective against early menarche among both white girls (OR = 0.75, 95% CI 0.65–0.85) and their black counterparts (OR = 0.80, 95% CI 0.73–0.88). Further, measures of caloric intake, physical activity, and the family environment were not significantly related to early menarche in either racial group.

Race, socioeconomic status, and early menarche

Being black was associated with early menarche (OR = 2.42, 95% CI 2.01–2.92). After household income was taken into account, the OR was 2.46 (95% CI 2.01–3.02). When parental education was controlled, the estimate did not change significantly (OR = 2.33, 95% CI 1.92–2.82). While the effect of one SES indicator—household income—on the timing of menarche differed between black and white girls as reflected by a significant interaction between household income and race (p < 0.0001, Table 2), the interaction was not significant with the other SES indicator—parental education (Table 3).

Table 2 Univariate and multivariate logistic regression odds ratios (95% CI) between income strata and early menarche (<12 years)
Table 3 Univariate and multivariate logistic regression odds ratios (95% CI) between education strata and early menarche (<12 years)

Tables 2 and 3 show estimates of associations between socioeconomic factors and early menarche unadjusted and adjusted for covariates that were positively associated with outcome in univariate analyses: percentage fat by skinfolds, body fat distribution as indicated by the ratio between triceps skinfolds and the sum of subscapular and suprailiac skinfolds, BMI, upper thigh circumference, and maternal age at menarche. In addition, when household income was modeled, we adjusted for parental education, and conversely, modeling the predictive value of parental education meant controlling for household income. White girls in the highest quartile of household income were at a significantly lower risk of early menarche after taking covariates into consideration (OR = 0.37, 95% CI 0.18–0.80). In addition, the inverse pattern was observed among black girls: those in the highest quartile of household income were at an increased risk of early menarche compared to the lowest income quartile (adjusted OR = 2.15, 95% CI 1.27–3.63, Table 2). As shown in Table 3, parental education was not associated with early menarche among the girls in the present study. Notably, when adjusted models were extended to include caloric intake, physical activity, number of parents and siblings, and the family environment subscale, no significant changes in estimates in any of the models were observed (data not shown).

Discussion

In the present study, nearly half of black girls were menarcheal before 12 years of age compared to approximately a quarter of their white counterparts. Overall, parental education was unrelated to age at menarche in the present cohort. The novel finding of this study is that the graded association between household income and age at menarche was strong and significant among black girls but less clear among white girls. Specifically, while higher levels of household income were protective against early menarche among white girls (adjusted OR approximately 0.40 for the highest socioeconomic stratum), the opposite was true for black girls who were more likely to go through early menarche at higher levels of household income (adjusted OR 2.15 for the highest stratum). This pattern of findings among black girls is more akin to that of developing countries where higher SES has been linked with early menarche [12], presumably due to nutrition-related factors.

While a positive association between household income and age at menarche is consistent with at least two other studies [8, 10], the lack of statistically significant influences of parental education on this outcome in the present study contrasts some previous reports [9, 13]. This may perhaps be due to the fact that parental education has been measured inconsistently (maternal educational attainment only versus maximum educational attainment of both parents/guardians). Thus, as we hypothesized, the SES factor selected (household income versus parental education) affected the findings regarding racial/ethnic differences in the timing of menarche. Since components of SES have different meanings in certain racial/ethnic groups [20], it is important to go beyond household income and parental education when elucidating socioeconomic risk factors for early puberty [20]. Indeed, there is evidence that at the same income levels, wealth and debt are independent predictors of racial/ethnic disparities in health outcomes [30, 31].

As a secondary goal, we attempted to determine the pathways by which socioeconomic characteristics affect age at menarche. SES represents proxies for other risk factors [32], and likely underlies at least two proximal determinants of the onset of puberty: anthropometric factors and environmental exposure (e.g., toxic agents, residential crowding) [33]. With regard to anthropometry, recent longitudinal studies have consistently indicated that adiposity in early childhood predicts early puberty [8, 9]. The gain in body fat may be one of the key signals, possibly through secretion of the fat-derived protein leptin, for stimulating the hypothalamus to increase secretion of GnRH [34], which in turn stimulates the pituitary–ovarian axis and initiates the pubertal cascade. In the present study, several anthropometric factors (upper thigh circumference, BMI, percent body fat, body fat distribution) were positively associated with age at menarche. Other potential mediating variables such as caloric intake, MET hour physical activity, and the family environment were unrelated to age at menarche in univariate analyses. Our findings indicate that these factors did not fully explain the association between household income and age at menarche in multivariate models. Further complicating the picture was the finding that BMI, while consistently associated with SES among white girls, was unrelated to SES among their black counterparts in the NGHS study [35].

The current study has several design strengths, but also some notable limitations. It included a large cohort of black and white girls from three study sites with long follow-up. We were able to evaluate, both cross-sectionally and longitudinally, the contribution of socioeconomic factors to girls’ menses onset. However, specific data on socioeconomic indicators and covariates prior to the participants’ study entry at age 9 or 10 were not available and, therefore, were not evaluated in our analysis. While we were able to adjust for a number of potential explanatory variables linking SES and menarche, including anthropometric factors, we were unable to probe exposure to hormonally active environmental agents [36] as potential mediators of the association between SES and age at menarche.

In conclusion, our findings show that nearly half of black girls were menarcheal before 12 years of age compared to approximately a quarter of their white counterparts. Compared with those in the lowest income quartile, white girls in the top quartile of household income were at significantly lower risk of early menarche after taking into account maternal age at menarche, parental educational attainment, and anthropometry. The inverse was true for black girls, with those in the highest income quartile showing a higher prevalence of early menarche. If contemporary cohort studies of economically diverse girls confirmed the relevance of household income resources for pubertal timing and were also able to include a wider array of potential mediating variables (e.g., energy balance health behaviors, exposure to hormonally active agents), the resulting information may generate important insights into the pathways leading to the social patterning of pubertal development.