Article Text


Growth hormone (GH) provocation tests and the response to GH treatment in GH deficiency
  1. T J Cole1,
  2. P C Hindmarsh2,
  3. D B Dunger3
  1. 1Centre for Paediatric Epidemiology and Biostatistics, Institute of Child Health, University College, London, UK
  2. 2Centre for Human Growth and Maturation, Department of Medicine, University College, London, UK
  3. 3Department of Paediatrics, University of Cambridge, Cambridge, UK
  1. Correspondence to:
    Professor T J Cole
    Centre for Paediatric Epidemiology and Biostatistics, Institute of Child Health, 30 Guilford Street, London WC1N 1EH, UK;


Objective: To identify factors, particularly the growth hormone (GH) provocation test result, affecting growth response to GH treatment in children with GH deficiency (GHD).

Subjects: A total of 337 prepubertal GHD patients aged <10 years from the UK Pharmacia KIGS database (GH response to provocation test <20 mU/l).

Outcome measure: Annual change in height standard deviation score (SDS) (revised UK reference) in the first and second years of treatment.

Results: Height increased by 0.74 SDS units (SD 0.39) in the first year of treatment and 0.37 units (SD 0.27) in the second. Adjusting for age, height, weight, midparent height, and injection frequency, the strongest predictor of first year growth response was the GH provocation test result; halving the result predicted an extra height increment of 0.09 units (p<0.0001). It predicted the second year response less well (p<0.0002) and after adjusting for the first year response was not predictive at all.

Conclusions: Among patients referred for possible GHD, the GH provocation test, though not a gold standard for diagnosis, is a valuable predictor of growth response in the first year of treatment. A year’s treatment is recommended for cases with a marginal provocation test result, with the option to continue treatment if the response is adequate. The value of unified protocols for single or repeated provocation tests needs to be assessed.

  • GH, growth hormone
  • GHD, growth hormone deficiency
  • KIGS, Kabi International Growth Study
  • SDS, standard deviation score
  • growth hormone
  • growth response
  • provocation test
  • short stature

Statistics from

Growth hormone deficiency (GHD) is suspected in subjects with short stature and a reduced growth velocity in whom other causes of poor growth have been excluded.1 The diagnosis is confirmed by a provocation test which identifies a subnormal response to a GH secretagogue such as insulin, clonidine, glucagon, arginine, or l-dopa. This is generally viewed as more effective than measuring the spontaneous GH secretion2 though a recent small study suggests the opposite.3 In subjects with documented hypothalamic/pituitary pathology or additional pituitary hormone deficits, and in those with a classical phenotype and severe isolated GHD (GH response <10 mU/l or <5 ng/ml), the diagnosis is generally straightforward. However the diagnosis of partial GHD (10 mU/l⩽GH response<20 mU/l) is more problematic. The various GH provocation tests, and laboratory measures of GH, vary in specificity and sensitivity and the definition of partial GHD is dependent on an arbitrary cut off. A second test may improve the discrimination, but consecutive tests on the same day may be affected by down regulation of the hypothalamic/pituitary axis and require complex statistical analysis.4 Surrogate measures of GH secretion, such as IGF-1 and IGFBP-3, are useful to rule out the diagnosis but are not sufficiently sensitive for the diagnosis of GHD.5,6

The growth response to a standard replacement dose of GH should discriminate those with GHD from those with normal variant short stature, but this response may be influenced by other factors such as midparent height, birth weight, and current weight.7 Both pretreatment growth rate8 and stature9 have been described as predictors of growth response although neither study adjusted for regression to the mean.10 The peak GH response to pharmacological testing1,8 and 24 h GH secretion11 have been shown to be related to growth response. Most recently Ranke and colleagues12 have developed prediction models for each of the first 4 years of GH treatment, including all the above variables. It is unclear though how useful these models are for deciding whether or not to provide GH treatment.

In order to determine the degree to which the GH provocation test result predicts the response to GH, and to improve the management of patients whose provocation test result is borderline, we analysed data from GH deficient subjects (GH response <20 mU/l) registered in the UK Pharmacia KIGS database.


Cases were selected from the UK Pharmacia KIGS (Kabi International Growth Study) database12 on the basis of a diagnosis of isolated GHD in children <10 years of age treated with GH, Tanner pubic hair stage 1, GH provocation test result <20 mU/l, and follow up >1 year. Cases with a history of pituitary or hypothalamic tumours or cranial radiotherapy were excluded. A total of 337 patients were identified with height measurements at baseline (t0) when GH was first given, and 1 year later (t1). Some patients also had measurements 1 year before (t−1) and 2 years after (t2) baseline. The times were defined to be the nominal time (that is, −1, 1, and 2 years relative to t0) ±2 months. On 5% of occasions patients had more than one measurement in the ±2 month window, which were then averaged.

The KIGS was set up by Kabi Pharmacia (now Pfizer) as a post-market surveillance programme for patients receiving GH. Participating patients (or their parents) gave signed informed consent for their data to be included in the database and used for the purposes of medical research. Irreversibly anonymised data for the current study were provided to the authors by Dr Patrick Wilton (Pharmacia).

The primary outcome measure was the first year growth response, that is, the change in height standard deviation score (SDS) (British revised reference13,14) during the first year of GH treatment, adjusted for other covariates. A secondary outcome was the growth response in the second year of treatment. Growth response was adjusted for the exact time between measurements. Other covariates included height SDS and weight SDS at the start of treatment, age, sex, birthweight SDS, gestation, midparent height SDS (mean of two parents’ SDS), GH dose and injection rate, growth rate in the pre-treatment year, and the GH provocation test result. This latter variable was obtained using local protocols including local GH assays. To account for skewness it was logged and multiplied by 100, so that its regression coefficient indicates the effect on height SDS change of a 1% change in the provocation test result.15

The factors predicting growth response were identified by multiple regression analysis. The relationship between growth response and provocation test was explored using a partial regression plot, which is a scatterplot where each factor is replaced by its residual adjusted for other factors in the relevant model. Analysis was carried out in Data Desk version 6.2 (Data Description, Ithaca, NY, USA).


Background information

Table 1 summarises the 337 patients at baseline (t0), and during the 2 years of treatment (t1 and t2). Overall there was a preponderance of boys, with midparent height SDS, birth weight SDS, and weight SDS all relatively low. The provocation test result (maximum GH peak) was also low, while details of GH treatment were similar in years 1 and 2.

Table 1

 Summary statistics at baseline and during years 1 and 2 of treatment in 337 pre-pubertal IGHD patients aged under 10 years (70% male)

Height SDS was fairly constant during the year before treatment (mean −3.4 SDS units), increasing to −2.7 after 1 year of treatment and −2.3 after 2 years (table 2). Height SDS at baseline depended on age (0.07 units greater per year, p = 0.03), sex (girls 0.3 units shorter than boys, p = 0.004), and the GH provocation test result (0.0021 units greater per %, p<0.0001).

Table 2

 Summary statistics of height SDS and change in height SDS for different time periods

There was a big growth response in the first year of treatment and a smaller response in the second, which was similar in the two sexes. The SD of height increment was greater in the first year than before or after, indicating the heterogeneity of the growth response.

First year growth response

Table 3 summarises the regression analysis for the first year of treatment. The 10 factors together explained 42% of the variance, with a residual SD of 0.29 height SDS units. The trend in height SDS increment with age was curvilinear, becoming less steep with age (fig 1). Baseline height SDS and weight SDS showed that shorter and heavier children gained more height with treatment, while midparent height SDS predicted a greater response in those with taller parents. The frequency of GH injections was important, but not the total GH dose. Injection frequency was three to seven injections per week, with 28% receiving three, and 79% six or seven.

Table 3

 Multiple regression analysis of the increase in height SDS during the first year of GH treatment on a series of prognostic factors (n = 300)

Figure 1

 The curvilinear relation between height SDS change and age in the first year of GH treatment.

The GH provocation test result was the most predictive factor, accounting for 9.9% of the variance. Figure 2 shows the partial regression plot between growth response and provocation test, where each factor has been adjusted for the other factors in table 3. The fitted regression line is also shown.

Figure 2

 The relation between height SDS change in the first year of GH treatment and maximum GH peak (%), each expressed as residuals adjusted for the other factors in the model of table 3.

There were also two marginally significant interaction terms (not shown). The provocation test result was less negative (that is, less predictive) in older children (interaction p = 0.06), and the GH dose effect was smaller in tall children (p = 0.09). There were no significant sex interactions (p>0.1), so the sexes were pooled for analysis.

Second year growth response

Just three factors predicted growth response in the second year: provocation test (p = 0.0005), GH dose, and injection rate (p∼0.05). The provocation test coefficient was half that seen in the first year. Together the factors explained 6.8% of the variance, and the residual SD was 0.26 height SDS units based on 232 patients.

By far the best predictor of second year growth response was the first year response. Adding it to the above model (along with baseline height SDS at year 1) tripled the explained variance to 21.6%, reduced the residual SD to 0.24 and rendered the provocation test result insignificant (p = 0.6) (table 4).

Table 4

 Multiple regression analysis of the increase in height SDS during the second year of treatment on a series of prognostic factors (n = 232)

Growth prediction before GH treatment

The one factor predicting growth in the pre-treatment year was age, with older patients growing faster (p = 0.001). The analysis was based on 153 patients and explained 6.9% of the variance with a residual SD of 0.20.

Height and height increment correlations

Table 5 gives the correlations between height and height increment at the various time points. Heights prior to GH treatment at t−1 and t0 were highly correlated, r = 0.97 indicating strong tracking. Height also tracked strongly during GH treatment, with a correlation of 0.95 between heights at times t1 and t2. The introduction of GH therapy disturbed the tracking, with strikingly lower correlations for heights before and after treatment, for example, 0.9 for the two pre-treatment years versus year 1, and 0.8 versus year 2.

Table 5

 Correlations between height SDS and annual change in height SDS


This group of very short children, mean height SDS −3.4 at baseline, showed a dramatic increase of 0.74 SDS in height after a year of GH treatment. Despite this their growth pre-treatment was sufficient to maintain their centile position, with a fall of only 0.05 in mean height SDS over the year (table 2). This is at first sight surprising, as untreated GH deficient patients cross height centiles downwards. It suggests the likely inclusion of non-GHD patients in the cohort some of whom were crossing centiles upwards.

A similar conclusion can be drawn from the patients’ heterogeneous response to GH treatment. Some were genuinely deficient and grew better with treatment, while others were not deficient and grew no better when treated. This shows itself in several ways. Firstly, there was no association between the provocation test result and growth prior to GH treatment, but a strongly significant and negative association after treatment (table 3). Ranke et al12 also found the provocation test result to be strongly predictive of post-treatment acceleration.

Secondly, the SD of the growth response almost doubled as a result of GH treatment, from 0.21 the year before to 0.39 the year after (table 2). Some patients responded to GH while others did not, and the extra heterogeneity in response on introducing GH was most likely due to the GH itself. The smaller SD of growth response in the second year (0.27) accords with this, in that the GH effect waned after the first year.

Thirdly, before-treatment and after-treatment height tracked strongly (table 5), and almost as strongly as seen in normal children,16 but heights before and after treatment correlated with each other far less strongly, for example, for the pre-treatment years versus years 1 and 2. This again shows that the introduction of GH treatment generated a heterogeneous response, with some patients responding and others not.

The correlation of 0.45 between height SDS change in the first and second years of treatment (table 5) was remarkably high, given the measurement errors involved. Of course measurement error impacts on all the correlations in table 5, but particularly so for correlations between successive velocities where error in the common measurement (that is, height at t1) counts twice. The high correlation indicates that first year growth response was a good predictor of later response. Furthermore, after adjusting for it the provocation test taken 1 year earlier was completely uninformative (table 4). So the first year response, once known, is far better than the provocation test for predicting final height increment.

The relationship between test result and subsequent growth while treated with GH was essentially linear on the log scale (fig 2). A halving of the provocation test result (for example, from 10 to 5 mU/l) corresponds to a 69% decrease on the log (%) scale,15 which in turn relates to an increase of −69×−0.0013 = +0.09 height SDS units, an upward shift in velocity of 0.3 SDs given the residual SD of 0.29 SDS units. This looks a modest velocity effect for a large change in the test result, but if maintained over time it would appreciably increase final height.

The test’s relatively poor performance may reflect measurement error due to the lack of standardisation in methodology and inter-laboratory differences. Bright et al17 found that the provocation test had 82% sensitivity but only 25% specificity to detect GHD, low predictive power which they attributed to inter-individual differences in GH handling. It is likely that both factors are relevant, but using a standardised protocol would be a relatively simple way to improve the signal to noise ratio.

The younger children grew more rapidly than the older children (table 3 and fig 1), and the provocation test result increased with age (r = +0.20, p = 0.0002). So younger children were more likely to have GHD than constitutional delay, and were more likely to respond to GH treatment. The frequency of GH injections was more significant than the dose during the first year of treatment, but this reversed in the second year. Both the dose and the injection rate were similar in the 2 years, so there is no obvious explanation for the switch. As more and more children receive six to seven injections per week, the dose will become the important factor.

Baseline height SDS had a highly significantly negative effect in the first year of treatment, due partly to regression to the mean and partly to the presence of baseline weight SDS in the model. The inclusion of both weight and height shows that short fat children were those that respond best to GH treatment. This of course describes the classic phenotype of GH deficiency.

Midparent height and birth weight were predictive during the first year of treatment, but not before or after. This suggests that they only emerged as predictive when the growth rate was sufficiently high.

In conclusion, the outcome of the GH provocation test was closely related to growth response in the first year of treatment. Growth response is effectively the “gold standard” to quantify the degree of GH insufficiency in the individual patient, since the aim of treatment is to increase height velocity and ultimately adult height. Growth responses in the first 2 years are highly correlated, emphasising the value of first year response for predicting later height gain attributable to treatment.

So the initial GH treatment decision should be based on the provocation test using the conservative cut off of 20 mU/l. Then if the first year growth response is sufficiently small, logically the treatment should be stopped at that point unless compliance is felt to be poor and can be improved. This decision should ideally be based on growth response adjusted for all the factors in table 3 with the exception of the provocation test result. The provocation test justifies treatment in the first place, but after 1 year it is superseded by the growth response and should not be adjusted for again. In practice of course such adjustments are unrealistic without dedicated software.

The way forward is to improve the predictive power of the GH provocation test, by collecting information on local variations in test procedures and comparing the results with those based on a standardised protocol and centralised GH assay. We propose a clinical trial to test the effectiveness of such a protocol.


We thank the KIGS International Board and Executive Scientific Committee for permission to access the KIGS data, and Dr Patrick Wilton for providing the data. We are also grateful to members of the British Society for Paediatric Endocrinology and Diabetes (BSPED) for contributing their patients to KIGS. Finally we thank the two reviewers for their comments which have considerably improved the paper.


View Abstract


  • Conflicts of interest: all three authors have from time to time received support from Pfizer for contributing to conferences on child growth.

Request permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.