Skip to main content

Analysis of heterogeneous growth changes in longitudinal height of children



There have been methodologies developed for a wide range of longitudinal data types; nevertheless, the conventional growth study is restricted if individuals in the sample have heterogeneous growth trajectories across time. Using growth mixture modeling approaches, we aimed to investigate group-level heterogeneities in the growth trajectories of children aged 1 to 15 years.


This longitudinal study examined group-level growth heterogeneities in a sample of 3401 males and 3200 females. Data were analyzed using growth mixture modeling approaches.


We examined different trajectories of growth change in children across four low- and middle-income countries using a data-driven growth mixture modeling technique. The study identified two-group trajectories: the most male samples group (n = 4260, 69.7%) and the most female samples group (n = 2341, 81.6%). The findings show that the two groups had different growth trajectories. Gender and country differences were shown to be related to growth factors; however, the association varied depending on the trajectory group. In both latent groups, females tended to have lower growth factors (initial height and rate of growth) than their male counterparts. Compared with children from Ethiopia, children from Peru and Vietnam tended to exhibit faster growth in height over time: In contrast, children from India showed a lower rate of change in both latent groups than that of children from Ethiopia.


The height of children in four low- and middle-income countries showed heterogeneous changes over time with two different groups of growth trajectories.


Describing how individuals vary over time is one of the main focuses in the study of development [1]. Repeated measurements are commonly used to study individual and population growth processes of stability and change over time [2]. Thus, selecting analytical methodologies that capture growth changes is an important point in longitudinal data analysis. Growth curve modeling has offered a set of techniques that are useful for modeling between-individual variations and within-individual change in growth [3]. In analyzing the change process, it is reasonable to assume that not everyone changes in similar trajectories. There may be underlying classes for each individual, with each class having its own set of parameters guiding the change process, or possibly following distinct functional forms of change entirely [4]. In such instances, the application of typical longitudinal data analysis techniques may not give appropriate results.

Growth modeling techniques, such as mixed-effects and latent growth models, assume that the sample is drawn from a single population with a single set of parameters (e.g., means, variances, covariance) [5, 6]. For a single population, covariates that explain part of the variances of the growth parameters (e.g., intercept and slope) can reflect observed variation in growth curve models. However, in the situation of unobserved heterogeneity, the assumption of a single population underlying the growth curves must be relaxed [2]. In contrast, multiple-group latent growth models can enable simultaneous modeling of change for multiple observed groups, in which parameters describing growth patterns are examined to see if they are invariant across groups. However, they require prior information about the group membership of an individual and they assume various growth trajectories can only be examined within subgroups that have observed identity variables (e.g., gender, etc.). Furthermore, when the group membership of an individual is unknown, unobserved subgroups of individuals may exist within the population, which may exhibit variability in their latent trajectories. Growth mixture modeling is a viable approach for identifying multiple unique trajectories in different unobserved subgroups. It is appropriate when subgroups of individuals with different trajectories are expected and individual group membership is unknown a priori [5,6,7].

Growth mixture models can be considered as an extension of the multiple-group latent growth model. An individual's group membership can be generated from the information of the estimated probabilities of latent class [2]. Hence, the growth mixture model refers to a model with categorical latent variables that reflect subgroups where group membership is unknown, and it is known as a finite mixture model. The mixture represents various latent trajectory classes [8]. Therefore, this study aims to present the growth mixture modeling approach to identify groups of children with different growth trajectories of physical height and to examine growth differences across different groups of individuals.


Study sample

Longitudinal height data were obtained from the Young Lives cohort study carried out in Ethiopia, India, Peru and Vietnam from 2002 to 2016. The Young Lives study is a 15-year longitudinal study of the changing nature of childhood poverty designed to collect information on children growing up in four low- and middle-income countries. The Young Lives study employed multistage sampling techniques with the first stage involving a selection of sentinel locations from each country. Sentinel site monitoring is a public health concept that entails a purposive sampling of a small number of settings that are thought to reflect a specific population or area, and then being studied uniformly at relatively wide ranges. Following that, 20 sentinel sites were selected at non-random in each study country. Following the selection of 20 sentinel sites, households with children in the appropriate age groups were chosen at random. This technique, which was implemented in 2002, resulted in the selection of 2,000 infants (ages 6 to 18 months) at random and considered as a younger cohort. Simultaneously, 1,000 older children (aged 7 to 8 years) were chosen at random in the same sites and considered as older cohort [9, 10]. Details regarding sampling and participant recruitment have been discussed in previously published works [9, 11].

The qualitative and quantitative survey data were gathered in five rounds. The first survey round was carried out in 2002 when the children were on average one year of age (younger cohort) and eight years of age (older cohort), the second survey was carried out in 2006, the third was in 2009, the fourth was in 2013, and the fifth was carried out in 2016. The anthropometric data were collected as quantitative data. Details regarding sampling and participant recruitment were discussed in previously published studies [12,13,14,15,16,17,18]. From a younger cohort, a total of 3401 males and 3200 females with measured height 5 times from ages 1 to 15 years were included in this study. The Young Lives data are publicly available and can be accessed from

Mixture models with Known group Membership

The goal of growth curve modeling is to characterize and test inter-individual variations in intra-individual change [19]. Latent growth models comprise the prediction of one latent class which is defined as a single population mean trajectory or multiple latent classes which are defined as one mean trajectory per class. Each latent group can be thought of as a collection of subjects that have similar growth trajectories that are not observed in the data [20]. In conventional growth modeling techniques, the sample is taken from a single population with a single set of growth parameters. A growth curve model can be extended to investigate variations in outcome growth among known latent multiple groups [5, 20,21,22]. Hence, multiple-group latent growth curve modeling required prior knowledge about an individual's group membership and the number of groups; for instance, gender can be divided into male group and female group, which are known and observed groups. Independent growth models are described for each group, which indicate the pattern of change, the mean change and the extent of between-group variations in the amount of change [5]. Multiple-group latent growth models [3] for g groups (g = 1, 2, …, G) can be expressed as follows:

$$y_{ti}^{{\left( {\text{g}} \right)}} = \alpha_{i}^{{\left( {\text{g}} \right)}} + \beta_{i}^{{\left( {\text{g}} \right)}} {\uplambda }_{t}^{{\left( {\text{g}} \right)}} + \varepsilon_{ti}$$


$$\alpha_{i}^{{\left( {\text{g}} \right)}} = \mu_{\alpha }^{{\left( {\text{g}} \right)}} + e_{\alpha }^{{\left( {\text{g}} \right)}}$$
$$\beta_{i}^{{\left( {\text{g}} \right)}} = \mu_{\beta }^{{\left( {\text{g}} \right)}} + e_{\beta }^{{\left( {\text{g}} \right)}}$$

Equation (1) can be expressed in the matrix form of structural equation modeling as:

$${\varvec{y}}^{{\left( {\text{g}} \right)}} = \eta^{{\left( {\text{g}} \right)}} {\Lambda }^{{\left( {\text{g}} \right)}} + \varepsilon^{{\left( {\text{g}} \right)}}$$


$${\varvec{y}}^{{\left( {\text{g}} \right)}} = \left[ {\begin{array}{*{20}c} {y_{1i}^{{\left( {\text{g}} \right)}} } \\ {y_{2i}^{{\left( {\text{g}} \right)}} } \\ {\begin{array}{*{20}c} {\begin{array}{*{20}c} {y_{3i}^{{\left( {\text{g}} \right)}} } \\ \vdots \\ \end{array} } \\ {y_{Ti}^{{\left( {\text{g}} \right)}} } \\ \end{array} } \\ \end{array} } \right],{\Lambda }^{{\left( {\text{g}} \right)}} = \left[ {\begin{array}{*{20}c} {\begin{array}{*{20}c} 1 \\ 1 \\ {\begin{array}{*{20}c} 1 \\ \vdots \\ \end{array} } \\ \end{array} } & {\begin{array}{*{20}c} 0 \\ 1 \\ {\begin{array}{*{20}c} {{\uplambda }_{3}^{{\left( {\text{g}} \right)}} } \\ \vdots \\ \end{array} } \\ \end{array} } \\ 1 & {{\uplambda }_{T}^{{\left( {\text{g}} \right)}} } \\ \end{array} } \right],\eta^{{\left( {\text{g}} \right)}} = \left[ {\begin{array}{*{20}c} {\alpha_{i}^{{\left( {\text{g}} \right)}} } \\ {\beta_{i}^{{\left( {\text{g}} \right)}} } \\ \end{array} } \right],\varepsilon^{{\left( {\text{g}} \right)}} = \left[ {\begin{array}{*{20}c} {\varepsilon_{1i}^{{\left( {\text{g}} \right)}} } \\ {\varepsilon_{2i}^{{\left( {\text{g}} \right)}} } \\ {\begin{array}{*{20}c} {\begin{array}{*{20}c} {\varepsilon_{3i}^{{\left( {\text{g}} \right)}} } \\ \vdots \\ \end{array} } \\ {\varepsilon_{Ti}^{{\left( {\text{g}} \right)}} } \\ \end{array} } \\ \end{array} } \right]$$

where \(y_{ti}\) is the measured outcome for the i-th individual at time t = 1, 2, …, T, \(\alpha_{i}^{{\left( {\text{g}} \right)}}\) and \(\beta_{i}^{{\left( {\text{g}} \right)}}\) are latent variables representing the initial and growth rate for i-th individual in g group, respectively, \(\Lambda_{t}\) represents the basis function for the slope component, and \(\varepsilon_{ti}\) is a time-specific residual. Individual variations in growth over time are allowed in this model since the growth intercept \(\alpha_{i}\) and slope \(\beta_{i}\) vary across individuals, resulting in individually varying patterns for \(y_{ti}\) over time.

Mixture models with unknown group membership

In the multiple-group modeling approach, the group membership of each individual in the sample is known or observed. In some circumstances, observations may come from multiple populations with no way of identifying in which population a given observation belongs to. When group membership of individuals is not known, mixture modeling gives a way for estimating models. It is a popular data analysis technique for detecting heterogeneity in a group or population through multiple unobserved subgroups, characterizing change over time within each unobserved subgroup and investigating variations in change across unobserved subgroups [5, 23]. This technique simultaneously classifies individuals into different groups and estimates the latent curves in each group. Individuals' latent classes can be identified using longitudinal finite mixture models [8]. In finite mixture models, the individual under study is considered to be made up of various latent classes [24]. Each class represents a set of heterogeneous observations in which factors affect the response in different ways [20]. To capture heterogeneities in the population, the model separates individuals into different latent classes, each with its functional form for the trajectory. Individuals are then assigned to classes depending on the degree of their similarity [25] using posterior probabilities. The data are used to infer posterior probability for each individual in all classifications. To put it another way, for each individual, a posterior probability is calculated for each class, and the individual is categorized into the class with the highest posterior probability [22]. The model equation [3] can be formulated as:

$$y_{ti} = \mathop \sum \limits_{{{\text{g}} = 1}}^{G} P_{i}^{{\left( {\text{g}} \right)}} \left( {\alpha_{i}^{{\left( {\text{g}} \right)}} + \beta_{i}^{{\left( {\text{g}} \right)}} {\uplambda }_{t}^{{\left( {\text{g}} \right)}} + \varepsilon_{ti} } \right)$$

where \(P_{i}^{{\left( {\text{g}} \right)}}\) is the probability that the i-th individual belongs to the g-th group such that \(P_{i}^{{\left( {\text{g}} \right)}} \ge 0\), \(\mathop \sum \limits_{{{\text{g}} = 1}}^{G} P_{i}^{{\left( {\text{g}} \right)}} = 1\), and g > 1. For continuous longitudinal outcomes, \(y_{ti}\) is normally distributed with mean \(\mu^{{\text{g}}}\) and variance–covariance \({\Sigma }^{{\text{g}}}\), \(y_{ti} \sim MVN\left( {\mu^{{\text{g}}} ,{\Sigma }^{{\text{g}}} } \right)\). There are two sources of variation in \({\Sigma }^{{\text{g}}}\): between-individual variation given by random growth parameters (the intercept and slope) (\(D^{{\text{g}}}\)) and within-individual variation given by errors (\(R^{{\text{g}}}\)) [26].

The conventional growth modeling approach is not well-suited for investigating heterogeneous change in a single population. A growth mixture model considers that the group is made up of diverse subgroups of trajectories (i.e., a “mixture” of distributions), allowing for heterogeneous trajectories to be accommodated [22]. The first procedure in fitting a growth mixture model to the data is to estimate a conventional latent growth curve model to see the assumptions of homogeneous variance with a common growth function. Fitting a latent growth curve model to the data, in this case, enables the study to analyze the overall fit of the data to a particular shape of trajectory before accounting for trajectory heterogeneity. The presence of heterogeneous trajectories, or subgroups with diverse trajectories, is suggested by poor model fit and statistically significant variances in growth parameters [22]. To compare the fit of nested models, the likelihood ratio test can be utilized. The Bayesian information criterion (BIC) is used to determine the number of groups for non-nested models [21]. The number of groups is gradually increased until a minimum BIC is achieved [27].

As depicted in Fig. 1, a growth mixture model accounts for heterogeneity by including multiple groups or latent categorical variables (C) of individual trajectories in addition to components of the conventional growth model in the same modeling approach. Class C can be observed or unobserved/latent classes which represent the latent trajectory classes that underpin the latent growth variables.

Fig. 1
figure 1

Conditional growth mixture model for five waves of data


This longitudinal study was designed to examine the group-specific trajectory in height growth from age 1 to 15 years using a mixture modeling technique. Four low- and middle-income countries were the group variable of interest of the study (Ethiopia: n = 1582, 24%, India: n = 1648, 25%, Peru: n = 1627, 24.6%, Vietnam: n = 1744, 26.4%). The summary of descriptive statistics for the endogenous variables and response frequencies for five measurement occasions is given in Table 1.

Table 1 Descriptive statistics of measurement of height for five measurement occasions by countries

The height of the children was measured in five rounds from 2002 to 2016. The first survey round was carried out in 2002 when the children were on average one year of age, the second survey was carried out in 2006, the third in 2009, the fourth in 2013, and the fifth in 2016 when the children were on average 15 years of age. The younger children were tracked from one to fifteen years, at ages 1, 5, 8, 12 and 15 years. The mean height of children given in Table 1 further revealed that in all four countries, there are increasing tendencies of mean height from age one to age 15 years. However, there were variations in mean height changes over time among the four countries. On all measurement occasions from ages 1 to 15 years, children in Vietnam had the highest mean height, and children in Peru had the second highest mean height (Table 1).

Latent basis growth mixture model with observed groups

The first step in modeling a growth mixture is to see how well a single-group basic latent growth model fits the data [6]. The basic latent growth curve model assumes the growth trajectories (Fig. 4) among children in Ethiopia, India, Peru and Vietnam differ only in their means intercepts, rate of growth or the impact of the covariates on these variables. However, the variation of intercept and slope was statistically significant (\(\psi_{\alpha \alpha } = 12.94,p < 0.0001\), \(\psi_{\beta \beta } = 0.07,p = 0.004\), respectively) and the functional form of the trajectory may also differ. The significant variation in the growth function between subjects indicates the presence of heterogeneity in longitudinal changes of height growth. To address these challenges, a single latent group was extended to observed multiple groups with considerably different growth patterns from the entire estimate. Hence, a separate functional form and data set were established for each group based on the residential country of the children.

Figure 2 depicts the observed group patterns by countries, which represent different growth trajectories in different countries. In this case, we might identify four groups of growth trajectories (Ethiopia, India, Peru and Vietnam), each with its own predicted growth parameters. As shown in Fig. 2, the individual height of children in all four countries is separately displayed with nonlinear changes over time. The minimum and maximum heights in each measurement occasion were likewise displayed in Fig. 2. Figure 3 shows the structural equation framework for multiple-group unconditional growth mixture models for these observed groups.

Fig. 2
figure 2

Growth trajectories for observed multiple groups

Fig. 3
figure 3

Multiple-group growth mixture models for height in Ethiopia, India, Peru and Vietnam

Using multiple-group models, testing for multiple-group invariance across groups was performed. For instance, M1: unconstrained model (no equality constraints imposed across the groups), M2: testing group-varying in factor loadings, M3: testing group-varying in mean intercept, M4: testing group-varying in slope component, and M5: testing group-varying random variance (\(\psi_{{\alpha_{g} }} ,\psi_{{\beta_{g} }}\)) and covariance (\(\psi_{{\alpha \beta_{g} }}\)) in latent intercept and slope components. The likelihood ratio test was used to test the invariance nature of the growth components across the G groups due to the nested link between a model with constraints and an unconstrained model. Table 2 shows the chi-square difference test statistics for these models.

Table 2 A summary of fit statistics for multiple-group invariance tests

The chi-square difference (\(\Delta x^{2}\)) between the two nested models can be used to perform a significance test to see if the more restricted model fits the same as the less restrictive model without the restrictions. A significant \(\Delta x^{2}\) indicates that the four classes differ in at least one of the growth parameters examined. Model 2 is a latent basis model in which the loading factors of \(\lambda_{1} = 0,\lambda_{5} = 14\) constraints were applied to all groups. The rest loading factors, \(\lambda_{2}\),\(\lambda_{3}\) and \(\lambda_{4}\), are unconstrained across all groups to permit different functional forms of trajectories for each group. A chi-square difference test of 458.699 with p = 0.000 in Table 2 indicates that at least one group has distinct growth patterns. Model 3 and 4 examined group-varying in mean intercept and rate of change, respectively. The small value of p = 0.000 suggests that at least one group has a different mean intercept and rate of change than other groups. The parameter estimates of these models are presented in Table 3.

Table 3 Parameter estimates of multiple-group analysis of longitudinal height data for Ethiopia, India, Peru and Vietnam

A comparison of the growth parameters across the four groups shows variations in growth patterns in height. Ethiopia captured the initial mean height of 71.38, p < 0.0001 with the rate of change 6.11, p < 0.0001. India captured the initial mean height of 72.15, p < 0.0001 with the rate of change 5.98, p < 0.0001. Peru captured the initial mean height of 71.77, p < 0.0001 with the rate of change of 6.13, p < 0.0001. Vietnam captured the initial mean height of 72.40, p < 0.0001 with the rate of change of 6.19, p < 0.0001. The variance–covariance structure varies depending on the detected group trajectories. In all groups, the intercept and slope variance were significant. The significant and positive covariance between mean intercept and slope for both Peru and Vietnam groups indicate that children who had a higher initial height tended to be growing at a faster rate. However, the covariance was not significant in the Ethiopian and Indian groups (Ethiopia: \(\psi_{\alpha \beta } = 0.061, p = 0.446, India: \psi_{\alpha \beta } = 0.116,{\text{p}} = 0.083\)).

Latent basis growth mixture model with unobserved groups

Individual categorization of trends based on prior information may result in developmental profiles being over-or under-fitted [28]. Instead, in the absence of a priori group classification, the growth mixture model provides an analytical opportunity to identify between different developmental shapes [2]. A growth mixture model can extend a multiple-group approach by introducing latent classes and a probabilistic categorization for each individual [28].

Figure 4 shows the plot of individual height measurements of the children against time for five waves of data. The visual information displayed in Fig. 4 shows that there is some indication of differences in height growth between and within the children. It also shows nonlinear growth changes in a child’s height over time. The between-individual variability in height growth was small in early childhood and increases with a child's age. Figure 4 also suggests that with a few exceptions, individuals with a lower initial height at age one also exhibited a lower height throughout the measurement occasions and other individuals who started with relatively high initial mean height also grew taller over time. This suggests that there could be latent or unobserved subgroups of individuals who show heterogeneous trajectories. For such instances, a growth mixture model is a reasonable model to investigate individuals’ growth heterogeneity by categorizing them into latent categories with more homogeneous trajectories. We analyzed height data collected from children in four low- and middle-income countries. Height measurements were taken on five occasions. For specification purposes, the first and the last measurement occasions were pre-specified to \(\lambda_{1} = 0\) and \(\lambda_{5} = 14\) factor loadings to accommodate nonlinearity trajectories.

Fig. 4
figure 4

Individuals growth trajectories across five measurement occasions

The number of latent groups in a growth mixture model is a major research subject. This is accomplished by fitting models with varying numbers of groups in a sequential manner and comparing the fitted models. Three growth mixture models were fitted along with the time-independent covariates to estimate the number of potential latent groups in the height data: a one-group model (H0: g = 1), a two-group model (H0: g = 2) and a three-group model (H0: g = 3). Information criteria–based indices were used to compare these models. Since models with distinct numbers of groups are not nested, a likelihood ratio test is not useable for model comparisons. Instead, Muthen and Muthen (2000) suggest using Bayesian Information Criteria (BIC) to choose the best number of latent groups and a model with a lower value is better-fitting [3, 21]. Table 4 shows the results of model fit indices for three different group growth mixture models. The one-group growth mixture model is the conventional latent growth curve model. The results suggest that the two-group model fits the data much better than the one-group and three-group models as it had a lower BIC value. The two-class model, which is the best-fitting model, divides the data into two groups (Fig. 5). Table 4 shows that the first and the second groups of the growth mixture model comprised 4260 (64.5%) and 2341 (35.5%) of the population, respectively. Females were estimated to comprise 30.3% and 81.6% of groups 1 and 2, respectively, and males were estimated to comprise 69.7%, and 18.4% of groups 1 and 2, respectively.

Table 4 The fit of an optimal number of groups for growth mixture models
Fig. 5
figure 5

Two-group individual growth trajectories over five time points

The evaluation of group membership predictors and their variations in growth parameters within each group is another important aspect of the growth mixture model. A latent basis growth curve model was employed as a baseline model to capture variations in height over five measurement occasions. The results of parameter estimates of the latent basis growth mixture model with two groups are shown in Table 5. The results indicate that each of the two groups exhibited distinct patterns. It can be seen that a latent basis form fits well both latent groups. Group 1 showed a significant \(\left( {\alpha_{\mu } = 72.63,p < 0.0001} \right)\) initial height measurement at age one and a significant rate of change over time \(\left( {\beta_{\mu } = 6.27,p < 0.0001} \right)\). Group 2 also showed a significant \(\left( {\alpha_{\mu } = 72.75,p < 0.0001} \right)\) initial height measurement at age one and a significant rate of change over time \(\left( {\beta_{\mu } = 5.92,p < 0.0001} \right)\). The significant intercept variance (Group 1: \(\psi_{\alpha \alpha } = 11.93,p < 0.0001\) and Group 2: \(\psi_{\alpha \alpha } = 14.23,p < 0.0001\)) and slope variance (Group 1: \(\psi_{\beta \beta } = 0.14,p < 0.0001\) and Group 2: \(\psi_{\beta \beta } = 0.13,p < 0.0001\)) indicate that there were considerable inter-individual variations in the initial level of height measurement and rate of changes across individuals, respectively. For group 1, the parameter values of slope factor loadings were estimated to be \(\lambda_{1} = 0\left( {fixed} \right),\lambda_{2} = 5.19,\lambda_{3} = 7.72,\lambda_{4} = 11.03,\lambda_{5} = 14\left( {fixed} \right)\) and for group 2 \(\lambda_{1} = 0\left( {fixed} \right),\lambda_{2} = 5.77,\lambda_{3} = 8.54,\lambda_{4} = 12.76,\lambda_{5} = 14(fixed\). The significantly different values of slope factor loadings for groups 1 and 2 indicate that latent group 1 and group 2 have different functional forms of trajectories. For group 1, the non-significant covariance between the mean intercept and rate of growth (\(\psi_{\alpha \beta } = - 0.045,p = 0.205\)) revealed that there was no association between the two growth components, while it was significant in group 2 (\(\psi_{\alpha \beta } = - 0.184,p < 0.0001\)), indicating that children with higher initial levels of mean height at age one progressed less rapidly in their growth over time than those with lower initial levels of mean height.

Table 5 Parameter estimates of 2‑group latent basis growth mixture model

The next work is to investigate and evaluate the likely differences in the covariate influences on these two group trajectories. This is accomplished by introducing group-specific time-invariant covariates, creating a conditional growth mixture model, to capture the difference in the group-specific growth factors. This conditional model entails defining analysis of continuous latent growth factors and groups of categorical latent variables on gender and residing country of children. The fitted results of the conditional growth mixture model are given in Table 5. Gender and country differences were found to be associated with the growth factors, but the relationship differed with the trajectory group. For instance, females tended to have lower growth factors compared with their male counterparts in both groups.

Compared with children from Ethiopia, children from Peru and Vietnam tended to exhibit faster growth in height over time: Group 1: (\(\beta_{Peru} = 0.04,p = 0.035,\beta_{Viet} = 0.09,p < 0.0001\)) and Group 2: (\(\beta_{Peru} = 0.07,p = 0.01,\beta_{Viet} = 0.17,p < 0.0001\)). In contrast, children from India showed a lower rate of change in both latent groups than that of children from Ethiopia: Group 1: (\(\beta_{India} = - 0.15,p < 0.0001\)) and Group 2: (\(\beta_{India} = - 0.12,p = 0.01,p < 0.0001\)). Furthermore, the findings showed that the effects of selected covariates on initial levels of mean height varied throughout latent trajectory groups.


Many analyses of longitudinal data assume that there is a single population on a single trend. Data from a single population assumed that individuals contribute comparable information about a hypothesized growth process. However, individuals may not have similar growth trajectories over time and they can be classified into various groups based on their growth trajectories. Because of the intrinsic heterogeneity found in trajectories of some individual characteristics, the assumption of a single population is not reasonable. In the current study, two mixture modeling approaches were applied to investigate growth heterogeneity in the height of children in four low- and middle-income countries. First, it is when the individual's group membership is known and second when the individual's group membership is unknown, and there are multiple groups with different growth trajectories. The latter technique permits for the evaluation of height change over time while explicitly permitting the latent growth structure to be parameterized with a specific number of groups.

The findings of the study identified that there was a substantial difference in growth parameters across the four observed groups. The differences in growth trajectories were observed among children. Subsequently, we defined an unobserved growth mixture model as an extension of a multiple-group growth model that is useful for analyzing the growth variations in multiple latent or unobserved subgroups. We performed Growth mixture models with one, two and three groups applying a latent basis growth curve model for all groups to estimate the number of potential latent groups. The BIC scores for these group models were evaluated, and the two-group latent basis mixture model was preferred for the current height data. In this model, individuals were categorized into two groups, with Group 1 accounting for 64.5 percent of the total and Group 2 accounting for 35.5 percent. Group 1 had the most male samples (69.7%), and Group 2 had the most female samples (81.6%). Two group trajectories were identified in the previous longitudinal study that employed growth mixture modeling to estimate weight-to-height trajectories in children aged 4 to 12 years [29]. The previous study has investigated sex-specific latent height class trajectories and identified the three-class model for males (low, intermediate and high) and the two-class model for females (low and high) (low and high). The study reported that in both sexes, height was significantly distinct between classes [30].

The results of the study demonstrated that the growth trajectories of children in four low- and middle-income countries were categorized into two different group trajectories. Gender and country differences were found to be associated with the growth factors, but the relationship differed with the trajectory group. Females had lower growth factors in both latent groups compared with males. In both trajectory groups, compared with children from Ethiopia, those from Peru and Vietnam grew taller more quickly over time. Children from India, on the other hand, showed a lower rate of change in both latent groups than children from Ethiopia.

When studying growth trajectories within a population, the growth mixture model provides a lot of potential for identifying group heterogeneity. Growth trajectories were found to be heterogeneous and the study identified two different group trajectories. The mean, variance and weight structures of the two groups were all different.

The limitation of the study is that it focused only on four low and middle-income countries with few covariates. As a result, further study is needed to address these limitations.


The study identified that the height of children in four low- and middle-income countries showed heterogeneous changes over time with two different groups of growth trajectories. Group 1 had the most male samples, and Group 2 had the most female samples. This study may also provide valuable insights for a better understanding of how to model growth heterogeneity in the context of mixture models.

Availability of data and materials

The datasets analyzed during the current study are available in the Young Lives study repository,


  1. Wohlwill JF. The study of behavioral development. San Diego: Academic Press; 1973.

    Google Scholar 

  2. Reinecke J, Seddig D. Growth mixture models in longitudinal research. AStA Adv Stat Anal. 2011;95(4):415–34.

    Article  Google Scholar 

  3. Bollen K, Curran P. Latent curve models: a structural equation perspective. John Wiley & Sons;2006.

  4. Kelley K. Estimating nonlinear change models in heterogeneous populations when class membership is unknown: defining and developing the latent classification differential change model (Doctoral dissertation). University of Notre Dame, Indiana;2005.

  5. Ram N, Grimm KJ. Methods and Measures: growth mixture modeling: a method for identifying differences in longitudinal change among unobserved groups. Int J Behav Dev. 2009;33(6):565–76.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Wang M, Bodner TE. Growth mixture modeling: Identifying and predicting unobserved subpopulations with longitudinal data. Organ Res Methods. 2007;10(4):635–56.

    Article  Google Scholar 

  7. Kwon JY, Sawatzky R, Baumbusch J, Lauck S, Ratner PA. Growth mixture models: a case example of the longitudinal analysis of patient-reported outcomes data captured by a clinical registry. BMC Med Res Methodol. 2021;21(1):1–11.

    Article  Google Scholar 

  8. McLachlan G, Peel D. Finite mixture models. Technometrics. New York: John Wiley & Sons;2000.

  9. Humphries DL, Dearden KA, Crookston BT, Fernald LC, Stein AD, Woldehanna T, et al. Cross-sectional and longitudinal associations between household food security and child anthropometry at ages 5 and 8 years in Ethiopia, India, Peru, and Vietnam. J Nutr. 2015;145(8):1924–33.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Lives Y. A guide to young lives research. Oxford: Young Lives; 2017.

    Google Scholar 

  11. Barnett I, Ariana P, Petrou S, Penny ME, Duc LT, Galab S, et al. Cohort profile: the young lives study. Int J Epidemiol. 2013;42(3):701–8.

    Article  PubMed  Google Scholar 

  12. Wake SK, Zewotir T, Lulu K, Fissuh YH. Longitudinal trends and determinants of stunting among children aged 1–15 years. Arch Public Heal. 2023;81(1):1–12.

    Google Scholar 

  13. Wake SK, Zewotir T, Muluneh EK. Studying latent change process in height growth of children in Ethiopia, India, Peru and Vietnam. BMC Pediatr. 2022;22(1):1–9.

    Article  Google Scholar 

  14. Wake SK, Zewotir T, Muluneh EK. Growth characteristics of four low-and middle-income countries children born just after the millennium development goals. J Biostat Epidemiol. 2021;7(2):108–19.

    Google Scholar 

  15. Wake SK, Zewotir T, Muluneh EK. Variations in physical growth trajectories among children aged 1–15 years in low and middle income countries: piecewise model approach. Malaysian J Public Health Med. 2021;21(3):200–8.

    Google Scholar 

  16. Wake SK, Baye BA, Gondol KB. Longitudinal study of growth variation and its determinants in body weight of children aged 1–15 years in Ethiopia. Iran J Pediatr. 2022;32(4): e122662.

    Article  Google Scholar 

  17. Wake SK, Zewotir T, Muluneh EK. Nonlinear physical growth of children from infancy to middle adolescence in low- and middle-income countries. J Res Health Sci. 2021;21(4): e00533.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Morrissey B, Taveras E, Allender S, Strugnell C. Sleep and obesity among children: a systematic review of multiple sleep dimensions. Pediatr Obes. 2020;15(4):1–21.

    Article  Google Scholar 

  19. Curran PJ, Obeidat K, Losardo D. Twelve frequently asked questions about growth curve modeling. J Cogn Dev. 2010;11(2):121–36.

    Article  PubMed  PubMed Central  Google Scholar 

  20. van der Nest G, Lima Passos V, Candel MJJM, van Breukelen GJP. An overview of mixture modelling for latent evolutions in longitudinal data: modelling approaches, fit statistics and software. Adv Life Course Res. 2020;43: 100323.

    Article  PubMed  Google Scholar 

  21. Muthén BO, Muthén LK. Integrating person-centered and variable-centered analyses: growth mixture modeling with latent trajectory classes. Alcohol Clin Exp Res. 2000;24(6):882–91.

    Article  PubMed  Google Scholar 

  22. Wickrama K, Lee TK, O’Neal CW, Lorenz F. Higher-order growth curves and mixture modeling with Mplus. Higher-order growth curves and mixture modeling with Mplus: Routledge; 2016.

    Book  Google Scholar 

  23. Biesanz JC, Deeb-Sossa N, Papadakis AA, Bollen KA, Curran PJ. The role of coding time in estimating and interpreting growth curve models. Psychol Methods. 2004;9(1):30–52.

    Article  PubMed  Google Scholar 

  24. Berlin KS, Williams NA, Parra GR. An introduction to latent variable mixture modeling (Part 1): overview and cross-sectional latent class and latent profile analyses. J Pediatr Psychol. 2014;39(2):174–87.

    Article  PubMed  Google Scholar 

  25. Diallo TMO, Morin AJS, Lu HZ. Performance of growth mixture models in the presence of time-varying covariates. Behav Res Methods. 2017;49(5):1951–65.

    Article  PubMed  Google Scholar 

  26. Davies CE, Glonek GFV, Giles LC. The impact of covariance misspecification in group-based trajectory models for longitudinal data with non-stationary covariance structure. Stat Methods Med Res. 2017;26(4):1982–91.

    Article  PubMed  Google Scholar 

  27. Muth´en B, Asparouhov T. Growth mixture modeling: Analysis with non-Gaussian random effects. In: Fitzmaurice G, Davidian M, Verbeke G, Molenberghs G, editors. Longitudinal data analysis: handbook of modern statistical methods book of modern statistical methods. London: Chapman & Hall/CRC.;2008.

  28. Shiyko MP, Ram N, Grimm kevin J. An overview of growth Mixture Modeling: a simple nonlinear application in OpenMx. In: Hoyle RH, editor. Handbook of structural equation modeling. New York, London;2012.

  29. Koning M, Hoekstra T, De Jong E, Visscher TLS, Seidell JC, Renders CM. Identifying developmental trajectories of body mass index in childhood using latent class growth (mixture) modelling: associations with dietary, sedentary and physical activity behaviors: a longitudinal study. BMC Public Health. 2016;16(1):1–12.

    Article  Google Scholar 

  30. Barrios PL, Garcia-Feregrino R, Rivera JA, Barraza-Villarreal A, Hernández-Cadena L, Romieu I, et al. Height trajectory during early childhood is inversely associated with fat mass in later childhood in Mexican boys. J Nutr. 2019;149(11):2011–9.

    Article  PubMed  PubMed Central  Google Scholar 

Download references


We thank the Young Lives study for giving us access to the data files.



Author information

Authors and Affiliations



SKW analyzed and drafted this manuscript. As a principal advisor for SKW, TZ designed, advised and supervised the start, the analysis and the write-up of the manuscript. EKM advised and supervised the analysis and the write-up of the manuscript. All authors have read and approved the final manuscript.

Corresponding author

Correspondence to Senahara Korsa Wake.

Ethics declarations

Ethics approval and consent to participant

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wake, S.K., Zewotir, T. & Muluneh, E.K. Analysis of heterogeneous growth changes in longitudinal height of children. J Health Popul Nutr 42, 78 (2023).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: