Editorial Type:
Article Category: Research Article
 | 
Online Publication Date: 13 Jan 2025

Craniofacial growth prediction models based on cephalometric landmarks in Korean and American children

,
,
,
,
, and
Page Range: 219 – 226
DOI: 10.2319/052324-399.1
Save
Download PDF

ABSTRACT

Objectives

To compare differences in craniofacial growth prediction results for Korean and American children according to growth prediction models developed using Korean and American longitudinal growth data.

Materials and Methods

Growth prediction models based on cephalometric landmarks were built for each population using longitudinally taken lateral cephalograms of Korean children and American children of northern European origin. The sample sizes of the serial datasets were 679 and 1257 for Korean and American children, respectively. On each cephalogram, 78 cephalometric landmarks were identified. The prediction models were based on the partial least squares method with 160 input and 154 output variables. For each group, growth was predicted by applying the prediction models developed using data from the same and different populations. The growth prediction results were compared and analyzed.

Results

The growth prediction results obtained with the prediction model developed using data from the same population were more accurate (P < .0001). The results distinctively visualized the discrepancies in the growth prediction results if different population types were not considered.

Conclusions

Applying a growth prediction model generated using data from the same population may be desirable.

INTRODUCTION

Anatomical variability can exist between different populations around the world. If different ethnic types are all treated the same, a serious error could be made.1 The issue of craniofacial growth in differing ethnicities may be of interest in diverse countries, particularly where heterogeneity of ethnicities exists, such as in the US.

Evaluation and prediction of craniofacial growth are of great importance in orthodontics. However, this topic is only considered in a few publications.2–4 Since growth is influenced by genetic and circumstantial factors, various growth evaluation materials and methods have been tailored to specific populations.2–7 While the importance of considering population differences in predicting growth has been emphasized traditionally, the consequences of overlooking these differences in a growing child have never been provided or visualized.

A major obstacle when conducting growth studies may be the need to collect longitudinal serial growth data meticulously. Due to funding and ethical reasons, it has become challenging to take repeated photographic and radiographic records of children regularly. Recently, Moon et al.2,3 reported individualized craniofacial growth prediction models using longitudinal serial cephalograms of 410 Korean children. In addition, Roseth4 developed growth prediction models using longitudinal serial growth datasets of American children of northern European origin collected by the Mathews Growth Study. These two datasets provide an opportunity for comparison of how different ethnicities affect growth prediction results.

Development of a growth prediction model requires a sophisticated method that can handle multiple input and output variables that are significantly correlated. The partial least squares (PLS) method is a machine learning method with the advantages of multivariate multiple linear regression and principal component analysis through dimensional reduction and modeling with latent variables.8,9 Since data characteristics commonly used in orthodontic research typically include relatively large numbers of skeletal, dental, and soft tissue variables that are correlated to each other, PLS has mostly been applied for predicting soft tissue changes resulting from combined surgical-orthodontic treatment,10–14 orthodontic treatment outcomes,15 and in predicting craniofacial growth,2,3 where multiple input and output variables are prevalent. One of the significant advantages of the PLS algorithm is its ability to account for nearly all skeletal and soft tissue cephalometric variables for each individual subject. This capability is achieved through linear combination and iterative computations involving the topology of 78 cephalometric landmarks during the PLS algorithm procedures. In other words, even if specific vertical and anteroposterior cephalometric variables are not explicitly included among the input variables, the algorithm still effectively considers factors such as whether an individual subject has a hypodivergent or hyperdivergent facial growth pattern or Class II or III relationship, among others.8,9

The purpose of this study was to compare craniofacial growth prediction results for Korean and American children according to growth prediction models developed using Korean and American longitudinal growth data, by applying the PLS method.

MATERIALS AND METHODS

The Institutional Review Board for the Protection of Human Subjects of the Seoul National University School of Dentistry reviewed and approved the data collection protocol (S-D20240019). The University of the Pacific (UOP) Human Subjects Protection Office of Research and Sponsored Programs approved the research protocol, and the project received an exempt review (UOP IRB 2023-28).

Longitudinal Serial Growth Data Collection From Two Different Population Data Sources

The longitudinal serial growth datasets of Korean children were presented by Moon et al.2 The study included 410 children, comprising 236 girls and 174 boys, with an average age of 11.5 years. The children were selected through the collection and collation of dental records from 25,810 patients who sought orthodontic treatment between January 2002 and December 2022 at the Department of Orthodontics, Seoul National University Dental Hospital. These children did not undergo any orthodontic or orthopedic treatment but had serial cephalograms taken. Among the 410 subjects, 90 (22%) subjects had cephalograms taken more than three times. Consequently, 935 cephalograms were taken for 679 serial growth datasets. On each of the 935 images, 78 cephalometric landmarks, consisting of 46 hard tissue and 32 soft tissue landmarks, were manually identified by a single examiner (SJL).

The longitudinal serial growth datasets of American children were collected from the American Association of Orthodontist Foundation Craniofacial Growth Legacy Collection homepage. Among the various datasets, lateral cephalograms collected by the UOP Mathews Growth Study were used. The American children were of northern European origin. On the 296 lateral cephalogram images from 33 subjects, comprising 21 girls and 12 boys aged 7.4–19.8 years, 78 cephalometric landmarks were manually identified by the same examiner (SJL). These UOP Mathews Growth Study data included 1257 serial growth datasets, which is twice as much data as the 679 Korean serial growth datasets. Table 1 summarizes the characteristics of the Korean and American longitudinal serial growth data.

Table 1. Characteristics of the Korean and American Subjects Whose Longitudinal Serial Craniofacial Growth Data Were Used in the Present Study
Table 1.

Development and Application of Two Different Growth Prediction Models

The input variables included sex, age when the observation started, age when the growth observation ended, Angle classification, and 154 x and y coordinates of 78 cephalometric landmarks. The output variables were 154 x and y coordinates of 78 cephalometric landmarks after the growth observation period. The landmark information can be found elsewhere.11,16

When the 78 landmarks were manually identified twice by the same examiner and another examiner on 283 validation images, the intraexaminer and interexaminer reliability measures, in terms of the intraclass correlation coefficient, were 0.9944 and 0.9823, respectively.

The Cartesian coordinate system was used, with its origin (0,0) at Sella. The horizontal reference plane was established with a line 7° downward from the Sella-Nasion line by applying an automated superimposition method suggested by Moon et al.17,18

Two growth prediction models were developed based on Korean and American data. When developing the prediction models, to avoid overfitting, the PLS method with 30 PLS components was applied.

After developing the prediction models, the growth of each individual was predicted by applying the two models developed using Korean and American data. When testing and validating the prediction results, the leave-one-out cross-validation method was applied because it is particularly useful in clinical studies.12,19

Comparison of Growth Prediction Accuracy

To assess how accurate the growth predictions were, the prediction error was defined as the difference between the actual growth and the predicted position of each cephalometric landmark. Prediction errors were compared in the following three ways:

  1. The mean radial error (MRE) or Euclidean distance in millimeters between the actual growth and the predicted position of each cephalometric landmark was calculated. MRE has been widely used to indicate reliability and as a measure of the magnitude of errors.16,20–22 To compare MRE between the prediction methods, the paired t-test with the Bonferroni correction of α values was conducted.

  2. MRE always has a positive value, and therefore, its comparisons cannot provide information regarding bias and error patterns. To visualize the direction, degree, and pattern of errors, scatter plots with 95% confidence ellipses were depicted. Mathematically, a 95% confidence ellipse follows the χ2 distribution and is particularly helpful for visualizing multidimensional errors and reliability measures.23,24

  3. A certain predicted landmark position might show considerable deviation in terms of MRE and/or in scatter plots. However, it might not necessarily indicate an error if the predicted position is located within the profile curves.25 To provide a more realistic presentation of results, 32 soft tissue landmarks were connected from the forehead to the terminal point below the neck using natural cubic spline functions and overlaid on cephalometric images of example subjects using Python programming (Python Software Foundation, Wilmington, Del).

All statistical analyses were performed using R (Vienna, Austria),26 and the significance level was set at 0.05.

RESULTS

The growth prediction errors for Korean and American subjects with prediction models developed using data from the same and different populations are summarized in Table 2. In general, the growth prediction results of the prediction model developed using data from the same population were more accurate than those of the prediction model developed using data from a different population (P < .0001).

Table 2. Growth Prediction Errors of Korean and American Subjects With Prediction Models Developed Using Data From the Same and Different Populationsa
Table 2.

Figure 1 illustrates several representative scatter plots of prediction errors for the posterior nasal spine, nasal bone tip, nose tip (pronasale), and upper lip. In general, the 95% confidence ellipses of the growth prediction errors for Korean children were larger than those of the growth prediction errors for American children, indicating that the prediction results were more accurate in American children than in Korean children.

Figure 1.Figure 1.Figure 1.
Figure 1. Scatterplots representing the patterns of growth prediction errors.

Citation: The Angle Orthodontist 95, 2; 10.2319/052324-399.1

In general, the prediction errors were greater when growth of Korean children was predicted by the prediction model developed using American data than when growth of American children was predicted by the prediction model developed using Korean data. When Korean children’s growth was predicted by the prediction model developed using American data, the bias indicated by the black circle was positioned downward and anteriorly, which implied that craniofacial growth was overestimated. On the contrary, when American children’s growth was predicted by the prediction model developed using Korean data, the bias marked by the red diamond was located in the three-quarter plane, which implied that craniofacial growth was underestimated (Figure 1).

Real-case examples comparing real growth with the prediction results generated by prediction models developed using data from the same and different populations are shown in Figure 2. The prediction results generated by prediction models developed using data from the same population (blue dotted lines) were more accurate and closer to the actual growth than the prediction results generated by prediction models developed using data from a different population (red dotted lines). The red dotted lines show that the growth of American children was underestimated, and the growth of Korean children was overestimated.

Figure 2.Figure 2.Figure 2.
Figure 2. Comparison of actual growth and prediction results.

Citation: The Angle Orthodontist 95, 2; 10.2319/052324-399.1

DISCUSSION

In the present study, we compare growth prediction results generated by prediction models developed using data from the same and different populations. Our specific aim was to highlight the importance of considering population differences when predicting craniofacial growth. The growth prediction results for Korean children were more accurate when the prediction model developed using Korean data was applied than when the prediction model developed using American data was applied. Likewise, the growth prediction results for American children were more accurate when the prediction model developed using American data was applied than when the prediction model developed using Korean data was applied. These results were unsurprising. Nonetheless, we are the first to confirm and visualize the discrepancies in craniofacial growth prediction results when ethnic differences are ignored or neglected.

The prediction model developed using American data yielded more accurate results than that developed using Korean data. The prediction model developed using Korean data underestimated the growth of American children, specifically in the upper-posterior direction. However, an ethnic difference should be inferred with caution because unknown or confounding factors may exist. For example, the results might have been influenced by the data characteristics. PLS is a machine-learning method, and its accuracy is dependent on the number of training or learning sample sizes.27,28 The American data were collected from 33 children, while the Korean data were collected from 410 children (Table 1). In contrast, 1257 American datasets and 679 Korean datasets were collected. In other words, the American data were more homogeneous and cohesive, with a greater sample size than the Korean data, which included many more subjects and had a smaller sample size, thereby increasing heterogeneity. Recently, Lee et al.27 investigated factors influencing the performance of artificial intelligence and found that growth prediction errors increased with an increasing number of subjects.

In addition, 51.5% and 48.5% of American subjects had Class I and II malocclusion, respectively, and none had Class III malocclusion. However, 43.2% and 34.1% of Korean subjects had Class II and III malocclusion, respectively. Authors of epidemiological studies have consistently shown that Koreans have significantly higher rates of Class III malocclusion than other ethnicities.29,30 Authors of previous studies have reported that children with Class III malocclusion showed greater growth prediction errors than children with Class II malocclusions.2,3

The two population datasets exhibited significant disparities in sample sizes, particularly regarding the number of subjects, longitudinal serial datasets, malocclusion proportions, and age range of the subjects. For example, the ages (minimum–maximum years) were 3.4–31.3 and 7.4–19.8 for Koreans and Americans, respectively. The difference in prediction accuracy might have partly originated from such subject characteristics, ie, a possible influence of malocclusion proportions and the subject age range on growth prediction accuracy.

The two-dimensional scatter plots for selected landmarks in Figure 1 show that the growth prediction errors increased when the landmark position was further away from the cranial base, and soft tissue landmarks demonstrated greater prediction errors than hard tissue landmarks. In addition, girls and subjects with Class II malocclusion had more accurate growth prediction results than boys and subjects with Class I malocclusion. All these results were in line with the growth prediction results of Moon et al.2,3 Determining how much latent growth exists for patients with Class II and III malocclusions employing different cephalometric values for diverse ethnic groups is critical for determining an appropriate treatment plan. A decade ago, Haskell and Segal1 strongly recommended the application of computer programs that can visualize treatment objectives. The drawn illustrations of growth prediction results, as shown in Figure 2, might help clinicians offer treatment choices for a growing child.

The results of the present study do not imply that craniofacial growth prediction results could be perfect or satisfactory. Nonetheless, some methods might be better than others. It is advisable to apply a growth prediction model developed using data from the same population that represents a specific ethnicity.

CONCLUSIONS

  • This study confirmed and highlighted the importance of considering population differences when predicting craniofacial growth. Although growth cannot be perfectly predicted, some methods may be better than others. Therefore, it may be desirable to apply a growth prediction model generated using data from the same population.

ACKNOWLEDGMENTS

Some of the data presented in the current study were included as part of a master’s thesis (Jeffrey Roseth). This research was partly supported by a grant from the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare, Republic of Korea (Grant No. HI22C1518).

DISCLOSURE

The authors report no conflicts of interest.

REFERENCES

  • 1.
    Haskell BS, Segal ES. Ethnic and ethical challenges in treatment planning: dealing with diversity in the 21st century. Angle Orthod. 2014;84:380382.
  • 2.
    Moon JH, Shin HK, Lee JM, et al. Comparison of individualized facial growth prediction models based on the partial least squares and artificial intelligence. Angle Orthod. 2024;94:207215.
  • 3.
    Moon JH, Kim MG, Hwang HW, Cho SJ, Donatelli RE, Lee SJ. Evaluation of an individualized facial growth prediction model based on the multivariate partial least squares method. Angle Orthod. 2022;92:705713.
  • 4.
    Roseth J. Comparison of Individualized Facial Growth Prediction Models Based on Artificial Intelligence and Partial Least Squares—With Longitudinal Growth Data From Mathews Growth Collection [thesis].
    San Francisco, Calif
    :
    University of the Pacific Arthur A. Dugoni School of Dentistry
    ; 2024.
  • 5.
    Lee SJ, An H, Ahn SJ, Kim YH, Pak S, Lee JW. Early stature prediction method using stature growth parameters. Ann Hum Biol. 2008;35:509517.
  • 6.
    Lee SJ, Kim YJ, Kim TW, Ahn SJ. New evaluation chart of stature and weight for Koreans. Korean J Orthod. 2006;36:153160.
  • 7.
    Lee YS, Lee SJ, An H, Donatelli RE, Kim SH. Do Class III patients have a different growth spurt than the general population? Am J Orthod Dentofacial Orthop. 2012;142:679689.
  • 8.
    Yu W, Lee SJ, Cho H. Partial least squares regression trees for multivariate response data with multicollinear predictors. IEEE Access. 2024;12:3663636644.
  • 9.
    Kim K, Lee SJ, Eo SH, Cho SJ, Lee JW. Modified partial least squares method implementing mixed-effect model. Commun Stat Appl Methods. 2023;30:6573.
  • 10.
    Suh HY, Lee SJ, Lee YS, et al. A more accurate method of predicting soft tissue changes after mandibular setback surgery. J Oral Maxillofac Surg. 2012;70:e553e562.
  • 11.
    Suh HY, Lee HJ, Lee YS, Eo SH, Donatelli RE, Lee SJ. Predicting soft tissue changes after orthognathic surgery: the sparse partial least squares method. Angle Orthod. 2019;89:910916.
  • 12.
    Yoon KS, Lee HJ, Lee SJ, Donatelli RE. Testing a better method of predicting postsurgery soft tissue response in Class II patients: a prospective study and validity assessment. Angle Orthod. 2015;85:597603.
  • 13.
    Lee YS, Suh HY, Lee SJ, Donatelli RE. A more accurate soft-tissue prediction model for Class III 2-jaw surgeries. Am J Orthod Dentofacial Orthop. 2014;146:724733.
  • 14.
    Lee HJ, Suh HY, Lee YS, et al. A better statistical method of predicting postsurgery soft tissue response in Class II patients. Angle Orthod. 2014;84:322328.
  • 15.
    Cho SJ, Moon JH, Ko DY, et al. Orthodontic treatment outcome predictive performance differences between artificial intelligence and conventional methods. Angle Orthod. 2024;94:557565.
  • 16.
    Hwang HW, Park JH, Moon JH, et al. Automated identification of cephalometric landmarks: part 2—might it be better than human? Angle Orthod. 2020;90:6976.
  • 17.
    Kim MG, Moon JH, Hwang HW, Cho SJ, Donatelli RE, Lee SJ. Evaluation of an automated superimposition method based on multiple landmarks for growing patients. Angle Orthod. 2022;92:226232.
  • 18.
    Moon JH, Kim MG, Cho SJ, et al. Evaluation of automated photograph-cephalogram image integration using artificial intelligence models. Angle Orthod. 2024;94:595601.
  • 19.
    Donatelli RE, Lee SJ. How to test validity in orthodontic research: a mixed dentition analysis example. Am J Orthod Dentofacial Orthop. 2015;147:272279.
  • 20.
    Hwang HW, Moon JH, Kim MG, Donatelli RE, Lee SJ. Evaluation of automated cephalometric analysis based on the latest deep learning method. Angle Orthod. 2021;91:329335.
  • 21.
    Donatelli RE, Lee SJ. How to report reliability in orthodontic research: part 1. Am J Orthod Dentofacial Orthop. 2013;144:156161.
  • 22.
    Donatelli RE, Lee SJ. How to report reliability in orthodontic research: part 2. Am J Orthod Dentofacial Orthop. 2013;144:315318.
  • 23.
    Moon JH, Lee JM, Park JA, Suh H, Lee SJ. Reliability statistics every orthodontist should know. Semin Orthod. 2024;30:4549.
  • 24.
    Donatelli RE, Park JA, Alghamdi YMA, Pandis N, Lee SJ. Assessment of reliability in orthodontic literature. Angle Orthod. 2022;92:409414.
  • 25.
    Park JA, Moon JH, Lee JM, et al. Does artificial intelligence predict orthognathic surgical outcomes better than conventional linear regression methods? Angle Orthod. 2024;94:549556.
  • 26.
    R Core Team . R: A Language and Environment for Statistical Computing.
    Vienna, Austria
    :
    R Foundation for Statistical Computing
    ; 2024. https://www.R-project.org/
  • 27.
    Lee JM, Moon JH, Park JA, Kim JH, Lee SJ. Factors influencing the development of artificial intelligence in orthodontics. Orthod Craniofac Res. 2024;27 Suppl 2:612.
  • 28.
    Moon JH, Hwang HW, Yu Y, Kim MG, Donatelli RE, Lee SJ. How much deep learning is enough for automatic identification to be reliable? A cephalometric example. Angle Orthod. 2020;90:823830.
  • 29.
    Dehesa-Santos A, Park JA, Lee SJ, Iglesias-Linares A. East Asian and southern European craniofacial Class III phenotype: two sides of the same coin? Clin Oral Investig. 2024;28:84.
  • 30.
    Donatelli RE, Park JA, Mathews SM, Lee SJ. Time series analysis. Am J Orthod Dentofacial Orthop. 2022;161:605608.
Copyright: © 2025 by The EH Angle Education and Research Foundation, Inc.
Figure 1.
Figure 1.

Scatterplots representing the patterns of growth prediction errors.


Figure 2.
Figure 2.

Comparison of actual growth and prediction results.


Contributor Notes

Graduate Student (Ph.D), Seoul National University, Seoul, Korea.
Private Practice, Cheonan, Korea.
Resident, Department of Orthodontics, Arthur A. Dugoni School of Dentistry, University of the Pacific, San Francisco, CA, USA.
Assistant Professor, Department of Orthodontics, Arthur A. Dugoni School of Dentistry, University of the Pacific, San Francisco, CA, USA.
Professor and Chair, Department of Orthodontics, Arthur A. Dugoni School of Dentistry, University of the Pacific, San Francisco, CA, USA.
Professor, Department of Orthodontics and Dental Research Institute, Seoul National University School of Dentistry, Seoul, Korea.

The first two authors contributed equally to this work.

Corresponding author: Dr Shin-Jae Lee, Professor, Department of Orthodontics and Dental Research Institute, Seoul National University School of Dentistry, 101 Daehakro, Jongro-Gu, Seoul 03080, Korea (e-mail: nonext.shinjae@gmail.com)
Received: 23 May 2024
Accepted: 05 Dec 2024
  • Download PDF