Diagnostic performance of ClinCheck, Dolphin Imaging, and 3D Slicer software for Bolton discrepancy analysis
To evaluate the diagnostic performance of ClinCheck, Dolphin Imaging orthodontic software, and 3D Slicer for the analysis of Bolton discrepancy (BD). Fifty-five pairs of early-stage digital models of patients treated with Invisalign were printed to measure the BD by manual method with a digital caliper (gold standard). The discrepancy values calculated by ClinCheck were obtained. In addition, the sample STL files were measured using Dolphin Imaging and 3D Slicer software to obtain BD values. To assess reliability, precision, and accuracy of the methods, intraclass correlation coefficients (ICCs), Dahlberg’s formula, paired t-tests, and the Bland-Altman method were used, respectively. Repeated-measures analysis of variance with Bonferroni post hoc test was used to assess the difference between groups. The three methods showed reliable measurements (ICC ≥ 0.7), with the values of anterior Bolton slightly higher than overall Bolton. Measurements for the anterior Bolton showed higher precision (Dahlberg’s formula 0.65, 0.70, and 0.55) than those for the overall Bolton. For anterior Bolton, only the measurements obtained by ClinCheck and Dolphin Imaging were accurate (P > .05, no proportion bias), while for overall Bolton, all groups had a significant difference. The Bland-Altman plots demonstrated no consistency for anterior Bolton measurements when 3D Slicer was used and for the overall Bolton. ClinCheck and Dolphin Imaging showed accuracy to quantify anterior BD. For the overall Bolton measurements, ClinCheck showed a statistical difference from the manual assessment but without relevant clinical significance.ABSTRACT
Objectives
Materials and Methods
Results
Conclusions
INTRODUCTION
To produce Invisalign clear aligners, there is a digital workflow in which the interaction between orthodontists and Align Technology takes place through a virtual tool, ClinCheck software (Align Technology, San Jose, Calif).1 This platform displays virtual patient models with predictions of sequential treatment steps to be performed with the aligners. Therefore, many studies have been carried out to establish validity between predictability and the clinical results obtained after treatment completion, while Invisalign becomes increasingly used widely among professionals.2–5
Among the information provided by ClinCheck, the Bolton discrepancy (BD) is displayed. This analysis identifies the proportions of tooth size between the upper and lower arches6 and is directly related to the amount of interproximal reduction (IPR) or resin additions that are indicated to achieve ideal occlusion. This analysis can be performed digitally using virtual three-dimensional (3D) models, or it can be assessed by the conventional technique initially described by Bolton, using physical models and calipers.7,8 For clear aligners, correct measurement of mesiodistal diameter of the teeth is even more important because it provides for the creation of adequate space to achieve desired tooth movements.9 Also, it is related to the correct indication of IPR to obtain space when necessary,10,11 minimizing iatrogenic effects in treatment with clear aligners.
Information is lacking on how the BD analysis is obtained through the ClinCheck tool. Therefore, the aim of the present study was to evaluate the diagnostic performance of ClinCheck as well as Dolphin Imaging (Dolphin Imaging and Management Solutions, Chatsworth, Calif) and 3D Slicer software (Kitware, Clifton Park, NY) to assess BD.
MATERIALS AND METHODS
The study was approved by the Research Ethics Committee of the Clementino Fraga Filho University Hospital (CEP-HUCFF) of the Federal University of Rio de Janeiro (UFRJ) under No. 5.202.503.
Sample Selection
A priori sample size calculations were performed using data from a pilot study (n = 10), following two approaches: (1) calculation for comparison of anterior and overall BDs in millimeters between two groups using the paired t-test (conventional evaluation vs ClinCheck), and (2) calculation for comparison of the same outcomes among three groups using within-subjects repeated-measures analysis of variance (ANOVA; ClinCheck vs Dolphin Imaging vs 3D Slicer). The estimation that resulted in a larger sample size was used for the present study (conventional evaluation vs. ClinCheck for overall Bolton; mean difference = −0.520 ± 1.265 mm). The parameters established for the calculation were effect size dz = −0.411, a = 0.05, and power = 0.8. The estimation resulted in a total of 49 models, rounded to 55, considering possible losses during the study. Sample size estimates were performed with G*Power software version 3.1.9.6 (Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany).
The sample comprised 55 patients aged 18–60 years, with no sex predilection, who were treated with Invisalign. Selection was based on the following inclusion criteria: complete permanent dentition; Class I, II, or III molar relationship; and lower crowding of up to 6 mm. Cases that had teeth with anatomic anomalies, occlusal reduction, as well as individuals with other orofacial malformations were excluded. After selection, the dental arches were scanned using an iTero Element 2 scanner.
The STL files in were prepared using the Meshmixer software version 3.5.474 (Autodesk, Inc., San Rafael, Calif). The digital models were printed with Anycubic Photon Mono SE 3D printer (Shenzhen, China), and the BD analysis was performed.
Bolton Discrepancy Assessment
Measurements of the mesiodistal diameter of the teeth were performed using a digital caliper (Mitutoyo©, Suzano, São Paulo, Brazil) on the 3D-printed models by the same previously calibrated operator. Teeth were measured from the first molar on one side to the first molar on the opposite side, ensuring the 12 teeth of each dental arch were measured. Then the BD analysis6 was performed for the anterior region (considering incisors and canines: 3-3) and overall area (first molar to first molar: 6-6), in millimeters.
For the ClinCheck tool, the values and location referring to the BD were identified in the software and tabulated.
Finally, the BD analysis was performed using the Dolphin Imaging orthodontic software version 11.95 Premium and 3D Slicer version 5.1.0. For each software program, the STL digital models were exported, and the same Bolton measurement methodology was performed.
Method Error
Twenty percent of the sample was randomly selected to assess the method error. Tooth measurements and subsequent calculation of the anterior and overall BDs were performed on the selected models two times with an interval of at least 2 weeks. The intraclass correlation coefficient (ICC) and the Bland-Altman method (proportion bias) were used to assess the reliability and accuracy of measurements reported in millimeters.
Data Analysis
The distribution and normality of the data for the anterior and overall BDs (in millimeters) were evaluated through the Shapiro-Wilk test, histograms, and Q-Q plots.
ICC was used to assess reliability by evaluating the degree of agreement between measurements at two times or using two methods. Random and systematic errors were assessed to determine validity of the methods. Dahlberg’s formula was used to measure the random error (precision), that is, how closely the values from two measurements were. The paired t-test was used to detect significant differences between measurements using different methods. The Bland-Altman method was also used since, in addition to determining the random error, it assessed the presence of systematic error by estimating the proportion bias with its corresponding 95% confidence interval. Assessments of the BD using the conventional method were used as the reference (gold standard) for all evaluations.
Additionally, the differences in discrepancy values in millimeters between each digital method and the conventional evaluation were independently calculated. Subsequently, these differences were compared using repeated-measures ANOVA. To detect which groups were different, post hoc pairwise comparisons were performed using the Bonferroni test.
All analyses were performed in Jamovi software version 2.0 (Sydney, Australia), adopting a significance level of 5%.
RESULTS
Method Error
Measurements of BDs in millimeters using the conventional method, Dolphin Imaging software, and 3D Slicer were reliable and accurate (Table 1). The ICC for repeated measures of the anterior and overall Bolton ranged from 0.86 to 0.96 and from 0.80 to 0.91, respectively.

Main Findings
Results of the reliability, precision, and accuracy assessments of the measurements of BD in millimeters are described in Table 2. The three methods showed reliable measurements (ICC ≥ 0.7). Those obtained for the anterior Bolton analysis were slightly more reliable than those obtained for the overall Bolton. Similarly, measurements for the anterior Bolton showed higher precision compared with the overall Bolton. Although the methods showed good ICC values (high correlation between assessments), only the measurements obtained for the anterior Bolton using the ClinCheck tool and Dolphin Imaging were accurate (paired t-test P > .05; no proportion bias). These methods had a precision of 0.65 mm and 0.70 mm, respectively. All the other measurements were inaccurate, evidencing proportion bias.

The Bland-Altman plots showed that the measures for the anterior Bolton in 3D Slicer and for the overall Bolton using the three software programs were scattered widely, above zero, demonstrating no consistency with measurements of the conventional method and tending either to (1) decrease the excess in the mandibular arch; (2) increase the excess in the maxillary arch; or (3) in case of minor discrepancies, indicate that the excess was in the upper arch when it was in the lower arch (Figure 1). When the mean differences between each of the tested methods and the conventional method were compared, no significant differences were observed for the anterior Bolton between the ClinCheck and Dolphin Imaging assessments (P > .999), while measurements using 3D Slicer software did show differences with both (P < .01; Figure 2). When the overall Bolton was analyzed, no significant difference was observed among the groups (Figure 2).



Citation: The Angle Orthodontist 95, 1; 10.2319/022724-156.1



Citation: The Angle Orthodontist 95, 1; 10.2319/022724-156.1
DISCUSSION
Success with Invisalign occurs when the occlusal results predicted by ClinCheck are consistent with those obtained during actual treatment.2 Thus, some parameters of this tool have been studied, such as the predictability of overbite correction,12,13 transverse changes in the arch,14,15 accuracy of measurements performed in the anterior region such as arch length, intercanine distance, overjet, overbite, dental midline shift,1 and accuracy of tooth movements predicted by this system.5,16,17 However, information is lacking on the validity of BD data indicated by ClinCheck, and considering its importance to diagnosis and orthodontic planning procedures, the present study was undertaken.
The gold standard reference for comparison was measurements performed on printed models using stereolithographic technology since the prototype models have demonstrated similar precision and clinical application to plaster models.18,19 The prototypes were printed with a layer thickness of 50 µm for accuracy.20 The models were arranged at 0° to the printing platform, minimizing the expansion effect in the terminal region of the arch.21
The results indicated that, although the measurements obtained were reliable, only the ClinCheck tool and Dolphin Imaging software showed accuracy for BD measurements performed in the anterior region. Authors of two previous studies investigated the diagnostic performance of ClinCheck compared with manual methods.9,22
Shailendran et al9 performed segmentation of the teeth to measure their mesiodistal diameter and assess BD. The findings indicated that ClinCheck tended to underestimate tooth width measurements by an average of 0.36 mm, while for molars, the average was 0.9 mm. As ClinCheck accuracy decreased as the measurements became farther posterior in the arch, a greater probability of underestimation was found in these areas, while more accurate values were found in the anteroinferior region,9 a pattern that was also observed in the current study.
Martin et al.22 compared the values indicated by ClinCheck software with measurements performed on plaster models and found higher values provided by the software, with the largest measurement errors also in the molar region. The measurement methodology was like that of the present study, in which the actual position of the teeth in the arch was considered, along with crowding, inclination, and existing dental rotation. Such factors present increased difficulty for accurate measurement of BD23 but also represent the classic evaluations performed on plaster models, developed by Bolton himself in 1962.6
Concerning overall Bolton assessment, ClinCheck, Dolphin Imaging, and 3D Slicer showed moderate reliability, but the values were significantly different than the measurements performed on the printed models, demonstrating that the measurements performed were not accurate. However, the variation was less than 1.5 mm which, according to Proffit et al,24 would have little clinical implication.
The current findings were like those of previous studies, in which authors observed satisfactory agreement between the Bolton analysis for anterior and overall proportions when comparing measurements in 3D models using software and plaster models. In those studies, the digital models were obtained by digitalizing plaster models and not with intraoral scanning, as performed in the present study.25–28 Regarding ClinCheck, the correlation found in the study by Shailendran et al9 was moderate, with an ICC of 0.83 for overall Bolton results, like that found in the present study.
Some explanation may exist as to why the reproducibility of overall Bolton measurements was not better. Acquisition of digital measures may be easier29 and require less time than manual methods,25,26 but divergence or imprecision may occur in marking the reference points of the measurements, even when performed by the same operator over a short time interval.23,27 This limitation may be related to the 3D nature of the models which is viewed two-dimensionally,29 making it difficult to identify points, axes, and planes by the software.23,26,30 For Dolphin Imaging, 3D features are lost since the software uses a static image of the dental arch in the occlusal view to mark the points.
Another factor would be the difficulty of scanning to faithfully reproduce posterior teeth due to patient-related factors, such as salivation.31 The software fills in such regions through a mathematical algorithm, leading to changes in tooth shape that generate measurement inaccuracies.1
Finally, inherent difficulties may exist in measuring posterior regions using scanning technology. Authors of studies comparing sizes in digital vs conventional models have indicated that, despite having high reproducibility, digital models tended to decrease the intermolar width and increase the palatal depth.23 This systematic error in the reproduction of the molar surface was also described by Zilberman et al.30 For the ClinCheck tool, authors of studies indicated that the software overestimated the alignment and posterior occlusal relationships obtained after treatment.2 Further studies are needed regarding the dental information provided by ClinCheck, especially pertaining to the posterior region of the arch.
Considering common clinical application, evidence of the accuracy of the anterior Bolton values using ClinCheck and Dolphin Imaging is encouraging since intervention in this region is more frequent, either by IPR in cases of excess or by addition of resin or by restoration to modify atypical shape of teeth to achieve proper occlusion. However, in the posterior region, IPR is less often indicated because it may increase the chance of food impaction and caries risk but may still be indicated to alter abnormal shape. It is recommended that some caution may be appropriate when assessing and interpreting BD data obtained through orthodontic software.
CONCLUSIONS
-
ClinCheck and Dolphin Imaging software demonstrated precision and accuracy for anterior BD analysis measurements, demonstrating validity to be used during orthodontic diagnosis.
-
During analysis of overall Bolton relationships, ClinCheck, Dolphin Imaging, and 3D Slicer showed more variability and tended to decrease excess in the lower arch and increase excess tooth size in the upper arch. However, the variations shown did not have clinically relevant implications.

Distribution of measurements after calculating the anterior and overall Bolton discrepancy (BD) by the three methods.

Comparison between the mean differences found between measurements using each of the three tested software methods and the conventional method for anterior and overall Bolton discrepancy (BD).
Contributor Notes