AI in Dental Imaging - What it can (and can't do)

Artificial intelligence has been exploding and integrating in all parts of life including in dentistry. It has applications in various areas such as radiographic interpretation, treatment planning, intraoral scanning/prosthetic design, practice management/operations, and patient communication.

Demystifying AI in Dentistry

As a dentist, or even as part of the public, AI is likely being marketed to you constantly as something magical that will make your life easier and more efficient. But how does it actually work? And what can it actually do? This article will explain:

  • What deep learning is
  • Why training data matters
  • Limitations of models
  • Use of AI as a helpful supportive tool, not a replacement

Deep Learning

Deep learning is a type of machine learning (which is a subset of AI) that uses massive unstructured datasets and uses pattern recognition. When you interpret a radiograph, what do you do? We look for patterns in radiolucencies, radio-opacities, etc. based on both experience and repetition. Deep learning seeks to do the same with mathematics. A neural network similarly also looks for relationships between the input images (e.g. pixels) and the output labels (e.g. caries present vs caries absent). Based on the training data, it analyzes thousands or millions of images (labelled as caries present vs caries absent) and it adjusts internal parameters until it becomes better at predicting the correct label. A Convolutional Neural Network (CNN) is designed to detect spatial patterns in images such as edges, shapes, and textures. It is crucial to clarify that the CNN has no idea what a lesion actually is. It detects statistical patterns of pixel intensity and spatial arrangement that were associated with lesions in the training data.

The model doesn’t know what decay is, what enamel is, or how caries is formed. It is unable to reason clinically. It can only find correlation between patterns in pixels and labels. For example, you could show someone 10,000 bitewings labeled “caries” or “no caries”. Over time, the person would likely get better at determining caries from recognizing patterns but he/she would still have a lack a fundamental understanding of caries.

Deep learning is powerful because it can help quantify subtle patterns that we can find hard to explain. However, they function on pure pattern recognition which is important to consider when we evaluate the use of AI.

Training Data

The accuracy of a model is directly linked to the quality and quantity of its training dataset (e.g. Garbage in = Garbage out). Diversity in datasets is also important to avoid bias and ensure fairness. Lastly, another important factor is who is labeling the training data? Is it dental students? General dentists? Oral maxillofacial radiologists? Interobserver variability is the differences and inconsistencies in measurements/interpretation of the same case by different observers due to differences in experience, training, or subjective interpretation.

Overfitting

Overfitting occurs when the model learns from the training dataset, including the “noise” (irrelevant or accidental patterns), too well rather than recognizing actual patterns that can be generalized . For example, the model could be built on recognizing radiographic artifacts correlated with a pathology in an image rather than the actual features of the pathology. This results in the model performing well on training datasets but poorly on new, real-world data.

For example, imagine you had an exam the next day. You have access to an old exam and ONLY study those questions. If your new exam was similar to the old exam, you would likely score very highly. BUT, if the new exam uses different scenarios/wording, your performance would likely drop. This is because you memorized patterns in the old exam questions instead of actually understanding the content.

Now, if an AI model is trained specifically on thousands of images from a single clinic/DSO. These images likely all share the same type of sensors, exposure settings, and patient demographic. Therefore, the model may be learning patterns specific to that specific environment rather than generalizable characteristics of caries.The model may have high accuracy on that particular training dataset but may perform poorly on images from a different machine, practice setting, etc. because it’s being tested in an unfamiliar environment.

Especially in dentistry, overfitting can be an issue because datasets are often smaller than medical settings and can vary widely from different clinics. Therefore, a poorly fitting model can give false reassurance or unnecessary alarm when used in a more broad setting. Therefore, it is important for clinicians to understand the concept of overfitting and interpret AI outputs with appropriate caution.

Sensitivity, Specificity, and the Illusion of Accuracy

Model says pathology absent Model says pathology present
Pathology actually absent True negative (TN) False Positive (FP)
Pathology actually present False Negative (FN) True Positive (TP)

Remember back to statistics class? Is it enough when a model claims it is 95% accurate?

\[ \textrm{Accuracy} = \frac{\textrm{TP}+\textrm{TN}}{\textrm{TP}+\textrm{TN}+\textrm{FP}+\textrm{FN}} \]

\[ \textrm{Specificity} = \frac{\textrm{True Negatives}}{\textrm{True Negatives}+\textrm{False Positives}} \] \[ \textrm{Sensitivity} = \frac{\textrm{True Positives}}{\textrm{True Positives}+\textrm{False Negatives}} \] Sensitivity looks at how well a model is able to identify something when it is there. Specificity, on the other hand, looks at how well the model is able to identify when something is NOT there (ensuring healthy/normal isn’t flagged as diseased). Accuracy can sometimes be misleading, for example, if a pathology is present in 1% of the population, if the model labels everything in the dataset as negative (healthy), it will have a 99% accuracy rate (but 0% sensitivity). Sensitivity and specificity is needed to ensure that the model is able to catch rare cases.

Future of AI in Dentistry

AI thrives at repetitive tasks. However, it struggles with atypical presentations, integration with clinical history, and incidental findings. AI will likely be able to aid us in detection but complex interpretion remains a human task. Understanding how AI models work is important for dentists so we neither dismiss or overestimate their ability.

Cara Zou
Cara Zou

Interests include dentistry and computer science.

Related