<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.0 20040830//EN" "http://dtd.nlm.nih.gov/publishing/2.0/journalpublishing.dtd">
<article article-type="abstract" dtd-version="2.0" xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">IPROC</journal-id>
      <journal-id journal-id-type="nlm-ta">iproc</journal-id>
      <journal-title>Iproceedings</journal-title>
      <issn pub-type="epub">2369-6893</issn>
      <publisher>
        <publisher-name>JMIR Publications</publisher-name>
        <publisher-loc>Toronto, Canada</publisher-loc>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">v7i1e35391</article-id>
      <article-id pub-id-type="pmid">27762282</article-id>
      <article-id pub-id-type="doi">10.2196/35391</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Abstract</subject>
        </subj-group>
        <subj-group subj-group-type="article-type">
          <subject>Abstract</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Assessing Generalizability of Deep Learning Models Trained on Standardized and Nonstandardized Images and Their Performance Against Teledermatologists</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="editor">
          <name>
            <surname>Derrick</surname>
            <given-names>Thomas</given-names>
          </name>
        </contrib>
      </contrib-group>
      <contrib-group>
        <contrib id="contrib1" contrib-type="author" corresp="yes">
          <name name-style="western">
            <surname>Oloruntoba</surname>
            <given-names>Ibukun</given-names>
          </name>
          <xref rid="aff1" ref-type="aff">1</xref>
          <address>
            <institution>School of Public Health and Preventive Medicine</institution>
            <institution>Monash University</institution>
            <addr-line>Wellington Rd</addr-line>
            <addr-line>Clayton</addr-line>
            <addr-line>Melbourne, VIC 3800</addr-line>
            <country>Australia</country>
            <phone>61 3 9905 4000</phone>
            <email>aolo0001@student.monash.edu</email>
          </address>
          <ext-link ext-link-type="orcid">https://orcid.org/0000-0003-4694-9089</ext-link>
        </contrib>
        <contrib id="contrib2" contrib-type="author">
          <name name-style="western">
            <surname>Nguyen</surname>
            <given-names>Toan D</given-names>
          </name>
          <degrees>PhD</degrees>
          <xref rid="aff2" ref-type="aff">2</xref>
          <ext-link ext-link-type="orcid">https://orcid.org/0000-0001-6192-8601</ext-link>
        </contrib>
        <contrib id="contrib3" contrib-type="author">
          <name name-style="western">
            <surname>Ge</surname>
            <given-names>Zongyuan</given-names>
          </name>
          <degrees>PhD</degrees>
          <xref rid="aff3" ref-type="aff">3</xref>
          <ext-link ext-link-type="orcid">https://orcid.org/0000-0002-5880-8673</ext-link>
        </contrib>
        <contrib id="contrib4" contrib-type="author">
          <name name-style="western">
            <surname>Vestergaard</surname>
            <given-names>Tine</given-names>
          </name>
          <degrees>MD, PhD</degrees>
          <xref rid="aff4" ref-type="aff">4</xref>
          <ext-link ext-link-type="orcid">https://orcid.org/0000-0002-7210-8884</ext-link>
        </contrib>
        <contrib id="contrib5" contrib-type="author">
          <name name-style="western">
            <surname>Mar</surname>
            <given-names>Victoria</given-names>
          </name>
          <degrees>MBBS, FACD, PhD</degrees>
          <xref rid="aff5" ref-type="aff">5</xref>
          <ext-link ext-link-type="orcid">https://orcid.org/0000-0001-9423-3435</ext-link>
        </contrib>
      </contrib-group>
      <aff id="aff1">
        <label>1</label>
        <institution>School of Public Health and Preventive Medicine</institution>
        <institution>Monash University</institution>
        <addr-line>Melbourne</addr-line>
        <country>Australia</country>
      </aff>
      <aff id="aff2">
        <label>2</label>
        <institution>Monash University</institution>
        <addr-line>Melbourne</addr-line>
        <country>Australia</country>
      </aff>
      <aff id="aff3">
        <label>3</label>
        <institution>Monash eResearch Centre</institution>
        <institution>Monash University</institution>
        <addr-line>Melbourne</addr-line>
        <country>Australia</country>
      </aff>
      <aff id="aff4">
        <label>4</label>
        <institution>Department of Dermatology and Allergy Centre</institution>
        <institution>Odense University Hospital</institution>
        <addr-line>Odense</addr-line>
        <country>Denmark</country>
      </aff>
      <aff id="aff5">
        <label>5</label>
        <institution>Victorian Melanoma Service</institution>
        <institution>Alfred Health and School of Public Health and Preventive Medicine</institution>
        <institution>Monash University</institution>
        <addr-line>Melbourne</addr-line>
        <country>Australia</country>
      </aff>
      <author-notes>
        <corresp>Corresponding Author: Ibukun Oloruntoba <email>aolo0001@student.monash.edu</email></corresp>
      </author-notes>
      <pub-date pub-type="collection">
        <season>Jan-Dec</season>
        <year>2021</year>
      </pub-date>
      <pub-date pub-type="epub">
        <day>10</day>
        <month>12</month>
        <year>2021</year>
      </pub-date>
      <volume>7</volume>
      <issue>1</issue>
      <elocation-id>e35391</elocation-id>
      <history>
        <date date-type="received">
          <day>2</day>
          <month>12</month>
          <year>2021</year>
        </date>
        <date date-type="accepted">
          <day>3</day>
          <month>12</month>
          <year>2021</year>
        </date>
      </history>
      <copyright-statement>©Ibukun Oloruntoba, Toan D Nguyen, Zongyuan Ge, Tine Vestergaard, Victoria Mar. Originally published in Iproceedings (https://www.iproc.org), 10.12.2021.</copyright-statement>
      <copyright-year>2021</copyright-year>
      <license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
        <p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in Iproceedings, is properly cited. The complete bibliographic information, a link to the original publication on https://www.iproc.org/, as well as this copyright and license information must be included.</p>
      </license>
      <self-uri xlink:href="https://www.iproc.org/2021/1/e35391" xlink:type="simple"/>
      <abstract>
        <sec sec-type="background">
          <title>Background</title>
          <p>Convolutional neural networks (CNNs) are a type of artificial intelligence that show promise as a diagnostic aid for skin cancer. However, the majority are trained using retrospective image data sets of varying quality and image capture standardization.</p>
        </sec>
        <sec sec-type="objective">
          <title>Objective</title>
          <p>The aim of our study is to use CNN models with the same architecture, but different training image sets, and test variability in performance when classifying skin cancer images in different populations, acquired with different devices. Additionally, we wanted to assess the performance of the models against Danish teledermatologists when tested on images acquired from Denmark.</p>
        </sec>
        <sec sec-type="methods">
          <title>Methods</title>
          <p>Three CNNs with the same architecture were trained. CNN-NS was trained on 25,331 nonstandardized images taken from the International Skin Imaging Collaboration using different image capture devices. CNN-S was trained on 235,268 standardized images, and CNN-S2 was trained on 25,331 standardized images (matched for number and classes of training images to CNN-NS). Both standardized data sets (CNN-S and CNN-S2) were provided by Molemap using the same image capture device. A total of 495 Danish patients with 569 images of skin lesions predominantly involving Fitzpatrick skin types II and III were used to test the performance of the models. Four teledermatologists independently diagnosed and assessed the images taken of the lesions. Primary outcome measures were sensitivity, specificity, and area under the curve of the receiver operating characteristic (AUROC).</p>
        </sec>
        <sec sec-type="results">
          <title>Results</title>
          <p>A total of 569 images were taken from 495 patients (n=280, 57% women, n=215, 43% men; mean age 55, SD 17 years) for this study. On these images, CNN-S achieved an AUROC of 0.861 (95% CI 0.830-0.889; <italic>P</italic>&lt;.001), and CNN-S2 achieved an AUROC of 0.831 (95% CI 0.798-0.861; <italic>P</italic>=.009), with both outperforming CNN-NS, which achieved an AUROC of 0.759 (95% CI 0.722-0.794; <italic>P</italic>&lt;.001; <italic>P</italic>=.009). When the CNNs were matched to the mean sensitivity and specificity of the teledermatologists, the model’s resultant sensitivities and specificities were surpassed by the teledermatologists. However, when compared to CNN-S, the differences were not statistically significant (<italic>P</italic>=.10; <italic>P</italic>=.05). Performance across all CNN models and teledermatologists was influenced by the image quality.</p>
        </sec>
        <sec sec-type="conclusions">
          <title>Conclusions</title>
          <p>CNNs trained on standardized images had improved performance and therefore greater generalizability in skin cancer classification when applied to an unseen data set. This is an important consideration for future algorithm development, regulation, and approval. Further, when tested on these unseen test images, the teledermatologists <italic>clinically</italic> outperformed all the CNN models; however, the difference was deemed to be statistically insignificant when compared to CNN-S.</p>
        </sec>
        <sec>
          <title>Conflicts of Interest</title>
          <p>VM received speakers fees from Merck, Eli Lily, Novartis and Bristol Myers Squibb. VM is the principal investigator for a clinical trial funded by the Victorian Department of Health and Human Services with 1:1 contribution from MoleMap.</p>
        </sec>
      </abstract>
      <kwd-group>
        <kwd>teledermatology</kwd>
        <kwd>CNN</kwd>
        <kwd>artificial intelligence</kwd>
        <kwd>skin cancer</kwd>
        <kwd>Denmark</kwd>
        <kwd>Australia</kwd>
        <kwd>New Zealand</kwd>
        <kwd>image standardization</kwd>
        <kwd>generalizability</kwd>
        <kwd>classification</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body/>
  <back>
    <app-group>
      <supplementary-material id="app1">
        <label>Multimedia Appendix 1</label>
        <p>Receiver operating characteristic (ROC) curves for the three convolutional neural network (CNN) models and the performances of the teledermatologists on the Danish test set. The ROC and the area under the curve of the ROC of the CNN models in relation to the sensitivity and 1-specificity of the teledermatologists when tested on the 569 Danish test images. The teledermatologist's performance was greater than all of the CNN models.</p>
        <media xlink:href="iproc_v7i1e35391_app1.png" xlink:title="PNG File , 341 KB"/>
      </supplementary-material>
      <supplementary-material id="app2">
        <label>Multimedia Appendix 2</label>
        <p>Table 1: Sensitivity and specificity of the convolutional neural network models when matched to the average performance of the teledermatologists.</p>
        <media xlink:href="iproc_v7i1e35391_app2.png" xlink:title="PNG File , 398 KB"/>
      </supplementary-material>
    </app-group>
    <glossary>
      <title>Abbreviations</title>
      <def-list>
        <def-item>
          <term id="abb1">AUROC</term>
          <def>
            <p>area under the curve of the receiver operating characteristic</p>
          </def>
        </def-item>
        <def-item>
          <term id="abb2">CNN</term>
          <def>
            <p>convolutional neural network</p>
          </def>
        </def-item>
      </def-list>
    </glossary>
  </back>
</article>
