<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.0 20040830//EN" "http://dtd.nlm.nih.gov/publishing/2.0/journalpublishing.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" article-type="abstract" dtd-version="2.0">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">IPROC</journal-id>
      <journal-id journal-id-type="nlm-ta">iproc</journal-id>
      <journal-title>Iproceedings</journal-title>
      <issn pub-type="epub">2369-6893</issn>
      <publisher>
        <publisher-name>JMIR Publications</publisher-name>
        <publisher-loc>Toronto, Canada</publisher-loc>
      </publisher>
    </journal-meta>
    <article-meta>
    <article-id pub-id-type="publisher-id">v2i1e44</article-id>
    <article-id pub-id-type="pmid"/>
    <article-id pub-id-type="doi">10.2196/iproc.6136</article-id>
    <article-categories>
      <subj-group subj-group-type="heading">
        <subject>Poster</subject>
      </subj-group>
      <subj-group subj-group-type="article-type">
        <subject>Poster</subject>
      </subj-group>
    </article-categories>
    <title-group>
      <article-title>Toward Expert Systems in Mental Health Assessment: A Computational Approach to the Face and Voice in Dyadic Patient-Doctor Interactions</article-title>
    </title-group>
    <contrib-group>
      <contrib contrib-type="editor">
        <name>
          <surname>Hale</surname>
          <given-names>Timothy</given-names>
        </name>
      </contrib>
    </contrib-group>
<contrib-group>
<contrib contrib-type="reviewer">
<name>
<surname>CHS Scientific Program Committee</surname>
</name>
</contrib>
</contrib-group>
    <contrib-group>
      <contrib contrib-type="author" id="contrib1" corresp="yes">
      <name name-style="western">
        <surname>Baker</surname>
        <given-names>Justin T</given-names>
      </name>
      <degrees>MD, PhD</degrees>
      <xref rid="aff1" ref-type="aff">1</xref>
      <xref rid="aff2" ref-type="aff">2</xref>
      <address>
        <institution>Department of Psychiatry</institution>
        <institution>Harvard Medical School</institution>
        <addr-line>25 Shattuck St</addr-line>
        <addr-line>Boston, MA, 02115</addr-line>
        <country>United States</country>
        <phone>1 617 855 3913</phone>
        <fax>1 617 855 0000</fax>
        <email>jtbaker@partners.org</email>
      </address> </contrib>
      <contrib contrib-type="author" id="contrib2">
        <name name-style="western">
          <surname>Pennant</surname>
          <given-names>Luciana</given-names>
        </name>
        <xref rid="aff1" ref-type="aff">1</xref>
      </contrib>
      <contrib contrib-type="author" id="contrib3">
        <name name-style="western">
          <surname>Baltrušaitis</surname>
          <given-names>Tadas</given-names>
        </name>
        <degrees>PhD</degrees>
        <xref rid="aff3" ref-type="aff">3</xref>
      </contrib>
      <contrib contrib-type="author" id="contrib4">
        <name name-style="western">
          <surname>Vijay</surname>
          <given-names>Supriya</given-names>
        </name>
        <xref rid="aff3" ref-type="aff">3</xref>
      </contrib>
      <contrib contrib-type="author" id="contrib5">
        <name name-style="western">
          <surname>Liebson</surname>
          <given-names>Elizabeth S</given-names>
        </name>
        <degrees>MD</degrees>
        <xref rid="aff1" ref-type="aff">1</xref>
        <xref rid="aff2" ref-type="aff">2</xref>
      </contrib>
      <contrib contrib-type="author" id="contrib6">
        <name name-style="western">
          <surname>Ongur</surname>
          <given-names>Dost</given-names>
        </name>
        <degrees>MD, PhD</degrees>
        <xref rid="aff1" ref-type="aff">1</xref>
        <xref rid="aff2" ref-type="aff">2</xref>
      </contrib>
      <contrib contrib-type="author" id="contrib7">
        <name name-style="western">
          <surname>Morency</surname>
          <given-names>Louis-Philippe</given-names>
        </name>
        <degrees>PhD</degrees>
        <xref rid="aff3" ref-type="aff">3</xref>
      </contrib>
    </contrib-group>
    <aff id="aff1">
    <sup>1</sup>
    <institution>Psychotic Disorders Division</institution>
    <institution>McLean Hospital</institution>  
    <addr-line>Belmont, MA</addr-line>
    <country>United States</country></aff>
    <aff id="aff2">
    <sup>2</sup>
    <institution>Department of Psychiatry</institution>
    <institution>Harvard Medical School</institution>  
    <addr-line>Boston, MA</addr-line>
    <country>United States</country></aff>
    <aff id="aff3">
    <sup>3</sup>
    <institution>Language Technical Institute</institution>
    <institution>School of Computer Science</institution>  
    <institution>Carnegie Mellon University</institution>  
    <addr-line>Pittsburg, PA</addr-line>
    <country>United States</country></aff>
    <author-notes>
      <corresp>Corresponding Author: Justin T Baker 
      <email>jtbaker@partners.org</email></corresp>
    </author-notes>
    <pub-date pub-type="collection"><season>Jan-Dec</season><year>2016</year></pub-date>
    <pub-date pub-type="epub">
      <day>30</day>
      <month>12</month>
      <year>2016</year>
    </pub-date>
    <volume>2</volume>
    <issue>1</issue>
    <elocation-id>e44</elocation-id>
    <!--history from ojs - api-xml-->
    <history>
      <date date-type="received">
        <day>5</day>
        <month>6</month>
        <year>2016</year>
      </date>
      <date date-type="accepted">
        <day>2</day>
        <month>8</month>
        <year>2016</year>
      </date>
    </history>
    <!--(c) the authors - correct author names and publication date here if necessary. Date in form ', dd.mm.yyyy' after jmir.org-->
    <copyright-statement>©Justin T Baker, Luciana Pennant, Tadas Baltrušaitis, Supriya Vijay, Elizabeth S Liebson, Dost Ongur, Louis-Philippe Morency. Originally published in Iproceedings (http://www.iproc.org), 30.12.2016.</copyright-statement>
    <copyright-year>2017</copyright-year>
    <license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/2.0/">
      <p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in Iproceedings, is properly cited. The complete bibliographic information, a link to the original publication on http://www.iproc.org/, as well as this copyright and license information must be included.</p>
    </license>  
    <self-uri xlink:href="http://www.iproc.org/2016/1/e44/" xlink:type="simple"/>
    <abstract>
      <sec sec-type="background">
        <title>Background</title>
        <p>Computational approaches to measure naturalistic behavior in clinical settings could provide an objective backstop for mental health assessment and disease monitoring, both of which are costly and unreliable using traditional methods.</p>
      </sec>
      <sec sec-type="objective">
        <title>Objective</title>
        <p>The objective of this pilot study was to determine which parts of the mental status exam could be reliably predicted by a combination of facial and vocal features extracted from a recorded interview using a combination of computer-assisted methods, in order to assess feasibility of our approach to quantify behavior for a longitudinal study of patients receiving psychiatric treatment.</p>
      </sec>
      <sec sec-type="methods">
        <title>Methods</title>
        <p>A total of 18 patients carrying diagnoses of schizophrenia, bipolar disorder, and related conditions were recruited from an inpatient psychiatric unit and participated in a total of 24 semi-structured interviews lasting 5-15 minutes (modeled after clinical rounds). Synchronized audio and video data were acquired from both patient and doctor during each encounter using 1080p webcams focused on the face and upper torso and cardioid headset microphones. Standardized psychiatric symptom scales was obtained after each recorded interview. Behavioral features, including facial action units (AUs), gaze, and speech characteristics (eg, prosody, pitch, tone, texture) were computed automatically using in-house and publicly available software. To predict clinical scales we trained a linear kernel support vector regressor (SVR) using features from both the entire session (ie, global mean) and each experimental epoch (eg, means during time spent alone and each individual question), leading to 15 predictors for each clinical scale item and scale totals. We used leave-one-out validation on the training data (maximizing the Pearson correlation coefficient) to determine the C parameter for the SVR models; for testing, we used leave-one-subject-out cross-validation (ie, leaving 17 participants for training/validation in each fold).</p>
      </sec>
      <sec sec-type="results">
        <title>Results</title>
        <p>Providing evidence of our approach's ability to capture and quantify relevant signal that confirms or verifies clearly visible psychopathology, we found that parameters such as brow furrowing (AU4, R=0.744) and eye widening (AU5, R=–0.601) were correlated with depression measures on the BPRS. In many cases, these effects were specific to the question or experimental epoch. For instance, unusual thought content was most evident in increased frequency of brow flashes (AU2, R=0.752) and greater smile variability (R=0.656) that occurred while participants were alone in the room. Individuals with higher ratings of delusions also showed increased brow flashes in response to a question about their self confidence (R=0.739). Many relationships showed a “dose effect” with midrange scores corresponding with moderate psychopathology.</p>
      </sec>
      <sec sec-type="conclusions">
        <title>Conclusions</title>
        <p>Our experiments show that automatically detected facial action units and speech properties can be used to predict and quantify a number of psychiatric symptoms from multiple domains of psychopathology, including both mood and psychosis. We demonstrate the importance of analyzing behaviors in the appropriate context (ie, while participants are alone or prompted with a specific question) in order to optimally extract clinically relevant information from objective indices of behavior. Thus, quantitative assessment of behavior in naturalistic settings is both feasible and informative as an adjunct to traditional methods of mental status assessment.</p>
      </sec>
    </abstract>
    <kwd-group>
      <kwd>human interaction</kwd>
      <kwd>facial expression</kwd>
      <kwd>voice</kwd>
      <kwd>depression</kwd>
      <kwd>bipolar disorder</kwd>
      <kwd>schizophrenia</kwd>
      <kwd>patient-physician relationship</kwd>
    </kwd-group></article-meta>
  </front>
  <body>
    <p>This poster was presented at the Connected Health Symposium 2016, October 20-21, Boston, MA, United States. A photo of the poster is displayed as an image in <xref ref-type="fig" rid="figure1">Figure 1</xref> and as a higher resolution image in <xref ref-type="app" rid="app1">Multimedia Appendix 1</xref>.</p>
    <fig id="figure1" position="float">
      <label>Figure 1</label>
      <caption>
        <p>Poster.</p>
      </caption>
      <graphic xlink:href="iproc_v2i1e44_fig1.png" alt-version="no" mimetype="image" position="float" xlink:type="simple"/>
    </fig>
  </body>
  <back>
    <app-group>
      <app id="app1">
        <title>Multimedia Appendix 1</title>
        <p>Poster.</p>
        <media xlink:href="iproc_v2i1e44_app1.JPG" xlink:title="JPG File, 2MB"/>
      </app>
    </app-group>
  </back>
</article>
