1 |
Multimodal Clustering with Role Induced Constraints for Speaker Diarization ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Neural Speech Decoding During Audition, Imagination and Production
|
|
|
|
In: IEEE (2021)
|
|
BASE
|
|
Show details
|
|
3 |
Automated Quality Assessment of Cognitive Behavioral Therapy Sessions Through Highly Contextualized Language Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Annotation and Evaluation of Coreference Resolution in Screenplays ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
An Automated Quality Evaluation Framework of Psychotherapy Conversations with Local Quality Estimates ...
|
|
|
|
Abstract:
Text-based computational approaches for assessing the quality of psychotherapy are being developed to support quality assurance and clinical training. However, due to the long durations of typical conversation based therapy sessions, and due to limited annotated modeling resources, computational methods largely rely on frequency-based lexical features or dialogue acts to assess the overall session level characteristics. In this work, we propose a hierarchical framework to automatically evaluate the quality of transcribed Cognitive Behavioral Therapy (CBT) interactions. Given the richly dynamic nature of the spoken dialog within a talk therapy session, to evaluate the overall session level quality, we propose to consider modeling it as a function of local variations across the interaction. To implement that empirically, we divide each psychotherapy session into conversation segments and initialize the segment-level qualities with the session-level scores. First, we produce segment embeddings by fine-tuning a ... : Accepted by Computer Speech & Language ...
|
|
Keyword:
Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering
|
|
URL: https://dx.doi.org/10.48550/arxiv.2106.07922 https://arxiv.org/abs/2106.07922
|
|
BASE
|
|
Hide details
|
|
9 |
Deblurring for Spiral Real-Time MRI Using Convolutional Neural Networks ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Derivation of Fitts' law from the Task Dynamics model of speech production ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Feature Fusion Strategies for End-to-End Evaluation of Cognitive Behavior Therapy Sessions ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Screenplay Quality Assessment: Can We Predict Who Gets Nominated? ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
An empirical analysis of information encoded in disentangled neural speaker representations ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Sentence level estimation of psycholinguistic norms using joint multidimensional annotations ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Variability in individual constriction contributions to third formant values in American English /ɹ/a)
|
|
|
|
In: J Acoust Soc Am (2020)
|
|
BASE
|
|
Show details
|
|
18 |
How an aglossic speaker produces an alveolar-like percept without a functional tongue tip
|
|
|
|
In: J Acoust Soc Am (2020)
|
|
BASE
|
|
Show details
|
|
19 |
Machine learning and natural language processing in psychotherapy research: Alliance as example use case
|
|
|
|
In: J Couns Psychol (2020)
|
|
BASE
|
|
Show details
|
|
20 |
Deblurring for Spiral Real-Time MRI Using Convolutional Neural Networks
|
|
|
|
In: Magn Reson Med (2020)
|
|
BASE
|
|
Show details
|
|
|
|