PsyProxy
Datasets·Sentiment·08__dialogue_emotion__empathetic_dialogues__context

Empathetic Dialogues context labels

Empathetic Dialogues is a public corpus from Facebook AI of conversational utterances paired with one of 32 emotion-context labels, designed to train empathetic chat agents. The texts in this corpus frequently explore themes of personal achievements , emotional support , and interpersonal relationships , reflecting a range of experiences and sentiments. Common topics include academic milestones , such as graduation and job acquisition, alongside expressions of concern for loved ones facing challenges, like health issues or accidents. Complaints about trust and reliability in relationships also emerge, as do moments of joy and celebration related to personal events or milestones. Additionally, the texts convey a sense of community through shared experiences and advice, highlighting the importance of empathy and understanding in everyday interactions. [Summary on 50 random texts by ChatGPT 4o Mini].

Distribution of Emotion context (32 classes)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
3,441 at floor2,871 at ceiling
107,216
items
21,444
holdout n
Emotion context (32 classes)
target
Multi-class
kind
13
systems compared
Criterion validity

Reported holdout systems from the verified card

Multiclass classification uses Macro-F1 as the task-primary metric. Secondary columns keep the companion metrics visible so binary, ordinal, regression, and multiclass cards are not compared through one flattened score.

Source podium · Macro-F1 · 9 families
Gold
PsyProxy
0.257
Silver
Topic models
0.097
Bronze
LIWC
0.085
Model-family mix
PsyProxy · 4Topic model · 2Lexicon · 2Baseline · 5
SystemFamilyVariantMacro-F1AccuracyMacro-AUCPrimary scale
psyproxyPsyProxy — Technology Lens v0.5 · 800d
PsyProxypermissive0.2570.263-1.000
psyproxyPsyProxy — Social Economics Lens v0.5 · 1000d
PsyProxypermissive0.2480.257-1.000
psyproxyPsyProxy — Behavioral Sciences Lens v0.5 · 1000d
PsyProxypermissive0.2420.253-1.000
psyproxyPsyProxy — Health Lens v0.9 · 1100d
PsyProxypermissive0.2390.247-1.000
topicBERTopic
Topic modelpermissive0.0970.1150.644
lexLinguistic Inquiry and Word Count (LIWC)
Lexiconpermissive0.0850.104-1.000
baselineEmpath
Baselinepermissive0.0620.083-1.000
baselineTool for the Automatic Analysis of Syntactic Sophistication and Complexity (TAASSC)
Baselinepermissive0.0520.077-1.000
topicHierarchical Dirichlet Process (tomotopy HDP)
Topic modelpermissive0.0490.0800.610
baselineTextDescriptives
Baselinepermissive0.0200.053-1.000
lexValence Aware Dictionary and sEntiment Reasoner (VADER)
Lexiconpermissive0.0140.060-1.000
baselineTool for the Automatic Analysis of Lexical Sophistication (TAALES)
Baselinepermissive0.0110.052-1.000
baselineTool for the Automatic Analysis of Cohesion (TAACO)
Baselinepermissive0.0070.051-1.000