Skip to content

Commit

Permalink
updating READMEs
Browse files Browse the repository at this point in the history
  • Loading branch information
svenbuechel committed May 18, 2022
1 parent 07f42d3 commit 7ad8cae
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 1 deletion.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,7 @@
This repository contains EmoBank, a large-scale text corpus manually annotated with emotion according to the psychological Valence-Arousal-Dominance scheme. It was build at JULIE Lab, Jena University and is described in detail in our papers from EACL 2017 and LAW 2017 (see Citation). The repository contains two folders: "corpus" which contains the actual Emobank data (described in the EACL paper) and "pilot" which contains the data from our pilot study (described in the LAW paper). See the readme files in the respective folders for more detailed information regarding the data format.

## News
* May 2022. We added the individual, per-annotator ratings for the reader and the writer perspectiver. The data can be found in `EmoBank/corpus/individual_reader_ratings.csv` and `EmoBank/corpus/individual_writer_ratings.csv`, respectively. We also included a notebook (`EmoBank/corpus/aggregation.ipynb`) illustrating how the individual ratings were aggregated.
* December 2019. We added a train-dev-test split to the dataset which can be found in `EmoBank/corpus/emobank.csv`. The data split is stratified with respect to text category (fiction, letters, newspaper,...). The code for creating the split can be found in `EmoBank/corpus/adding_data_split.ipynb`. We recommend using this split for model evaluation to increase comparability.

## Characteristics
Expand Down
4 changes: 3 additions & 1 deletion corpus/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,9 @@ This folder contains raw and meta data of EmoBank. In particular, it contains
* ```raw.csv```: The raw textual data.
* ```meta.tsv```: The source and genre meta-data.
* ```reader.csv```: The gold ratings from the reader perspective
* ```writer.csv```: The gold ratings fromt the writer perspective
* ```writer.csv```: The gold ratings from the writer perspective
* ```individual_reader_ratings.csv```: The original, per-annotator ratings from the reader perspective
* ```individual_writer_ratings.csv```: The original, per-annotator ratings from the writer perspective
* ```emobank.csv```: Weighted average of reader and writer annotations. **Use this file by default.**

# 1 Loading EmoBank
Expand Down

0 comments on commit 7ad8cae

Please sign in to comment.