GitHub - cooper12121/DIE-EC

DIE-EC

project for paper Enhancing Cross-Document Event Coreference Resolution by Discourse Structure and Semantic Information (LREC-COLING 2024))

Prerequisites

Python 3.8
#>pip install -r requirements.txt
#>export PYTHONPATH=<ROOT_PROJECT_FOLDER>

Preprocessing

The main train process require the mentions pairs and embeddings from each set.

constrcut RST trees and Lexical chains

We first construct RST tress for each documents, When generate mention pair, we construct cross-document lexical chains.

WEC-Eng and WEC-Zh

Since WEC-Eng/WEC-Zh train set contains many mentions, generating all negative pairs is very resource and time consuming. To that end, we added a control for the negative:positive ratio.

#>python src/preprocess_gen_pairs.py

Generate Embeddings

To generate the embeddings for WEC-Eng/WEC-Zh run the following script and provide the slit files location, for example:

#>python src/preprocess_embed.py

Initialize node

We use the generated embeddings to initialize node
#>python src/preprocess_edu_embed.py

Training

See train.py file header for the complete set of script parameters. Model file will be saved at output folder (for each iteration that improves).

For training over WEC-Eng/WEC-Zh:

#> python src/train.py

Inference

#> python src/inference.py

Cluster

#> python src/custer.py

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
ChineseDiscourseParser		ChineseDiscourseParser
data_process		data_process
dataset		dataset
experiments		experiments
gatv2		gatv2
sota_end2end_parser		sota_end2end_parser
sota_end2end_parser_1		sota_end2end_parser_1
src		src
.gitattributes		.gitattributes
README.md		README.md
coling2024.png		coling2024.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DIE-EC

Prerequisites

Preprocessing

constrcut RST trees and Lexical chains

WEC-Eng and WEC-Zh

Generate Embeddings

Initialize node

Training

Inference

Cluster

About

Releases

Packages

Languages

cooper12121/DIE-EC

Folders and files

Latest commit

History

Repository files navigation

DIE-EC

Prerequisites

Preprocessing

constrcut RST trees and Lexical chains

WEC-Eng and WEC-Zh

Generate Embeddings

Initialize node

Training

Inference

Cluster

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages