Spearman Corrleations for Table-4 #22

Atharva-Phatak · 2022-07-09T20:47:19Z

In Table-4 in the paper, for summEval dataset you have measured COH, FAC, FLU, INFO. I wanted to know which variants of bart-score you used.

From my understanding of the paper,
For factuality(FAC) you must have used BARTScore(s->h) i.e source -> hypothesis.

But i am not clear about FLU, COH and INFO.

If you could please elaborate that will be really helpful.

yyy-Apple · 2022-07-10T03:21:50Z

On the SummEval dataset, for FLU, COH and INFO, we also used BARTScore(s->h).

Atharva-Phatak · 2022-07-10T14:47:03Z

So what was the reason for using single score (s->h). Does BARTScore holistically measure quality of generated text ?

For example can you report s->h variant of BARTScore and say that overall from the basis of the score, the quality of Text Summary generated by Model A is better than Model B ?

Also how do you decide which BARTScore variant to use for a particular dataset to measure COH, FLU, INFO and FAC ?

Please let me know.

yyy-Apple · 2022-07-14T01:56:21Z

Here are some rules we have followed when deciding which BARTScore variant to use.

based on the definition of the evaluation perspective (for example, factuality must rely on the source document.)
modalities/languages supported by PLMs (for example, for Data-to-text, we can only use the h<->r due to the different modalities of source and hypothesis)

However, we agree that designing a metric with multiple interpretable dimensions will be a promising future work.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spearman Corrleations for Table-4 #22

Spearman Corrleations for Table-4 #22

Atharva-Phatak commented Jul 9, 2022

yyy-Apple commented Jul 10, 2022 •

edited

Loading

Atharva-Phatak commented Jul 10, 2022 •

edited

Loading

yyy-Apple commented Jul 14, 2022 •

edited

Loading

Spearman Corrleations for Table-4 #22

Spearman Corrleations for Table-4 #22

Comments

Atharva-Phatak commented Jul 9, 2022

yyy-Apple commented Jul 10, 2022 • edited Loading

Atharva-Phatak commented Jul 10, 2022 • edited Loading

yyy-Apple commented Jul 14, 2022 • edited Loading

yyy-Apple commented Jul 10, 2022 •

edited

Loading

Atharva-Phatak commented Jul 10, 2022 •

edited

Loading

yyy-Apple commented Jul 14, 2022 •

edited

Loading