About the true labels for calculating the F1 scores #15

li000678 · 2021-02-13T22:07:22Z

Hi,
I am interested in testing the PARC in more datasets. I am wondering what is the source of the true labels? do you use the labels provided by the original papers? For example, the labels in the ''clusters.csv" from the clustering analysis of the PBMC dataset provided by 10xGenomics: pbmc_68k?

Thank you!
Yijia

ShobiStassen · 2021-02-15T00:41:26Z

Hi Yijia,

The PBMC labels are based on the annotations made by the authors of the original paper. You can check out their GitHub page which provides the Rcode for how they annotate the mixed PBMCs based on pure PBMC populations. This is how we got the annotations provided in PARC (the annotations can be downloaded from the PARC readme link or you can run the Rcode by Zheng et al)
Hope that helps

li000678 · 2021-02-17T21:28:44Z

Hi Shobi,
Thank you, I looked into the procedures of how Zheng et al did for the clustering is: firstly use k-means to generate 9 clusters and then divide cluster No.9 into two clusters. I personally think the predefined cluster produced by k-means may not be accurate (it's obvious when comparing 'the ground truth' with the results from PARC, PARC seems to do a better job). What do you think? I am planning to validate it in more datasets, though the labels of many other datasets are also annotations based on clusters identified by clustering algorithms.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the true labels for calculating the F1 scores #15

About the true labels for calculating the F1 scores #15

li000678 commented Feb 13, 2021

ShobiStassen commented Feb 15, 2021

li000678 commented Feb 17, 2021 •

edited

Loading

About the true labels for calculating the F1 scores #15

About the true labels for calculating the F1 scores #15

Comments

li000678 commented Feb 13, 2021

ShobiStassen commented Feb 15, 2021

li000678 commented Feb 17, 2021 • edited Loading

li000678 commented Feb 17, 2021 •

edited

Loading