Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the true labels for calculating the F1 scores #15

Open
li000678 opened this issue Feb 13, 2021 · 2 comments
Open

About the true labels for calculating the F1 scores #15

li000678 opened this issue Feb 13, 2021 · 2 comments

Comments

@li000678
Copy link

Hi,
I am interested in testing the PARC in more datasets. I am wondering what is the source of the true labels? do you use the labels provided by the original papers? For example, the labels in the ''clusters.csv" from the clustering analysis of the PBMC dataset provided by 10xGenomics: pbmc_68k?

Thank you!
Yijia

@ShobiStassen
Copy link
Owner

Hi Yijia,

The PBMC labels are based on the annotations made by the authors of the original paper. You can check out their GitHub page which provides the Rcode for how they annotate the mixed PBMCs based on pure PBMC populations. This is how we got the annotations provided in PARC (the annotations can be downloaded from the PARC readme link or you can run the Rcode by Zheng et al)
Hope that helps

@li000678
Copy link
Author

li000678 commented Feb 17, 2021

Hi Shobi,
Thank you, I looked into the procedures of how Zheng et al did for the clustering is: firstly use k-means to generate 9 clusters and then divide cluster No.9 into two clusters. I personally think the predefined cluster produced by k-means may not be accurate (it's obvious when comparing 'the ground truth' with the results from PARC, PARC seems to do a better job). What do you think? I am planning to validate it in more datasets, though the labels of many other datasets are also annotations based on clusters identified by clustering algorithms.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants