Providing init to the phateR implementation only saves up time if only the parameter t is changing but not other parameters? #143

erzakiev · 2024-04-16T13:38:29Z

I was hoping on saving some time on computation as each time phateR::phate runs, ~75% of time goes to PCA and KNN graph calculation:

Calculating PHATE...
  Running PHATE on 52398 observations and 32344 variables.
  Calculating graph and diffusion operator...
    Calculating PCA...
    Calculated PCA in 143.55 seconds.
    Calculating KNN search...
    Calculated KNN search in 53.70 seconds.
    Calculating affinities...
    Calculated affinities in 75.59 seconds.
  Calculated graph and diffusion operator in 273.90 seconds.
  Calculating landmark operator...
    Calculating SVD...
    Calculated SVD in 48.39 seconds.
    Calculating KMeans...
    Calculated KMeans in 13.31 seconds.
  Calculated landmark operator in 65.04 seconds.
  Calculating optimal t...
    Automatically selected t = 23
  Calculated optimal t in 4.87 seconds.
  Calculating diffusion potential...
  Calculated diffusion potential in 0.93 seconds.
  Calculating metric MDS...
  Calculated metric MDS in 4.46 seconds.
Calculated PHATE in 349.23 seconds.

In case of a fixed knn and re-calculation of the map with different parameters of e.g. decay or gamma, this is a purely redundant overhead that could be avoided. I thought passing to init a previous phate object with at least the same knn parameters would help the issue, but apparently the only place where it speeds up the things is when we recompute using a different t.

If i want to avoid to recalculate PCA and KNN graph each time, should i pass a pre-computed affinity matrix to the data variable, as instructed in the man page of phateR::phate? If so, can I use knn graph, generated by Seurat's FindNeighbors function for that purpose?

Thank you in advance.

The text was updated successfully, but these errors were encountered:

erzakiev · 2024-04-16T14:00:46Z

can I use knn graph, generated by Seurat's FindNeighbors function for that purpose?

When doing so, I encounter a cryptic error message 'ValueError: assignment destination is read-only':

> phateR::phate(data = as.matrix(pbmc3k@graphs$RNA_nn),
 knn.dist.method='precomputed')
Calculating PHATE...
  Running PHATE on precomputed affinity matrix with 52398 observations.
  Calculating graph and diffusion operator...
  Calculated graph and diffusion operator in 1.98 seconds.
Calculated PHATE in 1.98 seconds.
Error in py_call_impl(callable, call_args$unnamed, call_args$named) : 
  ValueError: assignment destination is read-only

Can you point me to where i can start to scratch the surface of the problem, please?

erzakiev added the question label Apr 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Providing init to the phateR implementation only saves up time if only the parameter t is changing but not other parameters? #143

Providing init to the phateR implementation only saves up time if only the parameter t is changing but not other parameters? #143

erzakiev commented Apr 16, 2024

erzakiev commented Apr 16, 2024

Providing init to the phateR implementation only saves up time if only the parameter t is changing but not other parameters? #143

Providing init to the phateR implementation only saves up time if only the parameter t is changing but not other parameters? #143

Comments

erzakiev commented Apr 16, 2024

erzakiev commented Apr 16, 2024