Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Remove # of changes from priority calculation
In practice, including the number of changes in the priority calculation has been more harmful than helpful. While some pages return the same HTML if their content hasn't changed, others return slightly different HTML for every response of for every different session (and every Wayback Memento we have is typically a unique session). That means that some pages have fewer changes over the week because they legitimately had few changes, while others have *many* changes over the course of the week even when they didn't change at all, massively skewing our priority numbers. The way we were *trying* to use the change count could be useful, but we'd need a more practical way to compare versions than by exact byte equality, which is all we have the capacity for right now. This commit removes the change count from priority to avoid the issue. Fixes #2.
- Loading branch information