Suggestion: test against false positives #350

starius · 2024-04-04T03:15:59Z

Context

I use the package to distinguish crawlers from human users in HTTP server. The logic is to prevent crawlers from "spoiling" one time links shared in Discord and similar chats which request all the links sent to chats to make preview. Because the link is one-time, the request from the crawler uses it and it does not open when human user opens it. I solved this by blocking access from crawlers to such links. If you need more details, please see starius/pasta#8

Danger of false positives

If some legit browser sends User Agent which accidentally matches one of patterns, the user won't be able to access the link, because the site will treat this request as originated by a crawler.

I guess, other uses of this package will also benefit if false positives are minimized.

Proposed solution

Let's add a test to CI which runs most common User Agents through the patterns and fails if any of them matches.
The list of patterns can be loaded from here: https://github.com/microlinkhq/top-user-agents/tree/master/src
If somebody adds a pattern which matches any of them, it will be early detected and prevented.
Also if some popular browser starts using some User Agent accidentally matching one of patterns, this will also trigger the test failure.

monperrus · 2024-04-04T15:18:46Z

Excellent idea! Looking forward to the PR.

Check against the list from https://github.com/microlinkhq/top-user-agents Fix monperrus#350

monperrus · 2024-04-05T05:06:15Z

closed by #348

starius added a commit to starius/crawler-user-agents that referenced this issue Apr 5, 2024

golang: add test against false negatives

2945d30

Check against the list from https://github.com/microlinkhq/top-user-agents Fix monperrus#350

starius mentioned this issue Apr 5, 2024

Add Golang package #348

Merged

monperrus closed this as completed Apr 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Suggestion: test against false positives #350

Suggestion: test against false positives #350

starius commented Apr 4, 2024

monperrus commented Apr 4, 2024

monperrus commented Apr 5, 2024

Suggestion: test against false positives #350

Suggestion: test against false positives #350

Comments

starius commented Apr 4, 2024

Context

Danger of false positives

Proposed solution

monperrus commented Apr 4, 2024

monperrus commented Apr 5, 2024