Look into applying some more rigorous approach to generating experiment results #84

hellais · 2024-08-26T12:39:36Z

Currently experiment results are semi-manually coded using bayesian style reasoning to come up with the weights.

It's however possible to do this using a more rigorous approach that makes use of well established graph based modeling systems such as bayesian networks.

Work on this has started already since a few months and had a very fruitful conversation about this topic with Joss who provided key insight.

As part of this activity the plan is to move this forward by doing some more modeling using bayes networks and see how it works.

Some sub-activities as part of this might include:

Coming up with labeled data (probably enriched with what we have from the feedback reporting system) to validate the model and/or bootstrap/train it
- Build some kind of web interface to make it easier to label data quickly (currently it's too many clicks to do it via explorer for many measurements)
Refine and experiment with different features for the bayes net
Iterate on various configurations of the bayes network
Consider extending the observation data format to make it easier to extract the necessary features

hellais · 2024-08-26T12:45:49Z

Some work in progress on this front is being done on this branch: #85

In particular see the notebook which implements an early stage version of the bayes net: https://github.com/ooni/data/blob/bayes-net/oonipipeline/notebooks/web-analysis-bn.ipynb

There are still a few critical theoretical hurdles that need to be overcome, which are questions I would like to pose to people that have more experience about this, namely:

What are some best-practices or rules of thumb to determine optimal cardinality for the nodes and when it's appropriate to split a particular proposition into more sub-propositions?
How do you deal with the fact that the state of a particular proposition might be undefined? Is it OK for it to just be T | F or is it recommended to explicitly add the "unknown" state?
Are there best practices on the optimal cardinality of the CPD tables? (pgmpy has a hard limit of 32, but manually populating tables even of width 10+ is extremely tedious) Are there tricks to try and split the nodes up in a such a way to keep the cardinality low?

hellais · 2024-11-15T14:22:43Z

After more experimentation with the bayesian network approach and having a working PoC of it, I came to the conclusion that for the moment the performance of running this is not going to scale well to our use case without some significant work to re-engineer the analysis pipeline.

This lead to the conclusion that it was probably best for the time being to rollback to an approach that's simpler and closer to what we had done before, by using a fuzzy logic rule-based style classifier. Put in simpler terms this is just a list of IF THEN clauses that lead to the confidence estimates we have in a particular outcome being true. Through these we are effectively encoding the knowledge we have about certain signals in the measurements being a sign of blocking or not blocking.

In terms of implementation it's done directly as SQL queries which has the benefit of both being more performant than having to carry data in and out of python, but also allows to inspect and update the rules more easily as they all live in one place.

Work related to this is done inside of the following PR: #99, specifically the web_analysis.py contains the mega-sql query to perform the analysis.

I will be following up with some more extensive documentation explaining how this whole system works.

kopekC · 2024-12-29T18:19:12Z

Probably a dumb question but still worth asking, why not use embeddings in order to do some pre/post grouping, and apply labels based on the clusters that get formed ?

hellais · 2025-01-23T11:55:29Z

why not use embeddings in order to do some pre/post grouping, and apply labels based on the clusters that get formed

That's kind of what we are doing, though the clustering and labeling process is being done at moment using fuzzy rule based system.

You can find the list of what you could call embeddings in this mega SQL query which are recomputed every day based on the observations: https://github.com/ooni/data/blob/main/oonipipeline/src/oonipipeline/analysis/web_analysis.py#L111.

Examples of these are things like:

dns_blocking_country_consistent
length(dns_answers)
dns_tls_consistent
etc.

In the future it would be interesting to apply some ML to these feature vectors to see if it's possible to automatically generate the labels/outcomes, however the biggest challenge in doing so is the labelling through some form of ground truth.

hellais · 2025-01-23T11:56:24Z

I think that the approach we have at the moment is working OK for the intents and purposes we need, so I am going to close this issue as done. Follow up issues shall be created as more progress is made on this front.

hellais added the priority/medium Normal priority issue label Aug 26, 2024

hellais self-assigned this Aug 26, 2024

hellais added funder/drl2022-2024 research labels Aug 26, 2024

hellais added this to Roadmap Jan 7, 2025

hellais moved this to In Progress in Roadmap Jan 8, 2025

hellais closed this as completed Jan 23, 2025

github-project-automation bot moved this from In Progress to Done in Roadmap Jan 23, 2025

hellais mentioned this issue Jan 27, 2025

Autodetection - setup system for validating the experiment results ooni/ooni.org#1450

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Look into applying some more rigorous approach to generating experiment results #84

Look into applying some more rigorous approach to generating experiment results #84

hellais commented Aug 26, 2024 •

edited

Loading

hellais commented Aug 26, 2024

hellais commented Nov 15, 2024

kopekC commented Dec 29, 2024

hellais commented Jan 23, 2025

hellais commented Jan 23, 2025

Look into applying some more rigorous approach to generating experiment results #84

Look into applying some more rigorous approach to generating experiment results #84

Comments

hellais commented Aug 26, 2024 • edited Loading

hellais commented Aug 26, 2024

hellais commented Nov 15, 2024

kopekC commented Dec 29, 2024

hellais commented Jan 23, 2025

hellais commented Jan 23, 2025

hellais commented Aug 26, 2024 •

edited

Loading