Fingerprinting SecureDrop .onion services

dachary · November 2, 2017, 7:52am

Bonjour,

In a recent paper, How Unique is Your .onion? the authors included some SecureDrop .onion sites and calculated a score based on algorithms. Among them is Effective Attacks and Provable Defenses for Website Fingerprinting.

It would be interesting to repeat the process to confirm the results of the paper. And we could also add that to the SecureDrop integration tests. I’m not saying the results in the paper are incorrect: I will only be able to formulate an opinion when I’m able to reproduce them.

Cheers

redshiftzero · November 2, 2017, 3:40pm

Note we did have a project to do this (now defunct - this would take significant effort to convert into integration tests): https://github.com/freedomofpress/fingerprint-securedrop/

dachary · November 5, 2017, 7:16pm

research into what kind of random padding would effectively mitigate this threat https://petsymposium.org/2017/papers/issue2/paper54-2017-2-source.pdf

ageis · November 11, 2017, 10:53pm

My personal opinion is figure out some random padding and just do it. For nearly two years the project has collectively been aware of the risk of website fingerprinting, a problem amplified by the small set of Tor hidden services and the fact that almost every SecureDrop site is identical. That it’s within the realm of possibility has been already borne out by the research. We invested in crawling / machine learning tools in order to figure out if we can do it ourselves and get positive results, seemingly in order to confirm what we already know. Meanwhile, during the whole time folks were doing that, SecureDrop sites have continued to reside on the onion web, vulnerable to passive attacks, with no defenses deployed. We suspect random padding at some layer will help marginally; we just don’t know how much. However, we already know that adding random padding wouldn’t necessarily hurt—IMO there is no excuse, you should just do it.

dachary · November 11, 2017, 11:52pm

Unless someone else is willing, I’ll start working on this tomorrow.

redshiftzero · November 21, 2017, 1:37am

Sorry - I missed this thread - we get notifications in the FPF slack for comments on GitHub but not yet Discourse.

However, we already know that adding random padding wouldn’t necessarily hurt—IMO there is no excuse, you should just do it.

Just limiting the conversation here to the paper in question, it states that the most distinguishing feature for SecureDrop was the large size of the site (total incoming packet size). Adding random padding does not address this. Indeed, it may make* the situation worse, as already very large sites with random padding may be even easier to fingerprint.

From the paper on SecureDrop specifically:

In particular, we noted that these pages embed images and use scripts and CSS styles that make them large and therefore distinguishable.

If we want to move toward to goal of making SecureDrop less fingerprintable, we should first decrease the size of the site, which is in line with e.g. Remove jQuery dependency by kaizensoze · Pull Request #1298 · freedomofpress/securedrop · GitHub.

If it was so simple, then we would indeed “just do it”, but this is a complicated problem as evidenced by the many academic computer science researchers working on this problem with suggestions evolving with each paper ;).

[*] note we can’t say definitely, as we have no method for testing. Furthermore, I do not think the SecureDrop engineering team should spend significant effort on developing this testing framework and instead we should focus our limited resources on addressing the much easier attacks on source anonymity that we know are currently happening.

github.com/freedomofpress/securedrop-docs

Publish journalist best practice guide

opened 02:21AM - 29 Sep 17 UTC

redshiftzero

Since this is so critical to the anonymity of SecureDrop sources, we should prov…ide more clear guidance on how journalists should be discussing and working with source materials (beyond just redacting metadata as described in freedomofpress/securedrop#1449). Some organizations may have their own internal processes worked out, but other organizations need more guidance than we are providing. It's fine if this is just linking out to other resources - if these resources exist, let's compile them in this ticket. Questions that need answers to be written down: - [ ] How should a team of journalists and editors communicate about source materials? (i.e. Signal groups, _not_ Slack or email) - [ ] What are the best practices for validating documents without leaking source identity? (there are existing resources for this, let's link to them) - [ ] Questions in freedomofpress/securedrop#1449 can go in this guide # User Stories As a journalist or editor, I want clear guidance on how I should responsibly work with SecureDrop documents to protect their identity.

github.com/freedomofpress/securedrop

Investigate potential attempt to compromise SVS

opened 12:36AM - 01 Sep 17 UTC

closed 12:08AM - 29 Nov 17 UTC

garrettr

security

Kevin Poulsen just [tweeted](https://twitter.com/kpoulsen/status/903412661404053…506) a snippet of code that he says he received on his SecureDrop. The code snippet is incomplete, but it appears to be an attempt to exfiltrate sensitive data from the airgapped Secure Viewing Station (SVS). Normally we would prefer to discuss potential security issues privately, in order to develop and deploy a fix without encouraging potential exploitation in case this really is a security vulnerability. In this case, the cat's out of the bag thanks the issue being reported publicly on Twitter, so we feel it's best to discuss it on an open forum in the interest of transparency.

dachary · November 21, 2017, 9:21am

Hey @redshiftzero

Thanks for explaining

Is there a place where I could learn more about that ? It makes perfect sense to me that a web page with a fixed size can be used to fingerprint a site (it may be the only page in all THS with this size), I don’t get how a large page can be a problem. Either it is the largest page of all THS services in which case, well yes, that makes sense. But I’d be surprised if it was the case Or its size is in the largest pages available in all THS and I understand if it would be better if its size would be closer to the average. But even in this case varying the size of the page looks like an effective way of improving the situation. I’m not trying to make a point: I’m very ignorant about all this. I’m writing my reasoning so you can better point at what’s wrong with it.

I do not think the SecureDrop engineering team should spend significant effort on developing this testing framework and instead we should focus our limited resources on addressing the much easier attacks on source anonymity that we know are currently happening.

I agree 100%. I was hoping for an easy way to improve things and made this pull request in that spirit. I’m not very excited at the idea of spending weeks working on this specific topic, specially since there is no way to repeat the conclusions found in the academic papers.

However I’m very motivated to understand why a fixed size is not the easiest fingerprinting signature.

Cheers