dodge data mining

WHAT IS DATA MINING?

Data mining is the process of discovering patterns in large data sets. In other words, let’s say you’re a company with a huge amount of information on your customers: how they find your site, what they click on, what pages they stay on longest, and whether they ultimately buy your product. You want to design your site so that the highest possible percentage of users buy your product, so you use the information collected on your site to determine what you should change or improve.

WHY YOU SHOULD CARE

Most of us probably don’t care if a car dealership tracks what links on their site we click so that they can improve their headlines and content. Unfortunately, one of the golden rules of the internet is any data that is collected on you can be accidentally made public or stolen.

Ten years ago, AOL accidentally released three months of its users’ search queries, containing keywords from 650,000 users. While these users were identified in the logs only by an anonymized user ID number, the New York Times successfully used information in the searches to identify several of the “anonymous” searchers. And some of these searches are about as personal and as embarrassing as it gets.

IT’S MORE THAN JUST YOUR SEARCHES

It’s pretty clear why advertisers would be interested in our search queries: if you’re searching for car dealerships near you, one of those dealers can serve you a targeted ad with a deal that is extremely likely to appeal to you. Google, however, doesn’t limit its data collection to what you’re searching for: it collects passwords, e-mails and other personal information from unsuspecting computer users via its Street View vehicles, and it collects and analyzes the content of Gmail and sells that to advertisers.

HOW TO DODGE DATA MINING

If you’re like many of us here at SpiderOak, and want to protect your online privacy and dodge data mining, we recommend using these tools:

Free browser add-ons to limit tracking:

  • Privacy Badger detects third-party domains and blocks if it can determine they are tracking you.
  • Disconnect blocks third-party trackers collecting, retaining or sharing user data. It also organizes blocked tracking requests into groups: advertising, analytics, social media and content.
  • RedMorph blocks every tracking signal it detects, without discrimination between the good and the bad.
  • HTTPS Everywhere by the EFF, a browser extension that encrypts your communications with major websites
  • AdBlocker Plus “Surf the web without annoying ads!
  • Privacy Badger also by the EFF, this tool blocks spying ads and invisible trackers

Search Engine:

  • Duck Duck Go: “The Search Engine That Doesn’t Track You”

Privacy-friendly web browsers:

  • Epicis a browser that takes a minimalistic approach to browsing in order to maximize privacy. Both cookies and trackers are deleted after each session and that all browser searches are proxied through their own servers, meaning that there is no way to connect an IP address to a search.
  • Dooble is a stripped back Chromium-based browser that is able to run on Windows, Linux and OS X, Dooble offers strict privacy features. But be warned: it will disable insecure Web-based interfaces such as Flash and Javascript, which will make some Web pages harder to read.

Other apps to consider:

VPNs:

  • NordVPN’s servers operate under the jurisdiction of Panama because, in Panama, internet providers are not obliged to monitor traffic or keep any data logs. It has servers in almost 50 countries and provides double data encryption, and an automatic “kill switch” to protect sensitive information in one click.
  • PureVPN is a VPN service provider that is fast and secure. The company is based in Hong Kong but will let you connect to more 500 servers across 141 countries via a 256-bit grade encryption.

For a dozens of more options, including many more technical solutions, check out Privacytools.io.

What about you? What are some of your favorite privacy tools? Let us know on Twitter @spideroak.

Last updated 2017/11/14