Research shows Google collects 20x more data from Android than Apple collects from iOS. Block this non-consensual telemetry using pihole blocklists.

Overview

pihole-antitelemetry

Research shows Google collects 20x more data from Android than Apple collects from iOS. Block both using these pihole lists.

Project Goals and FAQ

The purpose and goals of this blocklist are outliined in the FAQ

The lists

For a list of domains that should not break anything, use telemetry-domains.txt.

If you want to also block domains that are not well-tested, add telemetry-domains_beta.txt to the list.

Domains that are known to break user-facing services or apps are moved to telemetry-domains_borked.txt for use at your own risk.

These lists are maintained with help from the community. Any efforts to help sort domains into the correct blocklists will help others better block non-consensual telemetry.

Existing lists

Comments
  • `bag.itunes.apple.com` necessary for iOS App Store

    `bag.itunes.apple.com` necessary for iOS App Store

    Hey, thank you for these awesome blocklists!

    I started using the beta blocklist file and recognized that bag.itunes.apple.com is needed for the iOS app store to be able to load.

    opened by puresick 2
  • Document how to import blocklists

    Document how to import blocklists

    It took me some time to figure out how to add blocklists to Pi-hole. After you know it it pretty obvious and straight forward but you need to know where to look first :)

    opened by danifr 2
  • Telemetry file is empty

    Telemetry file is empty

    Hi! Why is the file empty? https://github.com/MoralCode/pihole-antitelemetry/blob/main/telemetry-domains.txt It doesn't make sense to block nothing... Thanks

    opened by SeriousM 2
  • Move xp.apple.com and humb.apple.com to borked list

    Move xp.apple.com and humb.apple.com to borked list

    Found two issues via trail and error:

    1. xp.apple.com prevents me to upgrade from 14.4.x to 14.5 via iOS beta preview developer profile
    2. humb.apple.com prevented me afterwards enabling my Apple Watch for unlocking my iPhone 11 Pro, when wearing a mask

    By the way, running python3 update-lists.py didn't remove the two domains in question from telemetry-domains_beta.txt, albeit it inserts them into telemetry-domains_borked.txt.

    opened by jk 1
  • Check additional information PDF for more domains

    Check additional information PDF for more domains

    there is a pdf (https://www.dropbox.com/s/qaazwyaj2ihj4qa/apple_google_additional_material.pdf) linked in the first article to more information about the connections made, which may contain more domains than are in the initial article.

    Would like to have these included in domains.csv too

    help wanted good first issue 
    opened by MoralCode 1
  • organize domains and catalog what information each one collects

    organize domains and catalog what information each one collects

    Would like to add information about what each domain is used for/what information is collected according to the results of the paper. This can help people understand which domains they want to blocked and what else may be affected.

    help wanted 
    opened by MoralCode 0
  • Domain name formatting

    Domain name formatting

    as mentioned in the UPDATING file, there are some domains in the list that represent the same domain but differ by a commonly confused character (like dashes, em dashes, and en dashes). Theres also a few that look like two characters smushed together (I think there's one like that for fi in a couple places). These are likely due to copy-paste errors or from parsing the PDF, however I want to be sure that these different domains have no chance of being valid or in-use domains at all before removing them as this may be a tactic used by google/Apple .etc to evade domain-based blocks using common confusable characters.

    For this issue I would like to:

    • [ ] look into the validity criteria for a domain name to determine what unicode/funky characters are or are not allowed by the spec
    • [ ] replace these characters with the canonically correct version(s) and update the id field for any groups of entries that represent the same domain and are merged with an existing category

    This will likely be necessary to have complete before #10 will be able to work properly as that relies on having a consistent domain list

    help wanted good first issue 
    opened by MoralCode 0
  • Create a script to help automate simple changes to the csv domain list

    Create a script to help automate simple changes to the csv domain list

    the UPDATING guide in this repo details some quirks and weird workarounds regarding how the CSV should best be edited.

    It may be helpful to create an additional python script that allows a domain name and list name to be passed as a parameter so it can automatically update the list of every instance of that domain to the new list.

    Related: #9

    opened by MoralCode 3
  • What other blocklist formats should be supported?

    What other blocklist formats should be supported?

    What other blocklist formats are out there that people would like to see supported?

    This is primarily a voting issue for people to leave a comment to mention their preferred format and upvote ones they would like to see added.

    question 
    opened by MoralCode 0
  • Validation of block lists

    Validation of block lists

    I think it would be both interesting and useful to replicate some of the methods used from the linked papers as a means of validating domains in the beta and borked lists to gather some evidence to promote them to the main list.

    That said there are other ways to help improve these lists, namely just by adding the beta and/or borked lists to your own pihole installation and report any domains that are in a list that they should not be in (i.e. domains in beta that break apps and services).

    Since this repository is mainly intended as a community resource to gather information on these domains, any contributions you can make will ultimately benefit other pihole users and blocklist creators.

    help wanted 
    opened by MoralCode 0
  • find a better way to manage and maintain the spreadsheet

    find a better way to manage and maintain the spreadsheet

    as seen by #4, the domains.csv is a spreadsheet of all the domains that were either part of the original paper or were submitted by users to this repo or on Reddit. Maintaining it in a CSV seems a bit cumbersome, so I'm looking for alternative ways to maintain this list as a community resource, ideally under version control.

    Looking for ideas for how to do this. Using a shared, public-comment google doc seems pretty counter to the goals of this repo (and Id prefer an open, or at least minimally harmful-to-users platform anyway). Currently thinking of using a GitHub Wiki for this but maintaining it as a markdown table doesn't seem very portable/usable to other blocklist maintainers.

    documentation help wanted 
    opened by MoralCode 1
Owner
Adrian Edwards
Adrian Edwards
Easily pull telemetry data and create beautiful visualizations for analysis.

This repository is a work in progress. Anything and everything is subject to change. Porpo Table of Contents Porpo Table of Contents General Informati

Ryan Dawes 33 Nov 30, 2022
Example-custom-ml-block-keras - Custom Keras ML block example for Edge Impulse

Custom Keras ML block example for Edge Impulse This repository is an example on

Edge Impulse 8 Nov 2, 2022
Much faster than SORT(Simple Online and Realtime Tracking), a little worse than SORT

QSORT QSORT(Quick + Simple Online and Realtime Tracking) is a simple online and realtime tracking algorithm for 2D multiple object tracking in video s

Yonghye Kwon 8 Jul 27, 2022
A tutorial showing how to train, convert, and run TensorFlow Lite object detection models on Android devices, the Raspberry Pi, and more!

A tutorial showing how to train, convert, and run TensorFlow Lite object detection models on Android devices, the Raspberry Pi, and more!

Evan 1.3k Jan 2, 2023
A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection

Confluence: A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection 1. 介绍 用以替代 NMS,在所有 bbox 中挑选出最优的集合。 NMS 仅考虑了 bbox 的得分,然后根据 IOU 来

null 44 Sep 15, 2022
A non-linear, non-parametric Machine Learning method capable of modeling complex datasets

Fast Symbolic Regression Symbolic Regression is a non-linear, non-parametric Machine Learning method capable of modeling complex data sets. fastsr aim

VAMSHI CHOWDARY 3 Jun 22, 2022
SIEM Logstash parsing for more than hundred technologies

LogIndexer Pipeline Logstash Parsing Configurations for Elastisearch SIEM and OpenDistro for Elasticsearch SIEM Why this project exists The overhead o

null 146 Dec 29, 2022
[CVPR 2021] Teachers Do More Than Teach: Compressing Image-to-Image Models (CAT)

CAT arXiv Pytorch implementation of our method for compressing image-to-image models. Teachers Do More Than Teach: Compressing Image-to-Image Models Q

Snap Research 160 Dec 9, 2022
We are More than Our JOints: Predicting How 3D Bodies Move

We are More than Our JOints: Predicting How 3D Bodies Move Citation This repo contains the official implementation of our paper MOJO: @inproceedings{Z

null 72 Oct 20, 2022
An AI Assistant More Than a Toolkit

tymon An AI Assistant More Than a Toolkit The reason for creating framework tymon is simple. making AI more like an assistant, helping us to complete

TymonXie 46 Oct 24, 2022
HMLLDB is a collection of LLDB commands to assist in the debugging of iOS apps.

HMLLDB is a collection of LLDB commands to assist in the debugging of iOS apps. 中文介绍 Features Non-intrusive. Your iOS project does not need to be modi

mao2020 47 Oct 22, 2022
An algorithm study of the 6th iOS 10 set of Boost Camp Web Mobile

알고리즘 스터디 ?? 부스트캠프 웹모바일 6기 iOS 10조의 알고리즘 스터디 입니다. 개인적인 사정 등으로 S034, S055만 참가하였습니다. 스터디 목적 상진: 코테 합격 + 부캠끝나고 아침에 일어나기 위해 필요한 사이클 기완: 꾸준하게 자리에 앉아 공부하기 +

null 2 Jan 11, 2022
Applicator Kit for Modo allow you to apply Apple ARKit Face Tracking data from your iPhone or iPad to your characters in Modo.

Applicator Kit for Modo Applicator Kit for Modo allow you to apply Apple ARKit Face Tracking data from your iPhone or iPad with a TrueDepth camera to

Andrew Buttigieg 3 Aug 24, 2021
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

Detectron is deprecated. Please see detectron2, a ground-up rewrite of Detectron in PyTorch. Detectron Detectron is Facebook AI Research's software sy

Facebook Research 25.5k Jan 7, 2023
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

MMF is a modular framework for vision and language multimodal research from Facebook AI Research. MMF contains reference implementations of state-of-t

Facebook Research 5.1k Jan 4, 2023
OHLC Average Prediction of Apple Inc. Using LSTM Recurrent Neural Network

Stock Price Prediction of Apple Inc. Using Recurrent Neural Network OHLC Average Prediction of Apple Inc. Using LSTM Recurrent Neural Network Dataset:

Nouroz Rahman 410 Jan 5, 2023
This Jupyter notebook shows one way to implement a simple first-order low-pass filter on sampled data in discrete time.

How to Implement a First-Order Low-Pass Filter in Discrete Time We often teach or learn about filters in continuous time, but then need to implement t

Joshua Marshall 4 Aug 24, 2022
Pytorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Parallel Tacotron2 Pytorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Keon Lee 170 Dec 27, 2022