Filters to block and remove copycat-websites from DuckDuckGo and Google. Specific to dev websites like StackOverflow or GitHub.

Overview

uBlock-Origin-dev-filter

Filters to block and remove copycat-websites from DuckDuckGo and Google. Specific to dev websites like StackOverflow or GitHub.

Import into uBlock Origin

  1. Open uBlock Origin settings
  2. Under the "Filter" tab, scroll to the bottom where it says “Custom” and click the “Import” checkbox to reveal the custom URL textbox
  3. Append the URL https://raw.githubusercontent.com/quenhus/uBlock-Origin-dev-filter/main/dist/google_duckduckgo/all.txt in the textbox
  4. Press Apply Changes in the upper right

Adding URL's

Please create a pull-request or start an issue with evidence against the "copycats".

Sources

Do your own

  1. List URL that you want to block in a .txt in the data/ folder
  2. Use src/generate.py, which generate files in dist/ you can use as uBlock filters

Manual Google filter

  1. Take an URL *.abc.example.com or iam.copycat.com/*
  2. Remove * prefix or /* suffix
  3. Surround the URL with google.*##.g:has(a[href*=" and "])
  4. For example, it gives google.*##.g:has(a[href*=".abc.example.com"]) and google.*##.g:has(a[href*="iam.copycat.com"])

Manual DuckDuckGo filter

  1. Take an URL *.abc.example.com or iam.copycat.com/*
  2. Remove * prefix or /* suffix
  3. Surround the URL with duckduckgo.*##.results > div:has(a[href*=" and "])
  4. For example, it gives duckduckgo.*##.results > div:has(a[href*=".abc.example.com"]) and duckduckgo.*##.results > div:has(a[href*="iam.copycat.com"])
Comments
  • add more stackoverflow copycats

    add more stackoverflow copycats

    i started creating my own list and found there are some missing in your list as well. but i didn’t filtered by wiki/github copycats, so maybe there are not 100% stackoverflow copycats.

    opened by Maybach91 9
  • Request: add Wikipedia clones

    Request: add Wikipedia clones

    Another category of copycat sites that I find maddening are Wikipedia clones. Wikipedia is freely licensed and even allows database dumps, so there are a ton of annoying mirrors out there. These sites can show up in search results for just about any topic. Here are a few:

    *://www.wikiwand.com/*
    *://wiki2.org/*
    *://worddisk.com/wiki/*
    *://thereaderwiki.com/*
    *://www.absoluteastronomy.com/*
    *://encyclopedia.thefreedictionary.com/*
    *://peoplepill.com/*
    *://www.algebra.com/algebra/about/history/*.wikipedia
    

    You can see that these are clones from the following links:

    • https://www.wikiwand.com/
    • https://wiki2.org/
    • https://worddisk.com/wiki/search/
    • https://thereaderwiki.com/en/
    • http://www.absoluteastronomy.com/ (just click any link and scroll to the bottom to see the attribution)
    • https://encyclopedia.thefreedictionary.com/
    • https://peoplepill.com/ (just click any link and scroll to the bottom to see the attribution)
    • https://www.algebra.com/algebra/about/history/Main_Page.wikipedia

    There are a lot more copycats out there (Wikipedia maintains a giant list, but it includes many sites that have just copied a small amount of content and not the entire site, and I'm sure there are some dead/outdated entries as well), but I figured I'd just post a few to start.

    block-request 
    opened by oxguy3 9
  • Question: Is it fine to add URLs from another list (with evidences)?

    Question: Is it fine to add URLs from another list (with evidences)?

    Hello.

    Is it fine to add URLs from another list (with evidences)?

    For example - will it be OK to submit a block request issue based on https://github.com/vladgba/Back2source/blob/master/back2source.user.js content (if I will provide evidence for each site in this list)?

    I don't see any license requirements from the origin of this list, but how about you?

    Thank you.

    opened by S-trace 8
  • Request: add bikehike.org + others to the filter

    Request: add bikehike.org + others to the filter

    All sites:

    • https://bikehike.org/what-muscles-does-cycling-tone/
    • https://allfamousbirthday.com/faqs/does-cycling-give-you-a-big-bum/
    • https://iron-set.com/us/how-do-you-build-your-outer-thigh-muscle/
    • https://wikipedikia.org/is-20-minutes-of-cycling-a-day-enough/

    Original: https://www.welovecycling.com/wide/2018/06/21/embracing-strong-legs-how-cycling-does-doesnt-change-a-womans-body/

    block-request 
    opened by purpleyuan 5
  • Request: add tutorialmeta.com to the filter

    Request: add tutorialmeta.com to the filter

    Evidence: https://tutorialmeta.com/question/bookmarklet-containing-a-button Original: https://stackoverflow.com/questions/50040152/bookmarklet-containing-a-button

    Doesn't seem like this site has fully copied SOF yet, as the domain is probably very new, this year. Here's a funny, at the bottom of each page: "© 2022 All rights reserved"

    block-request 
    opened by realAzazello 5
  • OK to include in the letsblock.it project?

    OK to include in the letsblock.it project?

    Hello @quenhus,

    I am the maintainer of https://letsblock.it, a uBlock Origin list generator that allows users to pick and customize filter templates to filter out low-value content. The most requested feature so far is to extend the hide websites from search results template with presets to hide Github and Stackoverflow copycats. Instead of duplicating work, I'd love to reuse the data from your project and import it as user-selectable presets.

    I am currently working on the implementation PR: https://github.com/xvello/letsblockit/pull/64 and will deploy a staging version when the frontend is ready. I'd love to have your input on it, whether that's questions, ideas or concerns.

    opened by xvello 5
  • Fix syntax

    Fix syntax

    The Python script uses ||example.com$all, which seems to blocking any domain containing example.com. In my opinion, this syntax is better:

    ||example.com^$all
    

    This is the syntax used in uBo’s own filter lists (example) and many other filter lists for completely blocking a domain. Docs: https://github.com/gorhill/uBlock/wiki/Static-filter-syntax#static-network-filtering

    opened by iam-py-test 4
  • data: add Wikipedia mirrors

    data: add Wikipedia mirrors

    Original: https://en.wikipedia.org/wiki/First_five-year_plan (revision unknown)

    Evidence(*): https://paperzz.com/doc/7738264/first-five-year-plan--soviet-union-

    Notice the image to the right and its caption. Fortunately it's unchanged in the original wikipedia article at the time of writing. Based on this, further URLs were discovered.

    URLs:

    Query term/needle: "One of the primary objectives of Stalin's First Five-Year Plan was to build up the country's heavy industry." DDG results:

    • https://www.studymode.com/essays/Stalin-5-Year-Plan-1777540.html
    • https://www.liquisearch.com/first_five-year_plan_soviet_union
    • https://paperzz.com/doc/7738264/first-five-year-plan--soviet-union-
    • https://edufrogs.com/stalin-5-year-plan-essay/
    • https://patrickkidd3.typepad.com/blog/food-and-drink/ (** obvious SEO spam)
    • http://josehist.weebly.com/industrialization-5-year-plans-1928-1941.html (** any link on this page yields a redirect suggestion)

    Query term/needle: "The First Five-Year Plan, or 1st Five-Year Plan, of the Union of Soviet Socialist Republics (USSR) was a list of economic goals," DDG results:

    • https://www.primidi.com/first_five-year_plan_soviet_union

    Query term/needle: "The First Five-Year Plan, or 1st Five-Year Plan, of the Union of Soviet Socialist Republics (USSR) was a list of economic goals, created by Joseph Stalin and based off his policy of Socialism in One Country, that was designed to strengthen the country's economy between 1928 and 1932." Google results:

    • https://wn.com/First_Five-Year_Plan
    • https://actingcollegeses.com/library/acting-questions/read/61685-what-did-lil-wayne-mean-by-lasagna (** SEO spam)
    opened by trashman9000 4
  • Remove down websites from the filter

    Remove down websites from the filter

    I don't want to keep useless block rules in the filter. I created a tool to help detect domains that are down (/src/clean_data/main.py).

    If anyone want to help me with that :D

    I think we can remove domains without A/AAAA DNS response. However it would be great to use a Google "site:DOMAIN linux" to determine whether each domain is still used as a mirror. Don't forget to disable uBlock-Origin-dev-filter while doing so, otherwise you will only get empty responses.

    help wanted 
    opened by quenhus 4
  • Request: add javaallin.com to the filter

    Request: add javaallin.com to the filter

    Evidence: https://www.javaallin.com/code/write-avro-files-with-logicaltype-timestamp-millis-on-date-timestamps-in-java.html Original: https://stackoverflow.com/questions/62696580/write-avro-files-with-logicaltype-timestamp-millis-on-date-timestamps-in-java

    block-request website-is-down 
    opened by kegelink 4
  • Request: add many to the filter

    Request: add many to the filter

    Please block many mirrors

    2022-02-17_21-23

    Evidences: https://gitfreak.com/skyra-project https://cnpmjs.org/package/@aero/klasa/v/0.5.0-rc1 https://npmmirror.com/package/@aero/klasa Original: github npmjs

    block-request 
    opened by matteoturini 4
  • Request: add 9to5science.com to the filter

    Request: add 9to5science.com to the filter

    Evidence: https://9to5science.com/latex-warning-citation-undefined
    Original: https://tex.stackexchange.com/questions/168312/latex-warning-citation-undefined

    block-request 
    opened by daigennki 0
  • Request: add tagmerge.com to the filter

    Request: add tagmerge.com to the filter

    Evidence: https://tagmerge.com/question/rebase-failed-cannot-merge-binary-files-monopolee-xlsx

    Original: https://stackoverflow.com/questions/71813809/rebase-failed-cannot-merge-binary-files-monopolee-xlsx/71855238#71855238

    block-request 
    opened by AndrewKvalheim 0
  • Typing jank on google.com search field with

    Typing jank on google.com search field with "uBlock-Origin-dev-filter – All Search Engines – Global"

    This was originally filed in BMO as Bug 1801597 - Typing jank on google.com search field with adblocking filter "uBlock-Origin-dev-filter – All Search Engines – Global". uBO author gorhill suggested that I should report to you instead (extra info with some pointers here and here)

    I copy the my BMO report here:

    User Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:109.0) Gecko/20100101 Firefox/109.0 Build ID: 20221120214001

    Steps to Reproduce

    1. In a new profile, install uBlock Origin (currently 1.45.2).
    2. Subscribe uBlock-Origin-dev-filter – All Search Engines – Global(AG). (Or use this permalink url)
    3. Go to https://www.google.com/search?q=f. (So it skips the seemingly unaffected search field at google.com homepage.)
    4. Press and hold a key to type rapidly into google.com search field. Adding some spaces inbetween seems to make the issue worse.

    Expected Results

    Typing should be smooth and there should be little janks in performance profiles.

    Actual Results

    There are some janks/slowness as shown in the attached video and profiles (attached in section Screenshot). The situation, however, is much worse in my main profile. I can see the jank even if I type a search phrase normally.

    Additional Notes

    This issue seems to be related to google.com's autocomplete feature. If I make an extremely long search phrase, autocomplete flyout will not be activiated and the jank is gone. The jank is back after clicking the X in the search field to clear the search phrase and type again. However, Google will temporarily suspend autocomplete (https://www.google.com/complete/search?q=* requests return 403) for an unknown period of time (few hours max?) if I spam the autocomplete. I cannot reproduce the jank when autocomplete is suspended. Changing IP address or set up a new profile may lift the suspension.

    System configuration

    OS version: Windows 10 64bit 22H2 19045.2251 GPU model: Nvidia GTX 760 driver 473.81 (Aug 02, 2022) Number of cores: i7-4771 4 cores 8 threads Amount of memory (RAM): 16GB SSD drive Tested uBO versions: 1.45.3b7, 1.45.2, 1.35.2 (earliest version available in AMO) Tested Firefox versions: Nightly 2022-11-23, 2022-11-20, Firefox 107

    Performance profiles

    Captured in Firefox 107, new profile, uBO 1.45.3b7 with AG: https://share.firefox.dev/3Eug06G Disabling AG: https://share.firefox.dev/3hY0ZSX

    Video demonstrations of the issue

    https://user-images.githubusercontent.com/771216/203749338-8fbdf4cb-8be9-4aee-91f3-2e16fc18b2f4.mp4

    Performance recording (profile)

    Profile URL: These are the profiles as shown in the above video in a new Nightly profile with uBO 1.45.2: 1st [0:01 - 0:06], AG enabled: https://share.firefox.dev/3UYMj4H 2nd [0:18 - 0:21], reload google.com and try again: https://share.firefox.dev/3gnyUUS 3rd [0:37 - 0:42], AG disabled. No janks: https://share.firefox.dev/3V3ZuB4 4th [0:59 - 1:04], AG re-enabled. Janks again: https://share.firefox.dev/3V88ZyY 5th (not shown in the video), Accessibility is force-disabled to eliminate bug 1800028: https://share.firefox.dev/3Gu6HX7

    about:support of this profile: uBO dev filter about support.txt


    https://user-images.githubusercontent.com/771216/203750719-032083d7-b893-40ec-a2cc-ac1c3bd78db3.mp4

    The jank is much more noticeable in my main profile. Granted there should are other factors causing the jank, but disabling AG vastly improves the jank to an acceptable level.

    Profile captured during the video; AG enabled: https://share.firefox.dev/3ENTdUH AG disabled (captured at a much later time and browser restarted, PIDs had changed): https://share.firefox.dev/3Aw8WWa

    [main profile] about support.txt


    https://user-images.githubusercontent.com/771216/203756860-64705082-4e52-4ec7-bf84-c28e379c2eb4.mp4

    I can still reproduce the jank with a modern gaming laptop using (essentially) the same profile as my main profile.

    The laptop was plugged in with a high performance power profile. iGPU was used in the video but dGPU had the jank too.

    Profile URLs: 1st, AG enabled: https://share.firefox.dev/3ErP8Es 2nd, AG disabled: https://share.firefox.dev/3U2qqQU 3rd, AG re-enabled: https://share.firefox.dev/3GDDvwZ

    During the 3rd profile the jank was gone when autocompletes returned 400 (not 403) if the search phrase was too long.

    Thank you.

    opened by Fanolian 3
Owner
null
Query Amalgamator over StackOverflow and YouTube

QASY Query Amalgamator over StackOverflow and YouTube Decription A software you can use to save your valuable time of googling the errors you encounte

null 1 Nov 7, 2021
Stackoverflow Telegram Bot With Python

Template for Telegram Bot Template to create a telegram bot in python. How to Run Set your telegram bot token as environment variable TELEGRAM_BOT_TOK

PyTopia 10 Mar 7, 2022
First Party data integration solution built for marketing teams to enable audience and conversion onboarding into Google Marketing products (Google Ads, Campaign Manager, Google Analytics).

Megalista Sample integration code for onboarding offline/CRM data from BigQuery as custom audiences or offline conversions in Google Ads, Google Analy

Google 76 Dec 29, 2022
Image-Bot-Discord - This Is a discord bot that shows the specific image you search from Google

Advanced Discord.py Image Bot CREDITS Made by RLX and Mathiscool README by Milrato Installation Guide in .env Adjust the TOKEN python main.py to start

RLX 3 Jan 16, 2022
Support for Competitive Coding badges to add in Github readme or portfolio websites.

Support for Competitive Coding badges to add in Github readme or portfolio websites.

Akshat Aggarwal 2 Feb 14, 2022
A tool for exporting Telegram group chats into static websites, preserving chat history like mailing list archives.

tg-archive is a tool for exporting Telegram group chats into static websites, preserving chat history like mailing list archives. Preview The @fossuni

Kailash Nadh 400 Dec 27, 2022
Telegram hack bot [ For Dev ]

Telegram hack bot [ For Dev ]

Alison Parker 1 Jul 4, 2022
A twitter bot that simply replies with a beautiful screenshot of the tweet, powered by beautify.dhravya.dev

Poet this! Replies with a beautiful screenshot of the tweet, powered by poet.so Installation git clone https://github.com/dhravya/poet-this.git cd po

Dhravya Shah 30 Dec 4, 2022
Pinopoly is a tool to remove the "banker" player and replace them with a digitalized system

Pinopoly is a tool to remove the "banker" player and replace them with a digitalized system. It is intended to be used on a Raspberry Pi but can be used in the command line as well.

Alex Overstreet 11 Jul 9, 2022
A Telegram Music Tag Editor Bot that can remove almost all usernames in the music tags and add own username instead.

Music Tag Editor Bot A Telegram Music Tag Editor Bot that can remove almost all usernames in the music tags and add own username instead. It can also

null 14 Oct 21, 2022
Download apps and remove icloud

Download apps and remove icloud

0x 70 Dec 29, 2022
A simple telegram bot to help you to remove forward tag from post from any messages . Maded in python3 using @Pyrogram . Developed by @Kunal-Diwan

Frwd-Tag-Remover Telegram Bot to Remove forward tag from any Post . If you need any more modes in repo or If you find out any bugs, mention in @Develo

Kunal Diwan 2 Oct 14, 2022
A Bot To remove forwarded messages

Forward-Mess-Remover A Bot To remove forwarded messages. uses Remove forwarded messages from Group. Deploy To Heroku

SpamShield 5 Oct 14, 2022
An simple python script for remove rockstar account for fivem, very useful for cheating or change account for unban from an server, remember change steam account.

This tool is used for be unbanned from servers, u need disconnect the discord, use other steam account and uninstall xbox for be unbanned 100%, it only work for unban in server, not global

MiguDEV 4 Oct 10, 2022
An advanced Filter Bot with nearly unlimitted filters!

Unlimited Filter Bot ㅤㅤㅤㅤㅤㅤㅤ ㅤㅤㅤㅤㅤㅤㅤ An advanced Filter Bot with nearly unlimitted filters! Features Nearly unlimited filters Supports all type of fil

TroJanzHEX 445 Jan 3, 2023
Rotates Amazon Personalize filters on a schedule based on dynamic templates

Amazon Personalize Filter Rotation This project contains the source code and supporting files for deploying a serverless application that provides aut

James Jory 2 Nov 12, 2021
An advanced Filter Bot with nearly unlimitted filters!

Unlimited Filter Bot ㅤㅤㅤㅤㅤㅤㅤ ㅤㅤㅤㅤㅤㅤㅤ An advanced Filter Bot with nearly unlimitted filters! Features Nearly unlimited filters Supports all type of fil

null 1 Nov 20, 2021
An advanced Filter Bot with nearly unlimitted filters

Telegram MTProto API Framework for Python Documentation • Releases • Community Pyrogram from pyrogram import Client, filters app = Client("my_account

Pyrogram 3.2k Jan 5, 2023
PyFIR - Python implementations of Finite Impulse Response (FIR) filters

pyFIR Python implementations of Finite Impulse Response (FIR) filters. The algorithms are mainly (but not strictly) the ones described in WEFERS, Fran

Davi Carvalho 4 Feb 12, 2022