A PyPI mirror client according to PEP 381 http://www.python.org/dev/peps/pep-0381/


This is a PyPI mirror client according to PEP 381 + PEP 503 http://www.python.org/dev/peps/pep-0381/.

  • bandersnatch >=4.0 supports Linux, MacOSX + Windows
bandersnatch maintainers are looking for more help! Please refer to our MAINTAINER documentation to see the roles and responsibilities. We would also ask you read our Mission Statement to ensure it aligns with your thoughts for this project.

The following instructions will place the bandersnatch executable in a virtualenv under bandersnatch/bin/bandersnatch.

  • bandersnatch requires >= Python 3.8.0


This will pull latest build. Please use a specific tag if desired.

  • Docker image includes /bandersnatch/src/runner.py to periodically run a bandersnatch mirror
    • Please /bandersnatch/src/runner.py --help for usage
  • With docker, we recommend bind mounting in a read only bandersnatch.conf
    • Defaults to /conf/bandersnatch.conf
docker pull pypa/bandersnatch
docker run pypa/bandersnatch bandersnatch --help


This installs the latest stable, released version.

python3 -m venv bandersnatch
bandersnatch/bin/pip install bandersnatch
bandersnatch/bin/bandersnatch --help


  • Run bandersnatch mirror - it will create an empty configuration file for you in /etc/bandersnatch.conf.
  • Review /etc/bandersnatch.conf and adapt to your needs.
  • Run bandersnatch mirror again. It will populate your mirror with the current status of all PyPI packages. Current mirror package size can be seen here: https://pypi.org/stats/
  • A blocklist or allowlist can be created to cut down your mirror size. You might want to Analyze PyPI downloads to determine which packages to add to your list.
  • Run bandersnatch mirror regularly to update your mirror with any intermediate changes.


Configure your webserver to serve the web/ sub-directory of the mirror. For nginx it should look something like this:

    server {
        listen [::1]:80;
        server_name <mymirrorname>;
        root <path-to-mirror>/web;
        autoindex on;
        charset utf-8;
  • Note that it is a good idea to have your webserver publish the HTML index files correctly with UTF-8 as the charset. The index pages will work without it but if humans look at the pages the characters will end up looking funny.

  • Make sure that the webserver uses UTF-8 to look up unicode path names. nginx gets this right by default - not sure about others.

For more information visit out official documentation for instructions on how to use a NGINX example Docker Image.

Cron jobs

You need to set up one cron job to run the mirror itself.

Here's a sample that you could place in /etc/cron.d/bandersnatch:

    */2 * * * * root bandersnatch mirror |& logger -t bandersnatch[mirror]

This assumes that you have a logger utility installed that will convert the output of the commands to syslog entries.

SystemD Timers are also another alternative in today's modern world.


bandersnatch does not keep much local state in addition to the mirrored data. In general you can just keep rerunning bandersnatch mirror to make it fix errors.

If you want to force bandersnatch to check everything against the master PyPI:

  • run bandersnatch mirror --force-check to move status files if they exist in your mirror directory in order get a full sync.

Be aware that full syncs likely take hours depending on PyPI's performance and your network latency and bandwidth.

Other Commands

  • bandersnatch delete --help - Allows you to specify package(s) to be removed from your mirror (dangerous)
  • bandersnatch verify --help - Crawls your repo and fixes any missed files + deletes any unowned files found (dangerous)

Operational notes

Case-sensitive filesystem needed

You need to run bandersnatch on a case-sensitive filesystem.

OS X natively does this OK even though the filesystem is not strictly case-sensitive and bandersnatch will work fine when running on OS X. However, tarring a bandersnatch data directory and moving it to, e.g. Linux with a case-sensitive filesystem will lead to inconsistencies. You can fix those by deleting the status files and have bandersnatch run a full check on your data.

Windows requires elevated prompt

Bandersnatch makes use of symbolic links. On Windows, this permission is turned off by default for non-admin users. In order to run bandersnatch on Windows either call it from an elevated command prompt (i.e. right-click, run-as Administrator) or give yourself symlink permissions in the group policy editor.

Many sub-directories needed

The PyPI has a quite extensive list of packages that we need to maintain in a flat directory. Filesystems with small limits on the number of sub-directories per directory can run into a problem like this:

    2013-07-09 16:11:33,331 ERROR: Error syncing package: [email protected]
    OSError: [Errno 31] Too many links: '../pypi/web/simple/zweb'

Specifically we recommend to avoid using ext3. Ext4 and newer does not have the limitation of 32k sub-directories.

Client Compatibility

A bandersnatch static mirror is compatible only to the "static", cacheable parts of PyPI that are needed to support package installation. It does not support more dynamic APIs of PyPI that maybe be used by various clients for other purposes.

An example of an unsupported API is PyPI's XML-RPC interface, which is used when running pip search.

Bandersnatch Mission

The bandersnatch project strives to:

  • Mirror all static objects of the Python Package Index (https://pypi.org/)
  • bandersnatch's main goal is to support the main global index to local syncing only
  • This will allow organizations to have lower latency access to PyPI and save bandwidth on their WAN connections and more importantly the PyPI CDN
  • Custom features and requests may be accepted if they can be of a plugin form
    • e.g. refer to the blocklist and allowlist plugins


If you have questions or comments, please submit a bug report to https://github.com/pypa/bandersnatch/issues/new

  • IRC: #bandersnatch on Freenode (You can use webchat if you don't have an IRC client)

Code of Conduct

Everyone interacting in the bandersnatch project's codebases, issue trackers, chat rooms, and mailing lists is expected to follow the PSF Code of Conduct.


This client is based on the original pep381client by Martin v. Loewis.

Richard Jones was very patient answering questions at PyCon 2013 and made the protocol more reliable by implementing some PyPI enhancements.

Christian Theune for creating and maintaining bandersnatch for many years!

  • 5.0.0(Apr 28, 2021)

    New Features

    • bandersnatch is now a >= 3.8 Python project
    • New size_project_metadata filter plugin, which can deny download of projects larger than defined threshold - PR #806
    • Add option to compare file size and upload time instead of sha256sum for downloading - PR #822
    • Add optional uvloop support - PR #891 - Thanks cooperlees
    • Move to official docker upload action w/arm64 images uploaded - PR #896 - Thanks cooperlees


    • blacklist/whitelist will no longer work in bandersnatch configuration - PR #897 - Thanks cooperlees
      • Please use allowlist/denylist respectively

    Bug Fixes

    • Unused storage plugins are loaded and cause non-fatal errors if dependencies are missing - PR #799 - Thanks electricworry
    • Replaced usages of asynctest with unittest.mock in tests - PR #807 and PR #856 - Thanks ichard26
    • Remove debugging line that loads entire files into memory. - PR #858 - Thanks asrp
    • Removed terrible isinstance check of unittest.Mock in mirror.py - PR #859 - Thanks ichard26
    • Put potential time consuming IO operations into executor - PR #877
    • Migrated Markdown documentation from recommonmark to MyST-Parser + docs config clean up - PR #879 - Thanks ichard26
    • Use shutil.move() for temp file management - PR #883 - Thanks happyaron
    • Fixed logging bug in SizeProjectMetadataFilter to show it activated - PR #889 - Thanks cooperlees
    • Attempt to wrap all potentially block calls in a ThreadPoolExecutor - PR #894 - Thanks cooperlees
    Source code(tar.gz)
    Source code(zip)
  • 4.4.0(Dec 31, 2020)

    New Features

    • Build a swift and non swift docker image - PR #754
    • Split Docker Build to accept build args to optionally include swift support - PR #741 - Thanks nlaurance-pyie
    • Slimmer docker image - PR #738 - Thanks nlaurance-pyie
    • Renamed black/white to block/allow lists - PR #737 - Thanks nlaurance-pyie
    • packages allowlist can be defined from requirements like files - PR #739 - Thanks nlaurance-pyie
    • Simplify logging around filters - PR #678 - Thanks @dalley

    Bug Fixes

    • Handling of timeouts that can occur in verify. - PR #785 - Thanks electricworry
    • Added retry logic on timeouts when fetching metadata - PR #773 - Thanks gerrod3
    • Fix links, improve docs CI, and improve external object linking - PR #776 - Thanks ichard26
    • Handle 404 status for json verify - PR #763 - Thanks electricworry
    • Clean up isort config after upgrade to 5+ - PR #767 - Thanks ichard26
    • Remove duplicate max() target serial finding code + update typing - PR #745
    • swift.py: use BaseFileLock's lock_file property - PR #699 - Thanks hauntsaninja
    • Move to latest isort + mypy fixes - PR #706
    • Update change log url in project metadata - PR #673 - Thanks @abn
    Source code(tar.gz)
    Source code(zip)
  • 4.3.0(Aug 25, 2020)

    New Features

    • Add SOCKS proxy support to aiohttp via aiohttp-socks - PR #668
    • Add support for skipping mirroring release files (metadata only) - PR #670 - Thanks @abn

    Bug Fixes

    • Move GitHub actions to v2 tags - PR #666 - Thanks @ryuichi1208
    Source code(tar.gz)
    Source code(zip)
  • 4.2.0(Aug 21, 2020)

    New Features

    Thanks to RedHat engineers @dalley + @gerrod3 for all this refactor work in PR #591

    • New generic Mirror class to perform Python metadata syncing
      • (previous Mirror class has been renamed to BandersnatchMirror)
    • Package's filter methods are now part of its public API
    • New errors.py file to house Bandersnatch specific errors

    Internal API Changes

    • Old Mirror class has been renamed to BandersnatchMirror. Performs same functionality with use of new Mirror API.
    • BandersnatchMirror now performs all filesystem operations throughout the sync process including the ones previously in Package.
    • Package no longer performs filesystem operations. Properties json_file, json_pypi_symlink, simple_directory and methods save_json_metadata, sync_release_files, gen_data_requires_python, generate_simple_page, sync_simple_page, _save_simple_page_version, _prepare_versions_path, _file_url_to_local_url, _file_url_to_local_path, download_file have all been moved into BandersnatchMirror. Package's sync has been refactored into Bandersnatch's process_package.
    • Package class is no longer created with an instance of Mirror
    • StaleMetadata exception has been moved to new errors.py file
    • PackageNotFound exception has been moved to new errors.py file

    Bug Fixes

    • Fixed Fix latest_release plugin to ensure latest version is included - PR #660 - Thanks @serverwentdown
    Source code(tar.gz)
    Source code(zip)
  • 4.1.1(Aug 12, 2020)

  • 4.1.0(Aug 10, 2020)

    New Features

    • bandersnatch is now 100% type annotated - PRs #546 #561 #592 #593 - Thanks @ichard26 + @rkm
    • Move to storage abstraction - PR #445 - Thanks @techalchemy
      • Can now support more than just filesystem e.g. swift
    • Add sync subcommand to force a sync on a particular PyPI package - PR #572 - Thanks @z4yx
    • Added new allowlist filter - PR #626 - Thanks @gerrod3
    • Make webdir/pypi/json/PKG symlinks relative - PR #637 - Thanks @indrat
      • Makes mirror files more portable
    • Add main and program name override to ArgumentParser - PR #643 - Thanks @rkm
      • Allow non pkg_resources install to work

    Internal API Changes

    • Refactored the removal of releases for release_plugins to happen inside of Package PR #608 - Thanks @gerrod3
    • Minor refactor of Package class PR #606 - Thanks @dralley
    • Refactored filter loading into seperate class PR #599 - Thanks @gerrod3
    • Move legacy directory cleanup to mirror.py PR #586
    • Move verify to use Master for HTTP calls - PR #555
    • Move http request code for package metadata to master.py - PRs #550 - Thanks @dralley

    Bug Fixes

    • Fixed allow/blocklist release filtering pre-releases - PR #641 - Thanks @gerrod3
    • Casefold (normalize per PEP503) package names in blacklist/whitelist plugins config - PR #629 - Thanks @lepaperwan
    • Fix passing package info to filters in verify action. PR #638 - Thanks @indrat
    • Fix todo file removal - PR #571
    • Introduce a new global-timeout config option for aiohttp coroutines - Default 5 hours - PR #540 - Thanks @techalchemy
    • Many doc fixes - PRs #542 #551 #557 #605 #628 #630 - Thanks @pgrimaud + @ichard26 + @hugovk
    • Move to setting timeout only on session + 10 * total_timeout (over sock timeouts) - PR #535
    • Stop using include_package_data option in setup.cfg to get config files included in more installs - PR #519
    Source code(tar.gz)
    Source code(zip)
  • 4.0.3(May 7, 2020)

    • Change aiohttp-xmlrpc to use Master.session to ensure config shared - PR #506 - Thanks @alebourdoulous for reporting
      • e.g. Maintin trust of proxy server environment variables
    Source code(tar.gz)
    Source code(zip)
  • 4.0.2(Apr 26, 2020)

    • Raise for error HTML response on all aiohttp session requests - PR #494 / #496 - Thanks @windtail
    • Pass str to shutil.move due to Python bug - PR #497 - Thanks @SanketDG
    • Some more type hints added to verify.py - PR #488 - Thanks @SanketDG
    • Ignore atime on stat in test test_package_sync_does_not_touch_existing_local_file comparision as it casues stat compare fail on a slower run - PR #487 - Thanks @SanketDG
    Source code(tar.gz)
    Source code(zip)
  • 4.0.1(Apr 5, 2020)

  • 4.0.0(Mar 29, 2020)

    • Replace requests with aiohttp - PR #440
    • Replace xmlrpc2 with aiohttp-xmlrpc - PR #404
    • Only store PEP503 Normalized Simple API directories - PR #465 + #455
    • Flag errors when KeyboardInterrupt raised during sync - PR #421
    • Finish Windows Support + Add CI - PRs #469 + #471 - Thanks @FaustinCarter
    • Autobuild Docker images with master - PR #88 - Thanks @abitrolly
    • Only print conf deprecations if found in config - PR #327
    • Add PyPI metadata and Python version plugin filters - PR #391 - Thanks @TemptorSent
    • Add in GitHub Actions CI for Linux (Ubuntu), MacOSX + Windows
    Source code(tar.gz)
    Source code(zip)
  • 3.6.0(Sep 24, 2019)

  • 3.5.0(Sep 14, 2019)

  • 3.4.0(May 30, 2019)

  • 3.3.0(Apr 12, 2019)

  • 3.2.0(Jan 25, 2019)

  • 3.1.3(Dec 26, 2018)

  • 3.1.2(Dec 3, 2018)

  • 3.1.1(Nov 26, 2018)

  • 3.1.0(Nov 26, 2018)

  • 3.0.1(Oct 30, 2018)

