Wrapper for the Swiss Parliament API for Python

Stefan Oderbolz

Last update: Jun 13, 2022

Related tags

Overview

swissparlpy

This module provides easy access to the data of the OData webservice of the Swiss parliament.

Installation
Usage
Credits
Release

Installation

swissparlpy is available on PyPI, so to install it simply use:

$ pip install swissparlpy

Usage

See the examples directory for more scripts.

Get tables and their variables

>>> import swissparlpy as spp
>>> spp.get_tables()[:5] # get first 5 tables
['MemberParty', 'Party', 'Person', 'PersonAddress', 'PersonCommunication']
>>> spp.get_variables('Party') # get variables of table `Party`
['ID', 'Language', 'PartyNumber', 'PartyName', 'StartDate', 'EndDate', 'Modified', 'PartyAbbreviation']

Get data of a table

>>> import swissparlpy as spp
>>> data = spp.get_data('Canton', Language='DE')
>>> data
<swissparlpy.client.SwissParlResponse object at 0x7f8e38baa610>
>>> data.count
26
>>> data[0]
{'ID': 2, 'Language': 'DE', 'CantonNumber': 2, 'CantonName': 'Bern', 'CantonAbbreviation': 'BE'}
>>> [d['CantonName'] for d in data]
['Bern', 'Neuenburg', 'Genf', 'Wallis', 'Uri', 'Schaffhausen', 'Jura', 'Basel-Stadt', 'St. Gallen', 'Obwalden', 'Appenzell A.-Rh.', 'Solothurn', 'Waadt', 'Zug', 'Aargau', 'Basel-Landschaft', 'Luzern', 'Thurgau', 'Freiburg', 'Appenzell I.-Rh.', 'Schwyz', 'Graubünden', 'Glarus', 'Tessin', 'Zürich', 'Nidwalden']

The return value of get_data is iterable, so you can easily loop over it. Or you can use indices to access elements, e.g. data[1] to get the second element, or data[-1] to get the last one.

Even slicing is supported, so you can do things like only iterate over the first 5 elements using

for rec in data[:5]:
   print(rec)

Use together with `pandas`

To create a pandas DataFrame from get_data simply pass the return value to the constructor:

>>> import swissparlpy as spp
>>> import pandas as pd
>>> parties = spp.get_data('Party', Language='DE')
>>> parties_df = pd.DataFrame(parties)
>>> parties_df
      ID Language  PartyNumber  ...                   EndDate                         Modified PartyAbbreviation
0     12       DE           12  ... 2000-01-01 00:00:00+00:00 2010-12-26 13:05:26.430000+00:00                SP
1     13       DE           13  ... 2000-01-01 00:00:00+00:00 2010-12-26 13:05:26.430000+00:00               SVP
2     14       DE           14  ... 2000-01-01 00:00:00+00:00 2010-12-26 13:05:26.430000+00:00               CVP
3     15       DE           15  ... 2000-01-01 00:00:00+00:00 2010-12-26 13:05:26.430000+00:00      FDP-Liberale
4     16       DE           16  ... 2000-01-01 00:00:00+00:00 2010-12-26 13:05:26.430000+00:00               LDP
..   ...      ...          ...  ...                       ...                              ...               ...
78  1582       DE         1582  ... 2000-01-01 00:00:00+00:00 2015-12-03 08:48:38.250000+00:00             BastA
79  1583       DE         1583  ... 2000-01-01 00:00:00+00:00 2019-03-07 17:24:15.013000+00:00              CVPO
80  1584       DE         1584  ... 2000-01-01 00:00:00+00:00 2019-11-08 17:28:43.947000+00:00                Al
81  1585       DE         1585  ... 2000-01-01 00:00:00+00:00 2019-11-08 17:41:39.513000+00:00               EàG
82  1586       DE         1586  ... 2000-01-01 00:00:00+00:00 2021-08-12 07:59:22.627000+00:00               M-E

[83 rows x 8 columns]

Substrings

If you want to query for substrings there are two main operators to use:

__startswith:

>>> import swissparlpy as spp
>>> persons = spp.get_data("Person", Language="DE", LastName__startswith='Bal')
>>> persons.count
12

__contains

>>> import swissparlpy as spp
>>> co2_business = spp.get_data("Business", Title__contains="CO2", Language = "DE")
>>> co2_business.count
265

You can suffix any field with those operators to query the data.

Large queries

Large queries (especially the tables Voting and Transcripts) may result in server-side errors (500 Internal Server Error). In these cases it is recommended to download the data in smaller batches, save the individual blocks and combine them after the download.

This is an example script to download all votes of the legislative period 50, session by session, and combine them afterwards in one DataFrame:

import swissparlpy as spp
import pandas as pd
import os

__location__ = os.path.realpath(os.getcwd())
path = os.path.join(__location__, "voting50")

# download votes of one session and save as pickled DataFrame
def save_votes_of_session(id, path):
    if not os.path.exists(path):
        os.mkdir(path)
    data = spp.get_data("Voting", Language="DE", IdSession=id)
    print(f"{data.count} rows loaded.")
    df = pd.DataFrame(data)
    pickle_path = os.path.join(path, f'{id}.pks')
    df.to_pickle(pickle_path)
    print(f"Saved pickle at {pickle_path}")


# get all session of the 50 legislative period
sessions50 = spp.get_data("Session", Language="DE", LegislativePeriodNumber=50)
sessions50.count

for session in sessions50:
    print(f"Loading session {session['ID']}")
    save_votes_of_session(session['ID'], path)

# Combine to one dataframe
df_voting50 = pd.concat([pd.read_pickle(os.path.join(path, x)) for x in os.listdir(path)])

Credits

This library is inspired by the R package swissparl of David Zumbach. Ralph Straumann initial asked about a Python version of swissparl on Twitter, which led to this project.

Release

To create a new release, follow these steps (please respect Semantic Versioning):

Adapt the version number in swissparlpy/__init__.py
Update the CHANGELOG with the version
Create a pull request to merge develop into main (make sure the tests pass!)
Create a new release/tag on GitHub (on the main branch)
The publication on PyPI happens via GitHub Actions on every tagged commit

EpikCord.py - This is an API Wrapper for Discord's API for Python

EpikCord.py - This is an API Wrapper for Discord's API for Python! We've decided not to fork discord.py and start completely from scratch for a new, better structuring system!

28 Oct 10, 2022

A simple Python API wrapper for Cloudflare Stream's API.

python-cloudflare-stream A basic Python API wrapper for working with Cloudflare Stream. Arbington.com started off using Cloudflare Stream. We used the

3 Sep 8, 2022

Discord-Wrapper - Discord Websocket Wrapper in python

This does not currently work and is in development Discord Websocket Wrapper in

3 Oct 25, 2022

An API wrapper around the pythonanywhere's API.

pyaww An API wrapper around the pythonanywhere's API. The name stands for pythonanywherewrapper. 100% api coverage most of the codebase is documented

7 Dec 11, 2022

An API Wrapper for Gofile API

Gofile2 from gofile2 import Gofile g_a = Gofile() print(g_a.upload(file="/home/itz-fork/photo.png")) An API Wrapper for Gofile API. About API Gofile

16 Dec 10, 2022

A simple API wrapper for the Tenor API

Gifpy A simple API wrapper for the Tenor API Installation Python 3.9 or higher is recommended python3 -m pip install gifpy Clone repository: $ git cl

4 Dec 22, 2021

An API wrapper around Discord API.

NeoCord This project is work in progress not for production use. An asynchronous API wrapper around Discord API written in Python. Features Modern API

14 Jan 3, 2022

A wrapper for The Movie Database API v3 and v4 that only uses the read access token (not api key).

fulltmdb A wrapper for The Movie Database API v3 and v4 that only uses the read access token (not api key). Installation Use the package manager pip t

2 Sep 26, 2021

An API wrapper around the pythonanywhere's API.

pyaww An API wrapper around the pythonanywhere's API. The name stands for pythonanywherewrapper. 100% API coverage Most of the codebase is documented

7 Dec 11, 2022

Comments

Ein Datum wird bei bestimmten Sessionen nicht richtig geparst
Danke vielmals für das tolle Swissparlpy-Modul! Ich habe versucht, Dein Beispiel für Votes auf den Download von Reden (Table ’Transcript’) anzuwenden, bekomme aber ein Fehler beim Parsen eines Datums. Die Session 5002 (spp.get_data("Transcript", IdSession=5002)) geht interessanterweise, aber die Sessionen 5001 und 5003 nicht (z.B. spp.get_data("Transcript", IdSession=5001)), da bekomme ich jeweils den Fehler, dass strptime ein Datum nicht Parsen kann. Hier mein Code:

import swissparlpy as spp import pandas as pd import os __location__ = os.path.realpath(os.getcwd()) path2 = os.path.join(__location__, "transcripts50") # download transcripts of one session and save as pickled DataFrame def save_transcripts_of_session(id, path2): if not os.path.exists(path2): os.mkdir(path2) data = spp.get_data("Transcript", IdSession=id) print(f"{data.count} rows loaded.") df = pd.DataFrame(data) pickle_path = os.path.join(path2, f'{id}.pks') df.to_pickle(pickle_path) print(f"Saved pickle at {pickle_path}") save_transcripts_of_session(5001, path2)

Hier die Fehlermeldung: Fehler_Transcripts.txt Mein Problem ist, dass nicht ersichtlich wird, welche Variable der Fehler betrifft und ob die Einträge mit diesem Datum schlicht ignoriert werden können?
opened by bwueest 5
Added documentation for swissparAPY tables

Provided some documentation using dbdiagram.io to have an explanation of the roles of the fields and tables as well as their relationships.

Added

docs folder containing the code and visualization for the documentation.

Updated

README.md to include the visualization as well as a link to the code.

opened by peterbonnesoeur 2
Discrepancy between count and returned entities
Hello,

Thank you for your work on the library !

I think there might be an issue either from the parlement api or from the library. Basically the returned count is different from the number of entities. Here are the steps to reproduce:

import swissparlpy as spp person = spp.get_data("Person") print(person.count) print(len(person.entities))

Result:

18170 1000

Any Idea why this is the case?

Thanks!
opened by Ahmedjjj 1

Releases(v0.2.1)

v0.2.1(Jan 31, 2022)
[0.2.1] - 2022-01-31

Fixed

In order to fix issue #17 a bug in pyodata had to be fixed. pyodata 1.9.0 contains the bugfix and is now specified as the minimum version.

Source code(tar.gz)
Source code(zip)
v0.2.0(Oct 14, 2021)
[0.2.0] - 2021-10-14

Added

Jupyter notebook with examples

New examples for advanced filters

Support for more advanced filters

Changed

Update README with examples

Source code(tar.gz)
Source code(zip)
v0.1.1(Sep 27, 2021)
[0.1.1] - 2021-09-27

Fixed

Typo in publish workflow

Source code(tar.gz)
Source code(zip)
v0.1.0(Sep 27, 2021)
[0.1.0] - 2021-09-27

Added

Test with fixtures

Linter for all code

Usage example in README

Fixed

Fixed get_variables call

Make sure get_tables returns a list

Source code(tar.gz)
Source code(zip)
v0.0.2(Sep 27, 2021)
[0.0.2] - 2021-09-27

Added

Added CHANGELOG file

Changed

Use flit to manage pypi package

Source code(tar.gz)
Source code(zip)
v0.0.1(Sep 27, 2021)
[0.0.1] - 2021-09-26

Added

Initial release of swissparlpy

Source code(tar.gz)
Source code(zip)

Owner

Stefan Oderbolz

GitHub

🚀 An asynchronous python API wrapper meant to replace discord.py - Snappy discord api wrapper written with aiohttp & websockets

Pincer An asynchronous python API wrapper meant to replace discord.py ❗ The package is currently within the planning phase ?? Links ｜Join the discord

125 Dec 26, 2022

Wrapper for the Swiss Parliament API for Python

Related tags

Overview

swissparlpy

Table of Contents

Installation

Usage

Get tables and their variables

Get data of a table

Use together with pandas

Substrings

Large queries

Credits

Release

You might also like...

EpikCord.py - This is an API Wrapper for Discord's API for Python

A simple Python API wrapper for Cloudflare Stream's API.

Discord-Wrapper - Discord Websocket Wrapper in python

An API wrapper around the pythonanywhere's API.

An API Wrapper for Gofile API

A simple API wrapper for the Tenor API

An API wrapper around Discord API.

A wrapper for The Movie Database API v3 and v4 that only uses the read access token (not api key).

An API wrapper around the pythonanywhere's API.

Comments

Ein Datum wird bei bestimmten Sessionen nicht richtig geparst

Added documentation for swissparAPY tables

Added

Updated

Discrepancy between count and returned entities

Releases(v0.2.1)

v0.2.1(Jan 31, 2022)

[0.2.1] - 2022-01-31

Fixed

v0.2.0(Oct 14, 2021)

[0.2.0] - 2021-10-14

Added

Changed

v0.1.1(Sep 27, 2021)

[0.1.1] - 2021-09-27

Fixed

v0.1.0(Sep 27, 2021)

[0.1.0] - 2021-09-27

Added

Fixed

v0.0.2(Sep 27, 2021)

[0.0.2] - 2021-09-27

Added

Changed

v0.0.1(Sep 27, 2021)

[0.0.1] - 2021-09-26

Added

Owner

Stefan Oderbolz

🚀 An asynchronous python API wrapper meant to replace discord.py - Snappy discord api wrapper written with aiohttp & websockets

Aws-lambda-requests-wrapper - Request/Response wrapper for AWS Lambda with API Gateway

PRAW, an acronym for "Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's API.

PRAW, an acronym for "Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's API.

Python API wrapper around Trello's API

Async ready API wrapper for Revolt API written in Python.

A Python API wrapper for the Twitter API!

Python API wrapper library for Convex Value API

This an API wrapper library for the OpenSea API written in Python 3.

YARSAW is an Async Python API Wrapper for the Random Stuff API.

Use together with `pandas`