softcatala-web-dataset
This repository contains Sofcatalà web site content (articles and programs descriptions).
Dataset are available in the dataset directory.
Dataset size:
- articles.json contains 623 articles with 366915 words
- programes.json contains 330 program descripctions with 49110 words
The license of the data is Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)