The idea is to have classes for different type of licenses whit each license having a subset of available parameters: name, type, version, link, description.
This entails replacing:
_LICENSE = "CC BY-NC-SA"
_LICENSE= "CC BY-NC-SA 3.0"
With
from bigbio.utils import license
_LICENSE = license.CreativeCommons(type="BY-NC-SA")
_LICENSE = license.CreativeCommons(type="BY-NC-SA", version=3.0)
and
return datasets.DatasetInfo(
description=_DESCRIPTION,
features=features,
homepage=_HOMEPAGE,
license= str(_LICENSE),
citation=_CITATION,
)
Special ones are Custom
for dataset-specific license and PubliclyAvailable
for those datasets which can be freely downloaded but do not provide license information.