Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)

Megagon Labs

Last update: Dec 23, 2022

Related tags

Text Data & NLP bunkai

Overview

Bunkai

Bunkai is a sentence boundary (SB) disambiguation tool for Japanese texts.

Quick Start

$ pip install bunkai
$ echo -e '宿を予約しました♪!まだ2ヶ月も先だけど。早すぎかな(笑)楽しみです★\n2文書目の先頭行です。▁改行はU+2581で表現します。' \
    | bunkai
宿を予約しました♪!│まだ2ヶ月も先だけど。│早すぎかな(笑)│楽しみです★
2文書目の先頭行です。▁│改行はU+2581で表現します。

Feed a document as one line by using ▁ (U+2581) for line breaks. The output shows sentence boundaries with │ (U+2502).

If you want to disambiguate sentence boundaries for line breaks, please add a --model option with the path to the model. Trained models are available here.

$ wget https://github.com/megagonlabs/bunkai/releases/download/v1.0.1/bunkai-model-bccwj-wo-pn-20210421.zip
$ unzip bunkai-model-bccwj-wo-pn-20210421.zip
$ echo -e "文の途中で改行を▁入れる文章ってありますよね▁それも対象です。" | bunkai --model bunkai-model-bccwj-wo-pn-20210421
文の途中で改行を▁入れる文章ってありますよね▁│それも対象です。

For more information, see examples or documents.

References

Yuta Hayashibe and Kensuke Mitsuzawa. Sentence Boundary Detection on Line Breaks in Japanese. Proceedings of The 6th Workshop on Noisy User-generated Text (W-NUT 2020), pp.71-75. November 2020. [PDF] [bib]

License

Apache License 2.0

Comments

Bump transformers from 4.9.2 to 4.10.0
Bumps transformers from 4.9.2 to 4.10.0.

Release notes

Sourced from transformers's releases.

v4.10.0: LayoutLM-v2, LayoutXLM, BEiT

LayoutLM-v2 and LayoutXLM

Four new models are released as part of the LatourLM-v2 implementation: LayoutLMv2ForSequenceClassification, LayoutLMv2Model, LayoutLMv2ForTokenClassification and LayoutLMv2ForQuestionAnswering, in PyTorch.

The LayoutLMV2 model was proposed in LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding by Yang Xu, Yiheng Xu, Tengchao Lv, Lei Cui, Furu Wei, Guoxin Wang, Yijuan Lu, Dinei Florencio, Cha Zhang, Wanxiang Che, Min Zhang, Lidong Zhou. LayoutLMV2 improves LayoutLM to obtain state-of-the-art results across several document image understanding benchmarks:

Add LayoutLMv2 + LayoutXLM #12604 (@NielsRogge)

Compatible checkpoints can be found on the Hub: https://huggingface.co/models?filter=layoutlmv2

BEiT

Three new models are released as part of the BEiT implementation: BeitModel, BeitForMaskedImageModeling, and BeitForImageClassification, in PyTorch.

The BEiT model was proposed in BEiT: BERT Pre-Training of Image Transformers by Hangbo Bao, Li Dong and Furu Wei. Inspired by BERT, BEiT is the first paper that makes self-supervised pre-training of Vision Transformers (ViTs) outperform supervised pre-training. Rather than pre-training the model to predict the class of an image (as done in the original ViT paper), BEiT models are pre-trained to predict visual tokens from the codebook of OpenAI’s DALL-E model given masked patches.

Add BEiT #12994 (@NielsRogge)

Compatible checkpoints can be found on the Hub: https://huggingface.co/models?filter=beit

Speech improvements

The Wav2Vec2 and HuBERT models now have a sequence classification head available.

Add Wav2Vec2 & Hubert ForSequenceClassification #13153 (@anton-l)

DeBERTa in TensorFlow (@kamalkraj)

The DeBERTa and DeBERTa-v2 models have been converted from PyTorch to TensorFlow.

Deberta tf #12972 (@kamalkraj)

Deberta_v2 tf #13120 (@kamalkraj)

Flax model additions

EncoderDecoder, DistilBERT, and ALBERT, now have support in Flax!

FlaxEncoderDecoder allowing Bert2Bert and Bert2GPT2 in Flax #13008 (@ydshieh)

FlaxDistilBERT #13324 (@kamalkraj)

FlaxAlBERT #13294 (@kamalkraj)

TensorFlow examples

A new example has been added in TensorFlow: multiple choice! Data collators have become framework agnostic and can now work for both TensorFlow and NumPy on top of PyTorch.

Add TF multiple choice example #12865 (@Rocketknight1)

TF/Numpy variants for all DataCollator classes #13105 (@Rocketknight1)

... (truncated)

Commits

d12bbe4 Release: v4.10.0

642e193 [GitHub Runner] Fix flax runner (#13357)

c76de10 Add generate kwargs to Seq2SeqTrainingArguments (#13339)

702f4a4 Fixed CLM model still using MODEL_FOR_MASKED_LM_MAPPING (#13002)

aa08a34 [Flax tests] NVIDIA-SMI failure should continue

854260c TF/Numpy variants for all DataCollator classes (#13105)

74b3344 Clean up test file

ef8d6f2 Set missing seq_length variable when using inputs_embeds with ALBERT & Remove...

180c6de docs: fix minor typo (#13289)

066fd04 correct TP implementation resources (#13248)

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Type: Dependencies
opened by dependabot[bot] 3
Bump emoji from 1.6.1 to 1.6.3
Bumps emoji from 1.6.1 to 1.6.3.

Release notes

Sourced from emoji's releases.

v1.6.3

Added support for counting unique emojis

v1.6.2

Improve performance of demojize()

Added more tests

Added warning when someone uses any other language than 'en' with use_aliases=True in emojize()

Changelog

Sourced from emoji's changelog.

1.6.3

Added support for counting unique emojis

1.6.2

Improve performance of demojize()

Added more tests

Added warning when someone uses any other language than 'en' with use_aliases=True in emojize()

Commits

17e7a1e v1.6.3

efbb3e1 Merge pull request #199 from tomwojcik/add-count-unique-emojis

5aebf67 add support for count unique

e35fc45 Merge pull request #197 from cvzi/demojize_performance

b3b0dc6 small fixes

424acc5 use_aliases=True overrides language='...' (this restores the behaviour of pre...

148d531 Code style

bd9d89d More tests

cfe81a4 Improve performance of demojize

290529c Merge pull request #193 from cvzi/multiplealiases-ghpages

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Type: Dependencies
opened by dependabot[bot] 2
Bump transformers from 4.9.2 to 4.10.2
Bumps transformers from 4.9.2 to 4.10.2.

Release notes

Sourced from transformers's releases.

v4.10.2: Patch release

[Wav2Vec2] Fix dtype 64 bug #13517 (@patrickvonplaten)

v4.10.1: Patch release

[Wav2Vec2] Fix normalization for non-padded tensors #13512 (@patrickvonplaten)

Fixing backward compatiblity for non prefixed tokens (B-, I-). #13493 (@Narsil)

Fixing #13381 #13400 (@Narsil)

v4.10.0: LayoutLM-v2, LayoutXLM, BEiT

LayoutLM-v2 and LayoutXLM

Four new models are released as part of the LatourLM-v2 implementation: LayoutLMv2ForSequenceClassification, LayoutLMv2Model, LayoutLMv2ForTokenClassification and LayoutLMv2ForQuestionAnswering, in PyTorch.

The LayoutLMV2 model was proposed in LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding by Yang Xu, Yiheng Xu, Tengchao Lv, Lei Cui, Furu Wei, Guoxin Wang, Yijuan Lu, Dinei Florencio, Cha Zhang, Wanxiang Che, Min Zhang, Lidong Zhou. LayoutLMV2 improves LayoutLM to obtain state-of-the-art results across several document image understanding benchmarks:

Add LayoutLMv2 + LayoutXLM #12604 (@NielsRogge)

Compatible checkpoints can be found on the Hub: https://huggingface.co/models?filter=layoutlmv2

BEiT

Three new models are released as part of the BEiT implementation: BeitModel, BeitForMaskedImageModeling, and BeitForImageClassification, in PyTorch.

The BEiT model was proposed in BEiT: BERT Pre-Training of Image Transformers by Hangbo Bao, Li Dong and Furu Wei. Inspired by BERT, BEiT is the first paper that makes self-supervised pre-training of Vision Transformers (ViTs) outperform supervised pre-training. Rather than pre-training the model to predict the class of an image (as done in the original ViT paper), BEiT models are pre-trained to predict visual tokens from the codebook of OpenAI’s DALL-E model given masked patches.

Add BEiT #12994 (@NielsRogge)

Compatible checkpoints can be found on the Hub: https://huggingface.co/models?filter=beit

Speech improvements

The Wav2Vec2 and HuBERT models now have a sequence classification head available.

Add Wav2Vec2 & Hubert ForSequenceClassification #13153 (@anton-l)

DeBERTa in TensorFlow (@kamalkraj)

The DeBERTa and DeBERTa-v2 models have been converted from PyTorch to TensorFlow.

Deberta tf #12972 (@kamalkraj)

Deberta_v2 tf #13120 (@kamalkraj)

Flax model additions

EncoderDecoder, DistilBERT, and ALBERT, now have support in Flax!

FlaxEncoderDecoder allowing Bert2Bert and Bert2GPT2 in Flax #13008 (@ydshieh)

FlaxDistilBERT #13324 (@kamalkraj)

FlaxAlBERT #13294 (@kamalkraj)

... (truncated)

Commits

a5fc344 [Wav2Vec2] Fix dtype 64 bug (#13517)

2c51442 Release: v4.10.2

28e2787 Release: 4.10.1

e5e0452 Fixing #13381 (#13400)

4afbd7e Fixing backward compatiblity for non prefixed tokens (B-, I-). (#13493)

60eb416 [Wav2Vec2] Fix normalization for non-padded tensors (#13512)

d12bbe4 Release: v4.10.0

642e193 [GitHub Runner] Fix flax runner (#13357)

c76de10 Add generate kwargs to Seq2SeqTrainingArguments (#13339)

702f4a4 Fixed CLM model still using MODEL_FOR_MASKED_LM_MAPPING (#13002)

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Type: Dependencies
opened by dependabot[bot] 2
Bump coverage from 7.0.0 to 7.0.1
Bumps coverage from 7.0.0 to 7.0.1.

Changelog

Sourced from coverage's changelog.

Version 7.0.1 — 2022-12-23

When checking if a file mapping resolved to a file that exists, we weren't considering files in .whl files. This is now fixed, closing issue 1511_.

File pattern rules were too strict, forbidding plus signs and curly braces in directory and file names. This is now fixed, closing issue 1513_.

Unusual Unicode or control characters in source files could prevent reporting. This is now fixed, closing issue 1512_.

The PyPy wheel now installs on PyPy 3.7, 3.8, and 3.9, closing issue 1510_.

.. _issue 1510: nedbat/coveragepy#1510 .. _issue 1511: nedbat/coveragepy#1511 .. _issue 1512: nedbat/coveragepy#1512 .. _issue 1513: nedbat/coveragepy#1513

.. _changes_7-0-0:

Commits

c5cda3a docs: releases take a little bit longer now

9d4226e docs: latest sample HTML report

8c77758 docs: prep for 7.0.1

da1b282 fix: also look into .whl files for source

d327a70 fix: more information when mapping rules aren't working right.

35e249f fix: certain strange characters caused reporting to fail. #1512

152cdc7 fix: don't forbid plus signs in file names. #1513

31513b4 chore: make upgrade

873b059 test: don't run tests on Windows PyPy-3.9

5c5caa2 build: PyPy wheel now installs on 3.7, 3.8, and 3.9. #1510

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Type: Dependencies python
opened by dependabot[bot] 1
Bump torch from 1.13.0 to 1.13.1
Bumps torch from 1.13.0 to 1.13.1.

Release notes

Sourced from torch's releases.

PyTorch 1.13.1 Release, small bug fix release

This release is meant to fix the following issues (regressions / silent correctness):

RuntimeError by torch.nn.modules.activation.MultiheadAttention with bias=False and batch_first=True #88669

Installation via pip on Amazon Linux 2, regression #88869

Installation using poetry on Mac M1, failure #88049

Missing masked tensor documentation #89734

torch.jit.annotations.parse_type_line is not safe (command injection) #88868

Use the Python frame safely in _pythonCallstack #88993

Double-backward with full_backward_hook causes RuntimeError #88312

Fix logical error in get_default_qat_qconfig #88876

Fix cuda/cpu check on NoneType and unit test #88854 and #88970

Onnx ATen Fallback for BUILD_CAFFE2=0 for ONNX-only ops #88504

Onnx operator_export_type on the new registry #87735

torchrun AttributeError caused by file_based_local_timer on Windows #85427

The release tracker should contain all relevant pull requests related to this release as well as links to related issues

Changelog

Sourced from torch's changelog.

Releasing PyTorch

General Overview

Cutting a release branch preparations

Cutting release branches

pytorch/pytorch

pytorch/builder / PyTorch domain libraries

Making release branch specific changes for PyTorch

Making release branch specific changes for domain libraries

Drafting RCs (https://github.com/pytorch/pytorch/blob/master/Release Candidates) for PyTorch and domain libraries

Release Candidate Storage

Release Candidate health validation

Cherry Picking Fixes

Promoting RCs to Stable

Additional Steps to prepare for release day

Modify release matrix

Open Google Colab issue

Patch Releases

Patch Release Criteria

Patch Release Process

Triage

Issue Tracker for Patch releases

Building a release schedule / cherry picking

Building Binaries / Promotion to Stable

Hardware / Software Support in Binary Build Matrix

Python

TL;DR

Accelerator Software

Special support cases

Special Topics

Updating submodules for a release

General Overview

Releasing a new version of PyTorch generally entails 3 major steps:

Cutting a release branch preparations

Cutting a release branch and making release branch specific changes

Drafting RCs (Release Candidates), and merging cherry picks

Promoting RCs to stable and performing release day tasks

Cutting a release branch preparations

Following Requirements needs to be met prior to final RC Cut:

Resolve all outstanding issues in the milestones(for example 1.11.0)before first RC cut is completed. After RC cut is completed following script should be executed from builder repo in order to validate the presence of the fixes in the release branch :

... (truncated)

Commits

49444c3 [BE] Do not package caffe2 in wheel (#87986) (#90433)

56de8a3 Add manual cuda deps search logic (#90411) (#90426)

a4d16e0 Fix ATen Fallback for BUILD_CAFFE2=0 for ONNX-only ops (#88504) (#90104)

80abad3 Handle Tensor.deepcopy via clone(), on IPU (#89129) (#89999)

73a852a [Release only change] Fix rocm5.1.1 docker image (#90321)

029ec16 Add platform markers for linux only extra_install_requires (#88826) (#89924)

197c5c0 Fix cuda/cpu check on NoneType (#88854) (#90068)

aadbeb7 Make TorchElastic timer importable on Windows (#88522) (#90045)

aa94433 Mark IPU device as not supports_as_strided (#89130) (#89998)

59b4f3b Use the Python frame safely in _pythonCallstack (#89997)

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Type: Dependencies python
opened by dependabot[bot] 1
Bump pyright from 1.1.277 to 1.1.278
Bumps pyright from 1.1.277 to 1.1.278.

Release notes

Sourced from pyright's releases.

Published 1.1.278

Behavior Change: Changed the reportUnusedImport check to not report an error for "from y import x as x" since x is considered to be re-exported in this case. Previously, this case was exempted only for type stubs.

Enhancement: Expand enums in negative narrowing of x in y.

Bug Fix: Fixed bug that resulted in incorrect type evaluation when an f-string uses expressions that are unions of literal strings. The result should be LiteralString, not str.

Enhancement: Improved the reportUnnecessaryComparison diagnostic check to detect (and properly report) cases where a comparison if literals is always true or always false.

Bug Fix: Fixed bug that resulted in incorrect type evaluation when creating a union from two generic functions that are parameterized by different ParamSpecs but are otherwise identical.

Behavior Change: Updated the default Python version from 3.10 to 3.11 now that 3.11 has been released in its final form. Pyright assumes the default version if it is not otherwise told which version to use (explicitly as a setting or command-line argument or implicitly via the selected Python environment).

Enhancement: Enhanced --verifytypes so it honors the --pythonversion and --pythonplatform command-line arguments and falls back to the selected Python environment to get these values if they are unspecified.

Bug Fix: Fixed a bug in the type evaluator related to TypeVarTuple that caused it to incorrectly report a type incompatibility between *Ts and *tuple[*Ts]. These are the same type, so they should be compatible.

Bug Fix: Fixed a bug that results in incorrect specialization of a function or method that uses a TypeVarTuple parameter. The internal flags were being lost including the tracking of class methods and static methods.

Bug Fix: Added support for unpacked tuple arguments passed to an *args parameter declared with a TypeVarTuple.

Commits

0fdcb59 Published 1.1.278

See full diff in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Type: Dependencies javascript
opened by dependabot[bot] 1
Bump pyright from 1.1.274 to 1.1.275
Bumps pyright from 1.1.274 to 1.1.275.

Release notes

Sourced from pyright's releases.

Published 1.1.275

Bug Fix: Fixed a bug that resulted in an incorrect type evaluation when a TypeVar bound to an Enum class is instantiated.

Bug Fix: Added a global "cache manager" to track heap usage for all type analyzer service instances. This prevents out-of-memory conditions when many roots are used in a multi-root workspace.

Behavior Change (from Pylance): Don't treat classes decorated with dataclass_transform as dataclasses for purposes of determining which fields should be represented in the synthesized __init__ method. This was an ambiguity in PEP 681, which was recently clarified.

Bug Fix: Fixed a bug that resulted in a false negative when using illegal annotation expression forms within a TypedDict inlined (call-based) definition.

Bug Fix: Fixed a bug in the code flow engine that resulted in incorrect type evaluation in certain cases involving nested loops.

Bug Fix: Fixed a bug that resulted in a false positive error when a call targets an object with a __call__ that is itself an object with a __call__ method.

Bug Fix: Added a check for a return statement with an explicit return value within an async generator function. This is considered a syntax error by the Python compiler, so it should be flagged at static analysis time.

Bug Fix: Fixed a bug that resulted in incorrect handling of binary files (e.g. ".so" or ".pyd" files) within a library when creating stub files.

Behavior Change: Enhanced the heuristics used to infer the type of an unannotated function parameter with a default value. If that value evaluates to an instance of a class with a private name (i.e. it starts with an underscore), pyright will now assume that the value is potentially a singleton sentinel value similar to None and will infer the type to be a union of that type and Unknown.

Commits

779b872 Published 1.1.275

See full diff in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Type: Dependencies javascript
opened by dependabot[bot] 1
Bump numpy from 1.23.3 to 1.23.4
Bumps numpy from 1.23.3 to 1.23.4.

Release notes

Sourced from numpy's releases.

v1.23.4

NumPy 1.23.4 Release Notes

NumPy 1.23.4 is a maintenance release that fixes bugs discovered after the 1.23.3 release and keeps the build infrastructure current. The main improvements are fixes for some annotation corner cases, a fix for a long time nested_iters memory leak, and a fix of complex vector dot for very large arrays. The Python versions supported for this release are 3.8-3.11.

Note that the mypy version needs to be 0.981+ if you test using Python 3.10.7, otherwise the typing tests will fail.

Contributors

A total of 8 people contributed to this release. People with a "+" by their names contributed a patch for the first time.

Bas van Beek

Charles Harris

Matthew Barber

Matti Picus

Ralf Gommers

Ross Barnowski

Sebastian Berg

Sicheng Zeng +

Pull requests merged

A total of 13 pull requests were merged for this release.

#22368: BUG: Add __array_api_version__ to numpy.array_api namespace

#22370: MAINT: update sde toolkit to 9.0, fix download link

#22382: BLD: use macos-11 image on azure, macos-1015 is deprecated

#22383: MAINT: random: remove get_info from "extending with Cython"...

#22384: BUG: Fix complex vector dot with more than NPY_CBLAS_CHUNK elements

#22387: REV: Loosen lookfor's import try/except again

#22388: TYP,ENH: Mark numpy.typing protocols as runtime checkable

#22389: TYP,MAINT: Change more overloads to play nice with pyright

#22390: TST,TYP: Bump mypy to 0.981

#22391: DOC: Update delimiter param description.

#22392: BUG: Memory leaks in numpy.nested_iters

#22413: REL: Prepare for the NumPy 1.23.4 release.

#22424: TST: Fix failing aarch64 wheel builds.

Checksums

MD5

90a3d95982490cfeeef22c0f7cbd874f numpy-1.23.4-cp310-cp310-macosx_10_9_x86_64.whl

... (truncated)

Commits

f802155 Merge pull request #22425 from charris/prepare-1.23.4-release_1

298174b REL: Prepare for the NumPy 1.23.4 release (1)

631072f Merge pull request #22424 from charris/backport-22418

f4a3195 TST, BLD: Fix failing aarch64 wheel builds.

1cbd127 Merge pull request #22413 from charris/prepare-for-1.23.4

8cfc899 REL: Prepare for the NumPy 1.23.4 release.

22a41b5 Merge pull request #22392 from charris/backport-22296

f6a3c11 Adding missing Py_DECREF call on iter

8274a16 Merge pull request #22391 from charris/backport-22372

fa16a0c Merge pull request #22390 from charris/backport-22360

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Type: Dependencies python
opened by dependabot[bot] 1
Bump transformers from 4.22.2 to 4.23.0
Bumps transformers from 4.22.2 to 4.23.0.

Release notes

Sourced from transformers's releases.

v4.23.0: Whisper, Time series, Conditional DETR, MSN, MarkupLM, safetensors

Whisper

The Whisper model was proposed in Robust Speech Recognition via Large-Scale Weak Supervision by Alec Radford, Jong Wook Kim, Tao Xu, Greg Brockman, Christine McLeavey, Ilya Sutskever.

The abstract from the paper is the following:

We study the capabilities of speech processing systems trained simply to predict large amounts of transcripts of audio on the internet. When scaled to 680,000 hours of multilingual and multitask supervision, the resulting models generalize well to standard benchmarks and are often competitive with prior fully supervised results but in a zeroshot transfer setting without the need for any finetuning. When compared to humans, the models approach their accuracy and robustness. We are releasing models and inference code to serve as a foundation for further work on robust speech processing.

Add WhisperModel to transformers by @ArthurZucker in #19166

Add TF whisper by @amyeroberts in #19378

Time series

The Time Series Transformer model is a vanilla encoder-decoder Transformer for time series forecasting.

:warning: This is a recently introduced model and modality, so the API hasn't been tested extensively. There may be some bugs or slight breaking changes to fix it in the future. If you see something strange, file a Github Issue.

time series forecasting model by @kashif in #17965

Conditional DETR

The Conditional DETR model was proposed in Conditional DETR for Fast Training Convergence by Depu Meng, Xiaokang Chen, Zejia Fan, Gang Zeng, Houqiang Li, Yuhui Yuan, Lei Sun, Jingdong Wang. Conditional DETR presents a conditional cross-attention mechanism for fast DETR training. Conditional DETR converges 6.7× to 10× faster than DETR.

The abstract from the paper is the following:

The recently-developed DETR approach applies the transformer encoder and decoder architecture to object detection and achieves promising performance. In this paper, we handle the critical issue, slow training convergence, and present a conditional cross-attention mechanism for fast DETR training. Our approach is motivated by that the cross-attention in DETR relies highly on the content embeddings for localizing the four extremities and predicting the box, which increases the need for high-quality content embeddings and thus the training difficulty. Our approach, named conditional DETR, learns a conditional spatial query from the decoder embedding for decoder multi-head cross-attention. The benefit is that through the conditional spatial query, each cross-attention head is able to attend to a band containing a distinct region, e.g., one object extremity or a region inside the object box. This narrows down the spatial range for localizing the distinct regions for object classification and box regression, thus relaxing the dependence on the content embeddings and easing the training. Empirical results show that conditional DETR converges 6.7× faster for the backbones R50 and R101 and 10× faster for stronger backbones DC5-R50 and DC5-R101.

Add support for conditional detr by @DeppMeng in #18948

Improve conditional detr docs by @NielsRogge in #19154

Masked Siamese Networks

The ViTMSN model was proposed in Masked Siamese Networks for Label-Efficient Learning by Mahmoud Assran, Mathilde Caron, Ishan Misra, Piotr Bojanowski, Florian Bordes, Pascal Vincent, Armand Joulin, Michael Rabbat, Nicolas Ballas. The paper presents a joint-embedding architecture to match the prototypes of masked patches with that of the unmasked patches. With this setup, their method yields excellent performance in the low-shot and extreme low-shot regimes.

The abstract from the paper is the following:

We propose Masked Siamese Networks (MSN), a self-supervised learning framework for learning image representations. Our approach matches the representation of an image view containing randomly masked patches to the representation of the original unmasked image. This self-supervised pre-training strategy is particularly scalable when applied to Vision Transformers since only the unmasked patches are processed by the network. As a result, MSNs improve the scalability of joint-embedding architectures, while producing representations of a high semantic level that perform competitively on low-shot image classification. For instance, on ImageNet-1K, with only 5,000 annotated images, our base MSN model achieves 72.4% top-1 accuracy, and with 1% of ImageNet-1K labels, we achieve 75.7% top-1 accuracy, setting a new state-of-the-art for self-supervised learning on this benchmark.

MSN (Masked Siamese Networks) for ViT by @sayakpaul in #18815

MarkupLM

The MarkupLM model was proposed in MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding by Junlong Li, Yiheng Xu, Lei Cui, Furu Wei. MarkupLM is BERT, but applied to HTML pages instead of raw text documents. The model incorporates additional embedding layers to improve performance, similar to LayoutLM.

The model can be used for tasks like question answering on web pages or information extraction from web pages. It obtains state-of-the-art results on 2 important benchmarks:

WebSRC, a dataset for Web-Based Structual Reading Comprehension (a bit like SQuAD but for web pages) SWDE, a dataset for information extraction from web pages (basically named-entity recogntion on web pages) The abstract from the paper is the following:

... (truncated)

Commits

9ae22fe Release: v4.23.0

df2f281 wrap forward passes with torch.no_grad() (#19412)

5f5e264 wrap forward passes with torch.no_grad() (#19413)

c6a928c wrap forward passes with torch.no_grad() (#19414)

d739a70 wrap forward passes with torch.no_grad() (#19416)

870a954 wrap forward passes with torch.no_grad() (#19438)

692c5be wrap forward passes with torch.no_grad() (#19439)

a7bc422 fix (#19469)

25cfd91 Fixed a non-working hyperlink in the README.md file (#19434)

9df953a Fix misspelled word in docstring (#19415)

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Type: Dependencies python
opened by dependabot[bot] 1
Bump black from 22.8.0 to 22.10.0
Bumps black from 22.8.0 to 22.10.0.

Release notes

Sourced from black's releases.

22.10.0

Highlights

Runtime support for Python 3.6 has been removed. Formatting 3.6 code will still be supported until further notice.

Stable style

Fix a crash when # fmt: on is used on a different block level than # fmt: off (#3281)

Preview style

Fix a crash when formatting some dicts with parenthesis-wrapped long string keys (#3262)

Configuration

.ipynb_checkpoints directories are now excluded by default (#3293)

Add --skip-source-first-line / -x option to ignore the first line of source code while formatting (#3299)

Packaging

Executables made with PyInstaller will no longer crash when formatting several files at once on macOS. Native x86-64 executables for macOS are available once again. (#3275)

Hatchling is now used as the build backend. This will not have any effect for users who install Black with its wheels from PyPI. (#3233)

Faster compiled wheels are now available for CPython 3.11 (#3276)

Blackd

Windows style (CRLF) newlines will be preserved (#3257).

Integrations

Vim plugin: add flag (g:black_preview) to enable/disable the preview style (#3246)

Update GitHub Action to support formatting of Jupyter Notebook files via a jupyter option (#3282)

Update GitHub Action to support use of version specifiers (e.g. <23) for Black version (#3265)

Changelog

Sourced from black's changelog.

22.10.0

Highlights

Runtime support for Python 3.6 has been removed. Formatting 3.6 code will still be supported until further notice.

Stable style

Fix a crash when # fmt: on is used on a different block level than # fmt: off (#3281)

Preview style

Fix a crash when formatting some dicts with parenthesis-wrapped long string keys (#3262)

Configuration

.ipynb_checkpoints directories are now excluded by default (#3293)

Add --skip-source-first-line / -x option to ignore the first line of source code while formatting (#3299)

Packaging

Executables made with PyInstaller will no longer crash when formatting several files at once on macOS. Native x86-64 executables for macOS are available once again. (#3275)

Hatchling is now used as the build backend. This will not have any effect for users who install Black with its wheels from PyPI. (#3233)

Faster compiled wheels are now available for CPython 3.11 (#3276)

Blackd

Windows style (CRLF) newlines will be preserved (#3257).

Integrations

Vim plugin: add flag (g:black_preview) to enable/disable the preview style (#3246)

Update GitHub Action to support formatting of Jupyter Notebook files via a jupyter option (#3282)

Update GitHub Action to support use of version specifiers (e.g. <23) for Black version (#3265)

Commits

27d2014 Prepare release 22.10.0 (#3311)

4da0851 Add option to skip the first line of source code (#3299)

0359b85 Preserve crlf line endings in blackd (#3257)

27d7ea4 Bump docutils from 0.18.1 to 0.19 in /docs (#3161)

1a20c4d Bump sphinx from 5.2.1 to 5.2.3 in /docs (#3305)

980997f Bump furo from 2022.9.15 to 2022.9.29 in /docs (#3304)

b1077aa Bump myst-parser from 0.18.0 to 0.18.1 in /docs (#3303)

956bf39 Add .ipynb_checkpoints to DEFAULT_EXCLUDES (#3293)

141291a Enable build isolation under CIWB (#3297)

ddb9924 Bump pypa/cibuildwheel from 2.10.0 to 2.10.2 (#3290)

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Type: Dependencies python
opened by dependabot[bot] 1
Bump emoji from 2.0.0 to 2.1.0
Bumps emoji from 2.0.0 to 2.1.0.

Release notes

Sourced from emoji's releases.

v.2.1.0

Added Farsi support

Added Trove classifiers for Python 3.10 and 3.11

Changelog

Sourced from emoji's changelog.

2.1.0

Added Farsi support

Added Trove classifiers for Python 3.10 and 3.11

Commits

83cec96 Update init.py

1b84734 Update CHANGES.md

1aa2e13 Update CHANGES.md

6869664 Merge pull request #230 from hugovk/rm-travis

3cb6da6 Merge pull request #229 from hugovk/add-3.10

4096998 Merge pull request #227 from AliNajafi1998/farsi-support

8be12eb Bump Development Status classifier to 5 - Production/Stable

df8cd31 Remove redundant Travis CI config

53a88af Fix GitHub typos

d1f0066 Fix PyPI typos

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Type: Dependencies python
opened by dependabot[bot] 1
Bump coverage from 7.0.0 to 7.0.2
Bumps coverage from 7.0.0 to 7.0.2.

Changelog

Sourced from coverage's changelog.

Version 7.0.2 — 2023-01-02

Fix: when using the [run] relative_files = True setting, a relative [paths] pattern was still being made absolute. This is now fixed, closing issue 1519_.

Fix: if Python doesn't provide tomllib, then TOML configuration files can only be read if coverage.py is installed with the [toml] extra. Coverage.py will raise an error if TOML support is not installed when it sees your settings are in a .toml file. But it didn't understand that [tools.coverage] was a valid section header, so the error wasn't reported if you used that header, and settings were silently ignored. This is now fixed, closing issue 1516_.

Fix: adjusted how decorators are traced on PyPy 7.3.10, fixing issue 1515_.

Fix: the coverage lcov report did not properly implement the --fail-under=MIN option. This has been fixed.

Refactor: added many type annotations, including a number of refactorings. This should not affect outward behavior, but they were a bit invasive in some places, so keep your eyes peeled for oddities.

Refactor: removed the vestigial and long untested support for Jython and IronPython.

.. _issue 1515: nedbat/coveragepy#1515 .. _issue 1516: nedbat/coveragepy#1516 .. _issue 1519: nedbat/coveragepy#1519

.. _changes_7-0-1:

Version 7.0.1 — 2022-12-23

When checking if a file mapping resolved to a file that exists, we weren't considering files in .whl files. This is now fixed, closing issue 1511_.

File pattern rules were too strict, forbidding plus signs and curly braces in directory and file names. This is now fixed, closing issue 1513_.

Unusual Unicode or control characters in source files could prevent reporting. This is now fixed, closing issue 1512_.

The PyPy wheel now installs on PyPy 3.7, 3.8, and 3.9, closing issue 1510_.

.. _issue 1510: nedbat/coveragepy#1510 .. _issue 1511: nedbat/coveragepy#1511

... (truncated)

Commits

2f731e2 docs: sample HTML

dbbd5b7 docs: prep for 7.0.2

d08e6d0 fix: relative_files should keep relative path maps. #1519

3f0bce2 mypy: partial debug.py and pytracer.py

ffc701a mypy: test_xml.py

5580cf8 mypy: xmlreport.py

0c9b5e0 mypy: check collector.py and plugin_support.py

8f4d404 refactor: a better way to filter coverage debug pybehave

a3f3841 mypy: add cmdline.py and test_cmdline.py

09f9188 mypy: add env.py

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Type: Dependencies python
opened by dependabot[bot] 0
Bump pydocstyle from 6.1.1 to 6.2.0
Bumps pydocstyle from 6.1.1 to 6.2.0.

Release notes

Sourced from pydocstyle's releases.

6.2.0

What's Changed

Gitpod-ify the repository by @samj1912 in PyCQA/pydocstyle#548

Updating readme to reflect support for Python 3.9 by @sanders41 in PyCQA/pydocstyle#545

Exempt properties from D401 by @TomFryers in PyCQA/pydocstyle#546

Add support for Python 3.10 by @hugovk in PyCQA/pydocstyle#554

Fix CI for python 3.10 by @samj1912 in PyCQA/pydocstyle#565

Add D419: Add and switch to "Docstring is empty" error code by @thejcannon in PyCQA/pydocstyle#559

Allow for hanging indent when documenting args in Google style by @rmorshea in PyCQA/pydocstyle#564

Fix match option to only consider basename when given a path argument by @oczkoisse in PyCQA/pydocstyle#550

Typo fix in docs/error_codes.rst by @Daniel-Mietchen in PyCQA/pydocstyle#578

docs: Fix double-dash converted to en-dash by @wjandrea in PyCQA/pydocstyle#579

Fix test warnings and improve lint configuration by @aphedges in PyCQA/pydocstyle#584

Fix broken CI checks by @aphedges in PyCQA/pydocstyle#585

Move tool configs to pyproject.toml by @Mr-Pepe in PyCQA/pydocstyle#588

Add github action to be able to release on published tag by @Pierre-Sassoulas in PyCQA/pydocstyle#582

Typo by @Fokko in PyCQA/pydocstyle#598

Move to poetry and automated relases via Github UI by @samj1912 in PyCQA/pydocstyle#614

New Contributors

@sanders41 made their first contribution in PyCQA/pydocstyle#545

@TomFryers made their first contribution in PyCQA/pydocstyle#546

@thejcannon made their first contribution in PyCQA/pydocstyle#559

@rmorshea made their first contribution in PyCQA/pydocstyle#564

@oczkoisse made their first contribution in PyCQA/pydocstyle#550

@Daniel-Mietchen made their first contribution in PyCQA/pydocstyle#578

@wjandrea made their first contribution in PyCQA/pydocstyle#579

@aphedges made their first contribution in PyCQA/pydocstyle#584

@Mr-Pepe made their first contribution in PyCQA/pydocstyle#588

@Pierre-Sassoulas made their first contribution in PyCQA/pydocstyle#582

@Fokko made their first contribution in PyCQA/pydocstyle#598

Full Changelog: https://github.com/PyCQA/pydocstyle/compare/6.1.1...6.2.0

Changelog

Sourced from pydocstyle's changelog.

6.2.0 - January 2nd, 2023

New Features

Allow for hanging indent when documenting args in Google style. (#449)

Add support for property_decorators config to ignore D401.

Add support for Python 3.10 (#554).

Replace D10X errors with D419 if docstring exists but is empty (#559).

Bug Fixes

Fix --match option to only consider filename when matching full paths (#550).

Commits

05b92ba Add testpypi to poetry repositories

7007961 Move to poetry and automated relases via Github UI (#614)

b6cb2c6 Fix a typo in the readme (#598)

f1dc7be Add github action to be able to release on published tag (#582)

a6fe422 Move tool configs to pyproject.toml (#588)

50894da Fix broken CI checks (#585)

03404c8 Fix test warnings and improve lint configuration (#584)

187bac7 docs: Fix double-dash converted to en-dash (#579)

27c6477 Remove duplicate "of" in documentation

bd49933 Fix match option to only consider basename when given a path argument (#550)

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Type: Dependencies python
opened by dependabot[bot] 0
Bump mock from 4.0.3 to 5.0.0
Bumps mock from 4.0.3 to 5.0.0.

Changelog

Sourced from mock's changelog.

5.0.0

gh-98624: Add a mutex to unittest.mock.NonCallableMock to protect concurrent access to mock attributes.

bpo-43478: Mocks can no longer be used as the specs for other Mocks. As a result, an already-mocked object cannot have an attribute mocked using autospec=True or be the subject of a create_autospec(...) call. This can uncover bugs in tests since these Mock-derived Mocks will always pass certain tests (e.g. isinstance) and builtin assert functions (e.g. assert_called_once_with) will unconditionally pass.

bpo-45156: Fixes infinite loop on :func:unittest.mock.seal of mocks created by :func:~unittest.create_autospec.

bpo-41403: Make :meth:mock.patch raise a :exc:TypeError with a relevant error message on invalid arg. Previously it allowed a cryptic :exc:AttributeError to escape.

gh-91803: Fix an error when using a method of objects mocked with :func:unittest.mock.create_autospec after it was sealed with :func:unittest.mock.seal function.

bpo-41877: AttributeError for suspected misspellings of assertions on mocks are now pointing out that the cause are misspelled assertions and also what to do if the misspelling is actually an intended attribute name. The unittest.mock document is also updated to reflect the current set of recognised misspellings.

bpo-43478: Mocks can no longer be provided as the specs for other Mocks. As a result, an already-mocked object cannot be passed to mock.Mock(). This can uncover bugs in tests since these Mock-derived Mocks will always pass certain tests (e.g. isinstance) and builtin assert functions (e.g. assert_called_once_with) will unconditionally pass.

bpo-45010: Remove support of special method __div__ in :mod:unittest.mock. It is not used in Python 3.

gh-84753: :func:inspect.iscoroutinefunction now properly returns True when an instance of :class:unittest.mock.AsyncMock is passed to it. This makes it consistent with behavior of :func:asyncio.iscoroutinefunction. Patch by Mehdi ABAAKOUK.

bpo-46852: Remove the undocumented private float.__set_format__() method, previously known as float.__setformat__() in Python 3.7. Its docstring said: "You probably don't want to use this function. It exists mainly to be used in Python's test suite." Patch by Victor Stinner.

gh-98086: Make sure patch.dict() can be applied on async functions.

... (truncated)

Commits

6d40148 sudo no longer needed

172a36c Add support for gh issues to release.py.

c4dd650 Preparing for 5.0.0 release.

3a70a8c Update historical issue references to be bpo- prefixed

85a196e Fixup reverting backported rev example.

726e5ee latest sync point

5462a55 Fix mock code coverage. (#100580)

78fed8a gh-100287: Fix unittest.mock.seal with AsyncMock (#100496)

36e4a68 gh-83076: 3.8x speed improvement in (Async)Mock instantiation (#100252)

a2cb0be gh-98086: Now patch.dict can decorate async functions (#98095)

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Type: Dependencies python
opened by dependabot[bot] 0
Bump pyright from 1.1.285 to 1.1.286
Bumps pyright from 1.1.285 to 1.1.286.

Release notes

Sourced from pyright's releases.

Published 1.1.286

Bug Fix: Reverted a recent update to the TOML parser that resulted in a regression. This reversion means that some TOML 1.0 features will not be handled correctly.

Bug Fix: Fixed a bug that resulted in incorrect handling of literals in the TypeVar constraint solver. This involved a pretty significant change to the constraint solver logic — one that eliminated some heuristics and special cases.

Bug Fix: Fixed a bug that caused target expressions within a chained assignment to be evaluated in the wrong order (right to left instead of left to right). This resulted in false positives and negatives in some cases where one target referred to another target. This change also makes it illegal to use a Python 2-style type comment on a line containing a chained assignment statement, reflecting the fact that Python 3-style variable type annotations are not legal here either.

Enhancement: Improved handling of TypeVarTuple constraint solving. Previously, if a TypeVarTuple appeared more than once, the corresponding tuple types needed to be identical. The constraint solver now supports the same sort of narrowing/widening within the tuple entries to find the best solution.

Bug Fix: Fixed a bug that led to a false negative during protocol matching if the protocol class refers to itself within an invariant type argument.

Enhancement: Improved handling of generic functions passed as arguments to generic higher-order functions. Pyright is now able to solve the type variables for both the generic callback and the called function.

Enhancement: Updated typeshed stubs to the latest version.

Enhancement: Improved handling of generic functions passed as arguments to generic higher-order functions that use a ParamSpec. Pyright is now able to solve the type variables for both the generic callback and the called function.

Bug Fix: Fixed a bug in the code flow engine that resulted in incorrect type evaluation in some cases involving double nested loops.

Bug Fix: Improved the method override consistency checks to detect the case where an override uses an *args parameter that is not type compatible with the overridden method's parameter types. Thanks to @mehdigmira for this contribution.

Enhancement: Improved handling of TypeVars that appear only within a Callable within a return type annotation for a function. By a strict reading of PEP 484, these should be bound to the function's scope, but practically, they are bound to the Callable. This allows a function to return a generic callable type. When TypeVars are rescoped in this manner, the TypeVar cannot be referenced within the function body because it is no longer in scope in that context.

Enhancement: Improved error handling for NewType calls

Enhancement: Completed initial implementation of PEP 696. Added support for default TypeVar types that refer to other TypeVars.

Commits

e4472b3 Published 1.1.286

See full diff in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Type: Dependencies javascript
opened by dependabot[bot] 0
Bump numpy from 1.24.0 to 1.24.1
Bumps numpy from 1.24.0 to 1.24.1.

Release notes

Sourced from numpy's releases.

v1.24.1

NumPy 1.24.1 Release Notes

NumPy 1.24.1 is a maintenance release that fixes bugs and regressions discovered after the 1.24.0 release. The Python versions supported by this release are 3.8-3.11.

Contributors

A total of 12 people contributed to this release. People with a "+" by their names contributed a patch for the first time.

Andrew Nelson

Ben Greiner +

Charles Harris

Clément Robert

Matteo Raso

Matti Picus

Melissa Weber Mendonça

Miles Cranmer

Ralf Gommers

Rohit Goswami

Sayed Adel

Sebastian Berg

Pull requests merged

A total of 18 pull requests were merged for this release.

#22820: BLD: add workaround in setup.py for newer setuptools

#22830: BLD: CIRRUS_TAG redux

#22831: DOC: fix a couple typos in 1.23 notes

#22832: BUG: Fix refcounting errors found using pytest-leaks

#22834: BUG, SIMD: Fix invalid value encountered in several ufuncs

#22837: TST: ignore more np.distutils.log imports

#22839: BUG: Do not use getdata() in np.ma.masked_invalid

#22847: BUG: Ensure correct behavior for rows ending in delimiter in...

#22848: BUG, SIMD: Fix the bitmask of the boolean comparison

#22857: BLD: Help raspian arm + clang 13 about __builtin_mul_overflow

#22858: API: Ensure a full mask is returned for masked_invalid

#22866: BUG: Polynomials now copy properly (#22669)

#22867: BUG, SIMD: Fix memory overlap in ufunc comparison loops

#22868: BUG: Fortify string casts against floating point warnings

#22875: TST: Ignore nan-warnings in randomized out tests

#22883: MAINT: restore npymath implementations needed for freebsd

#22884: BUG: Fix integer overflow in in1d for mixed integer dtypes #22877

#22887: BUG: Use whole file for encoding checks with charset_normalizer.

Checksums

... (truncated)

Commits

a28f4f2 Merge pull request #22888 from charris/prepare-1.24.1-release

f8fea39 REL: Prepare for the NumPY 1.24.1 release.

6f491e0 Merge pull request #22887 from charris/backport-22872

48f5fe4 BUG: Use whole file for encoding checks with charset_normalizer [f2py] (#22...

0f3484a Merge pull request #22883 from charris/backport-22882

002c60d Merge pull request #22884 from charris/backport-22878

38ef9ce BUG: Fix integer overflow in in1d for mixed integer dtypes #22877 (#22878)

bb00c68 MAINT: restore npymath implementations needed for freebsd

64e09c3 Merge pull request #22875 from charris/backport-22869

dc7bac6 TST: Ignore nan-warnings in randomized out tests

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Type: Dependencies python
opened by dependabot[bot] 0
Bump crate-ci/typos from 1.13.4 to 1.13.6
Bumps crate-ci/typos from 1.13.4 to 1.13.6.

Release notes

Sourced from crate-ci/typos's releases.

v1.13.6

[1.13.6] - 2022-12-20

Features

(precommit) Add a docker variant

v1.13.5

[1.13.5] - 2022-12-19

(docker) Ensure correct libc is available

(precommit) Restrict what stages it runs during

Changelog

Sourced from crate-ci/typos's changelog.

[1.13.6] - 2022-12-20

Features

(precommit) Add a docker variant

[1.13.5] - 2022-12-19

(docker) Ensure correct libc is available

(precommit) Restrict what stages it runs during

Commits

6efbdc3 chore: Release

629a9bf docs: Update changelog

93d01ae Merge pull request #640 from scop/feat/pre-commit-docker

ec3ffa5 feat: Add Docker pre-commit hook

f268a89 chore: Release

822d759 docs: Update changelog

51f0b65 Merge pull request #639 from scop/feat/docker-entrypoint

87d2236 Merge pull request #638 from scop/fix/docker-debian-sync

7f0e8c7 chore: Make Debian dist a Docker build argument

5737842 chore: Use rust:bullseye as Docker builder

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Type: Dependencies github_actions
opened by dependabot[bot] 0

Releases(v1.5.5)

v1.5.5(Sep 15, 2022)
Fix for transformers v4.22.0

Source code(tar.gz)
Source code(zip)
v1.5.4(Jul 19, 2022)
Fix for emoji 2.0.0

Source code(tar.gz)
Source code(zip)
v1.5.2(Apr 14, 2022)
FIx import error (#105)

Source code(tar.gz)
Source code(zip)
v1.5.1(Apr 14, 2022)
Use train extra for training models

Source code(tar.gz)
Source code(zip)
v1.5.0(Apr 13, 2022)
⚠️ Breaking changes

If you want to disambiguate sentence boundaries for line breaks, you need to install bunkai with lb option.

$ pip install -U bunkai[lb]
Source code(tar.gz)
Source code(zip)
v1.4.5(Feb 2, 2022)
Drop Python 3.7 support (#90)

Source code(tar.gz)
Source code(zip)
v1.4.3(Jul 28, 2021)
Bug fix

Fix indirect_quote_exception_annotator.py (#33)

Source code(tar.gz)
Source code(zip)
v1.4.2(Jul 27, 2021)
Bug fix

Fix emoji annotator (#32)

Source code(tar.gz)
Source code(zip)
v1.4.1(Jul 15, 2021)
New features

Added --version Option

Source code(tar.gz)
Source code(zip)
v1.4.0(Jul 9, 2021)
Bug fix

Fix IndexError when an input text ends with No. (#27, Thanks to @t-yamamura)

Source code(tar.gz)
Source code(zip)
v1.3.0(Jun 1, 2021)
Breaking changes

Renamed _eos() to eos()

New features

Added --ma option to show morphological analysis results in CLI

Source code(tar.gz)
Source code(zip)
v1.2.0(May 31, 2021)
Breaking changes

Use Path instead of str for arguments of constructors and methods

New features

Added an alias to call bunkai.algorithm.bunkai_sbd.bunkai_sbd.BunkaiSentenceBoundaryDisambiguation simply

from bunkai import Bunkai bunkai = Bunkai() #Alias for BunkaiSentenceBoundaryDisambiguation
Source code(tar.gz)
Source code(zip)
v1.1.1(Apr 26, 2021)
Please use --setup option for setup to disambiguate sentence boundaries for line breaks like bunkai --model bunkai-model-directory --setup.

The attached file is trained on BCCWJ by National Institute for Japanese Language and Linguistics, and Megagon Labs. It is licensed under Apache-2.0 License.

Source code(tar.gz)
Source code(zip)
bunkai-model-setup-20210426.zip(320.38 MB)