Open source annotation tool for machine learning practitioners.

Overview

doccano

Codacy Badge doccano CI

doccano is an open source text annotation tool for humans. It provides annotation features for text classification, sequence labeling and sequence to sequence tasks. So, you can create labeled data for sentiment analysis, named entity recognition, text summarization and so on. Just create a project, upload data and start annotating. You can build a dataset in hours.

Demo

You can try the annotation demo.

Demo image

Features

  • Collaborative annotation
  • Multi-language support
  • Mobile support
  • Emoji 😄 support
  • Dark theme
  • RESTful API

Usage

Three options to run doccano:

  • pip(experimental)
  • Docker
  • Docker Compose
    • production
    • development

For docker and docker compose, you need to install the following dependencies:

pip installation

To install doccano, simply run:

pip install doccano

After installation, run the following commands:

# Initialize database.
doccano init
# Create a super user.
doccano createuser --username admin --password pass
# Start a web server.
doccano webserver --port 8000

In another terminal, run the following command:

# Start the task queue to handle file upload/download.
doccano task

Go to http://127.0.0.1:8000/.

By default, sqlite3 is used for the default database. If you want to use PostgreSQL, install the additional dependency:

pip install 'doccano[postgresql]'

Create an .env file with variables in the following format, each on a new line:

POSTGRES_USER=doccano
POSTGRES_PASSWORD=doccano
POSTGRES_DB=doccano

Then, pass it to docker run with the --env-file flag:

docker run --rm -d \
    -p 5432:5432 \
    -v postgres-data:/var/lib/postgresql/data \
    --env-file .env \
    postgres:13.3-alpine

And set DATABASE_URL environment variable:

# Please replace each variable.
DATABASE_URL=postgres://${POSTGRES_USER}:${POSTGRES_PASSWORD}@localhost:5432/${POSTGRES_DB}?sslmode=disable

Now run the command as before:

doccano init
doccano createuser --username admin --password pass
doccano webserver --port 8000

# In another terminal.
# Don't forget to set DATABASE_URL
doccano task

Docker

As a one-time setup, create a Docker container as follows:

docker pull doccano/doccano
docker container create --name doccano \
  -e "ADMIN_USERNAME=admin" \
  -e "[email protected]" \
  -e "ADMIN_PASSWORD=password" \
  -p 8000:8000 doccano/doccano

Next, start doccano by running the container:

docker container start doccano

To stop the container, run docker container stop doccano -t 5. All data created in the container will persist across restarts.

Go to http://127.0.0.1:8000/.

Docker Compose

You need to clone the repository:

git clone https://github.com/doccano/doccano.git
cd doccano

Note for Windows developers: Be sure to configure git to correctly handle line endings or you may encounter status code 127 errors while running the services in future steps. Running with the git config options below will ensure your git directory correctly handles line endings.

git clone https://github.com/doccano/doccano.git --config core.autocrlf=input

Then, create an .env file with variables in the following format(see ./config/.env.example):

# platform settings
ADMIN_USERNAME=admin
ADMIN_PASSWORD=password
[email protected]

# rabbit mq settings
RABBITMQ_DEFAULT_USER=doccano
RABBITMQ_DEFAULT_PASS=doccano

# database settings
POSTGRES_USER=doccano
POSTGRES_PASSWORD=doccano
POSTGRES_DB=doccano

Production

After running the following command, access http://0.0.0.0/.

docker-compose -f docker-compose.prod.yml --env-file ./config/.env.example up

Development

After running the following command, access http://127.0.0.1:3000/. If you want to use the admin site, please access http://127.0.0.1:8000/admin/.

docker-compose -f docker-compose.dev.yml --env-file ./config/.env.example up

You can run the the test codes for the backend with the following command:

docker exec doccano_backend_1 python backend/manage.py test api

One-click Deployment

Service Button
AWS1 AWS CloudFormation Launch Stack SVG Button
Heroku Deploy

FAQ

See the documentation for details.

Contribution

As with any software, doccano is under continuous development. If you have requests for features, please file an issue describing your request. Also, if you want to see work towards a specific feature, feel free to contribute by working towards it. The standard procedure is to fork the repository, add a feature, fix a bug, then file a pull request that your changes are to be merged into the main repository and included in the next release.

Here are some tips might be helpful. How to Contribute to Doccano Project

Citation

@misc{doccano,
  title={{doccano}: Text Annotation Tool for Human},
  url={https://github.com/doccano/doccano},
  note={Software available from https://github.com/doccano/doccano},
  author={
    Hiroki Nakayama and
    Takahiro Kubo and
    Junya Kamura and
    Yasufumi Taniguchi and
    Xu Liang},
  year={2018},
}

Contact

For help and feedback, please feel free to contact the author.

Footnotes

  1. (1) EC2 KeyPair cannot be created automatically, so make sure you have an existing EC2 KeyPair in one region. Or create one yourself. (2) If you want to access doccano via HTTPS in AWS, here is an instruction.

Comments
  • Import labeled dataset

    Import labeled dataset

    Feature Request: import labeled data sets in BIO format. Like:

    SOCCER	O
    -	O
    JAPAN	B-LOC
    GET	O
    LUCKY	O
    WIN	O
    ,	O
    CHINA	B-PER
    IN	O
    SURPRISE	O
    DEFEAT	O
    .	O
    
    Nadim	B-PER
    Ladki	I-PER
    
    AL-AIN	B-LOC
    ,	O
    United	B-LOC
    Arab	I-LOC
    Emirates	I-LOC
    1996-12-06 O
    

    Btw, I love your tool, thanks for doing it open source

    enhancement 
    opened by ismaeIfm 30
  • Okta OIDC social login is not available

    Okta OIDC social login is not available

    Problem

    The documentation for activating Okta OIDC social authentication indicates that when I set the three following environment variables, the login screen will have an option to authenticate using Okta OIDC:

    1. OAUTH_OKTA_OPENIDCONNECT_KEY
    2. OAUTH_OKTA_OPENIDCONNECT_SECRET
    3. OAUTH_OKTA_OPENIDCONNECT_API_URL

    This is not happening.

    How to reproduce the behaviour

    1. Run doccano:1.2.1 image with the following command:
    docker run --rm --interactive --tty
        --env SECRET_KEY=REDACTED \
        --env [email protected] \
        --env ADMIN_PASSWORD=REDACTED \
        --env ADMIN_USERNAME=admin \
        --env OAUTH_OKTA_OPENIDCONNECT_KEY=REDACTED \
        --env OAUTH_OKTA_OPENIDCONNECT_SECRET=REDACTED \
        --env OAUTH_OKTA_OPENIDCONNECT_API_URL=https://my-identity.okta.com/api \
        --publish 8000:8000 \
        doccano/doccano:1.2.1
    
    1. Navigate to doccano in your browser at: http://locahost:8000
    2. Select the "login" button at the top-right of the page

    Expected behavior

    I expect to see the "Login with Okta" option as depicted in this image

    Actual behavior

    I end up seeing the same login screen/options that I get when I do not configure Okta OIDC social authentication.

    My environment

    • I am running the doccano/doccano:1.2.1 Docker image
    • I am running behind a enterprise firewall with a SSL packet inspection and enterprise certificate authority. Other applications can perform Okta OIDC SSO in this environment, but app eventually needs to recognize system installed trusted CA chain. I do not expect this to be the source of this problem before we actually submit login credentials, however - there is a possibility.
    • docker-desktop v20.10.5 on Windows 10
    • Above command issued from WSL version 1, running WSL (version 1 with Ubuntu 18.04 LTS) bash:
    $ uname -a
    Linux MY-PC 4.4.0-18362-Microsoft #1049-Microsoft Thu Aug 14 12:01:00 PST 2020 x86_64 x86_64 x86_64 GNU/Linux
    
    bug 
    opened by yoshiya8 21
  • can't annotate after uploading the dataset

    can't annotate after uploading the dataset

    If you open a GitHub issue, here is our policy:

    1. It must be a bug, a feature request, or a significant problem with documentation (for small docs fixes please send a PR instead).
    2. The form below must be filled out.

    System information

    • Windows 10
    • Python 3.6.5

    Describe the problem

    For text classification, after I uploaded the text documents and defined the labels per the instruction, I clicked the annotate data tab, but it only showed an empty page, any clues what went wrong?

    Source code / logs

    This is a snapshot of what is shown when I clicked annotate data tab image

    bug 
    opened by xuanlv2 21
  • Does Doccano support relation marking between entities?

    Does Doccano support relation marking between entities?

    Hi, I am working on Joint Relation and Entity problem. I want to extract relationship between the entities. So, in order to do so, I need to annotate the data with NER and Relationship between them. Is relationship annotation possible?

    Please help. Thanks

    question 
    opened by karndeepsingh 19
  • `Create` button does not do anything

    `Create` button does not do anything

    How to reproduce the behaviour

    After following the tutorial and setting up the localhost server, I logged in and tried to create a toy sequence labeling project, and then when I click Create nothing happens. I see no errors on the terminal windows either.

    I saw an issue that was closed a couple days ago with the same problem but I have no idea what the solution was.

    Your Environment

    • Operating System: macOS BigSur 11.6
    • Python Version Used: 3.8.0
    • When you install doccano: yesterday
    • How did you install doccano (Heroku button etc): pip within a venv environement
    bug 
    opened by slkh 18
  • How to connect to a local REST API for auto annotation?

    How to connect to a local REST API for auto annotation?

    How to reproduce the behaviour

    I want to use a local auto-annotator using spaCy. So I made a simple API using FastAPI as follows - NOTE: Right now it does not annotate, but the idea is to be able to communicate first. Atleast for now I should get the input text back as it is. Also, since the rest api is local, I exposed port 7000 of container to 7000 of my host

    from fastapi import FastAPI
    import uvicorn
    from pydantic import BaseModel
    
    
    class TextToAnnotate(BaseModel):
        text: str
    
    app = FastAPI()
    
    @app.post("/auto_annotate")
    async def auto_annotate(doc: TextToAnnotate):
        return {"message": doc.text}
    
    if __name__=="__main__":
        uvicorn.run("auto_annotate:app", host='0.0.0.0', port=7000)
    

    I checked this API from Postman by sending a simple text from body and was able to get the message back - image

    Then in doccano, I tried to setup the Auto Annotator as follows -

    1. Config Template - Custom REST Request image

    2. Parameters, headers, Body and Testing url -0.0.0.0:7000/auto_annotate method - POST image

    3. Params - Nothing

    4. Headers - Content-Type - application/json

    5. Body - Just provided the key as text image

    6. Test the parameters - Sample Text - "Does this work?" image I don't get any output. it says that it Failed to establish a connection

    Your Environment

    • Operating System: Windows 10
    • Python Version Used: Whatever ships with the Docker image
    • When you install doccano: 17th June 2021
    • How did you install doccano (Heroku button etc): Dockerfile
    opened by armsp 18
  • Login found——Error: Request failed with status code 500

    Login found——Error: Request failed with status code 500

    step1: run: docker-compose -f docker-compose.prod.yml up image

    step2: login username:admin password:password image

    then remind me status code 500. image

    terminate response. image

    Then can create dataset and operate other. but can't annotation, i see Home->step5. annotate the dataset, but now i try annotate found not operate. image

    bug 
    opened by zhanwen 17
  • Convert label to label-ID when exporting

    Convert label to label-ID when exporting

    System information

    • Mac / Docker / latest chakkiworks/doccano image

    Describe the problem

    I´m not able to figure out which label-ID corresponds too which tag-name when exporting document classification in JSONP.

    There have been related issues in the past and a patch, but I can´t seem to figure out how to get the labels in the json1 format.

    Cheers

    enhancement 
    opened by tovare 17
  • RTL language Support

    RTL language Support

    Hi, i'm not sure whether this should be a feature request or a bug (apologies in advance), but the annotations boxes in sequence labeling tasks get wrong positions and doesn't align with the actual order of the words, in case of a RTL language. more specifically i'm using doccano for Arabic project and here's an example:

    original text rendered right(correct order) image

    after adding a label: image

    hope this example makes it clear (focus on the order of the words and the part "جملة اختبار")

    Thanks!

    enhancement 
    opened by muhnashX 16
  • can not export data,there is nothing in the export compressed file

    can not export data,there is nothing in the export compressed file

    How to reproduce the behaviour

    Your Environment

    • Operating System: ubuntu16.4
    • Python Version Used:
    • When you install doccano: 2022/3/23
    • How did you install doccano (Heroku button etc): docker
    bug 
    opened by amzfc 15
  • can not see any 'projects' in the projects page with docker compose depoly

    can not see any 'projects' in the projects page with docker compose depoly

    If you open a GitHub issue, here is our policy:

    1. It must be a bug, a feature request, or a significant problem with documentation (for small docs fixes please send a PR instead).
    2. The form below must be filled out.

    System information

    • centos 7.4:
    • docker 18.09.5:

    Describe the problem

    as the READEME describe, I deployed the doccano use the docker-compose method,after the system started,:

    1. login
    2. click 'projects' and I cannt see anything in this page like this

    and I open the developer console, a js file cannot find by the system, like this

    also I cannot find this js file in the app/server folder, but in the app/server/static

    Source code / logs

    docker node logs:

    Starting webpack
    
    > [email protected] start /src/app/server
    > cross-env HOT_RELOAD=1 DEBUG=1 webpack-dev-server
    
    ℹ 「wds」: Project is running at http://0.0.0.0:8080/
    ℹ 「wds」: webpack output is served from http://127.0.0.1:8080/
    ℹ 「wdm」: wait until bundle finished: /static/js/
    ℹ 「wdm」: Hash: 3f63afd2adca1b8d0559
    Version: webpack 4.12.0
    Time: 4963ms
    Built at: 2019-05-21 08:35:39
                              Asset      Size                        Chunks             Chunk Names
                           index.js  1.52 MiB                         index  [emitted]  index
               demo_named_entity.js  1.93 MiB             demo_named_entity  [emitted]  demo_named_entity
                demo_translation.js  1.88 MiB              demo_translation  [emitted]  demo_translation
         document_classification.js  4.19 MiB       document_classification  [emitted]  document_classification
                download_seq2seq.js   4.1 MiB              download_seq2seq  [emitted]  download_seq2seq
      download_sequence_labeling.js  4.09 MiB    download_sequence_labeling  [emitted]  download_sequence_labeling
    download_text_classification.js   4.1 MiB  download_text_classification  [emitted]  download_text_classification
                       guideline.js  1.94 MiB                     guideline  [emitted]  guideline
        demo_text_classification.js  1.91 MiB      demo_text_classification  [emitted]  demo_text_classification
                           label.js   1.9 MiB                         label  [emitted]  label
                        projects.js  1.86 MiB                      projects  [emitted]  projects
                         seq2seq.js  4.15 MiB                       seq2seq  [emitted]  seq2seq
               sequence_labeling.js  4.22 MiB             sequence_labeling  [emitted]  sequence_labeling
                           stats.js  4.42 MiB                         stats  [emitted]  stats
                  upload_seq2seq.js   4.1 MiB                upload_seq2seq  [emitted]  upload_seq2seq
        upload_sequence_labeling.js   4.1 MiB      upload_sequence_labeling  [emitted]  upload_sequence_labeling
      upload_text_classification.js   4.1 MiB    upload_text_classification  [emitted]  upload_text_classification
    Entrypoint index = index.js
    Entrypoint sequence_labeling = sequence_labeling.js
    Entrypoint document_classification = document_classification.js
    Entrypoint seq2seq = seq2seq.js
    Entrypoint projects = projects.js
    Entrypoint stats = stats.js
    Entrypoint label = label.js
    Entrypoint guideline = guideline.js
    Entrypoint demo_text_classification = demo_text_classification.js
    Entrypoint demo_named_entity = demo_named_entity.js
    Entrypoint demo_translation = demo_translation.js
    Entrypoint upload_seq2seq = upload_seq2seq.js
    Entrypoint upload_sequence_labeling = upload_sequence_labeling.js
    Entrypoint upload_text_classification = upload_text_classification.js
    Entrypoint download_seq2seq = download_seq2seq.js
    Entrypoint download_sequence_labeling = download_sequence_labeling.js
    Entrypoint download_text_classification = download_text_classification.js
     [1] multi (webpack)-dev-server/client?http://0.0.0.0:8080 (webpack)/hot/dev-server.js ./static/js/sequence_labeling.js 52 bytes {sequence_labeling} [built]
     [2] multi (webpack)-dev-server/client?http://0.0.0.0:8080 (webpack)/hot/dev-server.js ./static/js/document_classification.js 52 bytes {document_classification} [built]
     [3] multi (webpack)-dev-server/client?http://0.0.0.0:8080 (webpack)/hot/dev-server.js ./static/js/seq2seq.js 52 bytes {seq2seq} [built]
     [4] multi (webpack)-dev-server/client?http://0.0.0.0:8080 (webpack)/hot/dev-server.js ./static/js/projects.js 52 bytes {projects} [built]
     [5] multi (webpack)-dev-server/client?http://0.0.0.0:8080 (webpack)/hot/dev-server.js ./static/js/stats.js 52 bytes {stats} [built]
     [6] multi (webpack)-dev-server/client?http://0.0.0.0:8080 (webpack)/hot/dev-server.js ./static/js/label.js 52 bytes {label} [built]
     [7] multi (webpack)-dev-server/client?http://0.0.0.0:8080 (webpack)/hot/dev-server.js ./static/js/guideline.js 52 bytes {guideline} [built]
     [8] multi (webpack)-dev-server/client?http://0.0.0.0:8080 (webpack)/hot/dev-server.js ./static/js/demo/demo_text_classification.js 52 bytes {demo_text_classification} [built]
     [9] multi (webpack)-dev-server/client?http://0.0.0.0:8080 (webpack)/hot/dev-server.js ./static/js/demo/demo_named_entity.js 52 bytes {demo_named_entity} [built]
    [10] multi (webpack)-dev-server/client?http://0.0.0.0:8080 (webpack)/hot/dev-server.js ./static/js/demo/demo_translation.js 52 bytes {demo_translation} [built]
    [11] multi (webpack)-dev-server/client?http://0.0.0.0:8080 (webpack)/hot/dev-server.js ./static/js/upload_seq2seq.js 52 bytes {upload_seq2seq} [built]
    [12] multi (webpack)-dev-server/client?http://0.0.0.0:8080 (webpack)/hot/dev-server.js ./static/js/upload_sequence_labeling.js 52 bytes {upload_sequence_labeling} [built]
    [13] multi (webpack)-dev-server/client?http://0.0.0.0:8080 (webpack)/hot/dev-server.js ./static/js/upload_text_classification.js 52 bytes {upload_text_classification} [built]
    [14] multi (webpack)-dev-server/client?http://0.0.0.0:8080 (webpack)/hot/dev-server.js ./static/js/download_seq2seq.js 52 bytes {download_seq2seq} [built]
    [16] multi (webpack)-dev-server/client?http://0.0.0.0:8080 (webpack)/hot/dev-server.js ./static/js/download_text_classification.js 52 bytes {download_text_classification} [built]
        + 567 hidden modules
     「wdm」: Compiled successfully.
    

    python logs:

    Installing dependencies
    Requirement already satisfied: applicationinsights==0.11.7 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 1)) (0.11.7)
    Requirement already satisfied: dj-database-url==0.5.0 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 2)) (0.5.0)
    Requirement already satisfied: Django==2.1.7 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 3)) (2.1.7)
    Requirement already satisfied: django-filter==2.0.0 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 4)) (2.0.0)
    Requirement already satisfied: django-heroku==0.3.1 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 5)) (0.3.1)
    Requirement already satisfied: django-webpack-loader==0.6.0 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 6)) (0.6.0)
    Requirement already satisfied: django-widget-tweaks==1.4.2 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 7)) (1.4.2)
    Requirement already satisfied: django-polymorphic==2.0.3 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 8)) (2.0.3)
    Requirement already satisfied: django-rest-polymorphic==0.1.8 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 9)) (0.1.8)
    Requirement already satisfied: djangorestframework==3.8.2 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 10)) (3.8.2)
    Requirement already satisfied: djangorestframework-csv==2.1.0 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 11)) (2.1.0)
    Requirement already satisfied: djangorestframework-filters==0.10.2 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 12)) (0.10.2)
    Requirement already satisfied: environs==4.1.0 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 13)) (4.1.0)
    Requirement already satisfied: djangorestframework-xml==1.4.0 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 14)) (1.4.0)
    Requirement already satisfied: Faker==0.8.8 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 15)) (0.8.8)
    Requirement already satisfied: flake8==3.6.0 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 16)) (3.6.0)
    Requirement already satisfied: furl==2.0.0 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 17)) (2.0.0)
    Requirement already satisfied: gunicorn==19.9.0 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 18)) (19.9.0)
    Requirement already satisfied: mixer==6.1.3 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 19)) (6.1.3)
    Requirement already satisfied: model-mommy==1.6.0 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 20)) (1.6.0)
    Requirement already satisfied: psycopg2-binary==2.7.7 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 21)) (2.7.7)
    Requirement already satisfied: python-dateutil==2.7.3 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 22)) (2.7.3)
    Requirement already satisfied: pytz==2018.4 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 23)) (2018.4)
    Requirement already satisfied: six==1.11.0 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 24)) (1.11.0)
    Requirement already satisfied: seqeval==0.0.6 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 25)) (0.0.6)
    Requirement already satisfied: social-auth-app-django==3.1.0 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 26)) (3.1.0)
    Requirement already satisfied: social-auth-core[azuread]==3.0.0 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 27)) (3.0.0)
    Requirement already satisfied: text-unidecode==1.2 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 28)) (1.2)
    Requirement already satisfied: tornado==5.0.2 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 29)) (5.0.2)
    Requirement already satisfied: unittest-xml-reporting==2.5.1 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 30)) (2.5.1)
    Requirement already satisfied: whitenoise[brotli]==4.1.2 in /src/venv/lib/python3.6/site-packages (from -r /src/tools/../requirements.txt (line 31)) (4.1.2)
    Requirement already satisfied: psycopg2 in /src/venv/lib/python3.6/site-packages (from django-heroku==0.3.1->-r /src/tools/../requirements.txt (line 5)) (2.8.2)
    Requirement already satisfied: unicodecsv in /src/venv/lib/python3.6/site-packages (from djangorestframework-csv==2.1.0->-r /src/tools/../requirements.txt (line 11)) (0.14.1)
    Requirement already satisfied: marshmallow>=2.7.0 in /src/venv/lib/python3.6/site-packages (from environs==4.1.0->-r /src/tools/../requirements.txt (line 13)) (2.19.2)
    Requirement already satisfied: python-dotenv in /src/venv/lib/python3.6/site-packages (from environs==4.1.0->-r /src/tools/../requirements.txt (line 13)) (0.10.2)
    Requirement already satisfied: defusedxml>=0.3 in /src/venv/lib/python3.6/site-packages (from djangorestframework-xml==1.4.0->-r /src/tools/../requirements.txt (line 14)) (0.6.0)
    Requirement already satisfied: pycodestyle<2.5.0,>=2.4.0 in /src/venv/lib/python3.6/site-packages (from flake8==3.6.0->-r /src/tools/../requirements.txt (line 16)) (2.4.0)
    Requirement already satisfied: setuptools>=30 in /src/venv/lib/python3.6/site-packages (from flake8==3.6.0->-r /src/tools/../requirements.txt (line 16)) (41.0.1)
    Requirement already satisfied: mccabe<0.7.0,>=0.6.0 in /src/venv/lib/python3.6/site-packages (from flake8==3.6.0->-r /src/tools/../requirements.txt (line 16)) (0.6.1)
    Requirement already satisfied: pyflakes<2.1.0,>=2.0.0 in /src/venv/lib/python3.6/site-packages (from flake8==3.6.0->-r /src/tools/../requirements.txt (line 16)) (2.0.0)
    Requirement already satisfied: orderedmultidict>=1.0 in /src/venv/lib/python3.6/site-packages (from furl==2.0.0->-r /src/tools/../requirements.txt (line 17)) (1.0)
    Requirement already satisfied: numpy>=1.14.0 in /src/venv/lib/python3.6/site-packages (from seqeval==0.0.6->-r /src/tools/../requirements.txt (line 25)) (1.16.3)
    Requirement already satisfied: requests-oauthlib>=0.6.1 in /src/venv/lib/python3.6/site-packages (from social-auth-core[azuread]==3.0.0->-r /src/tools/../requirements.txt (line 27)) (1.2.0)
    Requirement already satisfied: PyJWT>=1.4.0 in /src/venv/lib/python3.6/site-packages (from social-auth-core[azuread]==3.0.0->-r /src/tools/../requirements.txt (line 27)) (1.7.1)
    Requirement already satisfied: python3-openid>=3.0.10; python_version >= "3.0" in /src/venv/lib/python3.6/site-packages (from social-auth-core[azuread]==3.0.0->-r /src/tools/../requirements.txt (line 27)) (3.1.0)
    Requirement already satisfied: oauthlib>=1.0.3 in /src/venv/lib/python3.6/site-packages (from social-auth-core[azuread]==3.0.0->-r /src/tools/../requirements.txt (line 27)) (3.0.1)
    Requirement already satisfied: requests>=2.9.1 in /src/venv/lib/python3.6/site-packages (from social-auth-core[azuread]==3.0.0->-r /src/tools/../requirements.txt (line 27)) (2.22.0)
    Requirement already satisfied: cryptography>=2.1.1; extra == "azuread" in /src/venv/lib/python3.6/site-packages (from social-auth-core[azuread]==3.0.0->-r /src/tools/../requirements.txt (line 27)) (2.6.1)
    Requirement already satisfied: Brotli; extra == "brotli" in /src/venv/lib/python3.6/site-packages (from whitenoise[brotli]==4.1.2->-r /src/tools/../requirements.txt (line 31)) (1.0.7)
    Requirement already satisfied: chardet<3.1.0,>=3.0.2 in /src/venv/lib/python3.6/site-packages (from requests>=2.9.1->social-auth-core[azuread]==3.0.0->-r /src/tools/../requirements.txt (line 27)) (3.0.4)
    Requirement already satisfied: urllib3!=1.25.0,!=1.25.1,<1.26,>=1.21.1 in /src/venv/lib/python3.6/site-packages (from requests>=2.9.1->social-auth-core[azuread]==3.0.0->-r /src/tools/../requirements.txt (line 27)) (1.25.2)
    Requirement already satisfied: certifi>=2017.4.17 in /src/venv/lib/python3.6/site-packages (from requests>=2.9.1->social-auth-core[azuread]==3.0.0->-r /src/tools/../requirements.txt (line 27)) (2019.3.9)
    Requirement already satisfied: idna<2.9,>=2.5 in /src/venv/lib/python3.6/site-packages (from requests>=2.9.1->social-auth-core[azuread]==3.0.0->-r /src/tools/../requirements.txt (line 27)) (2.8)
    Requirement already satisfied: asn1crypto>=0.21.0 in /src/venv/lib/python3.6/site-packages (from cryptography>=2.1.1; extra == "azuread"->social-auth-core[azuread]==3.0.0->-r /src/tools/../requirements.txt (line 27)) (0.24.0)
    Requirement already satisfied: cffi!=1.11.3,>=1.8 in /src/venv/lib/python3.6/site-packages (from cryptography>=2.1.1; extra == "azuread"->social-auth-core[azuread]==3.0.0->-r /src/tools/../requirements.txt (line 27)) (1.12.3)
    Requirement already satisfied: pycparser in /src/venv/lib/python3.6/site-packages (from cffi!=1.11.3,>=1.8->cryptography>=2.1.1; extra == "azuread"->social-auth-core[azuread]==3.0.0->-r /src/tools/../requirements.txt (line 27)) (2.19)
    Initializing database
    Database unavailable on attempt 1/60: could not connect to server: Connection refused
            Is the server running on host "postgres" (172.22.0.2) and accepting
            TCP/IP connections on port 5432?
    Operations to perform:
      Apply all migrations: admin, auth, contenttypes, server, sessions, social_django
    Running migrations:
      Applying contenttypes.0001_initial... OK
      Applying auth.0001_initial... OK
      Applying admin.0001_initial... OK
      Applying admin.0002_logentry_remove_auto_add... OK
      Applying admin.0003_logentry_add_action_flag_choices... OK
      Applying contenttypes.0002_remove_content_type_name... OK
      Applying auth.0002_alter_permission_name_max_length... OK
      Applying auth.0003_alter_user_email_max_length... OK
      Applying auth.0004_alter_user_username_opts... OK
      Applying auth.0005_alter_user_last_login_null... OK
      Applying auth.0006_require_contenttypes_0002... OK
      Applying auth.0007_alter_validators_add_error_messages... OK
      Applying auth.0008_alter_user_username_max_length... OK
      Applying auth.0009_alter_user_last_name_max_length... OK
      Applying server.0001_initial... OK
      Applying sessions.0001_initial... OK
      Applying social_django.0001_initial... OK
      Applying social_django.0002_add_related_name... OK
      Applying social_django.0003_alter_email_max_length... OK
      Applying social_django.0004_auto_20160423_0400... OK
      Applying social_django.0005_auto_20160727_2333... OK
      Applying social_django.0006_partial... OK
      Applying social_django.0007_code_timestamp... OK
      Applying social_django.0008_partial_timestamp... OK
    Superuser created successfully.
    Starting django
    [21/May/2019 08:35:53] "GET / HTTP/1.1" 200 11489
    [21/May/2019 08:35:53] "GET /static/css/forum.css HTTP/1.1" 304 0
    [21/May/2019 08:35:53] "GET /static/images/logo.png HTTP/1.1" 304 0
    [21/May/2019 08:35:53] "GET /static/images/named_entity_recognition.png HTTP/1.1" 304 0
    [21/May/2019 08:35:53] "GET /static/images/sentiment_analysis.png HTTP/1.1" 304 0
    [21/May/2019 08:35:53] "GET /static/images/translation.png HTTP/1.1" 304 0
    [21/May/2019 08:35:57] "GET /login/ HTTP/1.1" 200 6069
    [21/May/2019 08:35:57] "GET /static/css/forum.css HTTP/1.1" 304 0
    [21/May/2019 08:35:57] "GET /static/images/logo.png HTTP/1.1" 304 0
    [21/May/2019 08:35:59] "POST /login/ HTTP/1.1" 302 0
    [21/May/2019 08:35:59] "GET /projects/ HTTP/1.1" 200 3766
    [21/May/2019 08:35:59] "GET /static/images/logo.png HTTP/1.1" 304 0
    [21/May/2019 08:35:59] "GET /static/css/forum.css HTTP/1.1" 304 0
    [21/May/2019 08:36:02] "GET /projects/ HTTP/1.1" 200 3766
    [21/May/2019 08:36:02] "GET /static/css/forum.css HTTP/1.1" 304 0
    [21/May/2019 08:36:02] "GET /static/images/logo.png HTTP/1.1" 304 0
    [21/May/2019 08:36:05] "GET /static/css/forum.css HTTP/1.1" 304 0
    [21/May/2019 08:36:07] "GET /projects/ HTTP/1.1" 200 3766
    [21/May/2019 08:36:07] "GET /static/css/forum.css HTTP/1.1" 304 0
    [21/May/2019 08:36:07] "GET /static/images/logo.png HTTP/1.1" 304 0
    [21/May/2019 08:36:08] "GET /static/images/favicons/favicon-32x32.png HTTP/1.1" 200 2379
    [21/May/2019 08:38:09] "GET / HTTP/1.1" 200 10860
    [21/May/2019 08:38:09] "GET /static/css/forum.css HTTP/1.1" 304 0
    [21/May/2019 08:38:09] "GET /static/images/logo.png HTTP/1.1" 304 0
    [21/May/2019 08:38:09] "GET /static/images/named_entity_recognition.png HTTP/1.1" 304 0
    [21/May/2019 08:38:09] "GET /static/images/translation.png HTTP/1.1" 304 0
    [21/May/2019 08:38:09] "GET /static/images/sentiment_analysis.png HTTP/1.1" 304 0
    [21/May/2019 08:38:12] "GET /projects/ HTTP/1.1" 200 3766
    [21/May/2019 08:38:12] "GET /static/css/forum.css HTTP/1.1" 304 0
    [21/May/2019 08:38:12] "GET /static/images/logo.png HTTP/1.1" 304 0
    [21/May/2019 08:38:36] "GET /projects/ HTTP/1.1" 200 3766
    [21/May/2019 08:38:36] "GET /static/images/logo.png HTTP/1.1" 304 0
    [21/May/2019 08:38:36] "GET /static/css/forum.css HTTP/1.1" 304 0
    [21/May/2019 08:38:37] "GET /static/images/favicons/favicon-32x32.png HTTP/1.1" 200 2379
    [21/May/2019 08:44:17] "GET /static/css/forum.css HTTP/1.1" 304 0
    [21/May/2019 08:44:19] "GET /projects/ HTTP/1.1" 200 3766
    [21/May/2019 08:44:19] "GET /static/css/forum.css HTTP/1.1" 304 0
    [21/May/2019 08:44:19] "GET /static/images/logo.png HTTP/1.1" 304 0
    [21/May/2019 08:44:20] "GET /static/images/favicons/favicon-32x32.png HTTP/1.1" 200 2379
    
    question 
    opened by littleji 15
  • Import Dataset Issues

    Import Dataset Issues

    How to reproduce the behaviour

    I have been trying to upload dataset since morning but unable to import to doccano i tried every extension format but no results could you help

    Your Environment

    • Operating System: windows 11
    • Python Version Used:3.9.13
    • When you install doccano: 29/12/2022
    • How did you install doccano (Heroku button etc): using pip and the installation process mentioned at first place # Initialize database. doccano init

    Create a super user.

    doccano createuser --username admin --password pass

    Start a web server.

    doccano webserver --port 8000

    opened by kamrankiyani777 1
  • Issue with `doccano init`: `wait_for_db` command not found

    Issue with `doccano init`: `wait_for_db` command not found

    I have an issue with the doccano installation Init doesn't work.

    How to reproduce the problem

    pip install doccano
    doccano init
    

    Error message

    [2022-12-28 10:43:27 +0000] [189] [INFO] [django_drf_filepond.apps::ready::61] App init: no django-storages backend configured, using default (local) storage backend if set, otherwise you need to manage file storage independently of this app.
    Setup Database.
    Traceback (most recent call last):
      File "/home/jeremy/miniconda3/lib/python3.9/site-packages/django/core/management/__init__.py", line 114, in call_command
        app_name = get_commands()[command_name]
    KeyError: 'wait_for_db'
    
    During handling of the above exception, another exception occurred:
    
    Traceback (most recent call last):
      File "/home/jeremy/miniconda3/bin/doccano", line 8, in <module>
        sys.exit(main())
      File "/home/jeremy/miniconda3/lib/python3.9/site-packages/backend/cli.py", line 177, in main
        args.handler(args)
      File "/home/jeremy/miniconda3/lib/python3.9/site-packages/backend/cli.py", line 74, in command_db_init
        management.call_command("wait_for_db")
      File "/home/jeremy/miniconda3/lib/python3.9/site-packages/django/core/management/__init__.py", line 116, in call_command
        raise CommandError("Unknown command: %r" % command_name)
    django.core.management.base.CommandError: Unknown command: 'wait_for_db'
    

    Your Environment

    • Operating System: Linux WSL (5.10.16.3-microsoft-standard-WSL2) Ubuntu 20.04 LTS
    • Python Version Used: Python 3.9.12
    • How did you install doccano (Heroku button etc): pip install doccano
    opened by JeremyArancio 0
  • Doccano - Docker/Kubernetes

    Doccano - Docker/Kubernetes

    We have installed the Doccano image in the Kubernetes using the below steps: https://artifacthub.io/packages/helm/curie-df-helm-charts/doccano

    Doccano version in the above image doesn't have the latest tag. This is based on the UI differences I see when compared with the latest version installed directly on Docker.

    Do you have your own Kubernetes Doccano updated image which can be used.

    Any leads/ideas in this regard is much appreciated.

    opened by jairamjidgekar 1
  • Sequence Labelling for CSV File

    Sequence Labelling for CSV File

    I am working on a project to perform Sequence Labelling on CSV file.

    Unfortunately there is no option to import a CSV File.

    As a work around, I am able to import same CSV file as TXT file and work on the annotation.

    opened by jairamjidgekar 2
  • There is no way to export selected documents.

    There is no way to export selected documents.

    Feature request

    The only selective export is for approved documents or not. How can i only export selected documents. I could not find any way to do that.

    opened by jitinmakker 1
  • django.core.exceptions.ImproperlyConfigured: Error loading psycopg2 module: /lib64/libk5crypto.so.3: undefined symbol: EVP_KDF_ctrl, version OPENSSL_1_1_1b

    django.core.exceptions.ImproperlyConfigured: Error loading psycopg2 module: /lib64/libk5crypto.so.3: undefined symbol: EVP_KDF_ctrl, version OPENSSL_1_1_1b

    I use postgres to store data. When I is executing "doccano init", he error is as follows。

    File "", line 219, in _call_with_frames_removed File "/root/miniconda3/envs/doccano/lib/python3.8/site-packages/django/contrib/auth/models.py", line 3, in from django.contrib.auth.base_user import AbstractBaseUser, BaseUserManager File "/root/miniconda3/envs/doccano/lib/python3.8/site-packages/django/contrib/auth/base_user.py", line 49, in class AbstractBaseUser(models.Model): File "/root/miniconda3/envs/doccano/lib/python3.8/site-packages/django/db/models/base.py", line 141, in new new_class.add_to_class("_meta", Options(meta, app_label)) File "/root/miniconda3/envs/doccano/lib/python3.8/site-packages/django/db/models/base.py", line 369, in add_to_class value.contribute_to_class(cls, name) File "/root/miniconda3/envs/doccano/lib/python3.8/site-packages/django/db/models/options.py", line 231, in contribute_to_class self.db_table, connection.ops.max_name_length() File "/root/miniconda3/envs/doccano/lib/python3.8/site-packages/django/utils/connection.py", line 15, in getattr return getattr(self._connections[self._alias], item) File "/root/miniconda3/envs/doccano/lib/python3.8/site-packages/django/utils/connection.py", line 62, in getitem conn = self.create_connection(alias) File "/root/miniconda3/envs/doccano/lib/python3.8/site-packages/django/db/utils.py", line 193, in create_connection backend = load_backend(db["ENGINE"]) File "/root/miniconda3/envs/doccano/lib/python3.8/site-packages/django/db/utils.py", line 113, in load_backend return import_module("%s.base" % backend_name) File "/root/miniconda3/envs/doccano/lib/python3.8/importlib/init.py", line 127, in import_module return _bootstrap._gcd_import(name[level:], package, level) File "/root/miniconda3/envs/doccano/lib/python3.8/site-packages/django/db/backends/postgresql/base.py", line 28, in raise ImproperlyConfigured("Error loading psycopg2 module: %s" % e) django.core.exceptions.ImproperlyConfigured: Error loading psycopg2 module: /lib64/libk5crypto.so.3: undefined symbol: EVP_KDF_ctrl, version OPENSSL_1_1_1b

    opened by FortuneStar 2
Releases(v1.8.3)
  • v1.8.3(Dec 8, 2022)

    What's Changed

    • Enable to set CSRF_TRUSTED_ORIGINS in debug mode by @Hironsan in https://github.com/doccano/doccano/pull/2077
    • Enable to use shortcut keys for classification by @Hironsan in https://github.com/doccano/doccano/pull/2083
    • Update cli to avoid ValueError: path must be a filename, not a directory. by @Hironsan in https://github.com/doccano/doccano/pull/2089

    Full Changelog: https://github.com/doccano/doccano/compare/v1.8.2...v1.8.3

    Source code(tar.gz)
    Source code(zip)
  • v1.8.2(Nov 29, 2022)

    What's Changed

    • Fix installation issue by @Hironsan in https://github.com/doccano/doccano/pull/2068

    Full Changelog: https://github.com/doccano/doccano/compare/v1.8.1...v1.8.2

    Source code(tar.gz)
    Source code(zip)
  • v1.8.1(Nov 28, 2022)

    What's Changed

    • Bump django from 4.0.4 to 4.0.6 in /backend by @dependabot in https://github.com/doccano/doccano/pull/1889
    • Bump lxml from 4.7.1 to 4.9.1 in /backend by @dependabot in https://github.com/doccano/doccano/pull/1892
    • Enable to access to admin site on Heroku by @Hironsan in https://github.com/doccano/doccano/pull/1907
    • Bump django-celery-results from 2.2.0 to 2.4.0 in /backend by @dependabot in https://github.com/doccano/doccano/pull/1910
    • Upgrade v-annotator by @Hironsan in https://github.com/doccano/doccano/pull/1916
    • Update mkdocs config by @Hironsan in https://github.com/doccano/doccano/pull/1917
    • [Refactor] repositories in frontend by @Hironsan in https://github.com/doccano/doccano/pull/1919
    • Export annotator comment by @CaseyZhang11 in https://github.com/doccano/doccano/pull/1897
    • [Refactoring] frontend models by @Hironsan in https://github.com/doccano/doccano/pull/1926
    • fix: install guide not acessible in the documentation website. by @abdelhak2406 in https://github.com/doccano/doccano/pull/1927
    • Update buster to bullseye by @Hironsan in https://github.com/doccano/doccano/pull/1930
    • lang: better translation for french by @abdelhak2406 in https://github.com/doccano/doccano/pull/1934
    • Update RabbitMQ version by @Hironsan in https://github.com/doccano/doccano/pull/1936
    • docs: add code consistency commands and mention by @abdelhak2406 in https://github.com/doccano/doccano/pull/1939
    • lang: add new french translation and new variables by @abdelhak2406 in https://github.com/doccano/doccano/pull/1938
    • Set z-index to the header by @Hironsan in https://github.com/doccano/doccano/pull/1942
    • Bump django from 4.0.6 to 4.0.7 in /backend by @dependabot in https://github.com/doccano/doccano/pull/1947
    • Enable to set worker processes by environment variable by @Hironsan in https://github.com/doccano/doccano/pull/1953
    • Describe the environment variables by @Hironsan in https://github.com/doccano/doccano/pull/1954
    • Fix the example file content for relation extraction by @Hironsan in https://github.com/doccano/doccano/pull/1955
    • Update documents by @Hironsan in https://github.com/doccano/doccano/pull/1956
    • Fix issue with the favicon not displaying by @Hironsan in https://github.com/doccano/doccano/pull/1958
    • Show status on dataset page by @Hironsan in https://github.com/doccano/doccano/pull/1959
    • Remove t2.small option due to OOM by @Hironsan in https://github.com/doccano/doccano/pull/1963
    • [Enhancement] Add Flower to monitor and manage Celery tasks by @Hironsan in https://github.com/doccano/doccano/pull/1972
    • Add retry setting explicitly to celery tasks by @Hironsan in https://github.com/doccano/doccano/pull/1977
    • Update poetry version to v1.2.0 by @Hironsan in https://github.com/doccano/doccano/pull/1982
    • Enable to order projects by created_at, author, name, and project type by @Hironsan in https://github.com/doccano/doccano/pull/1985
    • Fix nuxt build problem by @Hironsan in https://github.com/doccano/doccano/pull/1986
    • Add workflow to test installation and commands by @Hironsan in https://github.com/doccano/doccano/pull/1988
    • Warn if default password is not changed by @Hironsan in https://github.com/doccano/doccano/pull/1989
    • Update publish-image workflow by @Hironsan in https://github.com/doccano/doccano/pull/1991
    • [Enhancement] Use pre-build images to avoid building images in docker-compose by @Hironsan in https://github.com/doccano/doccano/pull/1992
    • Reduce the image size(doccano:backend) by @Hironsan in https://github.com/doccano/doccano/pull/1994
    • Specify platform on publish-image workflow by @Hironsan in https://github.com/doccano/doccano/pull/1995
    • Add a test case of auto labeling for nonexistent category by @Hironsan in https://github.com/doccano/doccano/pull/1996
    • Describe how to use custom rest api by @Hironsan in https://github.com/doccano/doccano/pull/1997
    • Handle JSONDecodeError on testing auto labeling API by @Hironsan in https://github.com/doccano/doccano/pull/1999
    • Improve contribution guide by @Hironsan in https://github.com/doccano/doccano/pull/2000
    • fix typo by @bsenst in https://github.com/doccano/doccano/pull/2001
    • [Enhancement] setting logging by @Hironsan in https://github.com/doccano/doccano/pull/2003
    • Update yarn install line by @Hironsan in https://github.com/doccano/doccano/pull/2004
    • Bump vuetify from 2.6.4 to 2.6.10 in /frontend by @dependabot in https://github.com/doccano/doccano/pull/2009
    • Bump django from 4.0.7 to 4.0.8 in /backend by @dependabot in https://github.com/doccano/doccano/pull/2026
    • Update python packages by @Hironsan in https://github.com/doccano/doccano/pull/2027
    • [Enhancement] order by score by @Hironsan in https://github.com/doccano/doccano/pull/2028
    • Add user creation API by @Hironsan in https://github.com/doccano/doccano/pull/2031
    • Make auto labeling working in intent detection by @Hironsan in https://github.com/doccano/doccano/pull/2053
    • Enable to expand/collapse label types card by @Hironsan in https://github.com/doccano/doccano/pull/2054
    • Trigger auto labeling automatically by @Hironsan in https://github.com/doccano/doccano/pull/2056

    New Contributors

    • @CaseyZhang11 made their first contribution in https://github.com/doccano/doccano/pull/1897
    • @abdelhak2406 made their first contribution in https://github.com/doccano/doccano/pull/1927
    • @bsenst made their first contribution in https://github.com/doccano/doccano/pull/2001

    Full Changelog: https://github.com/doccano/doccano/compare/v1.8.0...v1.8.1

    Source code(tar.gz)
    Source code(zip)
  • v1.8.0(Jul 11, 2022)

    What's Changed

    • Add image tasks by @Hironsan in https://github.com/doccano/doccano/pull/1899
      • Image captioning
      • Segmentation
      • Object detection
    • [Enhancement] documentation by @Hironsan in https://github.com/doccano/doccano/pull/1830
    • [Refactoring] data import by @Hironsan in https://github.com/doccano/doccano/pull/1832
    • Fix the position of the toolbar by @Hironsan in https://github.com/doccano/doccano/pull/1837
    • [Enhancement] Apply prettier to frontend and add it to workflow by @Hironsan in https://github.com/doccano/doccano/pull/1839
    • Enable to specify the number of thread in windows by @Hironsan in https://github.com/doccano/doccano/pull/1842
    • [Enhancement] keyboard shortcut by @Hironsan in https://github.com/doccano/doccano/pull/1843
    • [Enhancement] refactor data import by @Hironsan in https://github.com/doccano/doccano/pull/1845
    • Bump waitress from 2.1.1 to 2.1.2 in /backend by @dependabot in https://github.com/doccano/doccano/pull/1853
    • fix de, fr and zh messages by @morioka in https://github.com/doccano/doccano/pull/1869
    • Fixed #1816 by @morioka in https://github.com/doccano/doccano/pull/1866
    • update docker-compose: set nginx image name to doccano_nginx:prod by @zgfh in https://github.com/doccano/doccano/pull/1894

    New Contributors

    • @morioka made their first contribution in https://github.com/doccano/doccano/pull/1869
    • @zgfh made their first contribution in https://github.com/doccano/doccano/pull/1894

    Full Changelog: https://github.com/doccano/doccano/compare/v1.7.0...v1.8.0

    Source code(tar.gz)
    Source code(zip)
  • v1.7.0(May 19, 2022)

    What's Changed

    • Update LabelingMenu.vue by @Hironsan in https://github.com/doccano/doccano/pull/1738
    • Replace git:// with https:// by @Hironsan in https://github.com/doccano/doccano/pull/1739
    • fix: Replace git:// with https:// in yarn.lock by @WojciechKusa in https://github.com/doccano/doccano/pull/1742
    • Show id in database page by @Hironsan in https://github.com/doccano/doccano/pull/1743
    • Support workers option, by @Hironsan in https://github.com/doccano/doccano/pull/1744
    • Bump waitress from 2.0.0 to 2.1.1 in /backend by @dependabot in https://github.com/doccano/doccano/pull/1747
    • Update minimist version by @Hironsan in https://github.com/doccano/doccano/pull/1748
    • added encoding format while opening of file by @dhirajsuvarna in https://github.com/doccano/doccano/pull/1754
    • add confirmed_at field to ExampleStateSerializer by @federicopisanu in https://github.com/doccano/doccano/pull/1766
    • fix empty export in entity-relationship-labeling by @mkmark in https://github.com/doccano/doccano/pull/1764
    • Set nounset on Bash scripts (fixes #860) by @kuraga in https://github.com/doccano/doccano/pull/1760
    • show total progress if collaborative_annotation by @mkmark in https://github.com/doccano/doccano/pull/1770
    • [Feature] cloud upload by @Hironsan in https://github.com/doccano/doccano/pull/1779
    • Add tmp_file as a volume by @Hironsan in https://github.com/doccano/doccano/pull/1781
    • Fix dataset export problem by @Hironsan in https://github.com/doccano/doccano/pull/1783
    • Enhancement/dataset export by @Hironsan in https://github.com/doccano/doccano/pull/1799
    • Bump django from 4.0.2 to 4.0.4 in /backend by @dependabot in https://github.com/doccano/doccano/pull/1798
    • Enhancement/remove unused files by @Hironsan in https://github.com/doccano/doccano/pull/1801
    • [Enhancement] frontend linter by @Hironsan in https://github.com/doccano/doccano/pull/1802
    • Add hadolint as a job by @Hironsan in https://github.com/doccano/doccano/pull/1803
    • [Enhancement] relation import by @Hironsan in https://github.com/doccano/doccano/pull/1823
    • Fix relation export by @Hironsan in https://github.com/doccano/doccano/pull/1826
    • [Enhancement] pypi workflow by @Hironsan in https://github.com/doccano/doccano/pull/1827

    New Contributors

    • @WojciechKusa made their first contribution in https://github.com/doccano/doccano/pull/1742
    • @dhirajsuvarna made their first contribution in https://github.com/doccano/doccano/pull/1754
    • @federicopisanu made their first contribution in https://github.com/doccano/doccano/pull/1766
    • @mkmark made their first contribution in https://github.com/doccano/doccano/pull/1764

    Full Changelog: https://github.com/doccano/doccano/compare/v1.6.2...v1.7.0

    Source code(tar.gz)
    Source code(zip)
  • v1.6.2(Mar 14, 2022)

    What's Changed

    • Enable to view dataset page for project staffs by @Hironsan in https://github.com/doccano/doccano/pull/1733

    Full Changelog: https://github.com/doccano/doccano/compare/v1.6.1...v1.6.2

    Source code(tar.gz)
    Source code(zip)
  • v1.6.1(Mar 14, 2022)

    What's Changed

    • Upgrade npm packages by @Hironsan in https://github.com/doccano/doccano/pull/1730
    • Fix project update by @Hironsan in https://github.com/doccano/doccano/pull/1731

    Full Changelog: https://github.com/doccano/doccano/compare/v1.6.0...v1.6.1

    Source code(tar.gz)
    Source code(zip)
  • v1.6.0(Mar 11, 2022)

    Pickup

    • Refactoring project structure
    • #1703 Relation labeling with sequence labeling
    • #1619 Text classification and named entity recognition at the same time

    What's Changed

    • Speed up fetching comment by @Hironsan in https://github.com/doccano/doccano/pull/1585
    • Speed up statistics API by @Hironsan in https://github.com/doccano/doccano/pull/1587
    • Add document for auto labeling by @Hironsan in https://github.com/doccano/doccano/pull/1590
    • [Enhancement] Refactor Annotation API by @Hironsan in https://github.com/doccano/doccano/pull/1592
    • [Add a new task] Text classification and named entity recognition at the same time by @Hironsan in https://github.com/doccano/doccano/pull/1619
    • [Enhancement] Separate applications by @Hironsan in https://github.com/doccano/doccano/pull/1627
    • [Refactoring] move files by @Hironsan in https://github.com/doccano/doccano/pull/1631
    • [Enhancement] refactor project model and serializer by @Hironsan in https://github.com/doccano/doccano/pull/1632
    • Update django and celery by @Hironsan in https://github.com/doccano/doccano/pull/1633
    • Upgrade node packages by @Hironsan in https://github.com/doccano/doccano/pull/1634
    • Simplify frontend model as many as possible by @Hironsan in https://github.com/doccano/doccano/pull/1635
    • [Enhancement] create metrics app by @Hironsan in https://github.com/doccano/doccano/pull/1636
    • Rename tasks to celery_tasks by @Hironsan in https://github.com/doccano/doccano/pull/1637
    • Remove DocumentAPI by @Hironsan in https://github.com/doccano/doccano/pull/1638
    • [Enhancement]Add user app by @Hironsan in https://github.com/doccano/doccano/pull/1639
    • [Enhancement]Separate data import app by @Hironsan in https://github.com/doccano/doccano/pull/1640
    • Separate data export app by @Hironsan in https://github.com/doccano/doccano/pull/1641
    • [Enhancement]Remove server and authentication app by @Hironsan in https://github.com/doccano/doccano/pull/1642
    • [Enhancement]Remove users from project by @Hironsan in https://github.com/doccano/doccano/pull/1644
    • [Enhancement] Separate auto labeling app by @Hironsan in https://github.com/doccano/doccano/pull/1646
    • Remove unused Approve API by @Hironsan in https://github.com/doccano/doccano/pull/1647
    • [Enhancement] Remove unnecessary CommentListDoc API by @Hironsan in https://github.com/doccano/doccano/pull/1649
    • [Enhancement] Auto labeling by @Hironsan in https://github.com/doccano/doccano/pull/1650
    • [Enhancement] Separate label app by @Hironsan in https://github.com/doccano/doccano/pull/1652
    • Update clean method in Member by @Hironsan in https://github.com/doccano/doccano/pull/1654
    • [Enhancement] Separate label type app by @Hironsan in https://github.com/doccano/doccano/pull/1655
    • [Enhancement] Separate example app by @Hironsan in https://github.com/doccano/doccano/pull/1656
    • [Enhancement] Use django-health-check instead of own code by @Hironsan in https://github.com/doccano/doccano/pull/1657
    • Remove IsStaff permission by @Hironsan in https://github.com/doccano/doccano/pull/1658
    • Remove unused code and data by @Hironsan in https://github.com/doccano/doccano/pull/1659
    • [Enhancement] Move cloud templates by @Hironsan in https://github.com/doccano/doccano/pull/1660
    • Change permission names by @Hironsan in https://github.com/doccano/doccano/pull/1661
    • Improve test helper by @Hironsan in https://github.com/doccano/doccano/pull/1662
    • Fix #1648 by @Hironsan in https://github.com/doccano/doccano/pull/1664
    • Fix image import by @Hironsan in https://github.com/doccano/doccano/pull/1665
    • Enable to show images and audios in the case of pip and Docker by @Hironsan in https://github.com/doccano/doccano/pull/1667
    • Enable to paginate projects by @Hironsan in https://github.com/doccano/doccano/pull/1668
    • Fix a migration by @Hironsan in https://github.com/doccano/doccano/pull/1670
    • [Enhancement] Extract project app by @Hironsan in https://github.com/doccano/doccano/pull/1671
    • [Enhancement] Include black to workflow by @Hironsan in https://github.com/doccano/doccano/pull/1674
    • [Enhancement] Add mypy to the workflow by @Hironsan in https://github.com/doccano/doccano/pull/1675
    • [Enhancement] Add new labeling option for sequence labeling by @Hironsan in https://github.com/doccano/doccano/pull/1678
    • [Enhancement] Show progress by @Hironsan in https://github.com/doccano/doccano/pull/1679
    • [Enhancement] Label creation process by @Hironsan in https://github.com/doccano/doccano/pull/1680
    • [Bugfix] Update label types after save and add another by @Hironsan in https://github.com/doccano/doccano/pull/1681
    • Update Dockerfile by @Hironsan in https://github.com/doccano/doccano/pull/1682
    • Rename directory name from app to config by @Hironsan in https://github.com/doccano/doccano/pull/1683
    • [Enhancement] Split setting files by @Hironsan in https://github.com/doccano/doccano/pull/1684
    • Simplify and correct use instructions [ci skip] by @kuraga in https://github.com/doccano/doccano/pull/1677
    • [Enhancement] Separate dockerfiles by @Hironsan in https://github.com/doccano/doccano/pull/1686
    • Fix the path of docker-compose.prod.yml in CloudFormation template by @Hironsan in https://github.com/doccano/doccano/pull/1687
    • [Enhancement] Replace Pipenv with Poetry by @Hironsan in https://github.com/doccano/doccano/pull/1689
    • [Enhancement] mkdocs setting to docs by @Hironsan in https://github.com/doccano/doccano/pull/1690
    • Remove unused package.json by @Hironsan in https://github.com/doccano/doccano/pull/1691
    • Fix mkdocs workflow to use the same directory by @Hironsan in https://github.com/doccano/doccano/pull/1693
    • Fix project migrations by @Hironsan in https://github.com/doccano/doccano/pull/1694
    • [Enhancement] Add label type import page by @Hironsan in https://github.com/doccano/doccano/pull/1695
    • Add create-package.sh by @Hironsan in https://github.com/doccano/doccano/pull/1696
    • Replace subprocess with call_command by @Hironsan in https://github.com/doccano/doccano/pull/1697
    • Add migrate command to cli by @Hironsan in https://github.com/doccano/doccano/pull/1698
    • [Enhancement] documentation by @Hironsan in https://github.com/doccano/doccano/pull/1699
    • Relation labeling with sequence labeling by @Hironsan in https://github.com/doccano/doccano/pull/1703
    • Update docker compose instruction by @Hironsan in https://github.com/doccano/doccano/pull/1709
    • Update default database location in cli by @Hironsan in https://github.com/doccano/doccano/pull/1710
    • Fix state of label selection on LabelingMenu for sequenceLabeling by @jhoobergs in https://github.com/doccano/doccano/pull/1712
    • [Enhancement] Remove duplication on distribution component by @Hironsan in https://github.com/doccano/doccano/pull/1713
    • [Enhancement] Dataset import page by @Hironsan in https://github.com/doccano/doccano/pull/1714
    • Fix migration when adding uuids by @rolisz in https://github.com/doccano/doccano/pull/1716
    • [Enhancement] Fix minor code issues by @Hironsan in https://github.com/doccano/doccano/pull/1717
    • Remove duplicated zh locale by @Hironsan in https://github.com/doccano/doccano/pull/1718
    • [Enhancement] data export page by @Hironsan in https://github.com/doccano/doccano/pull/1719
    • Update tutorial document by @Hironsan in https://github.com/doccano/doccano/pull/1721
    • Bug fix: Specify project when retrieving labels in celery task by @youichiro in https://github.com/doccano/doccano/pull/1723
    • Output log from gunicorn by @Hironsan in https://github.com/doccano/doccano/pull/1727
    • [Enhancement] project creation page by @Hironsan in https://github.com/doccano/doccano/pull/1728

    New Contributors

    • @jhoobergs made their first contribution in https://github.com/doccano/doccano/pull/1712

    Full Changelog: https://github.com/doccano/doccano/compare/v1.5.5...v1.6.0

    Source code(tar.gz)
    Source code(zip)
  • v1.5.5(Dec 3, 2021)

    What's Changed

    • [Hotfix] Update unique constraint of Span by @Hironsan in https://github.com/doccano/doccano/pull/1583
    • Add an autocomplete in annotation menu by @Hironsan in https://github.com/doccano/doccano/pull/1581
    • Update Dockerfile to change the default value of DEBUG by @Hironsan in https://github.com/doccano/doccano/pull/1582
    • Show number of deleting rows only in confirm dialog by @Hironsan in https://github.com/doccano/doccano/pull/1584

    Full Changelog: https://github.com/doccano/doccano/compare/v1.5.4...v1.5.5

    Source code(tar.gz)
    Source code(zip)
  • v1.5.4(Dec 1, 2021)

    What's Changed

    • [fix] Update unique constraint by @Hironsan in https://github.com/doccano/doccano/pull/1580

    Full Changelog: https://github.com/doccano/doccano/compare/v1.5.3...v1.5.4

    Source code(tar.gz)
    Source code(zip)
  • v1.5.3(Nov 30, 2021)

    What's Changed

    • Specify image names in compose files by @Hironsan in https://github.com/doccano/doccano/pull/1554
    • Update CsvWriter to fix repeated metadata column when exported as csv by @Hironsan in https://github.com/doccano/doccano/pull/1557
    • Sort exported labels by @Hironsan in https://github.com/doccano/doccano/pull/1558
    • Add how to use PostgreSQL by @Hironsan in https://github.com/doccano/doccano/pull/1563
    • Assign label colors automatically by @Hironsan in https://github.com/doccano/doccano/pull/1564
    • Fix typo: injest -> ingest by @Hironsan in https://github.com/doccano/doccano/pull/1565
    • Add keyboard shortcut back to accept button by @FynnSu in https://github.com/doccano/doccano/pull/1562
    • Add PostgreSQL related env in docker compose mode by @YuMS in https://github.com/doccano/doccano/pull/1566
    • Enable to delete relation if one of the entities are deleted in demo NER by @Hironsan in https://github.com/doccano/doccano/pull/1570
    • Improve error messages when uplading dataset by @Hironsan in https://github.com/doccano/doccano/pull/1568
    • Support offline environment by @Hironsan in https://github.com/doccano/doccano/pull/1575
    • Update publish-image.yml by @Hironsan in https://github.com/doccano/doccano/pull/1578

    New Contributors

    • @FynnSu made their first contribution in https://github.com/doccano/doccano/pull/1562
    • @YuMS made their first contribution in https://github.com/doccano/doccano/pull/1566

    Full Changelog: https://github.com/doccano/doccano/compare/v1.5.2...v1.5.3

    Source code(tar.gz)
    Source code(zip)
  • v1.5.2(Nov 9, 2021)

    What's Changed

    • Revert primary key change by @Hironsan in https://github.com/doccano/doccano/pull/1550
    • Support limited import/export on windows by @Hironsan in https://github.com/doccano/doccano/pull/1552

    Full Changelog: https://github.com/doccano/doccano/compare/v1.5.1...v1.5.2

    Source code(tar.gz)
    Source code(zip)
  • v1.5.1(Nov 5, 2021)

    What's Changed

    • Add chardet as a dependency by @Hironsan in https://github.com/doccano/doccano/pull/1530
    • Update cli by @Hironsan in https://github.com/doccano/doccano/pull/1531
    • Add explanation on create user by @Hironsan in https://github.com/doccano/doccano/pull/1533
    • Add how to change password section to FAQ by @Hironsan in https://github.com/doccano/doccano/pull/1538
    • Display guideline by @Hironsan in https://github.com/doccano/doccano/pull/1542
    • Update docker-compose.dev.yml by @Hironsan in https://github.com/doccano/doccano/pull/1543
    • Preserve the order of imported data and show pre-labeled data by @Hironsan in https://github.com/doccano/doccano/pull/1544

    Full Changelog: https://github.com/doccano/doccano/compare/v1.5.0...v1.5.1

    Source code(tar.gz)
    Source code(zip)
  • v1.5.0(Oct 25, 2021)

    Pickup

    • #1511 Support nested named entity recognition
    • #1414 Fix pip installation on Windows

    What's Changed

    • Fix docker-compose.prod.yml by @Hironsan in https://github.com/doccano/doccano/pull/1379
    • Fix permission for /backend by @koxx009 in https://github.com/doccano/doccano/pull/1382
    • Relations annotation added to SequenceLabeling task by @DavideAllavena in https://github.com/doccano/doccano/pull/1384
    • Update Dockerfiles by @Hironsan in https://github.com/doccano/doccano/pull/1391
    • Feature/speech2text by @Hironsan in https://github.com/doccano/doccano/pull/1395
    • Remove t2.micro from cloudformation template by @Hironsan in https://github.com/doccano/doccano/pull/1396
    • Hide label and relation from the sidebar by @Hironsan in https://github.com/doccano/doccano/pull/1400
    • [Enhancement] Add state model by @Hironsan in https://github.com/doccano/doccano/pull/1402
    • [Enhancement] Support image and speech auto labeling by @Hironsan in https://github.com/doccano/doccano/pull/1413
    • Fix typo of exception name by @youichiro in https://github.com/doccano/doccano/pull/1449
    • Fix: Show all Project Types (de, fr, zh) in Add Project Interface by @ghontolux in https://github.com/doccano/doccano/pull/1458
    • Restrict project creation to staff users by @youichiro in https://github.com/doccano/doccano/pull/1453
    • put variables to .env file by @scheiblr in https://github.com/doccano/doccano/pull/1412
    • support image display and url handling by @SauceCat in https://github.com/doccano/doccano/pull/1430
    • Display dataset page for all user by @youichiro in https://github.com/doccano/doccano/pull/1504
    • move later to copy files in Dockerfile.prod by @youichiro in https://github.com/doccano/doccano/pull/1431
    • Add admin interface for AutoLabelingConfigs. Solves #1423 by @rolisz in https://github.com/doccano/doccano/pull/1438
    • Return a Response with a status if the task is not yet ready. by @rolisz in https://github.com/doccano/doccano/pull/1440
    • Bump django from 3.2.4 to 3.2.5 by @dependabot in https://github.com/doccano/doccano/pull/1503
    • iss1348: fix colors when importing labels by @dimidloc in https://github.com/doccano/doccano/pull/1459
    • Sequence labelling: fix background color in dark mode by @colinux in https://github.com/doccano/doccano/pull/1474
    • Increase the allowed max length for uploaded dataset filepath by @YosuaMichael in https://github.com/doccano/doccano/pull/1500
    • Fixing Data Annotation Issues by @zanussbaum in https://github.com/doccano/doccano/pull/1432
    • update cloudformation template to modify the sample env file by @daleevans in https://github.com/doccano/doccano/pull/1469
    • Updated various dependency and image versions by @RSCArt in https://github.com/doccano/doccano/pull/1462
    • Feature/annotate from comment list by @ayanamizuta in https://github.com/doccano/doccano/pull/1433
    • Remove unneeded query by @rolisz in https://github.com/doccano/doccano/pull/1442
    • Display a check button even for annotators by @youichiro in https://github.com/doccano/doccano/pull/1473
    • Update progress graph of statistics by @youichiro in https://github.com/doccano/doccano/pull/1495
    • Fix CI problem by @Hironsan in https://github.com/doccano/doccano/pull/1520
    • Changed the title from Sentiment Analysis to Text Classification by @oztalha in https://github.com/doccano/doccano/pull/1521
    • Support nested named entity recognition, relation labeling, RTL language, improve speed and grapheme mode by @Hironsan in https://github.com/doccano/doccano/pull/1511
    • Support doccano init on windows by @Hironsan in https://github.com/doccano/doccano/pull/1529

    New Contributors

    • @koxx009 made their first contribution in https://github.com/doccano/doccano/pull/1382
    • @DavideAllavena made their first contribution in https://github.com/doccano/doccano/pull/1384
    • @youichiro made their first contribution in https://github.com/doccano/doccano/pull/1449
    • @ghontolux made their first contribution in https://github.com/doccano/doccano/pull/1458
    • @scheiblr made their first contribution in https://github.com/doccano/doccano/pull/1412
    • @SauceCat made their first contribution in https://github.com/doccano/doccano/pull/1430
    • @dimidloc made their first contribution in https://github.com/doccano/doccano/pull/1459
    • @colinux made their first contribution in https://github.com/doccano/doccano/pull/1474
    • @YosuaMichael made their first contribution in https://github.com/doccano/doccano/pull/1500
    • @zanussbaum made their first contribution in https://github.com/doccano/doccano/pull/1432
    • @daleevans made their first contribution in https://github.com/doccano/doccano/pull/1469
    • @RSCArt made their first contribution in https://github.com/doccano/doccano/pull/1462
    • @ayanamizuta made their first contribution in https://github.com/doccano/doccano/pull/1433
    • @oztalha made their first contribution in https://github.com/doccano/doccano/pull/1521

    Full Changelog: https://github.com/doccano/doccano/compare/v1.4.1...v1.5.0

    Source code(tar.gz)
    Source code(zip)
  • v1.4.1(May 25, 2021)

  • v1.4.0(May 18, 2021)

    Feature

    • https://github.com/doccano/doccano/pull/1370 Add a new task: image classification

    Fix

    • https://github.com/doccano/doccano/pull/1368 Fix auto labeling function
    • https://github.com/doccano/doccano/pull/1340 Fix download dataset
    • https://github.com/doccano/doccano/pull/1358 Update documentation
    • https://github.com/doccano/doccano/pull/1349 Improve test code
    • https://github.com/doccano/doccano/pull/1367 Security update
    Source code(tar.gz)
    Source code(zip)
  • v1.3.1(Apr 26, 2021)

    Small updates

    • https://github.com/doccano/doccano/pull/1333 Update pagination
    • https://github.com/doccano/doccano/pull/1331 Replace PostgreSQL URL for Heroku
    • https://github.com/doccano/doccano/pull/1325 Show new lines in a text classification project
    • https://github.com/doccano/doccano/pull/1324 Fix rows per page display error
    • https://github.com/doccano/doccano/pull/1323 Fix pip usage documentation
    • Fix infinite loop if the specified encoding is Auto and the file is small
    • Hide shortcut circle if suffix key is empty
    • Update demo page's metadata
    Source code(tar.gz)
    Source code(zip)
  • v1.3.0(Apr 21, 2021)

    Mainly improve upload and download features:

    Upload

    • [x] Support a large file upload
    • [x] Support a folder upload
    • [x] Support multiple files upload
    • [x] Support asynchronous upload
    • [x] Enable to show upload progress

    Ingestion

    • [x] Support batch import to speed up the process
    • [x] Support file validation
    • [x] Import as much of the file's content as possible
    • [x] Provide feedback on which file and what line is wrong and why.
    • [x] Support 90 over encodings
    • [x] Support auto encoding detection
    • [x] Support saving filename

    File format

    • [x] Expand the available formats
    • [x] Support for specifying columns for labels and text
    • [x] Support for specifying schema of CoNLL(IOB2, IOE2, IOBES, BILOU)
    • [x] Support for specifying separator(CSV, CoNLL)

    Download

    • [x] Support a large file download
    • [x] Support asynchronous download
    • [x] Support zip download
    • [x] Support JSON download

    Others

    • [x] Support project tags
    Source code(tar.gz)
    Source code(zip)
  • v1.2.4(Mar 30, 2021)

  • v1.2.3(Mar 30, 2021)

    Bugfix

    • #1274 Replace TokenAuthentication with SessionAuthentication
    • #1271 Update libpq-dev version to avoid build failure
    • #1260 Add v-shortkey to text classification page

    Enhancement

    • #1277 Update getting-started.md
    • #1263 Display shortcut keys
    • #1261 Support single label classification
    • #1253 Change the limit length of label name from 30 to 100 characters
    Source code(tar.gz)
    Source code(zip)
  • v1.2.2(Mar 16, 2021)

  • v1.2.1(Feb 19, 2021)

  • v1.2.0(Feb 19, 2021)

    Support Auto Labeling

    This PR allows users to label text automatically. I think this enables users to speed up annotation.

    How it works

    This feature enables automatic labeling by calling the Web API from doccano. Therefore, you can use any commercial service (e.g. Google Natural Language API, Amazon Comprehend, Watson, etc.) or your own server for labeling, as long as you can call the API from doccano. Notice that there is no learning function for now. This is an issue for the future.

    How to use

    Configuration

    1. Select "Settings" on the side menu.

    image

    1. Select "Auto Labeling" tab and press "Create" button.

    image

    1. Select a configuration template. Some tasks have predefined templates to simplify the configuration.

    image

    1. Enter the parameters required to use the API.

    image

    1. Write a mapping template to extract labels from API responses. If you select the predefined template, you can skip this process.

    image

    1. Map the label fetched from the API to a label defined by you.

    image

    Turn on the feature

    1. Go to the annotation page.
    2. Select the settings button.
    3. Turn on the feature.

    Notice that if you don't have any configuration, you can't use this feature.

    Future works

    • Assigning a configuration to each user
    • Allow the admin to set throttling for each user
    • Allow merging of responses from multiple APIs
    • Implementing the training feature
    • Increase the number of the predefined templates(https://github.com/doccano/auto-labeling-pipeline)

    close #191

    Source code(tar.gz)
    Source code(zip)
  • v1.1.1(Jan 27, 2021)

  • v1.1.0(Jan 26, 2021)

    Enhancements

    Improvements on import/export

    • #1175 Increase max upload size to 100MB
    • #1163 Support bulk file import
    • #1156 Support fastText format export
    • #1138 Add option to export only approved datasets
    • #1073 FastText import functionality
    • #861 Feature/import unlabeled dataset
    • #1116 Enhancement/dynamic filename + changed json1 naming to jsonl

    Improvements in environment

    • #1147 Switch requirements.txt to Pipfile for Docker and Docker Compose
    • #1088 Dockerfile supports doccano v1.x
    • #1090 Support pip Installation
    • #1110 Improve build speed on production compose
    • #1120 Add GitHub actions for unit testing
    • #1121 Add a workflow to deploy mkdocs
    • #1127 Proper tagging of docker containers
    • #1122 Remove root requirements.txt to fix mkdocs
    • #987 Bring both run.sh scripts to parity
    • #1172 Add documentation for API testing

    Enhancements on multiple languages support

    • #1167 Support German UI
    • #1039 Support Chinese UI
    • #976 Support multiple language UI

    Enhancements on annotation

    • #1155 Support comment feature
    • #1115 Clear annotations and bulk delete functionality
    • #1071 Show only available shortcut keys
    • #981 Feature/first last page buttons
    • #980 Adding keyboard shortcut for approval button
    • #932 Hide check button from annotators

    Others

    • #945 Enable project delete button only for project administrators
    • #973 Move style tag to editor.css
    • #960 Adding style tag to frontend/layouts/project.vue
    • #964 Add timestamps to annotation serializers

    Bug fixes

    • #1177 Fix a method that counts the number of characters
    • #1164 Fix Heroku deployment
    • #1161 Fix 500 error after creating a project
    • #1143 Update settings and move commands
    • #1128 Fix python3 compatibility problem
    • #1092 Fix backend container fail
    • #1100 Fix pg_config error on pip installation
    • #1104 Fix one click deployment for GCP Cloud Run
    • #1108 Fix Invalid syntax error
    • #1065 Fix: AnnotateButton now links to the right docs
    • #1072 Deduplicate data on randomized order option
    • #1075 Enable to login on remote machine
    • #1047 Update nginx config to access swagger in production
    • #1051 Clear text in label selector after typing
    • #1064 Fix approve annotation documents function
    • #978 Update cloudformation template
    • #1007 Update run.sh
    • #1033 Fix Create project failed with error message "Bad Request"
    Source code(tar.gz)
    Source code(zip)
  • v1.0.5(Jul 22, 2020)

    Enhancements, Bug Fixes, and Other Changes

    Enhancements

    Update packages

    • #821 Update Django
    • #823 Update websocket-extensions
    • #898 Update lodash
    • #909 Replace django-pyodbc-azure with django-mssql-backend
    • #677 Fix mkdocs and mkdocs-material version

    Improve the documentation

    • #729 Update FAQ to include how to logout
    • #779 Improve installation documentation
    • #742 Update rebuild FAQ
    • #832 FAQ correction
    • #878 Rewrite FAQ about changing port number
    • #912 Add link to the documentation
    • #684 Clean document structure
    • #780 Update project_structure.md
    • #781 Adding annotators documentation improvement

    Improve annotation

    • #489 Single class classification
    • #495 Speech to text transcription
    • #817 Display newlines for sequence-labeling projects in old frontend
    • #908 Keep the new line in the text classification
    • #818 Show thumbs up icon for doc approved status on the left side view
    • #766 Change of permission for annotation edits

    Improve a project page

    • #678 Updated statistics page user data graph
    • #671 Clarified input format
    • #872 Feature/excel conll import
    • #899 Highlight current page in the side bar
    • #911 Update error message on login failure
    • #838 Improve home page text
    • #904 Fix missing icon

    Improve setup and middleware

    • #301 Added middleware that enables automatic login based on HTTP headers
    • #795 Use Alpine as root image
    • #791 Remove legacy endpoint
    • #812 Dockerfile cleanup
    • #801 Add simple /v1/health endpoint
    • #774 Added flag for allowing downgrades while using apt
    • #673 Remove middleware duplication
    • #843 Turn off caching on apk and pip
    • #787 Avoid username is taken
    • #862 Make google-analytics configurable

    Bug fixes

    • #841 Fix aws deploy
    • #855 Fix incorrect current user name
    • #885 Fix choosing filter twice results in 404 error
    • #897 Fix pagination and transition issues
    • #903 Fix 400 error on text classification
    • #863 Fix error on Django Admin search
    • #864 Fix problem to reset label color to #FF0000
    • #865 Fix unworked cancel button in label page
    • #900 Enable to close a dialog
    • #916 Allow annotator to update/delete other's annotations
    • #918 Fix ApproveLabelsAPI not to show other's annotations if it's checked
    • #680 Added username in localstorage
    • #693 Add CORS whitelist
    Source code(tar.gz)
    Source code(zip)
  • v1.0.4(Apr 6, 2020)

    Bug Fixes, Enhancements, and Other Changes

    Bug fixes

    • #513 Enable to display non-ASCII metadata
    • #518 Enable to redirect to root page
    • #587 Fix the annotation toolbar vanishment
    • #530 Fix filter initial state
    • #545, #613 Fix login/logout problem

    Enhancements

    • #604 Enable to show the current user name
    Source code(tar.gz)
    Source code(zip)
  • v1.0.3(Mar 31, 2020)

    Bug Fixes, Enhancements, and Other Changes

    Bug fixes

    • #654 Enable to handle multiline document in sequence labeling
    • #640 Enable to handle real enter event
    • #611 Fix broken links in tutorial.md
    • #607 Fix duplicate key error
    • #532 Fix flags set on project creation are discarded
    • #539 Fix build failure
    • #527 Fix collaborative annotation for stats and filter

    Enhancements

    • #647 Add API document generator
    • #558 Add a docker-compose file for development
    • #569 Add project structure documentation
    • #474 Avoid redirects with custom port dropped
    • #549 Disable left/right shortcut key in input and textarea
    • #480 Enable to move any page
    • #651 Enable to show error messages on importing labels
    • #606 Enable to show menu by default
    • #603 Enable to use of digits as shortcut keys
    • #535 Enable to import of CSVs with arbitrary column order
    • #633 Update nginx.conf to enhance security
    • #504 Verification emails on Azure

    Other changes

    • #602 Update package.json
    • #582 Update repository name
    • #580 Update libcurl3 version
    • #567 Update webpack version
    Source code(tar.gz)
    Source code(zip)
  • v1.0.2(Nov 29, 2019)

    Bug Fixes and Other Changes

    • https://github.com/chakki-works/doccano/pull/441 Add volume for SQLite DB in doccano container
    • https://github.com/chakki-works/doccano/pull/473 Fix error on first annotation page open
    • https://github.com/chakki-works/doccano/pull/470 Fix error on creating a label with an empty shortcut key
    • https://github.com/chakki-works/doccano/pull/456 Fix error on shortcut key in text classification
    • https://github.com/chakki-works/doccano/pull/465 Remove unused dependencies
    • https://github.com/chakki-works/doccano/pull/422 Supports MySQL
    Source code(tar.gz)
    Source code(zip)
  • v1.0.1(Nov 22, 2019)

    Bug Fixes and Other Changes

    • Adds X-Frame-Options header to prevent clickjacking.
    • Adds DEBUG=False to docker-compose.prod.yml.
    • Updates Docker Compose file version to 3.7.
    • Updates README.md for v1.0.0
    • https://github.com/chakki-works/doccano/pull/350 Adds settings required for SSL.
    • https://github.com/chakki-works/doccano/pull/373 Adds linter for Dockerfile.
    • https://github.com/chakki-works/doccano/pull/388 Adds GCP Run option as one-click deployment.
    • https://github.com/chakki-works/doccano/pull/418 Adds label nonfield error messages.
    • https://github.com/chakki-works/doccano/pull/443 Adds label import/export features.
    • https://github.com/chakki-works/doccano/pull/363 Configure DJANGO_SETTINGS_MODULE properly.
    • https://github.com/chakki-works/doccano/pull/319 Enable sorting of documents in frontend.
    • https://github.com/chakki-works/doccano/pull/398 Enable superuser to delete user annotations.
    • https://github.com/chakki-works/doccano/pull/390 Improve annotation creation performance.
    • https://github.com/chakki-works/doccano/pull/448 Updates tornado dependency on 5.1.1 to not be affected by DDOS.
    • https://github.com/chakki-works/doccano/pull/450 Updates project creation form for v1.0.0.
    Source code(tar.gz)
    Source code(zip)
ElegantRL is featured with lightweight, efficient and stable, for researchers and practitioners.

Lightweight, efficient and stable implementations of deep reinforcement learning algorithms using PyTorch. ??

AI4Finance 2.5k Jan 8, 2023
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

About This repository provides data and code for the paper: Scalable Data Annotation Pipeline for High-Quality Large Speech Datasets Development (subm

Appen Repos 86 Dec 7, 2022
PaddleRobotics is an open-source algorithm library for robots based on Paddle, including open-source parts such as human-robot interaction, complex motion control, environment perception, SLAM positioning, and navigation.

简体中文 | English PaddleRobotics paddleRobotics是基于paddle的机器人开源算法库集,包括人机交互、复杂运动控制、环境感知、slam定位导航等开源算法部分。 人机交互 主动多模交互技术TFVT-HRI 主动多模交互技术是通过视觉、语音、触摸传感器等输入机器人

null 185 Dec 26, 2022
A embed able annotation tool for end to end cross document co-reference

CoRefi CoRefi is an emebedable web component and stand alone suite for exaughstive Within Document and Cross Document Coreference Anntoation. For a de

PythicCoder 39 Dec 12, 2022
A Data Annotation Tool for Semantic Segmentation, Object Detection and Lane Line Detection.(In Development Stage)

Data-Annotation-Tool How to Run this Tool? To run this software, follow the steps: git clone https://github.com/Autonomous-Car-Project/Data-Annotation

TiVRA AI 13 Aug 18, 2022
A graphical Semi-automatic annotation tool based on labelImg and Yolov5

??YOLOV5 semi-automatic annotation tool (Based on labelImg)

EricFang 247 Jan 5, 2023
OpenCVのGrabCut()を利用したセマンティックセグメンテーション向けアノテーションツール(Annotation tool using GrabCut() of OpenCV. It can be used to create datasets for semantic segmentation.)

[Japanese/English] GrabCut-Annotation-Tool GrabCut-Annotation-Tool.mp4 OpenCVのGrabCut()を利用したアノテーションツールです。 セマンティックセグメンテーション向けのデータセット作成にご使用いただけます。 ※Grab

KazuhitoTakahashi 30 Nov 18, 2022
ObjectDrawer-ToolBox: a graphical image annotation tool to generate ground plane masks for a 3D object reconstruction system

ObjectDrawer-ToolBox is a graphical image annotation tool to generate ground plane masks for a 3D object reconstruction system, Object Drawer.

null 77 Jan 5, 2023
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Master status: Development status: Package information: TPOT stands for Tree-based Pipeline Optimization Tool. Consider TPOT your Data Science Assista

Epistasis Lab at UPenn 8.9k Dec 30, 2022
O2O-Afford: Annotation-Free Large-Scale Object-Object Affordance Learning (CoRL 2021)

O2O-Afford: Annotation-Free Large-Scale Object-Object Affordance Learning Object-object Interaction Affordance Learning. For a given object-object int

Kaichun Mo 26 Nov 4, 2022
An Open Source Machine Learning Framework for Everyone

Documentation TensorFlow is an end-to-end open source platform for machine learning. It has a comprehensive, flexible ecosystem of tools, libraries, a

null 170.1k Jan 4, 2023
An Open Source Machine Learning Framework for Everyone

Documentation TensorFlow is an end-to-end open source platform for machine learning. It has a comprehensive, flexible ecosystem of tools, libraries, a

null 170.1k Jan 5, 2023
An open source machine learning library for performing regression tasks using RVM technique.

Introduction neonrvm is an open source machine learning library for performing regression tasks using RVM technique. It is written in C programming la

Siavash Eliasi 33 May 31, 2022
An Open Source Machine Learning Framework for Everyone

Documentation TensorFlow is an end-to-end open source platform for machine learning. It has a comprehensive, flexible ecosystem of tools, libraries, a

null 153.2k Feb 13, 2021
FEDn is an open-source, modular and ML-framework agnostic framework for Federated Machine Learning

FEDn is an open-source, modular and ML-framework agnostic framework for Federated Machine Learning (FedML) developed and maintained by Scaleout Systems. FEDn enables highly scalable cross-silo and cross-device use-cases over FEDn networks.

Scaleout 75 Nov 9, 2022
This is an open source library implementing hyperbox-based machine learning algorithms

hyperbox-brain is a Python open source toolbox implementing hyperbox-based machine learning algorithms built on top of scikit-learn and is distributed

Complex Adaptive Systems (CAS) Lab - University of Technology Sydney 21 Dec 14, 2022
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.

Machine Learning From Scratch About Python implementations of some of the fundamental Machine Learning models and algorithms from scratch. The purpose

Erik Linder-Norén 21.8k Jan 9, 2023