Official PyTorch implementation of MX-Font (Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts)

Clova AI Research

Last update: Dec 23, 2022

Related tags

Deep Learning font machine-learning deep-learning pytorch generative-models font-generation mx-font

Overview

Introduction

Pytorch implementation of Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Expert. | paper

Song Park¹, Sanghyuk Chun^{2, 3}, Junbum Cha³, Bado Lee³, Hyunjung Shim¹
¹ _{School of Integrated Technology, Yonsei university}
² _{NAVER AI Lab}
³ _{NAVER CLOVA}

A few-shot font generation (FFG) method has to satisfy two objectives: the generated images should preserve the underlying global structure of the target character and present the diverse local reference style. Existing FFG methods aim to disentangle content and style either by extracting a universal representation style or extracting multiple component-wise style representations. However, previous methods either fail to capture diverse local styles or cannot be generalized to a character with unseen components, e.g., unseen language systems. To mitigate the issues, we propose a novel FFG method, named Multiple Localized Experts Few-shot Font Generation Network (MX-Font). MX-Font extracts multiple style features not explicitly conditioned on component labels, but automatically by multiple experts to represent different local concepts, e.g., left-side sub-glyph. Owing to the multiple experts, MX-Font can capture diverse local concepts and show the generalizability to unseen languages. During training, we utilize component labels as weak supervision to guide each expert to be specialized for different local concepts. We formulate the component assign problem to each expert as the graph matching problem, and solve it by the Hungarian algorithm. We also employ the independence loss and the content-style adversarial loss to impose the content-style disentanglement. In our experiments, MX-Font outperforms previous state-of-the-art FFG methods in the Chinese generation and cross-lingual, e.g., Chinese to Korean, generation.

You can find more related projects on the few-shot font generation at the following links:

clovaai/dmfont (ECCV'20)
clovaai/lffont (AAAI'21)

Prerequisites

Python > 3.6

Using conda is recommended: https://docs.anaconda.com/anaconda/install/linux/
pytorch >= 1.5

To install: https://pytorch.org/get-started/locally/
sconf, numpy, scipy, scikit-image, tqdm, jsonlib, fonttools

conda install numpy scipy scikit-image tqdm jsonlib-python3 fonttools

Usage

Note that, we only provide the example font files; not the font files used for the training the provided weight (generator.pth). The example font files are downloaded from here.

Preparing Data

The examples of datasets are in (./data)

Font files (.ttf)

Prepare the TrueType font files(.ttf) to use for the training and the validation.
Put the training font files and validation font files into separate directories.

The text files containing the available characters of .ttf files (.txt)

If you have the available character list of a .ttf file, save its available characters list to a text file (.txt) with the same name in the same directory with the ttf file.
- (example) TTF file: data/ttfs/train/MaShanZheng-Regular.ttf, its available characters: data/ttfs/train/MaShanZheng-Regular.txt
You can also generate the available characters files automatically using the get_chars_from_ttf.py

# Generating the available characters file

python get_chars_from_ttf.py --root_dir path/to/ttf/dir

--root_dir: The root directory to find the .ttf files. All the .ttf files under this directory and its subdirectories will be processed.

The json files with decomposition information (.json)

The files for the decomposition information are needed.
- The files for the Chinese characters are provided. (data/chn_decomposition.json, data/primals.json)
- If you want to train the model with a language other than Chinese, the files for the decomposition rule (see below) are also needed.
  - Decomposition rule
    - structure: dict (in json format)
    - format: {char: [list of components]}
    - example: {'㐬': ['亠', '厶', '川'], '㐭': ['亠', '囗', '口']}
  - Primals
    - structure: list (in json format)
    - format: [All the components in the decomposition rule file]
    - example: ['亠', '厶', '川', '囗', '口']

Training

Modify the configuration file (cfgs/train.yaml)

- use_ddp:  whether to use DataDistributedParallel, for multi-GPUs.
- port:  the port for the DataDistributedParallel training.

- work_dir:  the directory to save checkpoints, validation images, and the log.
- decomposition:  path to the "decomposition rule" file.
- primals:  path to the "primals" file.

- dset:  (leave blank)
  - train:  (leave blank)
    - data_dir : path to .ttf files for the training
  - val: (leave blank)
    - data_dir : path to .ttf files for the validation
    - source_font : path to .ttf file used as the source font during the validation

Run training

python train.py cfgs/train.yaml

arguments
- path/to/config (first argument): path to configration file.
- --resume (optional) : path to checkpoint to resume.

Test

Preparing the reference images

Prepare the reference images and the .ttf file to use as the source font.
The reference images are should be placed in this format:

    * data_dir
    |-- font1
        |-- char1.png
        |-- char2.png
        |-- char3.png
    |-- font2
        |-- char1.png
        |-- char2.png
            .
            .
            .

The names of the directory or the image files are not important, however, the images with the same reference style are should be grouped with the same directory.
If you want to generate only specific characters, prepare the file containing the list of the characters to generate.
- The example file is provided. (data/chn_gen.json)

Modify the configuration file (cfgs/eval.yaml)

- dset:  (leave blank)
  - test:  (leave blank)
    - data_dir: path to reference images
    - source_font: path to .ttf file used as the source font during the generation
    - gen_chars_file: path to file of the characters to generate. Leave blank if you want to generate all the available characters in the source font.

Run test

python eval.py \
    cfgs/eval.yaml \
    --weight generator.pth \
    --result_dir path/to/save/images

arguments
- path/to/config (first argument): path to configration file.
- --weight : path to saved weight to test.
- --result_dir: path to save generated images.

Code license

This project is distributed under MIT license, except modules.py which is adopted from https://github.com/NVlabs/FUNIT.

MX-Font
Copyright (c) 2021-present NAVER Corp.

Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in
all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.  IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
THE SOFTWARE.

Acknowledgement

This project is based on clovaai/dmfont and clovaai/lffont.

How to cite

@article{park2021mxfont,
    title={Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts},
    author={Park, Song and Chun, Sanghyuk and Cha, Junbum and Lee, Bado and Shim, Hyunjung},
    year={2021},
    journal={arXiv preprint arXiv:2104.00887},
}

Comments

about training

Thanks for your amazing work. I tried to train mxfont model following the default configuration, but how I get reference image and what the dataset format is.

please give the detail description. Thanks.

opened by XuyangPan 9
About inference...

I noticed that there is a source_path in inference.ipynb that needs to specify a ttf your data folder does not contain, does it use the same font as your lffont's content_font. If not, how do I get it?

opened by zha-hengfeng 4
a json file Includes all Chinese characters in the data folder

I want to ask if there is a json file，similar to chn_primals.json and chn_gen.json in the data folder，which Includes all Chinese characters？ can you send me such file？ @8uos @SanghyukChun

opened by pengyaru 4
trainning stragegy

Hello, thank you for your impressive work. I noticed that you set'max_iter' to 800,000, and the training data set contains 439 different styles (each has nearly 6000 characters). When I set 'batch' to 8, the epoch is between 2 and 3 (which makes me feel strange). Am I right?

opened by ecnuycxie 4
abort program aborting

Hello， the 313 fonts were collected and were made into dataset. During training mxfont model the program aborted.

Some losses were very abnormal. The log was recorded in the following.

opened by XuyangPan 4
get_defined_chars function seems to not work for some ttf files

Hello, thanks for the this amazing work in few-shot font generation!

When I want to train the model on my own ttf files, I met a problem. When I tried to utilize the get_chars_from_ttf.py, some of my ttf files can only produces the alphabets some punctuations, and no chinese chars is produced. However, the ttfs does have included chinese chars. I am not sure if there is some limitation of fontTools when it read in the ttf files. Could you help me to explain the reasons? Thanks for your help!

opened by YIYANGCAI 3
Training iteration

HI, thanks for your impressive work. I am trying training your code from the begining, so I was wondering how many GPUS you used and how long will it spends for training?

I noticed that you set'max_iter' to 800,000 and batch_size is 8. But in paper you said the iter is 650,000 and mini_batch size is 24. So which is correct?

When I set 'batch size' to 24, can I reduce the iteration to 650k/3 ? can I get the same result? I am looking forward to your response.Thanks

opened by ZYJ-JMF 3
problem about reproducing the mxfont and FUNIT

Thanks for your wonderful works. I am trying to reproduce the results of MXFont and FUNIT. According to your reply in #6, I choose the best AC_g_acc_c and AC_g_acc_s as the stop iteration. When I try to reproduce FUNIT (128*128), mode collapse happens after about 10000 iterations. Have you encountered this problem?

opened by ecnuycxie 1
about FID

do you measure the style-aware and content-aware FID using all the generated images instead of measuring the FID of each generated style and calculating the average?

opened by ecnuycxie 1
There are some smudge in the infer result image

This is an interesting work.But i got some issue when i try inference with the weight you provide . There are some dark line in the result image. and this is my refer style image

opened by kbaicai 1
while eval, get_defined_chars function can not get any Chinese character data! There is no output.

I gave the font file I downloaded myself, the font is correct and can be installed, but while testing, I found that there is no output pictures, I found that when using “get_defined_chars(fontfile)” function, there are only symbols like “！@#￥%…&×（）” in "chars". So the gen_chars and data_list is empty! I'm so sorry, but why

opened by sAuDoisy 0

Official PyTorch implementation of MX-Font (Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts)

Related tags

Overview

Introduction

Prerequisites

Usage

Preparing Data

Font files (.ttf)

The text files containing the available characters of .ttf files (.txt)

The json files with decomposition information (.json)

Training

Modify the configuration file (cfgs/train.yaml)

Run training

Test

Preparing the reference images

Modify the configuration file (cfgs/eval.yaml)

Run test

Code license

Acknowledgement

How to cite

Comments

Owner

Clova AI Research

The pytorch implementation of DG-Font: Deformable Generative Networks for Unsupervised Font Generation

Official code for "Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer. ICCV2021".

PyTorch implementation of D2C: Diffuison-Decoding Models for Few-shot Conditional Generation.

Official repository for Few-shot Image Generation via Cross-domain Correspondence (CVPR '21)

Few-NERD: Not Only a Few-shot NER Dataset

Source code for paper "Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling", AAAI 2021

Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering

Official PyTorch Implementation of Hypercorrelation Squeeze for Few-Shot Segmentation, arXiv 2021

(ICCV'21) Official PyTorch implementation of Relational Embedding for Few-Shot Classification

Pytorch implementation of paper: "NeurMiPs: Neural Mixture of Planar Experts for View Synthesis"

Much faster than SORT(Simple Online and Realtime Tracking), a little worse than SORT

Official Implementation of Few-shot Visual Relationship Co-localization

The official implementation of the CVPR 2021 paper FAPIS: a Few-shot Anchor-free Part-based Instance Segmenter

Distributed Asynchronous Hyperparameter Optimization better than HyperOpt.

[ICLR 2021] Is Attention Better Than Matrix Decomposition?

Code of PVTv2 is released! PVTv2 largely improves PVTv1 and works better than Swin Transformer with ImageNet-1K pre-training.

[NeurIPS 2021] Better Safe Than Sorry: Preventing Delusive Adversaries with Adversarial Training

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)