Converting CPT to bert form for use

黄辉

Last update: Oct 14, 2021

Related tags

Overview

cpt-encoder

将CPT转成bert形式使用

说明

刚刚刷到又出了一种模型：CPT，看论文显示，在很多中文任务上性能比mac bert还好，就迫不及待想把它用起来。

根据对源码的研究，发现该模型在做nlu建模时主要用的encoder部分，也就是bert，因此我将这部分权重转为bert权重类型，方便做nlu任务。

当然，想要发挥CPT的性能，还是得用官方代码用生成方式来使用，如prompt。

性能还未测试，第一个epoch看起来和roberta差不多。

加载方式

使用huggingface的transformers就可以加载，和BERT一样的方式。

转换代码

见 convert_cpt_to_bert.py

转好的权重地址

cpt-encoder-base: https://pan.baidu.com/s/1PqUAWNczX9vVcFtRHcE5cg 提取码：2fo2

cpt-encoder-large: https://pan.baidu.com/s/1KwumkF1NRL6wX7aifnq4xA 提取码：ke7o

官方地址

论文：CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation

github：CPT

Reference

Tooling for converting STAC metadata to ODC data model

Tooling for converting STAC metadata to ODC data model.

65 Dec 20, 2022

Experiments and examples converting Transformers to ONNX

Experiments and examples converting Transformers to ONNX This repository containes experiments and examples on converting different Transformers to ON

4 Dec 24, 2022

Optimizes image files by converting them to webp while also updating all references.

About Optimizes images by (re-)saving them as webp. For every file it replaced it automatically updates all references. Works on single files as well

18 Dec 23, 2022

Library for converting from RGB / GrayScale image to base64 and back.

Library for converting RGB / Grayscale numpy images from to base64 and back. Installation pip install -U image_to_base_64 Conversion RGB to base 64 b

16 Aug 28, 2022

BOOKSUM: A Collection of Datasets for Long-form Narrative Summarization

BOOKSUM: A Collection of Datasets for Long-form Narrative Summarization Authors: Wojciech Kryściński, Nazneen Rajani, Divyansh Agarwal, Caiming Xiong,

125 Dec 31, 2022

Paddle implementation for "Highly Efficient Knowledge Graph Embedding Learning with Closed-Form Orthogonal Procrustes Analysis" (NAACL 2021)

ProcrustEs-KGE Paddle implementation for Highly Efficient Knowledge Graph Embedding Learning with Orthogonal Procrustes Analysis 🙈 A more detailed re

4 Jun 9, 2021

FeTaQA: Free-form Table Question Answering

FeTaQA: Free-form Table Question Answering FeTaQA is a Free-form Table Question Answering dataset with 10K Wikipedia-based {table, question, free-form

Language, Information, and Learning at Yale

40 Dec 13, 2022

Towards Long-Form Video Understanding

Towards Long-Form Video Understanding Chao-Yuan Wu, Philipp Krähenbühl, CVPR 2021 [Paper] [Project Page] [Dataset] Citation @inproceedings{lvu2021,

69 Dec 26, 2022

Pose Detection and Machine Learning for real-time body posture analysis during exercise to provide audiovisual feedback on improvement of form.

Posture: Pose Tracking and Machine Learning for prescribing corrective suggestions to improve posture and form while exercising. This repository conta

10 Nov 11, 2022

Converting CPT to bert form for use

Related tags

Overview

cpt-encoder

将CPT转成bert形式使用

说明

加载方式

转换代码

转好的权重地址

官方地址

You might also like...

Tooling for converting STAC metadata to ODC data model

Experiments and examples converting Transformers to ONNX

Optimizes image files by converting them to webp while also updating all references.

Library for converting from RGB / GrayScale image to base64 and back.

BOOKSUM: A Collection of Datasets for Long-form Narrative Summarization

Paddle implementation for "Highly Efficient Knowledge Graph Embedding Learning with Closed-Form Orthogonal Procrustes Analysis" (NAACL 2021)

FeTaQA: Free-form Table Question Answering

Towards Long-Form Video Understanding

Pose Detection and Machine Learning for real-time body posture analysis during exercise to provide audiovisual feedback on improvement of form.

Owner

黄辉

CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation

Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging

I-BERT: Integer-only BERT Quantization

Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"

LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)

The source codes for ACL 2021 paper 'BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data'

VD-BERT: A Unified Vision and Dialog Transformer with BERT

Pre-trained BERT Models for Ancient and Medieval Greek, and associated code for LaTeCH 2021 paper titled - "A Pilot Study for BERT Language Modelling and Morphological Analysis for Ancient and Medieval Greek"

This demo showcase the use of onnxruntime-rs with a GPU on CUDA 11 to run Bert in a data pipeline with Rust.

A set of tools for converting a darknet dataset to COCO format working with YOLOX