Implementation of the final project of the course DDA6309 Probabilistic Graphical Model

Peng

Last update: Dec 26, 2021

Related tags

Deep Learning DDA6309-Probabilistic-Graphical-Models-Final-Project

Overview

Task-aware Joint CWS and POS (TCwsPos)

This is the implementation of the final project of the course DDA6309 Probabilistic Graphical Models, The Chinese University of Hong Kong (Shenzhen).

Please contact us at {pengsong,leqitian}@link.cuhk.edu.cn if you have any question.

Requirements

Our code works with the following environment.

python=3.6
pytorch=1.1

Downloading BERT

In our paper, we use BERT (paper) as the encoder.

For BERT, please download pre-trained BERT-Base Chinese from Google or from HuggingFace. If you download it from Google, you need to convert the model from TensorFlow version to PyTorch version.

Running on Sample Data

Run run_sample.sh to train a model on the small sample data under the sample_data folder.

Datasets

We use Universal Dependencies 2.4 (UD) in our paper.

To obtain and pre-process the data, you can go to data_preprocessing directory and run getdata.sh. This script will download and process the official data from UD.

All processed data will appear in data directory organized by the datasets, where each of them contains the files with the same file names under the sample_data directory.

Training and Testing

You can find the command lines to train and test model on a specific dataset in run.sh.

Here are some important parameters:

--do_train: train the model
--do_test: test the model
--use_bert: use BERT as encoder
--bert_model: the directory of pre-trained BERT model
--model_name: the name of model to save

Predicting

run_sample.sh contains the command line to segment and tag the sentences in an input file (./sample_data/sentence.txt).

Here are some important parameters:

--do_predict: segment and tag the sentences using a pre-trained TCwsPos model.
--input_file: the file contains sentences to be segmented and tagged. Each line contains one sentence; you can refer to a sample input file for the input format.
--output_file: the path of the output file. Words are segmented by a space; POS labels are attached to the resulting words by an underline ("_").
--eval_model: the pre-trained WMSeg model to be used to segment the sentences in the input file.

To-do List

Regular maintenance

You can leave comments in the Issues section, if you want us to implement any functions.

NAVER BoostCamp Final Project

CV 14조 final project Super Resolution and Deblur module Inference code & Pretrained weight Repo SwinIR Deblur 실행 방법 streamlit run WebServer/Server_SRD

5 Sep 6, 2022

Final project code: Implementing MAE with downscaled encoders and datasets, for ESE546 FA21 at University of Pennsylvania

546 Final Project: Masked Autoencoder Haoran Tang, Qirui Wu 1. Training To train the network, please run mae_pretraining.py. Please modify folder path

0 Apr 22, 2022

In this project we investigate the performance of the SetCon model on realistic video footage. Therefore, we implemented the model in PyTorch and tested the model on two example videos.

Contrastive Learning of Object Representations Supervisor: Prof. Dr. Gemma Roig Institutions: Goethe University CVAI - Computational Vision & Artifici

6 Dec 8, 2022

Implementation of the final project of the course DDA6309 Probabilistic Graphical Model

Related tags

Overview

Task-aware Joint CWS and POS (TCwsPos)

Requirements

Downloading BERT

Running on Sample Data

Datasets

Training and Testing

Predicting

To-do List

You might also like...

NAVER BoostCamp Final Project

Final project code: Implementing MAE with downscaled encoders and datasets, for ESE546 FA21 at University of Pennsylvania

Final project code: Implementing BicycleGAN, for CIS680 FA21 at University of Pennsylvania

Final project for machine learning (CSC 590). Detection of hepatitis C and progression through blood samples.

Cmsc11 arcade - Final Project for CMSC11

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling

A denoising diffusion probabilistic model (DDPM) tailored for conditional generation of protein distograms

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling @ INTERSPEECH 2021 Accepted

In this project we investigate the performance of the SetCon model on realistic video footage. Therefore, we implemented the model in PyTorch and tested the model on two example videos.

Owner

Peng

Computer Vision Script to recognize first person motion, developed as final project for the course "Machine Learning and Deep Learning"

Final Project for the CS238: Decision Making Under Uncertainty course at Stanford University in Autumn '21.

The reference baseline of final exam for XMU machine learning course

It's final year project of Diploma Engineering. This project is based on Computer Vision.

Deep Probabilistic Programming Course @ DIKU

All course materials for the Zero to Mastery Deep Learning with TensorFlow course.

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

Deep Learning for Computer Vision final project

Final project for Intro to CS class.

Final term project for Bayesian Machine Learning Lecture (XAI-623)