Honor's thesis project analyzing whether the GPT-2 model can more effectively generate free-verse or structured poetry.

Ashley Kim

Last update: Jan 9, 2022

Related tags

Text Data & NLP gpt2-poetry

Overview

gpt2-poetry

The following code is for my senior honor's thesis project, under the guidance of Dr. Keith Holyoak at the University of California, Los Angeles.

I am currently analyzing whether the GPT-2 model can more effectively generate free-verse or structured poetry by utilizing the GPT-2 architecture (code originated from "Language Models are Unsupervised Multitask Learners" by Radford et. al., paper at this link: https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf) to generate poetry trained on two different corpora: a corpora of sonnets (fourteen lined, rhymed poems) and another corpora of free-verse poetry from ten to eighteen lines selected from Poetry Magazine's issues from January 2012 - December 2021. I plan to compare the quality of these poems to randomly selected human-written poems from each of the training sets through a participant survey on the different characteristics of poetry.

To run: install Python 3.9.8, as well as the following modules: Fire 0.1.3, Regex 2017.4.5, Requests 2.21.0, tqdm 4.31.1, and toposort 1.5.

This project is in process and solely the free-verse portion of the data is currently uploaded to Github. The sonnets generated by the GPT-2 model will be uploaded soon!

Last updated: 1/5/2021

You might also like...

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

gpt-2-simple A simple Python package that wraps existing model fine-tuning and generation scripts for OpenAI's GPT-2 text generation model (specifical

2.5k Feb 17, 2021

API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend

gpt-j-api 🦜 An API to interact with the GPT-J language model. You can use and test the model in two different ways: Streamlit web app at http://api.v

276 Dec 31, 2022

Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing

Token Shift GPT Implementation of Token Shift GPT - An autoregressive model that relies solely on shifting along the sequence dimension and feedforwar

32 Oct 14, 2022

Train GPT-3 model on V100(16GB Mem) Using improved Transformer.

GPT-X using transformer pytorch

24 Sep 11, 2022

Bot to connect a real Telegram user, simulating responses with OpenAI's davinci GPT-3 model.

AI-BOT Bot to connect a real Telegram user, simulating responses with OpenAI's davinci GPT-3 model.

2 Dec 21, 2022

This repository serves as a place to document a toy attempt on how to create a generative text model in Catalan, based on GPT-2

GPT-2 Catalan playground and scripts to train a GPT-2 model either from scrath or from another pretrained model.

1 Jan 28, 2022

Tool to check whether a GCP bucket is public or not.

Honor's thesis project analyzing whether the GPT-2 model can more effectively generate free-verse or structured poetry.

Related tags

Overview

gpt2-poetry

You might also like...

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend

Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing

Train GPT-3 model on V100(16GB Mem) Using improved Transformer.

Bot to connect a real Telegram user, simulating responses with OpenAI's davinci GPT-3 model.

This repository serves as a place to document a toy attempt on how to create a generative text model in Catalan, based on GPT-2

Tool to check whether a GCP bucket is public or not.

Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)

Interactive Jupyter Notebook Environment for using the GPT-3 Instruct API

Owner

Ashley Kim

GPT-Code-Clippy (GPT-CC) is an open source version of GitHub Copilot, a language model

Generate product descriptions, blogs, ads and more using GPT architecture with a single request to TextCortex API a.k.a Hemingwai

A python project made to generate code using either OpenAI's codex or GPT-J (Although not as good as codex)

Transformers-regression - Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing Regressions In NLP Model Updates

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

Shirt Bot is a discord bot which uses GPT-3 to generate text

Poetry PEP 517 Build Backend & Core Utilities

Proquabet - Convert your prose into proquints and then you essentially have Vogon poetry

An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts