RecurrentArchitectures - See the accompanying blog post

Debajyoti Datta

Last update: Feb 6, 2022

Related tags

Miscellaneous RecurrentArchitectures

Overview

Why this? What is the goal?

The goal of this repository is to write all the recurrent architectures from scratch in tensorflow for learning purposes. This is a Work-In-Progress. I plan to implement some more architectures and publish the results and performances for all of them. The inspiration for this post was the last paragraph of this post: Understanding LSTMs Chris Olah mentioned two papers that did extensive study on recurrent architectures and I wanted to implement all the architectures in these two papers. A short Google search resulted that Jim Flemming already did half the work here, so I decided to implement all the remaining architectures of Jozefowicz's paper. (I also updated parts of his code so that all the architectures work in the newest version of tensorflow. Both these papers are fantastic and worth a read. Feel free to send me a pull request if you spot an error and/or find other papers with recurrent architecture variants. As and when time permits, I will implement them. All the implementations are in Tensorflow (0.12).

Deep Learning Recurrent Architectures

LSTM Network Variants This tutorial has a very nice approach to creating variations of LSTM Networks. A good approach to learning how to code a new network architecture and more importantly a methodical approach to understanding the gates in LSTM
Empirical Exploration of Recurrent Network Architectures

This was mainly because I wanted to learn the actual implementations of various recurrent neural network architecures and implement them from scratch without using pre defined lstm, gru etc. This is directly a fork of LSTM Network Variants, with the code changes to run on the most recent version of tensorflow. (0.12.0 as of this writing). I will keep this repositiory upto date with the new changes.

Also this repo has more network architectures from here: Empirical Exploration of Recurrent Network Architectures

The implementations are not optimal, in the sense, that in the actual implementations of the LSTM, GRU and RNN cells the states and input are concatenated before multiplications to reduce the number of matrix multiplications whereas this is directly an implementation of the lstm network that you would see in a textbook.

Recurrent Architectures Implemented

If with a (*) then it was implemented in LSTM Network Variants, else was implemented by me based on Empirical Exploration of Recurrent Network Architectures . Also network architectures that I have implemented follow the conventions and syntax of Empirical Exploration of Recurrent Network Architectures.

mut1 : Variant 1 from Empirical Exploration of Recurrent Network Architectures
mut2 : Variant 2 from Empirical Exploration of Recurrent Network Architectures
mut3 : Variant 3 from Empirical Exploration of Recurrent Network Architectures
vanillaRNN : Just a vanilla RNN Network
gru : Gated Recurrent Unit
cifg (*) : Coupled input-forget gate
fgr (*) : Full Gate Recurrence
lstm (*) : Long Short Term Memory
nfg (*) : No forget gate
niaf (*) : No input activation function
nig (*) : No input gate
noaf (*) : No output activation function
nog (*): No output gate
np (*): No peephole connections

Instructions

See the jupyter notebook here: https://github.com/debajyotidatta/RecurrentArchitectures/blob/master/Empirical%20Exploration%20of%20Recurrent%20Network%20Architectures.ipynb

Code implementation from my Medium blog post: [Transformers from Scratch in PyTorch]

transformer-from-scratch Code for my Medium blog post: Transformers from Scratch in PyTorch Note: This Transformer code does not include masked attent

27 Dec 21, 2022

An application to see if your Ethereum staking validator(s) are members of the current or next post-Altair sync committees.

eth_sync_committee.py Since the Altair upgrade, 512 validators are randomly chosen every 256 epochs (~27 hours) to form a sync committee. Validators i

4 Oct 27, 2022

PyTorch implementation of the Transformer in Post-LN (Post-LayerNorm) and Pre-LN (Pre-LayerNorm).

Transformer-PyTorch A PyTorch implementation of the Transformer from the paper Attention is All You Need in both Post-LN (Post-LayerNorm) and Pre-LN (

22 Feb 27, 2022

A faster and highly-compatible implementation of the Python programming language. The code here is out of date, please follow our blog

Pyston is a faster and highly-compatible implementation of the Python programming language. Version 2 is currently closed source, but you can find the

4.9k Dec 21, 2022

Simple, lightweight, and magic-free static site/blog generator for Python coders

makesite.py Take full control of your static website/blog generation by writing your own simple, lightweight, and magic-free static site generator in

1.7k Jan 1, 2023

A static website and blog generator

Nikola, a Static Site and Blog Generator In goes content, out comes a website, ready to deploy. Why Static Websites? Static websites are safer, use fe

2.4k Jan 5, 2023

Simple yet powerful and really extendable application for managing a blog within your Django Web site.

Django Blog Zinnia Simple yet powerful and really extendable application for managing a blog within your Django Web site. Zinnia has been made for pub

2.1k Dec 24, 2022

Visualize Data From Stray Scanner https://keke.dev/blog/2021/03/10/Stray-Scanner.html

StrayVisualizer A set of scripts to work with data collected using Stray Scanner. Usage Installing Dependencies Install dependencies with pip -r requi

45 Dec 30, 2022

This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog:

PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network Introduction This is a tensorflow re-implementation of PSENet: Shape Robu

498 Dec 30, 2022

A Django blog app implemented in Wagtail

Puput Puput is a powerful and simple Django app to manage a blog. It uses the awesome Wagtail CMS as content management system. Puput is the catalan n

535 Jan 8, 2023

Signatures and IoCs from public Volexity blog posts.

threat-intel This repository contains IoCs related to Volexity public threat intelligence blog posts. They are organised by year, and within each year

130 Dec 29, 2022

🌌 A Python script to generate blog banners from command line.

Auto Blog Banner Generator A Python script to generate blog banners. This script is used at RavSam. The following image is an example of the blog bann

10 Sep 20, 2022

Simple yet powerful and really extendable application for managing a blog within your Django Web site.

Django Blog Zinnia Simple yet powerful and really extendable application for managing a blog within your Django Web site. Zinnia has been made for pub

2.1k Dec 24, 2022

django blog - complete customization and ready to use with one click installer

django-blog-it Simple blog package developed with Django. Features: Dynamic blog articles Blog pages Contact us page (configurable) google analytics S

220 Sep 18, 2022

API with high performance to create a simple blog and Auth using OAuth2 ⛏

DogeAPI API with high performance built with FastAPI & SQLAlchemy, help to improve connection with your Backend Side to create a simple blog and Cruds

111 Jan 5, 2023

A simple Blog Using Django Framework and Used IBM Cloud Services for Text Analysis and Text to Speech

ElhamBlog Cloud Computing Course first assignment. A simple Blog Using Django Framework and Used IBM Cloud Services for Text Analysis and Text to Spee

5 Dec 6, 2022

Blog focused on skills enhancement and knowledge sharing. Tech Stack's: Vue.js, Django and Django-Ninja

2 Sep 21, 2022

Scraping comments from the political section of popular Nigerian blog (Nairaland), and saving in a CSV file.

Scraping_Nairaland This project scraped comments from the political section of popular Nigerian blog www.nairaland.com using the Python BeautifulSoup

1 Nov 14, 2021

Simple FastAPI Example : Blog API using FastAPI : Beginner Friendly

fastapi_blog FastAPI : Simple Blog API with CRUD operation Steps to run the project: git clone https://github.com/mrAvi07/fastapi_blog.git cd fastapi-

1 Oct 8, 2022

RecurrentArchitectures - See the accompanying blog post

Related tags

Overview

Why this? What is the goal?

Deep Learning Recurrent Architectures

Other Tutorials with that are also helpful

Recurrent Architectures Implemented

Instructions

You might also like...

Code implementation from my Medium blog post: [Transformers from Scratch in PyTorch]

An application to see if your Ethereum staking validator(s) are members of the current or next post-Altair sync committees.

PyTorch implementation of the Transformer in Post-LN (Post-LayerNorm) and Pre-LN (Pre-LayerNorm).

A faster and highly-compatible implementation of the Python programming language. The code here is out of date, please follow our blog

Simple, lightweight, and magic-free static site/blog generator for Python coders

A static website and blog generator

Simple yet powerful and really extendable application for managing a blog within your Django Web site.

Visualize Data From Stray Scanner https://keke.dev/blog/2021/03/10/Stray-Scanner.html

This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog:

A Django blog app implemented in Wagtail

Signatures and IoCs from public Volexity blog posts.

🌌 A Python script to generate blog banners from command line.

Simple yet powerful and really extendable application for managing a blog within your Django Web site.

django blog - complete customization and ready to use with one click installer

API with high performance to create a simple blog and Auth using OAuth2 ⛏

A simple Blog Using Django Framework and Used IBM Cloud Services for Text Analysis and Text to Speech

Blog focused on skills enhancement and knowledge sharing. Tech Stack's: Vue.js, Django and Django-Ninja

Scraping comments from the political section of popular Nigerian blog (Nairaland), and saving in a CSV file.

Simple FastAPI Example : Blog API using FastAPI : Beginner Friendly

Owner

Debajyoti Datta

Scraping comments from the political section of popular Nigerian blog (Nairaland), and saving in a CSV file.

Add your recently blog and douban states in your GitHub Profile

This is the accompanying repository for the Bloomberg Global Coal Countdown website.

👀 nothing to see here

Anki Addon idea by gbrl.sc to see previous ratings of a card in the reviewer

Password-Manager - A Password Manager application made using Python. You can use this python application to store and to see the stored passwords

BlackMamba is a multi client C2/post exploitation framework

UdemyPy is a bot that hourly looks for Udemy free courses and post them in my Telegram Channel: Free Courses.

AKSWINPOSTINIT -- AKS Windows node post provisioning initialization

AWS Blog post code for running feature-extraction on images using AWS Batch and Cloud Development Kit (CDK).