This Repository is an up-to-date version of Harvard nlp's Legacy code and a Refactoring of the jupyter notebook version as a shell script version.

신재욱

Last update: Sep 25, 2022

Related tags

Security related resources 2022_the_annotated_transformer

Overview

2022_the_annotated_transformer

Goal

This Repository is an up-to-date version of Harvard nlp's Legacy code and a Refactoring of the jupyter notebook version as a shell script version.

Key points

We have re-factored Harvard NLP's Annotated Trasformer into a shell script version.
Dataset utilized Multi30K. (The dataset is small, so you can see the results quickly even on computers with low specifications.)
We provide the Colab version along with the shell script version, making it easy to modify the model and test the method.

https://colab.research.google.com/drive/1SrRmC_Ti8IepeHFNBZBjNxl_wkTSJReC?usp=sharing
Loss Graph can be drawn.
BLEU Score can be measured.

file structure

├── models
│   ├── __init__.py
│   ├── blocks
│   │   ├── __init__.py
│   │   ├── decoder_layer.py
│   │   ├── encoder_layer.py
│   ├── embedding
│   │   ├── __init__.py
│   │   ├── positional_encoding.py
│   │   └── token_embedding.py
│   ├── layers
│   │   ├── __init__.py
│   │   ├── layer_norm.py
│   │   ├── multi_headed_attention.py
│   │   ├── position_wise_feed_forward.py
│   │   └── sublayer_connection.py
│   ├── model
│   │   ├── __init__.py
│   │   ├── decoder.py
│   │   ├── encoder_decoder.py
│   │   ├── encoder.py
│   │   ├── generator.py
│   └── util.py
├── result
│   ├── loss_graph.png
│   ├── train_loss.txt
│   └── valid_loss.txt
├── saved
├── utils
    ├── __init__.py
    ├── batch.py
    ├── batch_size_fn.py
    ├── bleu.py
    ├── data_loader.py
    ├── epoch_time.py
    ├── greedy_decode.py
    ├── label_smoothing.py
    ├── make_model.py
    ├── NoamOpt.py
    ├── run_epoch.py
    ├── simple_loss_compute.py
    └── tokenizer.py
├── README.md
├── test.py
├── train.py
├── config.py
├── data.py
└── graph.py

Training Result

Train Validation loss graph

Test set(unseen data) Translation Example

Test set(unseen data) BLEU Score Average: 35.870847920953594

Reference

https://nlp.seas.harvard.edu/2018/04/03/attention.html

https://jalammar.github.io/illustrated-transformer/

https://www.facebook.com/groups/TensorFlowKR/permalink/1618169785190740/

https://github.com/hyunwoongko/transformer

You might also like...

Script for automatic dump and brute-force passwords using Volatility Framework

Volatility-auto-hashdump Script for automatic dump and brute-force passwords using Volatility Framework

11 Apr 11, 2022

Subdomain enumeration,Web scraping and finding usernames automation script written in python

12 Nov 22, 2022

Providing DevOps and security teams script to identify cloud workloads that may be vulnerable to the Log4j vulnerability(CVE-2021-44228) in their AWS account.

We are providing DevOps and security teams script to identify cloud workloads that may be vulnerable to the Log4j vulnerability(CVE-2021-44228) in their AWS account. The script enables security teams to identify external-facing AWS assets by running the exploit on them, and thus be able to map them and quickly patch them

13 Jan 4, 2022

This Repository is an up-to-date version of Harvard nlp's Legacy code and a Refactoring of the jupyter notebook version as a shell script version.

Related tags

Overview

2022_the_annotated_transformer

Goal

Key points

file structure

Training Result

Train Validation loss graph

Test set(unseen data) Translation Example

Reference

You might also like...

Script for automatic dump and brute-force passwords using Volatility Framework

Subdomain enumeration,Web scraping and finding usernames automation script written in python

Providing DevOps and security teams script to identify cloud workloads that may be vulnerable to the Log4j vulnerability(CVE-2021-44228) in their AWS account.

Provides script to download and format public IP lists related to the Log4j exploit.

This python script will automate the testing for the Log4J vulnerability for HTTP and HTTPS connections.

A script to search, scrape and scan for Apache Log4j CVE-2021-44228 affected files using Google dorks

Yesitsme - Simple OSINT script to find Instagram profiles by name and e-mail/phone

Script to calculate Active Directory Kerberos keys (AES256 and AES128) for an account, using its plaintext password

An experimental script to perform bulk parsing of arbitrary file features with YARA and console logging.

Owner

신재욱

Unauthenticated Sqlinjection that leads to dump data base but this one impersonated Admin and drops a interactive shell

Generate MIPS reverse shell shellcodes easily !

Meterpreter Reverse shell over TOR network using hidden services

An advanced multi-threaded, multi-client python reverse shell for hacking linux systems

Vulnerability Exploitation Code Collection Repository

Dome - Subdomain Enumeration Tool. Fast and reliable python script that makes active and/or passive scan to obtain subdomains and search for open ports.

A python script to turn Ubuntu Desktop in a one stop security platform. The InfoSec Fortress installs the packages,tools, and resources to make Ubuntu 20.04 capable of both offensive and defensive security work.

This is python script that will extract the functions call in all used DLL in an executable and then provide a mapping of those functions to the attack classes defined and curated malapi.io.

Downloads SEP, Baseband and BuildManifest automatically for signed iOS version's for connected iDevice

Scans all drives for log4j jar files and gets their version from the manifest