A python gui program to generate reddit text to speech videos from the id of any post.

Overview

Reddit text to speech generator

A python gui program to generate reddit text to speech videos from the id of any post.

Current functionality

  • Generate videos for subs based on comments,(askreddit) so reading individual comments.
  • Generate videos for subs with longer posts,(entitledparents), so slicing the post into multiple frames to fit the text.
  • Easy login, all you need to do, is run main.py, you will get a link; go to the link and click allow; Thats it! the credentials are written to a json file so you dont need to repeat this step as long as the token.json file is there.
  • Optional customizability options to change the font, backdrop, position of text on the screen

Todo

Feel free to suggest any feature or bug via comments or issues

  • Get comments based on the permalink
  • Generate mp3 and jpg of the post and its comments; concatenate both of them for a clip, them combine all the clip into one file
  • Migrate to praw
  • Ability to generate tts for post based subreddits (r/nosleep or r/relationships)
  • Better post/comment formatting
    • Nsfw filter
  • Visual enhancements
    • Slicing longer posts into to frames or jpg(s)
  • Wrapping it all up into a nice tkinter pysimplegui window
    • Implemented option for reading post or comments (with title)
    • Output folder selection
    • Voice selection
    • Some clean ups and error handling

Usage

  • do pip install -r requirements.txt to install all the dependencies and then, run python main.py
  • It will prompt you to open a url, open it, then allow "post_scraper" to access your account, then you will be redirected to a browser tab, you can then close it.

Screenshots

Backburners

  • Converting links and emojis to plain text
You might also like...
Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".

STEMM: Self-learning with Speech-Text Manifold Mixup for Speech Translation This is a PyTorch implementation for the ACL 2022 main conference paper ST

This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems

Proteno This is the data release associated with the corresponding NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deploymen

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration This repo contains only model Implementation of Zero-Shot Text-to-Speech for Text

glow-speak is a fast, local, neural text to speech system that uses eSpeak-ng as a text/phoneme front-end.

Glow-Speak glow-speak is a fast, local, neural text to speech system that uses eSpeak-ng as a text/phoneme front-end. Installation git clone https://g

Download videos from YouTube/Twitch/Twitter right in the Windows Explorer, without installing any shady shareware apps
Download videos from YouTube/Twitch/Twitter right in the Windows Explorer, without installing any shady shareware apps

youtube-dl and ffmpeg Windows Explorer Integration Download videos from YouTube/Twitch/Twitter and more (any platform that is supported by youtube-dl)

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

⚠️ Checkout develop branch to see what is coming in pyannote.audio 2.0: a much smaller and cleaner codebase Python-first API (the good old pyannote-au

In this repository, I have developed an end to end Automatic speech recognition project. I have developed the neural network model for automatic speech recognition with PyTorch and used MLflow to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry.
Speech Recognition for Uyghur using Speech transformer

Speech Recognition for Uyghur using Speech transformer Training: this model using CTC loss and Cross Entropy loss for training. Download pretrained mo

A flask application to predict the speech emotion of any .wav file.

This is a speech emotion recognition app. It will allow you to train a modular MLP model with the RAVDESS dataset, and then use that model with a flask application to predict the speech emotion of any .wav file.

Comments
  • Screenshot for your readme?

    Screenshot for your readme?

    Hi!

    I see in your readme that you've got a GUI and indeed, the code shows you've got one... but, there's no image in your readme. I'm SURE visitors would enjoy seeing what you made.

    If you've not put an image into a readme before, it's super simple. It's as easy as embedding an image in this GitHub Issue...

    This is a simple one I made a couple of years ago. Nothing fancy that's for sure....

    image

    https://github.com/PySimpleGUI/PySimpleGUI-Reddit-Searcher

    I put my screenshots that I used in my readme for that project into an Issue and then pasted the links into my readme

    https://github.com/PySimpleGUI/PySimpleGUI-Reddit-Searcher/issues/1

    The window above was made by pasting into this comment. It made this line of text:

    ![image](https://user-images.githubusercontent.com/46163555/142887950-bf23dbca-ffee-4401-a90a-3f9816d18ab0.png) You can do the same with your program... paste a screenshot and then paste the link into your readme. Simple.... I 💗 simple

    image

    opened by PySimpleGUI 16
  • Can't make PySimpleGUI calls from threads....

    Can't make PySimpleGUI calls from threads....

    It appears you're making PySimpleGUI calls from a thread. You may even be "getting away with it". If tktiner catches you, your program will crash. PySimpleGUI wasn't written to be threadsafe either.

    There is an easy way out however. window.write_event_value() is callable from a thread. With this call you can pass data through to your event loop and then have your event loop perform the operation. I recommend using tuples for this. If you use a tuple with your events, then you can have the first item in your tuple be an indicator that the event is from your thread, then have the next item indicate the operation.

    Maybe an event like this if you want to print something to the element with the key '-ML-': ('-THREAD-', 'print', '-ML-')

    window.write_event_value(('-THREAD-', 'print', '-ML-'), "this is what I want to print")
    
    

    In your event loop you can then have

    if event[0] == '-THREAD-':
        # all of your thread based event processing...
        if event[1] == 'print':
            window[event[2]].print(values[event])
    

    I did something like this when I added threading to an image restoration project I stumbled onto: https://github.com/mizosoft/ImageRestorationAndEnhancement/issues/2

    Lots of different ways to do it. You'll find examples in the Demo Programs as well that use write_event_value image

    opened by PySimpleGUI 1
Owner
Aadvik
I automate stuff with python, and occasionally work with nodejs. Also, coffee is overrated.
Aadvik
Silero Models: pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple

Silero Models: pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple

Alexander Veysov 3.2k Dec 31, 2022
PyTorch implementation of Microsoft's text-to-speech system FastSpeech 2: Fast and High-Quality End-to-End Text to Speech.

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Chung-Ming Chien 1k Dec 30, 2022
Simple Speech to Text, Text to Speech

Simple Speech to Text, Text to Speech 1. Download Repository Opsi 1 Download repository ini, extract di lokasi yang diinginkan Opsi 2 Jika sudah famil

Habib Abdurrasyid 5 Dec 28, 2021
Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

textgenrnn Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code, or quickly tr

Max Woolf 4.8k Dec 30, 2022
Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

textgenrnn Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code, or quickly tr

Max Woolf 4.3k Feb 18, 2021
This is a GUI program that will generate a word search puzzle image

Word Search Puzzle Generator Table of Contents About The Project Built With Getting Started Prerequisites Installation Usage Roadmap Contributing Cont

null 11 Feb 22, 2022
Text to speech converter with GUI made in Python.

Text-to-speech-with-GUI Text to speech converter with GUI made in Python. To run this download the zip file and run the main file or clone this repo.

SidTheMiner 1 Nov 15, 2021
Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech tagging and word segmentation.

Pytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech tagging and word segmentation.

null 186 Dec 24, 2022
A Python module made to simplify the usage of Text To Speech and Speech Recognition.

Nav Module The solution for voice related stuff in Python Nav is a Python module which simplifies voice related stuff in Python. Just import the Modul

Snm Logic 1 Dec 20, 2021
Minimal GUI for accessing the Watson Text to Speech service.

Description Minimal graphical application for accessing the Watson Text to Speech service. Requirements Python 3 plus all dependencies listed in requi

Moritz Maxeiner 1 Oct 22, 2021