Speech Recognition Database Management with python

Overview

Speech Recognition Database Management

The main aim of this project is to recognize voice of the user as input and convert that input voice into the text form.

Libraries Used Inside the Project

We have used Speech Recognition module of Python to accomplish this mission. Inside it we have modules like PyAudio which helps us to play and record audio.

Libraries

Also, we have used the MySQL connector module for connecting our Python program to our MySQL database.

2

Libraries Created During the Project

We have created a library named MySQLvoice which helps our Artificial Intelligence to manage and organise the databases.

3

The main aim of this module is to select the keywords from the given input. After selecting the keywords our Artificial Intelligence start working on the database and provide the required results.

How We Converted the Voice into Text

For getting the voice input of the user we have used the pre-build library of Python which is Speech Recognition. We have taken the voice input from the systems microphone and stored it into a variable. After that we used the recognize function of Speech Recognition to recognize what user said and stored it into a variable.

After recognizing we printed the input into the text form to check the durability of our program.

4

Description

Using MySQLvoice library user doesn't need to know SQL database languages to make any changes or to know anything about their database. We have announced eight new keywords as follows:

5

How to Install and Run the Project

Once the MySQLvoice pip package is uploaded on PyPI, you can directly write "pip install MySQLvoice" in your respective terminals to install it in your system. After installing you can import it in your Python compiler and get benefited.

How to Use the Project

This Project is limited to MySQL Database operations but it can be used in all regions of the world for handeling databases as it is very easy to develop for regional languages. We are mostly working in common English language but it has the capability to be coded for any languages spoken in the world like Kannada, Korean, Japenese, Hindi, Gujrati etc. It will help the Non-Technical person to handle databases with ease.

Advantages

  1. It supports multitasking.

  2. Users don’t need to code.

  3. Can be used in any sector of industry where we employ databases.

  4. It saves time of the user which will enhance work procedure and economy.

Disadvantages

  1. May fail to work during hardware failure.

  2. May take time in data training of speech recognition.

  3. Noise pollution can hamper the quality of voice input.

  4. The improper pronunciation can effect the voice input.

Future Plans

We dream to include the regional languages (such as Kannada, Gujarati, Marathi etc.) which will help non-technical person to handle their databases.

We have a plan to include this developer tool features to small scale industries to enhance their productivity with this time saving database handling.

Conclusion

This project will help a lot of indutries and business as they are able to manage and organize their databases with thier voice. Also it will reduce the work load to a greater extent.

This project is just a small example of Artificial Intelligence related Database Management.

This project was jointly created by:

6

You might also like...
PyKaldi is a Python scripting layer for the Kaldi speech recognition toolkit.
PyKaldi is a Python scripting layer for the Kaldi speech recognition toolkit.

PyKaldi is a Python scripting layer for the Kaldi speech recognition toolkit. It provides easy-to-use, low-overhead, first-class Python wrappers for t

A Python wrapper for simple offline real-time dictation (speech-to-text) and speaker-recognition using Vosk.

Simple-Vosk A Python wrapper for simple offline real-time dictation (speech-to-text) and speaker-recognition using Vosk. Check out the official Vosk G

PhoNLP: A BERT-based multi-task learning toolkit for part-of-speech tagging, named entity recognition and dependency parsing
PhoNLP: A BERT-based multi-task learning toolkit for part-of-speech tagging, named entity recognition and dependency parsing

PhoNLP is a multi-task learning model for joint part-of-speech (POS) tagging, named entity recognition (NER) and dependency parsing. Experiments on Vietnamese benchmark datasets show that PhoNLP produces state-of-the-art results, outperforming a single-task learning approach that fine-tunes the pre-trained Vietnamese language model PhoBERT for each task independently.

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.
Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.

anaGo anaGo is a Python library for sequence labeling(NER, PoS Tagging,...), implemented in Keras. anaGo can solve sequence labeling tasks such as nam

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.
Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.

anaGo anaGo is a Python library for sequence labeling(NER, PoS Tagging,...), implemented in Keras. anaGo can solve sequence labeling tasks such as nam

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Espresso Espresso is an open-source, modular, extensible end-to-end neural automatic speech recognition (ASR) toolkit based on the deep learning libra

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

Lightning ASR Modular and extensible speech recognition library leveraging pytorch-lightning and hydra What is Lightning ASR • Installation • Get Star

voice2json is a collection of command-line tools for offline speech/intent recognition on Linux
voice2json is a collection of command-line tools for offline speech/intent recognition on Linux

Command-line tools for speech and intent recognition on Linux

Owner
Abhishek Kumar Jha
Abhishek Kumar Jha
A Python module made to simplify the usage of Text To Speech and Speech Recognition.

Nav Module The solution for voice related stuff in Python Nav is a Python module which simplifies voice related stuff in Python. Just import the Modul

Snm Logic 1 Dec 20, 2021
Speech Recognition for Uyghur using Speech transformer

Speech Recognition for Uyghur using Speech transformer Training: this model using CTC loss and Cross Entropy loss for training. Download pretrained mo

Uyghur 11 Nov 17, 2022
Codename generator using WordNet parts of speech database

codenames Codename generator using WordNet parts of speech database References: https://possiblywrong.wordpress.com/2021/09/13/code-name-generator/ ht

possiblywrong 27 Oct 30, 2022
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

⚠️ Checkout develop branch to see what is coming in pyannote.audio 2.0: a much smaller and cleaner codebase Python-first API (the good old pyannote-au

pyannote 2.2k Jan 9, 2023
Silero Models: pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple

Silero Models: pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple

Alexander Veysov 3.2k Dec 31, 2022
PyTorch implementation of Microsoft's text-to-speech system FastSpeech 2: Fast and High-Quality End-to-End Text to Speech.

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Chung-Ming Chien 1k Dec 30, 2022
Simple Speech to Text, Text to Speech

Simple Speech to Text, Text to Speech 1. Download Repository Opsi 1 Download repository ini, extract di lokasi yang diinginkan Opsi 2 Jika sudah famil

Habib Abdurrasyid 5 Dec 28, 2021
Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".

STEMM: Self-learning with Speech-Text Manifold Mixup for Speech Translation This is a PyTorch implementation for the ACL 2022 main conference paper ST

ICTNLP 29 Oct 16, 2022
A fast and lightweight python-based CTC beam search decoder for speech recognition.

pyctcdecode A fast and feature-rich CTC beam search decoder for speech recognition written in Python, providing n-gram (kenlm) language model support

Kensho 315 Dec 21, 2022