Bioinformatics tool for exploring RNA-Protein interactions

Overview

RNPFind


pytest black isort PyPI version Lines of code PyPI - Downloads PyPI - License GitHub language count GitHub top language


Explore RNA-Protein interactions.

RNPFind is a bioinformatics tool. It takes an RNA transcript as input and gives a list of RNA binding protein (RBP) binding sites on the transcript as output.

Various output formats for representing the binding sites is supported.

Usage

Use rnpfind as either a command line tool or through the webtool.

Command line tool

The full docs are available here, but a quick example for installation and use is:

pip install rnpfind
rnpfind -f bed -o ./malat1-sites malat1

which installs rnpfind from the PyPi respository, and then invokes the program to generate binding sites for the gene malat1 in the bed file format in the directory ./malat1-sites.

Note that the tool will download large data files upon first invocation (around 6.4 GB). If the memory footprint is too much for you, the web tool is probably better for you.

Web tool

Visit the RNPFind website and the rest is pretty self-explanatory.

How does it work?

RNPFind collects its data from three main databases:

Each database stores binding sites of RBPs deduced through various methods, such as various experimental methods as well as computational methods. RBPDB and ATTRACT provide binding site patterns for RBPs, so we scan the input gene's sequence to deduce the binding sites. POSTAR stores experimentally deduced binding sites for the transcriptome for all RBPs, so we simply query.

Acknowledgements

This tool was produced at iYLab at CMUQ to help both my molecular biology research project as well as that of others in the lab. Originally a CLI tool that needed to be installed on the user's computer, it was made into a webapp later for ease of use and access.

You might also like...
erdantic is a simple tool for drawing entity relationship diagrams (ERDs) for Python data model classes
erdantic is a simple tool for drawing entity relationship diagrams (ERDs) for Python data model classes

erdantic is a simple tool for drawing entity relationship diagrams (ERDs) for Python data model classes. Diagrams are rendered using the venerable Graphviz library.

BGraph is a tool designed to generate dependencies graphs from Android.bp soong files.

BGraph BGraph is a tool designed to generate dependencies graphs from Android.bp soong files. Overview BGraph (for Build-Graphs) is a project aimed at

Yata is a fast, simple and easy Data Visulaization tool, running on python dash
Yata is a fast, simple and easy Data Visulaization tool, running on python dash

Yata is a fast, simple and easy Data Visulaization tool, running on python dash. The main goal of Yata is to provide a easy way for persons with little programming knowledge to visualize their data easily.

Squidpy is a tool for the analysis and visualization of spatial molecular data.
Squidpy is a tool for the analysis and visualization of spatial molecular data.

Squidpy is a tool for the analysis and visualization of spatial molecular data. It builds on top of scanpy and anndata, from which it inherits modularity and scalability. It provides analysis tools that leverages the spatial coordinates of the data, as well as tissue images if available.

A simple python tool for explore your object detection dataset
A simple python tool for explore your object detection dataset

A simple tool for explore your object detection dataset. The goal of this library is to provide simple and intuitive visualizations from your dataset and automatically find the best parameters for generating a specific grid of anchors that can fit you data characteristics

Simple spectra visualization tool for astronomers

SpecViewer A simple visualization tool for astronomers. Dependencies Python = 3.7.4 PyQt5 = 5.15.4 pyqtgraph == 0.10.0 numpy = 1.19.4 How to use py

A visualization tool made in Pygame for various pathfinding algorithms.
A visualization tool made in Pygame for various pathfinding algorithms.

Pathfinding-Visualizer 🚀 A visualization tool made in Pygame for various pathfinding algorithms. Pathfinding is closely related to the shortest path

A command line tool for visualizing CSV/spreadsheet-like data
A command line tool for visualizing CSV/spreadsheet-like data

PerfPlotter Read data from CSV files using pandas and generate interactive plots using bokeh, which can then be embedded into HTML pages and served by

simple tool to paint axis x and y
simple tool to paint axis x and y

simple tool to paint axis x and y

Comments
  • Are genomic coordinates in bed file comparable to ensembl coordinates?

    Are genomic coordinates in bed file comparable to ensembl coordinates?

    Currently it is a little confusing as to whether the genomic coordinates returned from all 3 data sources (attract, postar and rbpdb) are comparable against ensembl due to the [displacement] variable (https://github.com/mnahinkhan/rnpfind/blob/dev/cli/src/rnpfind/populate_rbp_binding_sites_script.py#L41) applied.

    As far as I understand the binding sites reported following sequence screening using pwm_scan are 0 based e.g. a binding site 1-5 binds from the 2nd to the 6th base of the sequence inclusive.

    However I'm unsure as to whether the displacement variable applied to convert this into a genomic coordinate is correct?

    In addition does the returned genomic start need be + 1 to compare against ensembl coordinates due to ensembl being 1 based not 0 based (https://m.ensembl.org/Help/Faq?id=286)

    opened by jsleight1 3
Owner
Nahin Khan
Computational biologist, programming enthusiast
Nahin Khan
ScisorWiz: Differential Isoform Visualizer for Long-Read RNA Sequencing Data

ScisorWiz: Vizualizer for Differential Isoform Expression README ScisorWiz is a linux-based R-package for visualizing differential isoform expression

Alexander Stein 6 Oct 4, 2022
Rubrix is a free and open-source tool for exploring and iterating on data for artificial intelligence projects.

Open-source tool for exploring, labeling, and monitoring data for AI projects

Recognai 1.5k Jan 7, 2023
Smoking Simulation is an app to simulate the spreading of smokers and non-smokers, their interactions and population during certain amount of time.

Smoking Simulation is an app to simulate the spreading of smokers and non-smokers, their interactions and population during certain

Bohdan Ruban 5 Nov 8, 2022
Library for exploring and validating machine learning data

TensorFlow Data Validation TensorFlow Data Validation (TFDV) is a library for exploring and validating machine learning data. It is designed to be hig

null 688 Jan 3, 2023
Library for exploring and validating machine learning data

TensorFlow Data Validation TensorFlow Data Validation (TFDV) is a library for exploring and validating machine learning data. It is designed to be hig

null 520 Feb 17, 2021
Splore - a simple graphical interface for scrolling through and exploring data sets of molecules

Scroll through and exPLORE molecule sets The splore framework aims to offer a si

null 3 Jun 18, 2022
Pebble is a stat's visualization tool, this will provide a skeleton to develop a monitoring tool.

Pebble is a stat's visualization tool, this will provide a skeleton to develop a monitoring tool.

Aravind Kumar G 2 Nov 17, 2021
The open-source tool for building high-quality datasets and computer vision models

The open-source tool for building high-quality datasets and computer vision models. Website • Docs • Try it Now • Tutorials • Examples • Blog • Commun

Voxel51 2.4k Jan 7, 2023
The open-source tool for building high-quality datasets and computer vision models

The open-source tool for building high-quality datasets and computer vision models. Website • Docs • Try it Now • Tutorials • Examples • Blog • Commun

Voxel51 209 Feb 17, 2021
SummVis is an interactive visualization tool for text summarization.

SummVis is an interactive visualization tool for analyzing abstractive summarization model outputs and datasets.

Robustness Gym 246 Dec 8, 2022