Sample data associated with the Aurora-BP study

Microsoft

Last update: Dec 12, 2022

Related tags

Text Data & NLP aurorabp-sample-data

Overview

The Aurora-BP Study and Dataset

This repository contains sample code, sample data, and explanatory information for working with the Aurora-BP dataset released alongside the publication of the Aurora-BP study, i.e., Mieloszyk, Rebecca, et al. "A Comparison of Wearable Tonometry, Photoplethysmography, and Electrocardiography for Cuffless Measurement of Blood Pressure in an Ambulatory Setting." IEEE Journal of Biomedical and Health Informatics (2022). The dataset includes de-identified participant information, raw sensor data aligned with each measurement, and a wide variety of features derived from sensor data. The publishing of this dataset as well as the characterization of multiple feature groups across a broad population and multiple settings are intended to aid future cardiovascular research.

Note that the data contained in this repository represent a very small sample of the full dataset, meant only to illustrate the structure of the files and allow testing with the sample code. For access to the full dataset, see the Data Use Application section below.

Navigation:

docs:
- Data file descriptions, a detailed overview of the Aurora-BP Study protocol, and supplemental results not included in the Aurora-BP Study publication
notebooks:
- Sample Jupyter notebooks and environment files for basic analyses using Aurora-BP Study data
sample:
- Example data files, to run sample Jupyter notebooks and provide researchers a direct look at the data format before application for full data access.

Citation

If you use this repository, part or all of the full dataset, and/or our paper as part of your research, please refer to the dataset as the Aurora-BP dataset and cite the publication as below:

Mieloszyk, Rebecca, et al. "A Comparison of Wearable Tonometry, Photoplethysmography, and Electrocardiography for Cuffless Measurement of Blood Pressure in an Ambulatory Setting." IEEE Journal of Biomedical and Health Informatics (2022)

Data Access

Data Access Committee

Requests for data access are reviewed by the Data Access Committee. During review, the submitting investigator and primary investigator may be contacted for verification. The information you will need to gather to submit a Data Use Application as well as a link to the form are listed below. For additional questions regarding data access, contact: [email protected]

Data Use Application

Full data files are stored separately from this repo within an Azure data lake. To gain access to these data files, a data use application (detailed below and on the data lake landing page) must be submitted. Any researcher may submit a data use application, which includes:

Principal investigator information
- Academic credentials, affiliation, contact information, curriculum vitae, signature attesting accuracy of data use application
Additional investigator information
- Academic credentials, affiliation, contact information
Research proposal
Acknowledgement to comply with data use agreement. Key points are listed below:
- No sharing of data with anyone outside of approved PI and other specified investigators. New investigators must be reviewed.
- No data use outside of stated proposal scope
- No joining of data with other data sources
- No attempt to identify participants, contact participants, or reconstruct PII
- Storage with appropriate access control and best practices
- You may publish (or present papers or articles) on your results from using the data provided that no confidential information of Microsoft and no Personal Information are included in any such publication or presentation
- Any publication or presentation resulting from use of the data should include reference to the Aurora-BP Study, with full reference to the source publication when appropriate
- Aurora-BP Study authors and Microsoft are under no obligation to provide any support or additional materials related to the use of these data
- Aurora-BP Study authors and Microsoft are not liable for any losses, damages, or harms of any kind in connection to the use of these data
- Aurora-BP Study authors and Microsoft are not responsible or liable for the accuracy, usefulness or availability of these data
- Primary Investigator will provide a signature of attestation that they have read, understood, and accept the data use agreement

You might also like...

:mag: End-to-End Framework for building natural language search interfaces to data by utilizing Transformers and the State-of-the-Art of NLP. Supporting DPR, Elasticsearch, HuggingFace’s Modelhub and much more!

Haystack is an end-to-end framework that enables you to build powerful and production-ready pipelines for different search use cases. Whether you want

1.4k Feb 18, 2021

Synthetic data for the people.

zpy: Synthetic data in Blender. Website • Install • Docs • Examples • CLI • Contribute • Licence Abstract Collecting, labeling, and cleaning data for

253 Dec 21, 2022

This project is part of Eleuther AI's quest to create a massive repository of high quality text data for training language models.

42 Dec 13, 2022

原神抽卡记录数据集-Genshin Impact gacha data

提要持续收集原神抽卡记录中可以使用抽卡记录导出工具导出抽卡记录的json，将json文件发送至[email protected]，我会在清除个人信息后将文件提交到此处。以下两种导出工具任选其一即可。一种抽卡记录导出工具 from sunfkny 使用方法演示视频另一种electron版的抽

117 Dec 27, 2022

[Preprint] Escaping the Big Data Paradigm with Compact Transformers, 2021

Compact Transformers Preprint Link: Escaping the Big Data Paradigm with Compact Transformers By Ali Hassani[1]*, Steven Walton[1]*, Nikhil Shah[1], Ab

367 Dec 31, 2022

Coreference resolution for English, German and Polish, optimised for limited training data and easily extensible for further languages

Coreferee Author: Richard Paul Hudson, msg systems ag 1. Introduction 1.1 The basic idea 1.2 Getting started 1.2.1 English 1.2.2 German 1.2.3 Polish 1

169 Dec 21, 2022

🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

15k Jan 2, 2023

Data manipulation and transformation for audio signal processing, powered by PyTorch

torchaudio: an audio library for PyTorch The aim of torchaudio is to apply PyTorch to the audio domain. By supporting PyTorch, torchaudio follows the

1.9k Jan 8, 2023

A framework for cleaning Chinese dialog data

136 Dec 20, 2022

Comments

This repo is missing important files

There are important files that Microsoft projects should all have that are not present in this repository. A pull request has been opened to add the missing file(s). When the pr is merged this issue will be closed automatically.

Microsoft teams can learn more about this effort and share feedback within the open source guidance available internally.

Merge this pull request

opened by microsoft-github-policy-service[bot] 0
Adding Microsoft SECURITY.MD

Please accept this contribution adding the standard Microsoft SECURITY.MD :lock: file to help the community understand the security policy and how to safely report security issues. GitHub uses the presence of this file to light-up security reminders and a link to the file. This pull request commits the latest official SECURITY.MD file from https://github.com/microsoft/repo-templates/blob/main/shared/SECURITY.md.

Microsoft teams can learn more about this effort and share feedback within the open source guidance available internally.

opened by microsoft-github-policy-service[bot] 0
Feature/dataprovenance

Replaced the old data provenance figure with a new one: one participant's discard step was misclassified. The new figure represents the actual data files in the full dataset.

opened by twede 0
Environment lock files
Changes:

Modified notebooks to use sample data in the repo

Renamed SAMPLE_DIR to DATA_DIR

Sorted import statements

Changed python version from 3.6 to 3.8

Generated conda-lock files for x64 Windows, Linux, and Mac in additional to ARM Mac.

Added README for setting up environment and running Jupyter

Testing:

Tested by creating the environment using lock files, running Jupyter and executing all cells of all notebooks

Tested on the following environments:

Windows (x64)

WSL2 Ubuntu (x64)

Mac OSX (x64)
opened by gabe-microsoft 0

Sample data associated with the Aurora-BP study

Related tags

Overview

The Aurora-BP Study and Dataset

Navigation:

Citation

Data Access

Data Access Committee

Data Use Application

You might also like...

:mag: End-to-End Framework for building natural language search interfaces to data by utilizing Transformers and the State-of-the-Art of NLP. Supporting DPR, Elasticsearch, HuggingFace’s Modelhub and much more!

Synthetic data for the people.

This project is part of Eleuther AI's quest to create a massive repository of high quality text data for training language models.

原神抽卡记录数据集-Genshin Impact gacha data

[Preprint] Escaping the Big Data Paradigm with Compact Transformers, 2021

Coreference resolution for English, German and Polish, optimised for limited training data and easily extensible for further languages

🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Data manipulation and transformation for audio signal processing, powered by PyTorch

A framework for cleaning Chinese dialog data

Comments

This repo is missing important files

Adding Microsoft SECURITY.MD

Feature/dataprovenance

Environment lock files

Changes:

Testing:

Owner

Microsoft

Predict an emoji that is associated with a text

Associated Repository for "Translation between Molecules and Natural Language"

Finally, some decent sample sentences

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Study German declensions (dER nettE Mann, ein nettER Mann, mit dEM nettEN Mann, ohne dEN nettEN Mann ...) Generate as many exercises as you want using the incredible power of SPACY!

This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems

This repository is home to the Optimus data transformation plugins for various data processing needs.

Tools, wrappers, etc... for data science with a concentration on text processing

Data loaders and abstractions for text and NLP

Data loaders and abstractions for text and NLP