This repository contains project created during the Data Challenge module at London School of Hygiene & Tropical Medicine

Lukas Kopecky

Last update: Jan 30, 2022

Related tags

Deep Learning LSHTM_RCS

Overview

LSHTM_RCS

This repository contains project created during the Data Challenge module at London School of Hygiene & Tropical Medicine (LSHTM) in collaboration with the Royal College of Surgeons of England (RCS).

The aim of this project is to analyse the performacnce of the National Health Service (NHS) according to natioanl targets for cancer waiting times in the context of the COVID-19 pandemic. Particularly, how has this varied at national, commissioner, and provider levels and for different types of cancer?

Comments

Commissioner level data
Hello,

Could I ask you for some help with the commissioner-level data cleaning?

I have created a script that does all the heavy lifting automatically, but it is time consuming time consuming to run it for all the 71 files. Although the process itself is takes up only couple of seconds, changing path to those files is the annoying part - you need to change the file path as shown in the code below.

The cleaning file is called 'Commissioner data clean.r' allocated in the R folder of the repository. In order to run the file, pull changes from the repository and change following lines

path <- "data/com_data/2021/OCTOBER-2021-CANCER-WAITING-TIMES-COMMISSIONER-WORKBOOK-PROVISIONAL.xlsx" period <- as.Date('2021-10-01') #yyy-mm-dd newName <- "data/com_done/october-21.csv"

The first line is the file you want to clean Second is the date of the file (it is essential for our time series to be precise and in exact same format) Third line is the name of our new file, please keep it in format 'month-21.csv'

The data itself are allocated in the repository /Data/com_data/. Extracted files will be saved in /Data/com_done/.

I created a sign up sheet where you can indicate which files you want to take and the state of your work so we can ensure no one is doing double work.

To upload the data back to the repository, make a commit of your changes and push them back to GitHub as you would do with ordinary files or your code. Before making a commit, please make sure you deselect the 'Commissioner data clean.r' from the commit as might result in conflict (conflict usually happen when multiple people edit the same file. Alternatively, you can copy and paste this the code from this page before making a commit.

Thank you for help, it's much appreciated.
help wanted
opened by kopeckylukas 1
TO DO by Jan 18th
find more datasets and explore and write a short description for others

pick the cancer types to work on

how we can link the datasets and extract waiting times in days

figure out what statistical methods are useful
opened by hellohehehelen 1
Let's Get Started

Hi Everyone,

Let's get started with our project! Please use this site as a to do list and add any comments, questions or just simply allocated someone to a task.

Alternatively, we can also use Slack or Asana, but I do not think it is necessary for the scale of our project.

Thanks, see you around!

opened by kopeckylukas 1
Data Selection
provider_level_data

remove NAs

in standard, only use "2WW", "31 Days", and "62 Days"

in cancer_type, only use "Lung", "Suspected lung cancer", "Breast", and "Suspected breast cancer"
opened by hellohehehelen 1
Complete Datasets

Hi everyone, I have just uploaded the latest datasets. Please, do not use other datasets than displayed below.

| Dataset Name | Description | | -------------- | ------------ | | Beds_regions.csv | Data about covid bad occupancy, contains data for 4 commissioning areas as well as as England as whole.| | commissioner_level_data.csv | Data about commissioners (CCGs) admissions. Doesn't provide information on cancer type. | | provider_level_data.csv | Data about Providers (Trusts) admissions. Use for most of analyses |
documentation

opened by kopeckylukas 0

This repository contains project created during the Data Challenge module at London School of Hygiene & Tropical Medicine

Related tags

Overview

LSHTM_RCS

Comments

Commissioner level data

TO DO by Jan 18th

Let's Get Started

Data Selection

Complete Datasets

Owner

Lukas Kopecky

ManiSkill-Learn is a framework for training agents on SAPIEN Open-Source Manipulation Skill Challenge (ManiSkill Challenge), a large-scale learning-from-demonstrations benchmark for object manipulation.

I have created this Virtual Paint Program, in this you can paint(draw) on your screen using hand gestures, created in Python-3 using OpenCV and Mediapipe library. Gestures :- Index Finger for drawing and Index+Middle Finger for changing position and objects.

This repo contains the official code of our work SAM-SLR which won the CVPR 2021 Challenge on Large Scale Signer Independent Isolated Sign Language Recognition.

Automatic Attendance marker for LMS Practice School Division, BITS Pilani

An image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testingAn image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testing

Pose Detection and Machine Learning for real-time body posture analysis during exercise to provide audiovisual feedback on improvement of form.

RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x flax community week

Training code and evaluation benchmarks for the "Self-Supervised Policy Adaptation during Deployment" paper.

PyTorchMemTracer - Depict GPU memory footprint during DNN training of PyTorch

This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL 2021.

This repository contains the source code and data for reproducing results of Deep Continuous Clustering paper

This repository contains the data and code for the paper "Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Process Priors" (SPNLP@ACL2022)

The AugNet Python module contains functions for the fast computation of image similarity.

UIUCTF 2021 Public Challenge Repository

Illuminated3D This project participates in the Nasa Space Apps Challenge 2021.

计算机视觉中用到的注意力模块和其他即插即用模块PyTorch Implementation Collection of Attention Module and Plug&Play Module

Implementation of Invariant Point Attention, used for coordinate refinement in the structure module of Alphafold2, as a standalone Pytorch module

Public repository created to store my custom-made tools for Just Dance (UbiArt Engine)