CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation

Last update: Dec 25, 2022

Related tags

Deep Learning CoTr

Overview

CoTr: Efficient 3D Medical Image Segmentation by bridging CNN and Transformer

This is the official pytorch implementation of the CoTr:

Paper: CoTr: Efficient 3D Medical Image Segmentation by bridging CNN and Transformer.

Requirements

CUDA 11.0
Python 3.7
Pytorch 1.7
Torchvision 0.8.2

Usage

0. Installation

Install Pytorch1.7, nnUNet and CoTr as below

pip install torch==1.7.1+cu110 torchvision==0.8.2+cu110 torchaudio==0.7.2 -f https://download.pytorch.org/whl/torch_stable.html

cd nnUNet
pip install -e .

cd CoTr_package
pip install -e .

1. Data Preparation

Download BCV dataset
Preprocess the BCV dataset according to nnU-Net.
Training and Testing ID are in data/splits_final.pkl.

2. Training

cd CoTr_package/CoTr/run

Run nohup python run_training.py -gpu='0' -outpath='CoTr' 2>&1 & for training.

3. Testing

Run nohup python run_training.py -gpu='0' -outpath='CoTr' -val --val_folder='validation_output' 2>&1 & for validation.

4. Citation

If this code is helpful for your study, please cite:

@article{xie2021cotr,
  title={CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation},
  author={Xie, Yutong and Zhang, Jianpeng and Shen, Chunhua and Xia, Yong},
  journal={arXiv preprint arXiv:2103.03024},
  year={2021}
}

5. Acknowledgements

Part of codes are reused from the nnU-Net. Thanks to Fabian Isensee for the codes of nnU-Net.

Contact

Yutong Xie ([email protected])

You might also like...

Code for the ICML 2021 paper "Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation", Haoxiang Wang, Han Zhao, Bo Li.

Bridging Multi-Task Learning and Meta-Learning Code for the ICML 2021 paper "Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Trainin

57 Dec 15, 2022

Official implementation of "Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets" (CVPR2021)

Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets This is the official implementation of "Towards Good Pract

52 Nov 22, 2022

Medical Image Segmentation using Squeeze-and-Expansion Transformers

Medical Image Segmentation using Squeeze-and-Expansion Transformers Introduction This repository contains the code of the IJCAI'2021 paper 'Medical Im

172 Dec 20, 2022

Semi Supervised Learning for Medical Image Segmentation, a collection of literature reviews and code implementations.

Semi-supervised-learning-for-medical-image-segmentation. Recently, semi-supervised image segmentation has become a hot topic in medical image computin

1.3k Jan 3, 2023

S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)

S2-BNN (Self-supervised Binary Neural Networks Using Distillation Loss) This is the official pytorch implementation of our paper: "S2-BNN: Bridging th

52 Dec 24, 2022

Bridging Vision and Language Model

BriVL BriVL (Bridging Vision and Language Model) 是首个中文通用图文多模态大规模预训练模型。BriVL模型在图文检索任务上有着优异的效果，超过了同期其他常见的多模态预训练模型（例如UNITER、CLIP）。 BriVL论文：WenLan: Bridgi

235 Dec 27, 2022

This repo is the code release of EMNLP 2021 conference paper "Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories".

Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories This repo is the code release of EMNLP 2021 con

12 Nov 22, 2022

Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation

Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation Official PyTorch implementation for the paper Look

20 Nov 24, 2022

This repo holds code for TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation

TransUNet This repo holds code for TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation Usage

1.4k Jan 4, 2023

Comments

reference points解释

作者，你好，可以解释下这段代码的意思吗？ def get_reference_points(spatial_shapes, valid_ratios, device): reference_points_list = [] for lvl, (D_, H_, W_) in enumerate(spatial_shapes):

        ref_d, ref_y, ref_x = torch.meshgrid(torch.linspace(0.5, D_ - 0.5, D_, dtype=torch.float32, device=device),
                                             torch.linspace(0.5, H_ - 0.5, H_, dtype=torch.float32, device=device),
                                             torch.linspace(0.5, W_ - 0.5, W_, dtype=torch.float32, device=device))

        ref_d = ref_d.reshape(-1)[None] / (valid_ratios[:, None, lvl, 0] * D_)
        ref_y = ref_y.reshape(-1)[None] / (valid_ratios[:, None, lvl, 2] * H_)
        ref_x = ref_x.reshape(-1)[None] / (valid_ratios[:, None, lvl, 1] * W_)

        ref = torch.stack((ref_d, ref_x, ref_y), -1)   # D W H
        reference_points_list.append(ref)
    reference_points = torch.cat(reference_points_list, 1)
    reference_points = reference_points[:, :, None] * valid_ratios[:, None]
    return reference_points

opened by 60wanjinbing 0

CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation

Related tags

Overview

CoTr: Efficient 3D Medical Image Segmentation by bridging CNN and Transformer

Requirements

Usage

0. Installation

1. Data Preparation

2. Training

3. Testing

4. Citation

5. Acknowledgements

Contact

You might also like...

Code for the ICML 2021 paper "Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation", Haoxiang Wang, Han Zhao, Bo Li.

Official implementation of "Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets" (CVPR2021)

Medical Image Segmentation using Squeeze-and-Expansion Transformers

Semi Supervised Learning for Medical Image Segmentation, a collection of literature reviews and code implementations.

S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)

Bridging Vision and Language Model

This repo is the code release of EMNLP 2021 conference paper "Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories".

Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation

This repo holds code for TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation

Comments

reference points解释

Owner

Pytorch Code for "Medical Transformer: Gated Axial-Attention for Medical Image Segmentation"

The Medical Detection Toolkit contains 2D + 3D implementations of prevalent object detectors such as Mask R-CNN, Retina Net, Retina U-Net, as well as a training and inference framework focused on dealing with medical images.

The codes for the work "Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation"

MISSFormer: An Effective Medical Image Segmentation Transformer

Multi-atlas segmentation (MAS) is a promising framework for medical image segmentation

NFT-Price-Prediction-CNN - Using visual feature extraction, prices of NFTs are predicted via CNN (Alexnet and Resnet) architectures.

Copy Paste positive polyp using poisson image blending for medical image segmentation

Build a medical knowledge graph based on Unified Language Medical System (UMLS)

nnFormer: Interleaved Transformer for Volumetric Segmentation Code for paper "nnFormer: Interleaved Transformer for Volumetric Segmentation "

Vision Transformer for 3D medical image registration (Pytorch).