Data labels and scripts for fastMRI.org

Last update: Dec 22, 2022

Related tags

Overview

fastMRI+: Clinical pathology annotations for the fastMRI dataset

The fastMRI dataset is a publicly available MRI raw (k-space) dataset. It has been used widely to train machine learning models for image reconstruction and has been used in reconstruction challenges.

This repo includes clinical pathology annotations for this dataset. The entire knee dataset and approximately 1000 brain datasets have been labeled. The goal of providing these labels is to enable developers of image reconstruction models and algorithms to evaluate the performance of the developed techniques with a focus on the sections or regions that could contain clinical pathology.

Limitations

Each image has labeled by a single radiologist and without the benefit of looking at other views and angles of the same subject, and should therefore be considered in that context. Specifically, the labels should not be considered clinical ground truth or an exhaustive list of all lesions but rather an indicatition of where a pathology could be present.

Obtaining fastMRI raw data and images

The fastMRI raw data and reference images can be obtained from fastmri.org. You will be able to download and use the data for academic purposes after signing the data sharing agreement. If you are looking for automation for downloading the dataset and training fastMRI models, please see the InnerEye Deep Learning Toolkit.

Labeling procedure and generating DICOM images from fastMRI data

In order to label the data, DICOM files were generated from the fastMRI dataset, and we are providing a fastmri_to_dicom.py to document the procedure. This script can be used like this:

python fastmri_to_dicom.py --filename fastmridatafile.h5

Note: In the process of converting the images to DICOM, the pixel arrays were flipped (up/down) to provide a view that was closer to DICOM orientation and assist with labeling. This should be taken into consideration when using the labels.

The labeling was performed by experienced radiologists using MD.ai.

Working with the annotations

The Annotations folder contains a label file for each of the knee (knee.csv and brain (brain.csv datasets. The files contain one line for each annotation (bounding box) that was labeled by the radiologists. Datasets with no findings (no annotations) are not represented in the label files, however, you can see which files were reviewed in the brain_file_list.csv and knee_file_list.csv. If a dataset (a fastMRI file) is listed in the file lists but not in the label files, it means that it has been reviewed, but there were no findings.

The repo contains an example jupyter notebook, which illustrates how to read the labels and overlay them onto the image pixels.

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

Data labels and scripts for fastMRI.org

Related tags

Overview

fastMRI+: Clinical pathology annotations for the fastMRI dataset

Limitations

Obtaining fastMRI raw data and images

Labeling procedure and generating DICOM images from fastMRI data

Working with the annotations

Contributing

Trademarks

Owner

Microsoft

Implementation of ConvMixer-Patches Are All You Need? in TensorFlow and Keras

Arquitetura e Desenho de Software.

Single cell current best practices tutorial case study for the paper:Luecken and Theis, "Current best practices in single-cell RNA-seq analysis: a tutorial"

CodeContests is a competitive programming dataset for machine-learning

Gesture Volume Control v.2

Semi-supervised semantic segmentation needs strong, varied perturbations

Modified fork of Xuebin Qin's U-2-Net Repository. Used for demonstration purposes.

Code for project: "Learning to Minimize Remainder in Supervised Learning".

Tensorflow Implementation of ECCV'18 paper: Multimodal Human Motion Synthesis

Finetuner allows one to tune the weights of any deep neural network for better embeddings on search tasks

Food recognition model using convolutional neural network & computer vision

Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search

An implementation of "MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing" (ICML 2019).

ViDT: An Efficient and Effective Fully Transformer-based Object Detector

Resources for our AAAI 2022 paper: "LOREN: Logic-Regularized Reasoning for Interpretable Fact Verification".

Official PyTorch implementation of "Edge Rewiring Goes Neural: Boosting Network Resilience via Policy Gradient".

Fully Convolutional DenseNets for semantic segmentation.

[SIGGRAPH Asia 2021] Pose with Style: Detail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN

Pytorch implementation code for [Neural Architecture Search for Spiking Neural Networks]

Official PyTorch Implementation of Embedding Transfer with Label Relaxation for Improved Metric Learning, CVPR 2021