SurvTRACE: Transformers for Survival Analysis with Competing Events

Last update: Oct 06, 2022

Overview

⭐ SurvTRACE: Transformers for Survival Analysis with Competing Events

This repo provides the implementation of SurvTRACE for survival analysis. It is easy to use with only the following codes:

from survtrace.dataset import load_data
from survtrace.model import SurvTraceSingle
from survtrace import Evaluator
from survtrace import Trainer
from survtrace import STConfig

# use METABRIC dataset
STConfig['data'] = 'metabric'
df, df_train, df_y_train, df_test, df_y_test, df_val, df_y_val = load_data(STConfig)

# initialize model
model = SurvTraceSingle(STConfig)

# execute training
trainer = Trainer(model)
trainer.fit((df_train, df_y_train), (df_val, df_y_val))

# evaluating
evaluator = Evaluator(df, df_train.index)
evaluator.eval(model, (df_test, df_y_test))

print("done!")

🔥 See the demo

Please refer to experiment_metabric.ipynb and experiment_support.ipynb !

🔥 How to config the environment

Use our pre-saved conda environment!

conda env create --name survtrace --file=survtrace.yml
conda activate survtrace

or try to install from the requirement.txt

pip3 install -r requirements.txt

🔥 How to get SEER data

Go to https://seer.cancer.gov/data/ to ask for data request from SEER following the guide there.
After complete the step one, we should have the following seerstat software for data access. Open it and sign in with the username and password sent by seer.

Use seerstat to open the ./data/seer.sl file, we shall see the following.

Click on the 'excute' icon to request from the seer database. We will obtain a csv file.

move the csv file to ./data/seer_raw.csv, then run the python script process_seer.py, as
```
python process_seer.py
```
we will obtain the processed seer data named seer_processed.csv.

📝 Functions

single event survival analysis
competing events survival analysis
multi-task learning
automatic hyperparameter grid-search

😄 If you find this result interesting, please consider to cite this paper:

@article{wang2021survtrace,
      title={Surv{TRACE}: Transformers for Survival Analysis with Competing Events}, 
      author={Zifeng Wang and Jimeng Sun},
      year={2021},
      eprint={2110.00855},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

SurvTRACE: Transformers for Survival Analysis with Competing Events

Related tags

Overview

⭐ SurvTRACE: Transformers for Survival Analysis with Competing Events

🔥 See the demo

🔥 How to config the environment

🔥 How to get SEER data

📝 Functions

😄 If you find this result interesting, please consider to cite this paper:

Owner

Zifeng

VMD Audio/Text control with natural language

Product-Review-Summarizer - Created a product review summarizer which clustered thousands of product reviews and summarized them into a maximum of 500 characters, saving precious time of customers and helping them make a wise buying decision.

Malware-Related Sentence Classification

This Project is based on NLTK It generates a RANDOM WORD from a predefined list of words, From that random word it read out the word, its meaning with parts of speech , its antonyms, its synonyms

Implementation of TTS with combination of Tacotron2 and HiFi-GAN

Phrase-Based & Neural Unsupervised Machine Translation

Code for producing Japanese GPT-2 provided by rinna Co., Ltd.

Voilà turns Jupyter notebooks into standalone web applications

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

CredData is a set of files including credentials in open source projects

Write Alphabet, Words and Sentences with your eyes.

Pipeline for fast building text classification TF-IDF + LogReg baselines.

Code for Discovering Topics in Long-tailed Corpora with Causal Intervention.

A library for finding knowledge neurons in pretrained transformer models.

This script just scrapes the most recent Nepali news from Kathmandu Post and notifies the user about current events at regular intervals.It sends out the most recent news at random!

Develop open-source Python Arabic NLP libraries that the Arab world will easily use in all Natural Language Processing applications

This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.

✨Fast Coreference Resolution in spaCy with Neural Networks

The code from the whylogs workshop in DataTalks.Club on 29 March 2022

Tool which allow you to detect and translate text.