Update: 2025/01/22: We have created a repository for the paper titled Unveiling Discrete Clues: Superior Healthcare Predictions for Rare Diseases, which has been accepted by the WWW 2025. In this repository, we offer the original sample datasets, preprocessing scripts, and algorithm files to showcase the reproducibility of our work.
- torch==1.13.1+cu117
- dgl==1.1.2
- pyhealth==1.1.4
- seaborn==0.13.0
Owing to the copyright stipulations associated with the dataset, we are unable to provide direct upload access. However, it can be readily obtained by downloading directly from the official website: MIMIC-III, MIMIC-IV,eICU.
The structure of the data set should be like,
data
|_ DIAG
| |_ MIII
| |_ _processed
| |_ _ _datasets_pre_stand.pkl
| |_ _ rare.pkl
| |_ MIV
| |_ _ _datasets_pre_stand.pkl
|_ REC
|_ _MIII
| |_ _processed
| |_ _ _datasets_pre_stand.pkl
| |_ _ rare_patient.pkl
# run the main file
# change config.py
config = vars(UDCDIAGConfig)
config = {k:v for k,v in config.items() if not k.startswith('__')}
# please set pretrain=True (first, sencod stage)
# please set tuning=True (third stage)
python main.py
Please feel free to contact [email protected] for assistance or collaboration.