Spaces:

ml-jku
/

tox21_chemprop_classifier

Sleeping

Sonja Topf commited on Oct 29

Commit

28a9b0f

1 Parent(s): 903d2d4

file renamed

Files changed (4) hide show

README.md CHANGED Viewed

@@ -35,7 +35,7 @@ Additionally, the Space needs to implement inference in the `predict()` function
 - `checkpoints/` - the saved model that is used in `predict.py` is here.
 - `src/` - Core model & preprocessing logic:
-    - `data.py` - SMILES preprocessing pipeline
     - `evaluation.py` - compute ROC AUC metric from a csv
     - `train_model.py` - trains a single model
@@ -98,4 +98,4 @@ The output will be a nested dictionary in the format:
 - Adapting `predict.py`, `train.py`, `config/`, and `checkpoints/` is required for leaderboard submission.
-- Preprocessing (here inside `src/data.py`) must be done inside `predict.py` not just `train.py`.

 - `checkpoints/` - the saved model that is used in `predict.py` is here.
 - `src/` - Core model & preprocessing logic:
+    - `preprocess.py` - SMILES preprocessing pipeline
     - `evaluation.py` - compute ROC AUC metric from a csv
     - `train_model.py` - trains a single model
 - Adapting `predict.py`, `train.py`, `config/`, and `checkpoints/` is required for leaderboard submission.
+- Preprocessing (here inside `src/preprocess.py`) must be done inside `predict.py` not just `train.py`.

predict.py CHANGED Viewed

@@ -2,7 +2,7 @@ import torch
 import csv
 import subprocess
-from data import create_clean_smiles
 def predict(smiles_list):
     """

 import csv
 import subprocess
+from preprocess import create_clean_smiles
 def predict(smiles_list):
     """

src/{data.py → preprocess.py} RENAMED Viewed

File without changes

train.py CHANGED Viewed

@@ -1,7 +1,7 @@
 import os
 from dotenv import load_dotenv
-from src.data import clean_smiles_in_csv, get_combined_dataset_csv
 from src.train_model import train_model

 import os
 from dotenv import load_dotenv
+from preprocess import clean_smiles_in_csv, get_combined_dataset_csv
 from src.train_model import train_model