Spaces:
Sleeping
Sleeping
Sonja Topf
commited on
Commit
·
28a9b0f
1
Parent(s):
903d2d4
file renamed
Browse files- README.md +2 -2
- predict.py +1 -1
- src/{data.py → preprocess.py} +0 -0
- train.py +1 -1
README.md
CHANGED
|
@@ -35,7 +35,7 @@ Additionally, the Space needs to implement inference in the `predict()` function
|
|
| 35 |
- `checkpoints/` - the saved model that is used in `predict.py` is here.
|
| 36 |
|
| 37 |
- `src/` - Core model & preprocessing logic:
|
| 38 |
-
- `
|
| 39 |
- `evaluation.py` - compute ROC AUC metric from a csv
|
| 40 |
- `train_model.py` - trains a single model
|
| 41 |
|
|
@@ -98,4 +98,4 @@ The output will be a nested dictionary in the format:
|
|
| 98 |
|
| 99 |
- Adapting `predict.py`, `train.py`, `config/`, and `checkpoints/` is required for leaderboard submission.
|
| 100 |
|
| 101 |
-
- Preprocessing (here inside `src/
|
|
|
|
| 35 |
- `checkpoints/` - the saved model that is used in `predict.py` is here.
|
| 36 |
|
| 37 |
- `src/` - Core model & preprocessing logic:
|
| 38 |
+
- `preprocess.py` - SMILES preprocessing pipeline
|
| 39 |
- `evaluation.py` - compute ROC AUC metric from a csv
|
| 40 |
- `train_model.py` - trains a single model
|
| 41 |
|
|
|
|
| 98 |
|
| 99 |
- Adapting `predict.py`, `train.py`, `config/`, and `checkpoints/` is required for leaderboard submission.
|
| 100 |
|
| 101 |
+
- Preprocessing (here inside `src/preprocess.py`) must be done inside `predict.py` not just `train.py`.
|
predict.py
CHANGED
|
@@ -2,7 +2,7 @@ import torch
|
|
| 2 |
import csv
|
| 3 |
import subprocess
|
| 4 |
|
| 5 |
-
from
|
| 6 |
|
| 7 |
def predict(smiles_list):
|
| 8 |
"""
|
|
|
|
| 2 |
import csv
|
| 3 |
import subprocess
|
| 4 |
|
| 5 |
+
from preprocess import create_clean_smiles
|
| 6 |
|
| 7 |
def predict(smiles_list):
|
| 8 |
"""
|
src/{data.py → preprocess.py}
RENAMED
|
File without changes
|
train.py
CHANGED
|
@@ -1,7 +1,7 @@
|
|
| 1 |
import os
|
| 2 |
from dotenv import load_dotenv
|
| 3 |
|
| 4 |
-
from
|
| 5 |
from src.train_model import train_model
|
| 6 |
|
| 7 |
|
|
|
|
| 1 |
import os
|
| 2 |
from dotenv import load_dotenv
|
| 3 |
|
| 4 |
+
from preprocess import clean_smiles_in_csv, get_combined_dataset_csv
|
| 5 |
from src.train_model import train_model
|
| 6 |
|
| 7 |
|