Sonja Topf commited on
Commit
28a9b0f
·
1 Parent(s): 903d2d4

file renamed

Browse files
Files changed (4) hide show
  1. README.md +2 -2
  2. predict.py +1 -1
  3. src/{data.py → preprocess.py} +0 -0
  4. train.py +1 -1
README.md CHANGED
@@ -35,7 +35,7 @@ Additionally, the Space needs to implement inference in the `predict()` function
35
  - `checkpoints/` - the saved model that is used in `predict.py` is here.
36
 
37
  - `src/` - Core model & preprocessing logic:
38
- - `data.py` - SMILES preprocessing pipeline
39
  - `evaluation.py` - compute ROC AUC metric from a csv
40
  - `train_model.py` - trains a single model
41
 
@@ -98,4 +98,4 @@ The output will be a nested dictionary in the format:
98
 
99
  - Adapting `predict.py`, `train.py`, `config/`, and `checkpoints/` is required for leaderboard submission.
100
 
101
- - Preprocessing (here inside `src/data.py`) must be done inside `predict.py` not just `train.py`.
 
35
  - `checkpoints/` - the saved model that is used in `predict.py` is here.
36
 
37
  - `src/` - Core model & preprocessing logic:
38
+ - `preprocess.py` - SMILES preprocessing pipeline
39
  - `evaluation.py` - compute ROC AUC metric from a csv
40
  - `train_model.py` - trains a single model
41
 
 
98
 
99
  - Adapting `predict.py`, `train.py`, `config/`, and `checkpoints/` is required for leaderboard submission.
100
 
101
+ - Preprocessing (here inside `src/preprocess.py`) must be done inside `predict.py` not just `train.py`.
predict.py CHANGED
@@ -2,7 +2,7 @@ import torch
2
  import csv
3
  import subprocess
4
 
5
- from data import create_clean_smiles
6
 
7
  def predict(smiles_list):
8
  """
 
2
  import csv
3
  import subprocess
4
 
5
+ from preprocess import create_clean_smiles
6
 
7
  def predict(smiles_list):
8
  """
src/{data.py → preprocess.py} RENAMED
File without changes
train.py CHANGED
@@ -1,7 +1,7 @@
1
  import os
2
  from dotenv import load_dotenv
3
 
4
- from src.data import clean_smiles_in_csv, get_combined_dataset_csv
5
  from src.train_model import train_model
6
 
7
 
 
1
  import os
2
  from dotenv import load_dotenv
3
 
4
+ from preprocess import clean_smiles_in_csv, get_combined_dataset_csv
5
  from src.train_model import train_model
6
 
7