Commit
·
4c38445
1
Parent(s):
9886e8e
chore: Add model selection section to the notebook
Browse files- notebook_guide.ipynb +25 -1
notebook_guide.ipynb
CHANGED
|
@@ -95700,7 +95700,31 @@
|
|
| 95700 |
"cell_type": "markdown",
|
| 95701 |
"metadata": {},
|
| 95702 |
"source": [
|
| 95703 |
-
"## 5. Model Selection\n"
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 95704 |
]
|
| 95705 |
}
|
| 95706 |
],
|
|
|
|
| 95700 |
"cell_type": "markdown",
|
| 95701 |
"metadata": {},
|
| 95702 |
"source": [
|
| 95703 |
+
"## 5. Model Comparison and Selection\n"
|
| 95704 |
+
]
|
| 95705 |
+
},
|
| 95706 |
+
{
|
| 95707 |
+
"cell_type": "markdown",
|
| 95708 |
+
"metadata": {},
|
| 95709 |
+
"source": [
|
| 95710 |
+
"The following table summarizes the performance of all models trained in this project.\n",
|
| 95711 |
+
"\n",
|
| 95712 |
+
"| Model | Modality | Accuracy | Macro Avg F1-Score | Weighted Avg F1-Score |\n",
|
| 95713 |
+
"| :------------------ | :----------- | :------- | :----------------- | :-------------------- |\n",
|
| 95714 |
+
"| Random Forest | Text | 0.90 | 0.83 | 0.90 |\n",
|
| 95715 |
+
"| Logistic Regression | Text | 0.90 | 0.84 | 0.90 |\n",
|
| 95716 |
+
"| Random Forest | Image | 0.80 | 0.70 | 0.79 |\n",
|
| 95717 |
+
"| Random Forest | Combined | 0.89 | 0.79 | 0.89 |\n",
|
| 95718 |
+
"| Logistic Regression | Combined | 0.89 | 0.83 | 0.89 |\n",
|
| 95719 |
+
"| **MLP** | **Image** | **0.84** | **0.77** | **0.84** |\n",
|
| 95720 |
+
"| **MLP** | **Text** | **0.92** | **0.87** | **0.92** |\n",
|
| 95721 |
+
"| **MLP** | **Combined** | **0.92** | **0.85** | **0.92** |\n",
|
| 95722 |
+
"\n",
|
| 95723 |
+
"Based on the overall results, the MLP models consistently outperformed their classical machine learning counterparts. This is a common outcome when dealing with complex, high-dimensional features like the embeddings we are using. While traditional models like Random Forest and Logistic Regression are powerful, neural networks like the MLP are specifically designed to learn intricate, non-linear relationships within the data, which often leads to a performance edge on tasks like this.\n",
|
| 95724 |
+
"\n",
|
| 95725 |
+
"Both the Text MLP and Combined MLP models achieved the highest accuracy and weighted F1-score, confirming their superior ability to classify the products.\n",
|
| 95726 |
+
"\n",
|
| 95727 |
+
"For this project's final implementation, we will proceed with the **Multilayer Perceptron (MLP) models**. Their performance, particularly on the combined text and image data, makes them the ideal choice for a robust classification system that can leverage all available product information. These models are now ready to be used in a future application to perform real-time predictions.\n"
|
| 95728 |
]
|
| 95729 |
}
|
| 95730 |
],
|