Spaces:
Running
Running
metadata
title: Clinical Research Basics
emoji: π§ͺ
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 5.49.0
app_file: app.py
pinned: false
π§ͺ Clinical Research Chatbot
A lightweight, fully open-source chatbot for clinical research professionals.
Runs entirely on Hugging Face β no OpenAI dependency.
β Current Features
π¬ Chatbot Interface
- Gradio UI with chatbot + Admin Tools tab.
- Query pipeline: FAQ β Glossary β Knowledge Base β APIs (PubMed β FDA β ClinicalTrials.gov).
- Answers are clearly labeled by source.
π Knowledge Base (Docs + URLs)
- Supports ingestion of: PDF, DOCX, TXT, XLSX, JSON, HTML.
- Auto-ingests from:
/data/public_docs//data/urls.txt
- Smart chunking optimized for glossary terms + long text.
π¦ Vector Search
- FAISS +
all-MiniLM-L6-v2embeddings. - Persistent storage:
/persistent/faiss.index/persistent/faiss.index.meta.json
- Index survives restarts and can be exported/imported as
.zip.
π API Integrations
- PubMed
- FDA Drug Labels
- ClinicalTrials.gov
π§ Query Handling
- Glossary-aware normalization
(e.g., eCRF, e-CRF, electronic case report form β same match) - Glossary priority: if glossary hit exists β always returned first.
- Answer flow: FAQ β Glossary β KB β APIs.
- Clear section labels, citations, and confidence notes.
π Logging
All queries, answers, and sources saved in: