ClinicalTrialBasics / README.md
essprasad's picture
Update README.md
9b72798 verified
|
raw
history blame
1.46 kB
metadata
title: Clinical Research Basics
emoji: πŸ§ͺ
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 5.49.0
app_file: app.py
pinned: false

πŸ§ͺ Clinical Research Chatbot

A lightweight, fully open-source chatbot for clinical research professionals.
Runs entirely on Hugging Face β€” no OpenAI dependency.


βœ… Current Features

πŸ’¬ Chatbot Interface

  • Gradio UI with chatbot + Admin Tools tab.
  • Query pipeline: FAQ β†’ Glossary β†’ Knowledge Base β†’ APIs (PubMed β†’ FDA β†’ ClinicalTrials.gov).
  • Answers are clearly labeled by source.

πŸ” Knowledge Base (Docs + URLs)

  • Supports ingestion of: PDF, DOCX, TXT, XLSX, JSON, HTML.
  • Auto-ingests from:
    • /data/public_docs/
    • /data/urls.txt
  • Smart chunking optimized for glossary terms + long text.

πŸ“¦ Vector Search

  • FAISS + all-MiniLM-L6-v2 embeddings.
  • Persistent storage:
    • /persistent/faiss.index
    • /persistent/faiss.index.meta.json
  • Index survives restarts and can be exported/imported as .zip.

🌐 API Integrations

  • PubMed
  • FDA Drug Labels
  • ClinicalTrials.gov

🧠 Query Handling

  • Glossary-aware normalization
    (e.g., eCRF, e-CRF, electronic case report form β†’ same match)
  • Glossary priority: if glossary hit exists β†’ always returned first.
  • Answer flow: FAQ β†’ Glossary β†’ KB β†’ APIs.
  • Clear section labels, citations, and confidence notes.

πŸ“œ Logging

All queries, answers, and sources saved in: