Text Classification Dataset

Type: Dataset

Dataset Healthcare NLP Medical Informatics

Medical Text Classification Dataset

Overview

This dataset contains 50,000 medical texts collected from clinical notes and medical literature, annotated for various classification tasks.

Dataset Statistics

  • Total Samples: 50,000
  • Categories: 15 disease categories
  • Format: JSON and CSV
  • License: CC BY 4.0

Use Cases

  • Disease classification
  • Symptom extraction
  • Medical entity recognition
  • Clinical decision support systems

Citation

If you use this dataset, please cite: