CardioEmbed-BGE-M3

Domain-specialized cardiology text embeddings using LoRA-adapted BGE-M3

Part of a comparative study of 10 embedding architectures for clinical cardiology.

Performance

Metric	Score
Separation Score	0.209

Usage

from transformers import AutoModel, AutoTokenizer
from peft import PeftModel

base_model = AutoModel.from_pretrained("BAAI/bge-m3")
tokenizer = AutoTokenizer.from_pretrained("BAAI/bge-m3")
model = PeftModel.from_pretrained(base_model, "richardyoung/CardioEmbed-BGE-M3")

Training

Training Data: 106,535 cardiology text pairs from medical textbooks
Method: LoRA fine-tuning (r=16, alpha=32)
Loss: Multiple Negatives Ranking Loss (InfoNCE)

Citation

@article{young2024comparative,
  title={Comparative Analysis of LoRA-Adapted Embedding Models for Clinical Cardiology Text Representation},
  author={Young, Richard J and Matthews, Alice M},
  journal={arXiv preprint},
  year={2024}
}

Downloads last month: 16

Model tree for richardyoung/CardioEmbed-BGE-M3

Base model

BAAI/bge-m3

Adapter

(4)

this model

Collection including richardyoung/CardioEmbed-BGE-M3

Medical & Healthcare AI

Collection

Models and datasets for medical AI research. Includes CardioEmbed embeddings for cardiology, medical LLMs, and synthetic patient datasets. • 9 items • Updated 11 days ago