Comparing LLMs and Traditional Machine Translation for Medical Consultation Summaries
A first-authored pilot study comparing three LLMs (GPT-4o, Llama-3.1, Gemma-2) against three traditional machine-translation tools for translating English medical summaries into Arabic, Chinese, and Vietnamese. Traditional MT led on surface metrics, while LLMs showed relative strengths in semantic similarity — underscoring the need for human review before clinical use.