Background
www.ncbi.nlm.nih.govRecent advances in large language models (LLMs), especially reasoning LLMs have demonstrated impressive reasoning capabilities in specialized domains. The purpose of this study is to evaluate the performance of new open-source reasoning LLM, DeepSeek-R1 and other contemporary LLMs on radiology board examination questions, comparing their accuracy to human radiologists.
www.ncbi.nlm.nih.govMock examinations are widely used in health professional education to assess learning and prepare candidates for national licensure. However, instructor-written multiple-choice items can vary in difficulty, coverage, and clarity. Recently, large language models (LLMs) have achieved high accuracy in medical examinations, highlighting their potential for assisting item-bank development; however, their educational quality remains insufficiently characterized.
www.ncbi.nlm.nih.govBreast MRI, particularly dynamic contrast-enhanced (DCE) MRI, offers high sensitivity in detecting breast lesions but suffers from variability in interpretation. Artificial intelligence (AI) tools like Gemini 2.0 (Google AI, Mountain View, CA) may help streamline and improve diagnostic accuracy. This study evaluates Gemini 2.0's performance in classifying breast lesion status using application programming interface (API)-based image analysis.
www.ncbi.nlm.nih.govLarge language models (LLMs) hold transformative potential for clinical decision-making in the rapidly advancing field of AI in medicine. This study evaluates how Chain-of-Thought (CoT) prompting techniques affect medical reasoning performance with consideration for clinical applicability.
www.ncbi.nlm.nih.govThis study aimed to evaluate the feasibility of general-purpose large language models (LLMs) in addressing inequities in medical licensure exam preparation for Thailand’s National Medical Licensing Examination (ThaiNLE), which currently lacks standardized public study materials.
www.ncbi.nlm.nih.govThis study aimed to explore the capabilities of advanced large language models (LLMs), including OpenAI's GPT-4 variants, Google's Gemini series, and Anthropic's Claude series, in addressing highly specialized otolaryngology board examination questions. Additionally, the study included a longitudinal assessment of GPT-3.5 Turbo, which was evaluated using the same set of questions one year ago to identify changes in its performance over time.
www.ncbi.nlm.nih.govA gemini surfactant imidazoline type, namely N-(3-(2-fatty-4,5-dihydro-1H-imidazol-1-yl) propyl) fatty amide, has been obtained from the fatty acids contained in the mango seed and used as a CO
www.ncbi.nlm.nih.govThis study set out to investigate the biological activity of monomeric surfactants dodecyltrimethylammonium bromide (DTAB) and the next generation gemini surfactant hexamethylene-1,6-bis-(
www.ncbi.nlm.nih.gov