{"works": [{"key": "/works/OL44983877W"}], "title": "Diagnosing AI: Evaluation of AI in Clinical Practice", "publishers": ["Dawning Research Press"], "publish_date": "2026", "key": "/books/OL61375300M", "type": {"key": "/type/edition"}, "identifiers": {"amazon": ["B0GQGHL3G3"]}, "covers": [15175347], "isbn_13": ["9798999832481"], "description": {"type": "/type/text", "value": "Diagnosing AI: Evaluation of AI in Clinical Practice is a vital and timely guide for clinicians, data scientists, and decision-makers at the intersection of artificial intelligence and medicine. As large language models (LLMs) and AI systems become increasingly visible in healthcare, this book provides a clear, evidence-based framework for understanding their true capabilities and limitations in clinical deployment.\r\n\r\nA Warning Against Misplaced Confidence\r\nWith the rise of AI chatbots that produce confident, fluent medical text, there is a real risk of mistaking eloquence for expertise. Dr. Milan Toma demonstrates, through empirical studies and systematic analysis, that LLMs can produce authoritative-sounding reports while masking fundamental errors and inconsistencies. The temptation to trust AI-generated prose as a substitute for rigorous clinical reasoning is not only misleading, but potentially dangerous.\r\n\r\nA Practical Framework for Evaluation\r\nDiagnosing AI goes beyond critique to offer a comprehensive, step-by-step framework for evaluating machine learning systems in medicine. You will learn to distinguish between models that have truly learned and those that merely imitate expertise, and how to translate clinical priorities into meaningful evaluation criteria.\r\n\r\nWhat You Will Learn:\r\nHow LLMs create the illusion of expertise and why their confident tone should not be mistaken for diagnostic accuracy.\r\nEmpirical evidence showing that leading multimodal language models often make critical errors and disagree on basic findings, making them unsuitable for autonomous medical image interpretation.\r\nThe landscape of machine learning in medicine, from traditional algorithms to modern transformers, and which approaches are appropriate for which diagnostic tasks.\r\nReal-world applications of proven machine learning algorithms in cardiovascular and other specialties, and the importance of validation and regulatory clearance.\r\nThe challenge of class imbalance in medical data, where rare diseases can be missed by models focused on overall accuracy, and strategies to address this.\r\nHow to develop clinically oriented evaluation protocols that reflect real-world priorities, recognizing that not all errors are equal (for example, missing a disease vs. a false alarm).\r\nWhy learning dynamics and training-validation curves are more revealing than final accuracy metrics, and how to use them to judge model reliability and generalizability.\r\nThe limitations of language models in interpreting diagnostic plots, and why human expertise remains essential in evaluating model quality.\r\nEconomic factors in AI deployment, integrating operational costs, error consequences, and implementation expenses to determine whether a system is worth investing in.\r\n\r\nFor Clinicians, Data Scientists, and Healthcare Leaders\r\nThroughout the book, Dr. Toma emphasizes a critical distinction: general-purpose language models may generate plausible medical text, but only specialized, validated machine learning systems are suitable for high-stakes clinical applications. For tasks like medical image interpretation, the gold standard remains purpose-built machine learning\u2014not general chatbots, no matter how impressive their language skills.\r\n\r\nBy the end of Diagnosing AI, readers will have a robust framework for evaluating clinical machine learning systems, from initial training dynamics through cost-effectiveness. You will understand why learning curves matter more than final metrics, why class imbalance demands special treatment, and why confident AI prose provides no guarantee of accuracy. In medicine, where the stakes are measured in human welfare, such understanding is not simply helpful; it is essential."}, "latest_revision": 4, "revision": 4, "created": {"type": "/type/datetime", "value": "2026-03-03T14:20:40.026539"}, "last_modified": {"type": "/type/datetime", "value": "2026-03-03T14:23:09.976290"}}