A NEW study has shown that large language models (LLMs ) have strong potential to evaluate medical research standards, with GPT-4 variants performing best at checking (RCTs) on artificial intelligence ...