Patronus AI Launches Industry-first LLM Benchmark for Finance to Address Hallucinations
Model evaluation shows state-of-the-art systems fail spectacularly on finance-related questions Patronus AIÂ today launched “FinanceBench”, the industry’s first benchmark for testing how LLMs perform on financial questions. Developed by AI researchers at Patronus AI and 15 financial industry domain experts, FinanceBench is a high quality, large-scale set of 10,000 question and......