Comparing LLMs for Enterprise: Interpreting 0.7% vs 20.2% and What Really Matters
https://spark-wiki.win/index.php/When_Summaries_Mislead:_Measuring_Journalism_Accuracy_in_Production_LLMs_(Vectara_vs_AA-Omniscience)
Vendor numbers are tempting: "0.7% hallucination on basic summarization" or "20.2% hallucination rate." Those figures matter, but not the way many product decks imply