Databricks Presents a New IDP Benchmark
Abstract
Most business documents still exist for humans first and machines second. One of our goals at Databricks is to make this human-centered data "legible" to AI and Agents, so that we gain insights and even take actions based upon those insights. But AI can still struggle to understand the full range of messy unstructured documents we produce for each other. We've created and will present a benchmark, OfficeQA, that probes the limits of current AI systems in analyzing a large 89,000 page public dataset.
Successful Page Load