Workshop
Document Intelligence
Nigel Duffy · Rama Akkiraju · Tania Bedrax Weiss · Paul Bennett · Hamid Reza Motahari-Nezhad
Business documents are central to the operation of business. Such documents include sales agreements, vendor contracts, mortgage terms, loan applications, purchase orders, invoices, financial statements, employment agreements and a wide many more. The information in such business documents is presented in natural language, and can be organized in a variety of ways from straight text, multi-column formats, and a wide variety of tables. Understanding these documents is made challenging due to inconsistent formats, poor quality scans and OCR, internal cross references, and complex document structure. Furthermore, these documents often reflect complex legal agreements and reference, explicitly or implicitly, regulations, legislation, case law and standard business practices.
The ability to read, understand and interpret business documents, collectively referred to here as “Document Intelligence”, is a critical and challenging application of artificial intelligence (AI) in business. While a variety of research has advanced the fundamentals of document understanding, the majority have focused on documents found on the web which fail to capture the complexity of analysis and types of understanding needed across business documents. Realizing the vision of document intelligence remains a research challenge that requires a multi-disciplinary perspective spanning not only natural language processing and understanding, but also computer vision, knowledge representation and reasoning, information retrieval, and more -- all of which have been profoundly impacted and advanced by neural network-based approaches and deep learning in the last few years.
We propose to organize a workshop for AI researchers, academics and industry practitioners to discuss the opportunities and challenges for document intelligence.
Schedule
|
Sat 8:00 a.m. - 8:10 a.m.
|
Opening Remarks
(
Discussion
)
>
|
🔗 |
|
Sat 8:10 a.m. - 9:05 a.m.
|
David Lewis: Artificial Intelligence in Legal Discovery ( Invited Talk ) > link | David Lewis 🔗 |
|
Sat 9:05 a.m. - 10:00 a.m.
|
Ndapa Nakashole: Generalizing Representations of Language for Documents Analysis across Different Domains ( Invited Talk ) > link | Ndapa Nakashole 🔗 |
|
Sat 10:00 a.m. - 10:30 a.m.
|
Coffee Break
|
🔗 |
|
Sat 10:30 a.m. - 12:05 p.m.
|
Poster Teaser Presentations ( Spotlights ) > link | 🔗 |
|
Sat 12:05 p.m. - 1:30 p.m.
|
Posters ( Poster Session / Lunch ) > link |
29 presentersTimo Denk · Ioannis Androutsopoulos · Oleg Bakhteev · Mohamed Kane · Petar Stojanov · Seunghyun Park · Bharat Mamidibathula · Kostiantyn Liepieshov · Johannes Höhne · Song Feng · Zikri Bayraktar · Kehinde Aruleba · ALEKSANDR OGALTSOV · Rita Kuznetsova · Paul Bennett · Saghar Hosseini · Kshtij Fadnis · Luis Lastras · Mehrdad Jabbarzadeh Gangeh · Christian Reisswig · Emad Elwany · Ilias Chalkidis · Jonathan DeGange · Kaixuan Zhang · Luke de Oliveira · Muhammed Koçyiğit · Haoyu Dong · Vera Liao · Wonseok Hwang |
|
Sat 1:30 p.m. - 2:30 p.m.
|
Rajasekar Krishnamurthy: Document Intelligence for Enterprise AI Applications: Requirements & Research Challenges ( Invited Talk ) > link | Rajasekar Krishnamurthy 🔗 |
|
Sat 2:30 p.m. - 3:30 p.m.
|
Asli Celikyilmaz: Learning Structure in Text Generation ( Invited Talk ) > link | Asli Celikyilmaz 🔗 |
|
Sat 3:30 p.m. - 4:00 p.m.
|
Coffee Break
|
🔗 |
|
Sat 4:00 p.m. - 5:00 p.m.
|
Discussion: Document Intelligence Research Challenges & Directions
(
Discussion
)
>
|
🔗 |
|
Sat 5:00 p.m. - 5:30 p.m.
|
Best Paper Talk: BERTGrid Contextualized Embedding for 2D Document Representation and Understanding
(
Talk
)
>
|
🔗 |
|
Sat 5:30 p.m. - 5:45 p.m.
|
Summary of Workshop and Closing Remarks
(
Discussion
)
>
|
🔗 |