Unstructured data explained: Why rule-based tools make for brittle document process automation models

When it comes to document process automation, you can write a rule to automate virtually any step in a process. But that doesn’t mean you should.

Writing rule after rule to address all the variables that may come up in a process involving numerous documents is a trap, says Slater Victoroff, CTO and founder of Indico Data.

“When you proceed in that way the ability to solve every problem is actually the biggest possible danger you could face because it sucks you into this belief that at some point your rules will be correct,” he says. The rules-focused approach assumes the problem is you haven’t written enough rules or haven’t written the right rules. “When in fact that is not the problem. The problem is that you are writing rules to begin with.”

In a recent installment of the “Unstructured Explained” video series, Victoroff discussed the issue with two Indico Data colleagues: ML Architect and Co-Founder Madison May and VP of Business Development Brandi Corbello.

Many rules make for brittle models

The problem with rules is they tend to make automation models brittle, May says. While any single rule may improve the quality of an automation application, taken together, they amount to numerous potential points of failure.

“It means when [the model] fails it fails much harder because you’re imposing stricter and stricter constraints on what your system can and cannot do,” May says. “And at a certain point it ceases to become useful to try and inject all of your preconceived knowledge into the problem and you should take a step back and let the model handle it for you.”

Corbello agrees and says a rules-based approach harkens back to the days when shared service centers were convinced that optical character recognition would solve all their problems. For an invoice processing application, for example, the solution was to have a huge file of words that may exist on an invoice and training the OCR application to look for any or all of those words.

“It’s basically like this big ‘Control F’ was happening rather than actually understanding what was on the document,” she says. “There’s a big difference between those two things.”

Artificial intelligence delivers real understanding

Fully understanding what’s on a document requires a level of intelligence inherent in artificial intelligence technologies such as machine learning, natural language processing and transfer learning. Such technologies are the foundation upon which the Indico Unstructured Data Platform is built. They give the platform the ability to read and comprehend even unstructured documents just like your employees would – only far faster and with greater accuracy.

To learn more, check out the full video below.

[addtoany]

Increase intake capacity. Drive top line revenue growth.

Schedule Demo

Resources

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Upcoming Webinar

From Automation to Agency: Indico Data Unveils the Future of Insurance with Agentic AI

Technology

Solutions

Why Indico

By Industry

By Use Case

By Role

Services

Resources

Documentation

Customer Stories

Partners

Find a Partner

Become a Partner

Partner Portal

Company

Press & Events

Careers

BLOG

Unstructured data explained: Why rule-based tools make for brittle document process automation models

Many rules make for brittle models

Artificial intelligence delivers real understanding

Increase intake capacity. Drive top line revenue growth.

Related Posts

Center of Excellence, Intelligent Process Automation, Unstructured Data, Unstructured Unlocked

Indico data debuts “Unstructured Unlocked” podcast – expert advice on intelligent automation

Confessions from a data scientist leader, Unstructured Data

How intelligent document processing machine learning is changing the unstructured data analytics game

Unstructured Data

Unstructured data explained: Why skewed data presents a big problem for intelligent document processing

See how Indico Data’s AI-driven solutions can revolutionize your decision-making processes.

Schedule
1-1 Demo

Resources

Blog

Gain insights from experts in automation, data, machine learning, and digital transformation.

Unstructured Unlocked

Enterprise leaders discuss how to unlock value from unstructured data.

YouTube Channel

Check out our YouTube channel to see clips from our podcast and more.

Upcoming Webinar

From Automation to Agency: Indico Data Unveils the Future of Insurance with Agentic AI

Technology

Solutions

Why Indico

By Industry

By Use Case

By Role

Resources

Documentation

Customer Stories

Indico Named as Major Contender and Star Performer in Everest Group's PEAK Matrix® for Intelligent Document Processing (IDP)

BLOG

Unstructured data explained: Why rule-based tools make for brittle document process automation models

Many rules make for brittle models

Artificial intelligence delivers real understanding

Increase intake capacity. Drive top line revenue growth.

Related Posts

Center of Excellence, Intelligent Process Automation, Unstructured Data, Unstructured Unlocked

Indico data debuts “Unstructured Unlocked” podcast – expert advice on intelligent automation

Confessions from a data scientist leader, Unstructured Data

How intelligent document processing machine learning is changing the unstructured data analytics game

Unstructured Data

Unstructured data explained: Why skewed data presents a big problem for intelligent document processing

See how Indico Data’s AI-driven solutions can revolutionize your decision-making processes.

Schedule1-1 Demo

Resources

Blog

Gain insights from experts in automation, data, machine learning, and digital transformation.

Unstructured Unlocked

Enterprise leaders discuss how to unlock value from unstructured data.

YouTube Channel

Check out our YouTube channel to see clips from our podcast and more.

Get our best content on intelligent automation sent to your inbox weekly!

Schedule
1-1 Demo