Everest Group IDP
             PEAK Matrix® 2022  
Indico Named as Major Contender and Star Performer in Everest Group's PEAK Matrix® for Intelligent Document Processing (IDP)
Access the Report


Is feature selection needed to be performed in lexicon-based sentiment analysis?

March 27, 2018 | Ask Slater

Back to Blog

 The two are typically at odds. Generally speaking your lexicon defines your features. So you’re performing feature selection as you build the lexicon and once you have your lexicon you stop choosing new features.

If you’re starting with a lexicon and want to add features you could add them to the lexicon you’re working with, or attempt to create some kind of ensemble approach that combines the features in your lexicon with whatever features you create yourself.

However, for sentiment analysis you really shouldn’t be manually creating features, or using lexicons. These approaches generally lead to extremely brittle models with very poor performance. The problem is that if you are engineering your features or changing your lexicon in response to test errors then you’re manually overfitting.

View original question on Quora >

Follow Slater on Quora >>

Increase intake capacity with automation

Get started with Indico

Interactive demo

Transform your own unstructured documents with our OOTB models

Live Demo

Explore firsthand the value the Indico Platform delivers

Talk with us

Discuss how the Indico Platform can help you tackle your unstructured data problems



Gain insights from experts in automation, data, machine learning, and digital transformation.

Unstructured Data Explained

Answers to the most complex questions in unstructured data.

CTO Corner

An accumulation of content straight from our co-founder and CTO.

Unstructured Unlocked

Enterprise leaders discuss how to unlock value from unstructured data.
Subscribe to our blog

Get our best content on intelligent automation sent to your inbox weekly!