Ask Slater Machine Learning Should we remove duplicates from a data-set while training a Machine Learning algorithm (shallow and/or deep methods)? February 6, 2019 by adamdev Itā¦depends. Mostly it depends on what your goals are and what your dataset looks like. There are two big divides here on both sides. Data Structured Data – here, duplicates very much come with the territory. In this situation youāve also likely got a lot of implicit ambiguity in your problem. Letās say that you […] Read more Ā»