4.1. Data SelectionΒΆ
Several criteria must be considered when deciding which of the data will be used in the analysis:
its relevance to achieving data mining objectives
the quality and technical constraints such as limits on data volume or data types
choose and explain which certain data must be included or excluded
which attributes (columns) are more important than others
which records (rows) are more important than others