Data Preparation

Submitted by valya on

Data Preparation

Data Preparation plays an important role in any OI/ML/Analytics. These steps include but are not limited to

  • choice of data format
  • choice of storage
  • naming convention
  • schema definition
  • data structures

The data formats are discussed below in a dedicated section. The current software landscape offers plenty of solution for data storage. The most common ones are Relation and non-relation (NoSQL) databases, key-value stores, variety of file systems, etc. In most analytics tasks though the majority of the cases will be covered by the aforementioned choices.