Dataset is a collection of data in an organized manner in a specific format of tables, array, CSV, JSON, XML. Dataset is used for data analysis, artificial intelligence, machine learning
Schema defines the structure in which a dataset holds the data. Each dataset model has their own schema.
Types of Dataset
Structured Dataset collects and stores the data in a structured format with data stored in tabular format with rows and columns. They have standard Schema.
Unstructured Dataset doesn’t have specific format. They constitute the largest type of dataset across the internet which comprises of text, videos and images.
Semi-Structured Dataset are structured dataset that doesn’t follow strict tabular structure.