The term 'data wrangling' usually refers to a great deal of repetitive and very time-consuming data preparation tasks, such as the acquisition, integration, manipulation, cleansing, enriching and transformation of data.
We have created this catalog to be the first data wrangling dataset repository. We have collected most of the datasets used previously in other tools for data manipulation or presented in the literature. In addition, we have generated new datasets collecting new data. All the datasets include six examples of one particular problem, with an input and the expected output.
This data is made available under the Open Data Commons Attribution License: https://opendatacommons.org/licenses/by/1.0/ .
What domains are included in the catalog?