Row

Unique

The unique task removes duplicate rows from the dataset based on values in the selected column. This functionality is valuable for ensuring data integrity and eliminating redundant entries.

Example Usage

In a dataset containing customer information, the unique task can be applied to the “Email Address” column to remove duplicate email entries and ensure each customer is represented only once in the dataset.

Sort

The sort task orders rows in the dataset based on values in the selected column. This functionality is useful for organizing data in a meaningful way for analysis or presentation.

Shuffle

The shuffle task randomizes the order of rows across the entire dataset. This functionality is helpful for introducing randomness into the dataset or for conducting randomized experiments or analyses.

Example Usage

In a dataset containing survey responses, the shuffle task can be applied to randomize the order of responses before analysis, reducing potential bias that may arise from the order in which responses were collected.