Content

Filter

The filter task is essential for selecting specific rows based on criteria defined within columns. Rows can be selected if a column contains or does not contain a certain value, if the content length is longer or shorter than a specified value, or if the content matches various types such as alphanumeric characters, numeric values, or upper/lower case. Once rows are matched, actions such as keeping them, removing them, tagging them, or removing tags from them can be performed. Read more about tags

Example Usage

Suppose you have a dataset containing customer feedback. Using the filter task, you can select all rows where the “Sentiment” column contains the word “positive,” keeping only the positive feedback for further analysis.

Task filter

Trim

The trim task is a fundamental operation for cleaning up data by removing leading and trailing white spaces from values in selected columns. It ensures data consistency and improves the accuracy of subsequent analyses, particularly when dealing with unstructured datasets.

Replace

The replace task allows users to substitute specific values within selected columns, facilitating data correction and standardization. It is commonly used to rectify data inconsistencies, update outdated values, or harmonize data representations.

Change Case

The change case task enables users to convert the case of values in selected columns to either lower or upper case. This operation is valuable for ensuring uniformity and consistency in textual data.

Append

The append task allows users to add text to the beginning or end of values in selected columns. This functionality is useful for appending prefixes or suffixes to data elements.

Task append

Example Usage

In a dataset containing product codes, the append task can be utilized to add a prefix indicating the product category, facilitating easier identification and categorization.

Insert

The insert task enables users to insert values into selected columns, including strings from input, random strings, random numbers within a specified range, hashes of column values, or UUID4 values.

Example Usage

In a dataset containing customer IDs, the insert task can be used to generate and insert unique UUID4 values into the “ID” column, ensuring each customer record has a distinct identifier.

Substring

The substring task allows users to extract a portion of the column value based on specified position and length parameters. This functionality is valuable for extracting substrings or segments from textual data.

Math

The math task facilitates basic numeric operations on values in selected columns, such as addition, subtraction, multiplication, and division. This functionality is useful for performing calculations and deriving new insights from numerical data.

Reverse

The reverse task reverses the order of characters in values of the selected column. This operation is helpful for tasks such as reversing strings or sequences.

Date reformat

The date reformat task facilitates the conversion of date formats in values of selected columns. Users provide the current format and the required format, and the task handles the conversion accordingly.

Example Usage

In a dataset containing dates in the “Timestamp” column in the format “YYYY-MM-DD,” the date reformat task can be used to convert them to the format “DD/MM/YYYY” for easier interpretation and analysis.