Zoho DataPrep is an advanced self-service data preparation software that offers a comprehensive solution to prepare large volumes of data from multiple data sources. The Zoho DataPrep add-on contextually provides the data preparation functionality of Zoho DataPrep within Zoho Analytics. This lets you significantly improve the quality of your data by cleaning and preparing it before analysis.
If you have the Zoho DataPrep add on, you can import your data and use the Prepare Data option to cleanse and prepare your data during import.


In case you missed to check the data quality or prepare data while importing, you can always use the options under More , and click on the View Data Quality option to analyze the data quality of your table or use the Prepare Data option to perform data transformations.
This option provides an overview about the table data quality. There are three sections in the Data quality analysis pane:
Clicking the Prepare data option will launch the data preparation pane. Let us look into the components available in the DataPrep pane.
DataPrep offers numerous options to measure and improve the quality of your data. Data quality can be assessed from the following areas in the data prep pane.
A data quality bar represents the quality of data in each column. It splits data quality into valid data, invalid data, and missing values, based on data type of the column.
Green represents valid data, red represents invalid data, and grey represents the missing values. When you click on a section, DataPrep filters out appropriate rows so that you can easily deal with invalid or missing values in your dataset.
Hover over the data quality bar to get a quick look at the data quality of a column. The data quality chart specific to the column reveals the column data quality. You may choose to filter valid, invalid, and missing data in the column using the options provided.
Click Show for all columns option to display data quality for all columns.
Dataset details reveal data quality for the entire dataset using a data quality chart. The number is derived from the collective quality of the individual columns.
You will see this section for the first time when a dataset loads onto the data preparation screen, and whenever none of the columns are selected.
Dataset details display the following information.
If the imported data is small enough, the entire dataset will be imported for data preparation and the sample details will not be shown in the dataset details section.

Generating sample data based on the imported data is called Sample strategy. You can choose one of the four methods to create a sample. You can also change the strategy at any point during the data prep process. Click on the Edit icon to change the sample strategy.
The different sample strategies available are:

The data quality chart splits data into a percentage of valid data, invalid data, and missing values. Click on the sections of the chart to selectively view valid data, invalid data, and missing values in your dataset. Here's a snapshot of data quality chart.
A graphical representation of data distribution and the range of values present in a column is shown at the top each column as a Histogram. Using the histogram, you can spot outliers and anomalies in the data. You can also select a bar of the histogram to filter the data within a specific range.

A detailed version of the histogram is present under Column details. You can search and filter the column data from this histogram. You can also edit the values present in the histogram and have it changed in the entire column as shown in the screenshot below.

The Expanded column details section contains further details on the column data including auto-identified data patterns and outliers.
The following widgets are available under the expanded column details view:

Perform search operations and apply filters to the dataset using the Search & filter box. You can use the suggestions that appear as you type for faster results.
You can also select default filters with predefined conditions using the filter icon from the Search & filter box.

The Undo and Redo buttons are located to the left of the search box. You can use these options to undo the previous transforms that were applied, edited, disabled, or deleted, or redo the previous operations once more.
In the DataPrep pane, you can right-click on any column to get the list of transforms available. You can then click on one of the transforms and provide inputs to apply and make necessary changes to the column. DataPrep shows a live preview of the column during the transformation.
You can also apply the same transformation on multiple columns by selecting more columns in the Columns to apply section. Each transformation is recorded as a rule in the Ruleset area.
There are some differences in the availability of transforms between Zoho DataPrep add-on and the standalone solution while during and after the data import. The differences are explained in the table below:
| Features | DataPrep | DataPrep Add-on | |
| During Import | After import | ||
| Dataset Transforms | |||
| Deduplicate | Yes | Yes | Yes |
| Join | Yes | No | No |
| Append | Yes | No | No |
| Pivot | Yes | No | No |
| UnPivot | Yes | Yes | No |
| Filter | Yes | Yes | Yes |
| Select | Yes | Yes | No |
| Add formula | Yes | Yes | No |
| Window functions | Yes | Yes | No |
| Derive dataset | Yes | No | No |
| Column Transforms | |||
| Change data type | Yes | Yes | No |
| Replace | Yes | Yes | Yes |
| Split | Yes | Yes | No |
| Yes | Yes | No |
| Trim spaces | Yes | Yes | Yes |
| Truncate | Yes | Yes | Yes |
| Add Prefix or Suffix | Yes | Yes | Yes |
| Fill empty cells | Yes | Yes | Yes |
| Count | Yes | Yes | No |
| Extract | Yes | Yes | No |
| Change format | Yes | Yes | No |
| Round off | Yes | Yes | Yes |
| Create buckets | Yes | Yes | No |
| Cluster and Merge | Yes | Yes | Yes |
| Change case | Yes | Yes | Yes |
| Language detection | Yes | Yes | No |
| Keyword extraction | Yes | Yes | No |
| Sentiment analysis | Yes | Yes | No |
| Change date format | Yes | Yes | No |
| Unify date formats | Yes | Yes | No |
| Extract from Map | Yes | Yes | Not applicable |
| Extract from List | Yes | Yes | Not applicable |
| Sort Column | Yes | Yes | Yes |
| Rename Column | Yes | Yes | No |
| Delete Column | Yes | Yes | No |
| Move Column | Yes | Yes | No |
To learn more about transforms in the standalone product, visit the link .
Each transformation applied on the dataset is stored as a rule, in order of their execution. The ordered list of these rules is called a Ruleset. You can access the Ruleset from the Ruleset pane, where you can add, edit, preview, or remove the rules that have been applied.
To access the Ruleset:
| IBM00858 | IBM857 | ISO-8859-6 | US-ASCII | windows-1253 |
| UTF-8 | IBM862 | ISO-8859-7 | UTF-16 | windows-1254 |
| IBM437 | IBM866 | ISO-8859-8 | UTF-16BE | windows-1255 |
| IBM775 | ISO-8859-1 | ISO-8859-9 | UTF-16LE | windows-1256 |
| IBM850 | ISO-8859-2 | ISO-8859-13 | windows-1250 | windows-1257 |
| IBM852 | ISO-8859-3 | KOI8-R | windows-1251 | x-IBM737 |
| IBM855 | ISO-8859-4 | KOI8-U | windows-1252 | x-IBM874 |
| x-UTF-16LE-BOM |
Ruleset history - Click Ruleset history from the <horizontal hamburger menu> to view the changes in Ruleset's timeline. Ruleset history include the user information who created/modified the rules, the timestamp when the changes were made, and the various operations added to the Ruleset.
Search - Choose Search from the <horizontal hamburger `menu> icon to perform a search in your Ruleset. For example, you can search the column name to find all the rules applied in that column.
You can modify individual rules using the actions associated with them. Click the <vertical hamburger dots> menu on the rules to Edit, Disable , or Remove the rule. You can also use the Insert rule before option to insert a rule prior to your current selection. The option Show info is used to retrieve the rule information.
Edit - Click Edit when you want to edit the transformation applied with the rule. You can make changes to the transformation in the Operations bar and verify the output in the preview. Click Apply to save your changes. The transformation will be recomputed to reflect the changes.
Remove - Click Remove when you want to delete a rule from the Ruleset. This will remove the rule and recompute transformations on data for the rules left.
Note : You can always undo deletion in case you deleted a rule in error. The Undo button is situated at the top-left just above the data grid.
Disable - Use this option to disable a rule in your Ruleset. This will disable the rule and recompute transformations on data for the rules left. You can always Enable back the rule from the <3 vertical dots> menu.
Insert rule before - Use the Insert rule before option to include a previous step before a rule. When you click, you will see an empty placeholder. Once you apply a transformation, the rule of this transformation will be stored in the placeholder.
Show info - Click on the Show info option for information about a specific rule such as who created or modified the rules. It also includes other details such as, when was the rule created or modified, and the list of users who made the changes.
You may choose to apply these changes:
You can view the flow of your data in the Data flow section of Data sources page. Using the data flow you can learn at which stage the data has been cleansed and prepared for analytics and further Edit or Delete the transformations as necessary. 
Clicking the Edit option will take you to the DataPrep pane where you can make further changes to your data and apply those changes.
Clicking the Delete option removes the data preparation flow from the next scheduled import, and the originally imported data is retained.
Undo prepared data option allows you to undo all the transformations applied over the imported data. This is particularly useful if you made a mistake and wish to undo all the changes made to your data using DataPrep.
After applying changes to the table, the undo prepared data option will be enabled. Using this option you can undo all the changes performed in the previous prepare data session.
The Undo prepared data option will only revert the last data preparation session, but none of the prepared data sessions before that. Undoing prepared data is permanent. Only further data preparation is possible from this point.
Zoho Analytics lets you import data prepared using Zoho DataPrep seamlessly using the DataPrep connector for Zoho Analytics. With the connector, you can set up the data import and synchronization process between Zoho DataPrep and Zoho Analytics. You can learn more about the Zoho DataPrep connector for Analytics from here .

In the standalone Zoho DataPrep application, you can import data from Zoho Analytics into DataPrep seamlessly using the Analytics connector for Zoho DataPrep. With the connector you can set up data imports and configure import schedules from Analytics as shown in the screenshot below. You can learn more about importing data using the Zoho Analytics connector from here .

In the standalone Zoho DataPrep application, you can export prepared data to Zoho Analytics seamlessly using the Analytics connector for Zoho DataPrep. With the connector you can set up data exports and configure export schedules to Zoho Analytics as shown in the screenshot below. You can learn more about exporting data using the Zoho Analytics connector from here .

You can correct the errors by clicking on More and then the Undo prepared data option from the table. This will revert all the changes done in the previous data preparation session
You can fix data quality issues by using the various transforms like deduplicate, find and replace, truncate, change format, fill empty cells, unify date format, change datatype, etc.
To use the full power of Zoho DataPrep, import the data from Zoho Analytics into Zoho DataPrep, cleanse the data and then export it back into Zoho Analytics onto another table.
In this case, you will have to re-import the data from the source and reapply the transforms based on the data.
The ideal flow is where dataprep is used to ingest data after which the data is pushed to Zoho Analytics using the Export to Zoho Analytics option.
You cannot push data into an existing table presently. However, this feature is being worked on and will be available soon.
Learn how to use the best tools for sales force automation and better customer engagement from Zoho's implementation specialists.
If you'd like a personalized walk-through of our data preparation tool, please request a demo and we'll be happy to show you how to get the best out of Zoho DataPrep.
All-in-one knowledge management and training platform for your employees and customers.
You are currently viewing the help pages of Qntrl’s earlier version. Click here to view our latest version—Qntrl 3.0's help articles.