# Data Preparation with Dataprep DataPrep in Analytics Plus Cloud helps you model, cleanse, enrich, and organize large volumes of data from multiple sources. It simplifies ETL (Extract, Transfer & Load) with visual data pipelines, seamless integration, and automated scheduling — ensuring clean, analysis-ready data every time. - [DataPrep during import](#dataprep-during-import) - [Prepare Data from a Table](#prepare-data-from-a-table) - [View Data quality](#view-data-quality) - [Prepare data](#prepare-data) - [Ruleset](#ruleset) - [Data flow](#data-flow) - [Undo prepared data](#undo-prepared-data) - [DataPrep standalone versus DataPrep add-on](#dataprep-standalone-versus-dataprep-add-on) - [Troubleshooting](#troubleshooting) ## DataPrep during import If you have the DataPrep add-on, you can import your data and use the **Prepare Data** option to cleanse and prepare your data during import. The below steps explain how to invoke Prepare Data while importing: 1. Choose the file to be imported. 2. In the **Create Table (Import)** pane, click **Prepare Data**. The **Prepare Data** pane will open. 3. You can apply various transformations, such as changing number format, Find and Replace, and define rule sets to enrich the data quality. 4. Click **Apply Changes**. 5. The **Create Table (Import)** pane will open. You can review the Table settings such as Table Name, First Row Contains Column Name and make changes if needed. ![DataPrep during import](https://cdn.manageengine.com/sites/meweb/images/analytics-plus/cloud-help/dataprep1.png) ![Create Table Import Pane](https://cdn.manageengine.com/sites/meweb/images/analytics-plus/cloud-help/1751878439760.png) **Note:** The Format of Date Column and the CSV Settings (thousand separator and decimal separator) cannot be changed after using Prepare Data. ## Prepare Data from a Table In case you missed checking the data quality or preparing data while importing, you can use the options under **More**, and click on the **View Data Quality** option to analyze the data quality of your table or use the **Prepare Data** option to perform data transformations. ![Prepare Data from Table](https://cdn.manageengine.com/sites/meweb/images/analytics-plus/cloud-help/dataprep3-1.png) ### View Data quality This option provides an overview of the table data quality. There are three sections in the Data Quality analysis pane: 1. **Summary** – Lists the number of columns, rows, and data types available. 2. **Data quality** – Displays the data quality chart of the table. 3. **Column Level Data Quality** – Lists only the columns containing invalid data. ### Prepare data Clicking the **Prepare data** option launches the DataPrep pane. The following components are available: ### Data quality bar DataPrep offers numerous options to measure and improve the quality of your data. Data quality can be assessed from the following areas: - A data quality bar represents the quality of data in each column. It splits data quality into valid data, invalid data, and missing values based on the column’s data type. - Green represents valid data, red represents invalid data, and grey represents missing values. Clicking a section filters appropriate rows. - Hover over the data quality bar to view quick insights into a column’s data quality. - Click **Show for all columns** to display data quality for all columns. ### Dataset details Dataset details reveal data quality for the entire dataset using a data quality chart derived from the collective quality of individual columns. You will see this section when: - A dataset loads onto the data preparation screen. - No columns are selected. Dataset details display: - Sample size - Sample strategy (Random, Erroneous, Column-based, Initial) - Number of rows - Number of columns - Number of data types in the dataset If the imported data is small, the entire dataset will be imported and sample details will not be shown. ![Dataset details](https://cdn.manageengine.com/sites/meweb/images/analytics-plus/cloud-help/newly.png) ### Sample strategy Generating sample data based on the imported data is called **Sample strategy**. You can choose one of four methods and change the strategy at any time using the Edit icon. Available strategies: - **Initial sample** – Generated from the initial rows of the imported file. - **Random sample** – Randomly selected rows. - **Erroneous sample** – Rows containing invalid or missing entries. - **Column-based sample** – Generated from distinct values of a selected column. ![Sample strategy](https://cdn.manageengine.com/sites/meweb/images/analytics-plus/cloud-help/sstra.png) ### Data quality chart The data quality chart splits data into percentages of valid, invalid, and missing values. Click sections to filter data selectively. ![Data quality chart](https://cdn.manageengine.com/sites/meweb/images/analytics-plus/cloud-help/dq.png) ### Data distribution A histogram at the top of each column visually represents data distribution and value ranges. You can: - Spot outliers and anomalies. - Select bars to filter specific ranges. ### Column details Column details appear in the side panel when a column is selected: - Displays data type, unique values, and counts of missing, invalid, and valid entries. - Change data type using the edit option. - Click **Show more details** for expanded view. ![Column details](https://cdn.manageengine.com/sites/meweb/images/analytics-plus/cloud-help/cd.png) ### Histogram A detailed histogram is available under Column details. You can: - Search and filter column data. - Edit values directly in the histogram and update the entire column. ![Histogram](https://cdn.manageengine.com/sites/meweb/images/analytics-plus/cloud-help/hg.png) ### Intelligent Suggestions DataPrep predicts and suggests operations based on imported data: - Column-level transforms are suggested when a column is selected. - Dataset-level transforms are suggested when no column is selected. - Clicking a suggestion opens the Operations bar with a live preview. - You can edit options and conditions before applying. ![Intelligent suggestions](https://cdn.manageengine.com/sites/meweb/images/analytics-plus/cloud-help/sug.png) ### Expanded column details Includes: - **Summary** – Detailed column summary. - **Statistics** – Statistical data points. - **Histogram** – Value distribution. - **Outliers** – Text length or pattern outliers. - **Column data quality** – Column-level data quality chart. - **Patterns** – Auto-identified data patterns. - **Suggestions** – Intelligent suggestions for improvement. ### Search & filter Use the **Search & filter** box to perform search operations and apply filters. - Suggestions appear as you type. - Use predefined filter conditions from the filter icon. Filter conditions: - Contains (default) - Doesn't contain - Begins with - Doesn't begin with - Ends with - Doesn't end with - Is - Is not - Each filter appears as a chip. - Remove a filter using the close button. - Click **Clear all** to remove all filters. ![Search and filter](https://cdn.manageengine.com/sites/meweb/images/analytics-plus/cloud-help/redo-1.png) ### Undo and Redo Located to the left of the search box: - **Undo** – Reverts previous transformations. - **Redo** – Reapplies previous operations. ### Performing transformations - Right-click a column to access available transforms. - Provide inputs and preview live changes. - Apply transformations to multiple columns. - Each transformation is recorded in the Ruleset area. ![Performing transformations](https://cdn.manageengine.com/sites/meweb/images/analytics-plus/cloud-help/performing.png) ### Ruleset Each transformation is stored as a rule. The ordered list of rules is called a **Ruleset**. ![Ruleset](https://cdn.manageengine.com/sites/meweb/images/analytics-plus/cloud-help/ruleset.png) To access: 1. Click the Ruleset icon in the top-right corner. 2. The Ruleset pane opens with the data source at the top. 3. Selecting an earlier rule updates the grid to that state. 4. Subsequent rules appear grayed out. 5. New transformations in earlier states are inserted accordingly. 6. Rules are recomputed; errors are highlighted in red. 7. Example: If a deleted column is referenced later, that rule will show an error. ### Ruleset operations - **Ruleset history** – View timeline, users, timestamps, and operations. - **Search** – Search rules by column name or operation. ### Options with individual rules Access via the vertical menu on each rule: - **Edit** – Modify and reapply transformation. - **Remove** – Delete rule and recompute. - **Disable** – Temporarily disable rule. - **Insert rule before** – Add a rule prior to current. - **Show info** – View rule metadata. ## Data flow Data flow shows at which stage data has been cleansed and prepared. You can view Data flow: - In the **Create Table (Import)** pane. - From the **Data Sources** page. ![Data flow 1](https://cdn.manageengine.com/sites/meweb/images/analytics-plus/cloud-help/dataflow1.png) ![Data flow 2](https://cdn.manageengine.com/sites/meweb/images/analytics-plus/cloud-help/df2.png) ![Data flow 3](https://cdn.manageengine.com/sites/meweb/images/analytics-plus/cloud-help/df3.png) ### Edit Click **Edit** to open the DataPrep pane and make changes. ### Delete Click **Delete** to remove the data preparation flow from the next scheduled import. Originally imported data is retained. ## Undo prepared data The **Undo prepared data** option allows you to undo all transformations applied during the last preparation session. - Enabled after applying changes. - Reverts only the most recent session. - Permanent action. - Further preparation is possible afterward. ![Undo prepared data](https://cdn.manageengine.com/sites/meweb/images/analytics-plus/cloud-help/undoprep.png) ## DataPrep standalone versus DataPrep add-on There are differences in transform availability between the standalone DataPrep and the DataPrep add-on. | Features | DataPrep | During Import (Add-on) | After Import (Add-on) | |---|---|---|---| | [Dataset Transforms](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/dataset-operations/articles/dataset-operations) | | | | | Deduplicate | Yes | Yes | Yes | | Join | Yes | No | No | | Append | Yes | No | No | | Pivot | Yes | No | No | | UnPivot | Yes | Yes | No | | Filter | Yes | Yes | Yes | | Select | Yes | Yes | No | | [Add formula](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/add-formula/articles/functions-in-formula-column) | Yes | Yes | No | | Window functions | Yes | Yes | No | | Derive dataset | Yes | No | No | | [Change data type](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/change-data-type/articles/change-data-type) | Yes | Yes | No | | [Replace](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/find-and-replace/articles/replace) | Yes | Yes | Yes | | [Split](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/split/articles/split) | Yes | Yes | No | | [Merge](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/merge/articles/merge) | Yes | Yes | No | | [Trim spaces](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/trim-spaces/articles/trim-spaces) | Yes | Yes | Yes | | [Truncate](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/truncate/articles/truncate) | Yes | Yes | Yes | | [Add Prefix or Suffix](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/add-prefix-or-suffix/articles/add-prefix-or-suffix) | Yes | Yes | Yes | | [Fill empty cells](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/fill-empty-cells/articles/fill-empty-cells) | Yes | Yes | Yes | | [Count](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/count/articles/count) | Yes | Yes | No | | [Extract](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/extract-from-text/articles/extract-from-text) | Yes | Yes | No | | [Change format](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/change-format/articles/numerical-transformation) | Yes | Yes | No | | [Round off](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/round-off/articles/round-off) | Yes | Yes | Yes | | [Create buckets](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/create-buckets-number/articles/create-buckets-number) | Yes | Yes | No | | [Cluster and Merge](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/cluster-and-merge/articles/cluster-and-merge) | Yes | Yes | Yes | | [Change case](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/change-case/articles/change-case) | Yes | Yes | Yes | | [Language detection](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/language-detection/articles/language-detection) | Yes | Yes | No | | [Keyword extraction](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/keyword-extraction/articles/keyword-extraction) | Yes | Yes | No | | [Sentiment analysis](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/sentiment-analysis/articles/sentiment-analysis) | Yes | Yes | No | | [Change date format](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/change-date-format/articles/change-date-format) | Yes | Yes | No | | [Unify date formats](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/unify-date-formats/articles/unify-date-formats) | Yes | Yes | No | | [Extract from Map](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/extract-json-data/articles/extract-json-data) | Yes | Yes | Not applicable | | [Extract from List](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/extract-json-data/articles/extract-json-data) | Yes | Yes | Not applicable | | [Sort Column](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/context-menu/articles/context-menu) | Yes | Yes | Yes | | [Rename Column](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/context-menu/articles/context-menu) | Yes | Yes | No | | [Delete Column](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/context-menu/articles/context-menu) | Yes | Yes | No | | [Move Column](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-operations/context-menu/articles/context-menu) | Yes | Yes | No | | [Smart Selection](https://help.zoho.com/portal/en/kb/dataprep/data-transformation/column-transforms/smart-selection/articles/smart-selection) | Yes | Yes | No | To learn more about transforms in the standalone product, visit the [DataPrep Help](https://help.zoho.com/portal/en/kb/dataprep/). ## Troubleshooting ### 1. How to correct errors made during the data preparation process in a table? Click **More** and then select **Undo prepared data**. This reverts all changes from the previous session. ### 2. How to fix data quality issues while importing data? Use transforms such as deduplicate, find and replace, truncate, change format, fill empty cells, unify date format, and change data type. ![Troubleshooting example](https://cdn.manageengine.com/sites/meweb/images/analytics-plus/cloud-help/t2.png) ### 3. The data preparation options are limited in Analytics Plus. How can I use all DataPrep features? Import data into standalone DataPrep, cleanse it, then export it back into Analytics Plus. ### 4. What if I see a problem in the data during scheduled import? Re-import the data from the source and reapply the transforms. ### 5. How to set up a perfect data ingest flow using DataPrep? Use DataPrep to ingest and cleanse data, then push it to Analytics Plus using **Export to Analytics Plus**. ### 6. How to push data into an existing table from DataPrep? Currently, pushing data into an existing table is not supported. This feature is under development. Please contact the support team for further information.