Cluster and Merge

Cluster and merge




Zoho DataPrep helps you to perform fuzzy matching using the Cluster & Merge transform. Using this transform, you can replace multiple versions of the data which are expressed differently with the version required by the user. 

For example, if the following country names all appear in your data: U.S., U.S.A., USA.
They all refer to the same country. You can choose to replace all these variations with one term, USA

The transform can also be used to remove spelling errors in data and ensure the columns show uniformity in data. This is particularly helpful for cleaning and preparing data harvested from multiple data sources.

To perform Cluster & Merge

1. Right-click the column and select the Cluster & Merge option from the context menu.

2. Choose one of the following language model algorithms to find clusters in your data:  
  1. Metaphone 
  2. Fingerprint
  3. n-gram
  1. The metaphone algorithm groups words by pronunciation and is the default algorithm used to find the clusters. 
  2. The fingerprint and n-gram algorithms are used to check spelling errors in your column data and resolve text mismatches.
  3. The 'N' value represents the continuous sequence of N words to be found in the clusters. 
    For example, n-gram size of 'Zoho' is 1-gram, 'Zoho Corporation' is 2-gram, etc.
2. The Transform panel shows all of the clusters recognized as cards. 

3. Select the items to be replaced using the checkboxes.

4. Use the text box in each card to enter a new value. This value will replace the selected items in the columns.



You can de-select a cluster card using the checkbox at the top-right corner (bookmarked in blue). 

To apply filters

If you want to apply some filters along with this transform, you can use the filters functionality.

1. Click the Filters tab.

2. Click the   icon and add the required columns in the Filters section. You can also reorder the filters using the drag and drop method.




3. For every column added, you can select one of the following options from the drop-down:
  1. Actual: This option lets you filter rows based on the actual values in the column. Click  here to know more.
  2. Data quality: This option lets you filter rows based on the quality of data in the column. Click  here to know more.
  3. Patterns: This option helps you filter rows based on the data patterns in the selected column. Click  here to know more.
  4. Outliers: This option allows you to filter rows based on the outliers present in the data of the selected column. Click  here to know more. 
Note: The filter options are displayed based on the datatype of the column added for the filter.

4. When you add more than one filter to the  Filters  section, the logical operators, AND or OR appear next to the filters. You can click to toggle the logical operator between AND and OR.
  1. Using the logical operators, you can combine the conditions and apply logic to determine the rule of precedence. The final expression is displayed in the  Criteria expression  box. You can click  Edit  to alter the default expression using logical operators and parenthesis to specify the precedence or the sequential order as to which condition should be evaluated first. Click  Save  after making the required changes. 
  1. For example, In the expression, ((1 OR 2) AND (3 OR 4)) , at first the condition ( 1 OR 2 ) will be executed and the condition ( 3 OR 4 ) will be executed next. Thirdly, since, the AND operator is used, the filter will be applied when both the conditions are true.
5. You can further drill down to choose specific values based on the filter option selected for each filter, in the next section.



For example, in the above screenshot, the Data quality option is selected for the All columns filter in the Filters section. Based on the selection, further options to filter specific values are displayed in the All columns (Data quality) section.

6. You can choose to include or exclude the selected items in the last section.

7. If you want to remove all the filters for some reason, you can use the Clear button.

8. A live preview of the filter transform is shown as you make changes. 

9. Click the Apply button to apply the transform along with the filters.

Limitation

Zoho DataPrep can identify a maximum of 50 clusters each time you apply the transform. However, you can keep applying the transform as many times as required.

SEE ALSO

    Zoho CRM Training Programs

    Learn how to use the best tools for sales force automation and better customer engagement from Zoho's implementation specialists.

    Zoho CRM Training
      Redefine the way you work
      with Zoho Workplace

        Zoho DataPrep Personalized Demo

        If you'd like a personalized walk-through of our data preparation tool, please request a demo and we'll be happy to show you how to get the best out of Zoho DataPrep.

        Zoho CRM Training

          Create, share, and deliver

          beautiful slides from anywhere.

          Get Started Now


            Zoho Sign now offers specialized one-on-one training for both administrators and developers.

            BOOK A SESSION








                                You are currently viewing the help pages of Qntrl’s earlier version. Click here to view our latest version—Qntrl 3.0's help articles.




                                    Manage your brands on social media

                                      Zoho Desk Resources

                                      • Desk Community Learning Series


                                      • Digest


                                      • Functions


                                      • Meetups


                                      • Kbase


                                      • Resources


                                      • Glossary


                                      • Desk Marketplace


                                      • MVP Corner


                                      • Word of the Day


                                        Zoho Marketing Automation

                                          Zoho Sheet Resources

                                           

                                              Zoho Forms Resources


                                                Secure your business
                                                communication with Zoho Mail


                                                Mail on the move with
                                                Zoho Mail mobile application

                                                  Stay on top of your schedule
                                                  at all times


                                                  Carry your calendar with you
                                                  Anytime, anywhere




                                                        Zoho Sign Resources

                                                          Sign, Paperless!

                                                          Sign and send business documents on the go!

                                                          Get Started Now




                                                                  Zoho TeamInbox Resources



                                                                          Zoho DataPrep Resources



                                                                            Zoho DataPrep Demo

                                                                            Get a personalized demo or POC

                                                                            REGISTER NOW


                                                                              Design. Discuss. Deliver.

                                                                              Create visually engaging stories with Zoho Show.

                                                                              Get Started Now







                                                                                            You are currently viewing the help articles of Sprints 1.0. If you are a user of 2.0, please refer here.

                                                                                            You are currently viewing the help articles of Sprints 2.0. If you are a user of 1.0, please refer here.



                                                                                                  • Related Articles

                                                                                                  • Column Explorer

                                                                                                    The Column Explorer in Zoho DataPrep allows you to search, navigate and control the viewing options of the columns in a dataset. You can search and filter columns with invalid or missing data and fix them quickly. This helps you focus on the columns ...
                                                                                                  • Merge

                                                                                                    DataPrep allows you to merge the columns based on specified conditions. You can combine multiple columns into one column using the Merge operation. You can merge two or more columns and insert a separator between the values of the columns being ...
                                                                                                  • Duplicate Column

                                                                                                    Zoho DataPrep allows you to duplicate columns present in your dataset. To duplicate columns 1. Right-click on a column header and select the Duplicate columns option from the context menu. 2. Select the columns from the Columns to duplicate dropdown. ...
                                                                                                  • Dataset transforms

                                                                                                    Dataset transform consists of combining and transforming datasets to suit your needs. You can apply the following transforms in a dataset. Deduplicate Row-wise Column-wise Join Append Pivot Unpivot Calculated columns Derive dataset Mark as personal ...
                                                                                                  • Fill empty cells

                                                                                                    DataPrep allows you to fill the empty cells in a column using the Fill empty cells transform. To fill empty cells 1. Right-click the column name and select the Fill empty cells option from the context menu. 2. Choose one of the following options from ...
                                                                                                    Wherever you are is as good as
                                                                                                    your workplace

                                                                                                      Resources

                                                                                                      Videos

                                                                                                      Watch comprehensive videos on features and other important topics that will help you master Zoho CRM.



                                                                                                      eBooks

                                                                                                      Download free eBooks and access a range of topics to get deeper insight on successfully using Zoho CRM.



                                                                                                      Webinars

                                                                                                      Sign up for our webinars and learn the Zoho CRM basics, from customization to sales force automation and more.



                                                                                                      CRM Tips

                                                                                                      Make the most of Zoho CRM with these useful tips.



                                                                                                        Zoho Show Resources