Incremental fetch from cloud storages

Import incremental data from cloud storage services

 
Incremental data fetch is a method used to import new files from a source after the previous sync. Zoho DataPrep helps you import incremental data from the following cloud storages using the advanced selection feature.
 
  • Google Drive
  • Zoho WorkDrive
  • Amazon S3
  • Dropbox
  • Box
  • Microsoft OneDrive
  • SharePoint
 

 To import incremental data    

 
1. Open an existing pipeline or create a pipeline from the Home Page, Pipelines tab, or Workspaces tab and click the Add data option.

Info: You can also click the Import data  icon at the top of the pipeline builder and bring data from multiple sources into the pipeline.
 
 
2. Choose the Cloud storage category from the left pane and click the required cloud storage service. You can also search cloud storage services in the search box.

 
Note: If you had already added a connection earlier, click the Saved connections category from the left pane and proceed to import. To learn more about Saved connections, click here.

3. If you have already added a connection, click the existing connection and start importing data.

Note: Click the Add new link to add a new account. You can create as many connections as required.
 
 
 
4. Authenticate your Cloud storage account. You will need to authorize DataPrep to access your files when you do this for the first time.

Info: For Zoho WorkDrive, data will be directly fetched from your WorkDrive account using which you have logged into Zoho DataPrep.


 
Note: The connection configuration will be saved for importing data from your cloud storage in the future. Credentials are securely encrypted and stored.

  
Advanced selection

5. To import your data incrementally, click the Advanced selection link.




Advanced selection helps you perform dynamic file selection based on regex. This can be used for getting new or incremental data from your cloud storage account. The newly added or modified file that matches the file pattern after the previous sync will be taken from your account.

You can import using the advanced selection option from the following cloud storages:
Info:  We support only CSV, TSV, JSON, XML, and TXT file formats for incremental fetch from cloud storage.

6. Once you have completed importing data, the Visual Pipeline builder opens and you can start applying transforms. You can also right-click the stage and choose the Prepare data option and prepare your data in the DataPrep Studio page. Click here to know more about the transforms.


 
7. Once you are done creating your data flow and applying necessary transforms in your stages, you can right-click a stage and add a destination to complete your data flow.
8. After adding a destination to the pipeline, you can try executing your pipeline using a manual run at first. Once you make sure manual run works, you can then set up a schedule to automate the pipeline. Learn about the different types of runs here.

 
How incremental sync works            

You can configure how to import and fetch incremental data from your source using the Import configuration option. Incremental data import is a method used to import new or modified records after the previous sync.

In incremental file fetch, when the pipeline is run, the new or modified files will be fetched. During every successful sync, the last sync time is updated with the new value, and the file created or modified after the sync time is imported. If there is no new file in the cloud storage, no data will be imported, or the previously imported data will be fetched again according to the import configuration. Even if no data were synced, the sync time will be updated since it was given a try. In the next cycle, the file created or modified after this sync time will be fetched.

Incremental fetch for schedule run 

In incremental file fetch, when the pipeline is scheduled, the data interval of the first schedule will span from the previous one up to the current data interval. During this interval, all the new or modified files will be fetched. The second schedule will extend from the current interval to the next, with subsequent schedules following this pattern. 
 
For example, The schedule is set to 1 hour interval. The data source contains a total of 10 files, in which 5 files were uploaded an hour ago that matches the file pattern. Import configuration is set to import all files incrementally, and the batch size is set to 10.




 
When the first schedule runs, the 5 files created/modified in the last 1 hour will be fetched and exported as a single file.
 
In the second schedule, if 4 new files with the same file pattern are added in the source, only the 4 files will be fetched and exported as a single file based on the created/modified time in the source in that specific interval.
 
In the third schedule, if 13 new files are added to the source that matches the file pattern, then only the first 10 files will be fetched and exported as a single file based on the created/modified time in the source since the batch size is set as 10. The same logic will be applied for files during incremental fetch in the upcoming schedules and so on.

Click here to know about the schedule run.

Incremental fetch for backfill run

 
In an incremental file fetch, when a backfill run is configured for a pipeline, all new or modified files within the specified data interval will be fetched.


For instance, the backfill is set from August 1st to 3rd with a daily job frequency. August 1st has 4 files, August 2nd has 7 files, and August 3rd has 13 files. The import configuration is set to fetch all files incrementally with a batch size of 10.

During the first backfill job, the 4 files created or modified on August 1st will be fetched and exported as a single file. In the second backfill job, the 7 files from August 2nd will be similarly processed and exported. In the third backfill job, the first 10 files from August 3rd will be fetched and exported as a single file.

Click here to know about the backfill run.
 

 Incremental fetch for manual reload

In an incremental file fetch, when a manual reload is configured for a pipeline, all new or modified files within the specified data interval will be fetched.
 
For instance, the Manual reload is set from August 1st to 2nd.
August 1st has 3 files, August 2nd has 4 files. The import configuration is set to fetch all files incrementally with a batch size of 5 based on the oldest created time.
During the manual reload job, the 3 files created on August 1st, the first 2 files created on August 2nd will be fetched and exported as a single file.

Click here to know about the manual reload.
 
9. A pipeline job will be created at the run time. You can view the status of a job with the granular details in the Job summary.
 
In the Output tab of the Job summary, you can view the destinations added, the data quality, output stage, rows and columns exported, and the status of each export.
 You can also preview the output data and download the prepared data of a particular destination or download all the outputs as a zip file. Click here to know more about the job summary.

Note: All the newly added or modified files are incrementally fetched based on Greenwich MeanTime (GMT) / UTC.

SEE ALSO

    Zoho CRM Training Programs

    Learn how to use the best tools for sales force automation and better customer engagement from Zoho's implementation specialists.

    Zoho CRM Training
      Redefine the way you work
      with Zoho Workplace

        Zoho DataPrep Personalized Demo

        If you'd like a personalized walk-through of our data preparation tool, please request a demo and we'll be happy to show you how to get the best out of Zoho DataPrep.

        Zoho CRM Training

          Create, share, and deliver

          beautiful slides from anywhere.

          Get Started Now


            Zoho Sign now offers specialized one-on-one training for both administrators and developers.

            BOOK A SESSION











                                        You are currently viewing the help pages of Qntrl’s earlier version. Click here to view our latest version—Qntrl 3.0's help articles.




                                            Manage your brands on social media

                                              Zoho Desk Resources

                                              • Desk Community Learning Series


                                              • Digest


                                              • Functions


                                              • Meetups


                                              • Kbase


                                              • Resources


                                              • Glossary


                                              • Desk Marketplace


                                              • MVP Corner


                                              • Word of the Day


                                                Zoho Marketing Automation

                                                  Zoho Sheet Resources

                                                   

                                                      Zoho Forms Resources


                                                        Secure your business
                                                        communication with Zoho Mail


                                                        Mail on the move with
                                                        Zoho Mail mobile application

                                                          Stay on top of your schedule
                                                          at all times


                                                          Carry your calendar with you
                                                          Anytime, anywhere




                                                                Zoho Sign Resources

                                                                  Sign, Paperless!

                                                                  Sign and send business documents on the go!

                                                                  Get Started Now




                                                                          Zoho TeamInbox Resources



                                                                                  Zoho DataPrep Resources



                                                                                    Zoho DataPrep Demo

                                                                                    Get a personalized demo or POC

                                                                                    REGISTER NOW


                                                                                      Design. Discuss. Deliver.

                                                                                      Create visually engaging stories with Zoho Show.

                                                                                      Get Started Now









                                                                                                          • Related Articles

                                                                                                          • Import configuration for cloud storage services

                                                                                                            You can import data from cloud storage using simple selection or advanced selection. Look into the following sections to learn about the import configuration of each selection type. How to import data from cloud storage using simple selection? How to ...
                                                                                                          • Import data from cloud storage services

                                                                                                            DataPrep supports import from the following cloud storage services. Google Drive One Drive Box Storage Dropbox Zoho WorkDrive Amazon S3 To import data from a cloud storage service 1. Open an existing pipeline or create a pipeline from the Home Page, ...
                                                                                                          • Import data from OneDrive

                                                                                                            Zoho DataPrep supports importing data from OneDrive, a file hosting service operated by Microsoft. OneDrive enables users to store, protect, share and synchronize their files. To import data from OneDrive 1. Open an existing pipeline or create a ...
                                                                                                          • Import data from Dropbox

                                                                                                            Zoho DataPrep supports importing data from Dropbox, a file hosting service. Dropbox offers cloud storage, file synchronization, personal cloud, and client software. To import data from Dropbox 1. Open an existing pipeline or create a pipeline from ...
                                                                                                          • Import data from Box

                                                                                                            Zoho DataPrep supports importing data from Box, a cloud storage service that allows users to store, collaborate and manage files securely in an online folder system accessible from any device. To import data from Box 1. Open an existing pipeline or ...
                                                                                                            Wherever you are is as good as
                                                                                                            your workplace

                                                                                                              Resources

                                                                                                              Videos

                                                                                                              Watch comprehensive videos on features and other important topics that will help you master Zoho CRM.



                                                                                                              eBooks

                                                                                                              Download free eBooks and access a range of topics to get deeper insight on successfully using Zoho CRM.



                                                                                                              Webinars

                                                                                                              Sign up for our webinars and learn the Zoho CRM basics, from customization to sales force automation and more.



                                                                                                              CRM Tips

                                                                                                              Make the most of Zoho CRM with these useful tips.



                                                                                                                Zoho Show Resources