In DataPrep, you can import various file types, including CSV, TSV, JSON, HTML, XLS, XLSX and XML. With this connector, you can streamline your ETL workflows, enable smooth data movement, and simplify data integration between local files and Zoho DataPrep.
Note :
1. DataPrep also supports files without any extension or files in plain text format. However, binary files cannot be parsed during import; users will have to manually parse the file into one of the supported formats listed above.
To import a local file
1.
Create a pipeline or open an existing pipeline from the
Home Page,
Pipelines tab or
Workspaces tab and click the
Add data option. You can also click the
Import data option under the
Workspaces tab to import data.
Info: You can also click the
Import data 
icon at the top of the pipeline builder and bring data from multiple sources into the pipeline.
2. Choose the Files option from the Choose your data source. You can also click the Files category from the left pane and upload files you select one of the below options:
3. Drag and drop your files into the upload box or click on it to browse for your files. You can also enable the
Import from local network toggle if you want to import local files incrementally based on a specific file pattern.
Click here to know more.
Note : In the case of Zip files, only one file is supported. Make sure you have only one file compressed or zipped within the .Zip file.
You can also click the Try sample link to import the sample dataset and see how it works.
4. The files added will be shown along with their file size. You have the option to add more files or remove the existing ones.
Note : In the case of HTML files, data selection for import depends on the order in which the tables exist within the HTML page.
5. Click Import once you have all your files. The upload progress will be shown.
6. If the files are HTML, XLS, XLSX or password protected files the user can enter the password and preview the files before clicking the Import button to complete the import.
7. Once all files are successfully imported, it will take you to the
Pipeline builder page where you can start applying transforms to the ETL pipeline. You can also right-click the stage and choose the
Prepare data option to prepare your data in the
DataPrep Studio page.
Click here to know about transforms.
8. Once you are done creating your data flow and applying necessary transforms in your stages, you can right-click a stage and
add a destination to complete your data flow.
Note: After adding a destination to the ETL pipeline, you can try executing your pipeline using a manual run at first. Once you make sure manual run works, you can then set up schedule to automate the pipeline and data movement. Learn about the different types of runs here.Note
1. Each file is created as a separate dataset inside your workspace.
2. If the file is password-protected, DataPrep will prompt you to enter the password to open the file. This password will not be saved and is only used for opening the password-protected file.
3. If you upload excel files, DataPrep offers an option to select individual sheets in the file for import. You can also preview the sheets before choosing to import them.
To import files from local network
Using the Import from local network toggle, you can perform dynamic file selection based on the regex. This can be used for getting new or incremental data from your local file system. Incremental fetch is a method used to import new files from a source after the previous sync. Zoho DataPrep helps you import incremental data from your local files using Zoho Databridge.
Important: This option is used not only for incremental fetch. You can also use this option for bulk import of files based on the file pattern.
In DataPrep, you can incrementally import various file types, including CSV, TSV, JSON, XML, XLS, XLSX and TXT.
1. Enable the Import from local network toggle if you want to import local files incrementally based on a specific file pattern.
Info: Please ensure that you have installed Zoho Databridge in your machine to import local files.
Click here to know how to download and install Zoho Databridge.
2. Choose an active Databridge and provide the following details:
Folder path : The folder path where you want to search for files. Eg. D:\DataPrep\Datasets
Info: Folder path is case-sensitive.
Include subfolders : You can select this checkbox if you want to include subfolders while searching for a file.
File pattern : The pattern used to match the file names in the specified location. This supports regex type matching. You can also use the pattern, ".*" to match any file in the path specified.
Merge files and import : This will merge all the files that match the pattern specified and import them as a single dataset.
File password : Enter the password if the file is password protected.
Info: File pattern is case-sensitive.
Note: The file pattern match is a simple regex type match. For example, to fetch files with names such as Sales_2022.csv,Sales_2023.csv, Sales_2024.csv , you can input the pattern Sales_.*
Similarly to fetch files such as PublicData1.csv , PublicData2.csv , PublicData3.csv , use Public.*
If you want to import a single file, then specify the pattern using the exact file name.
Eg: leads_jan_2022.*
File type : Choose the required file format. The available formats are CSV, TSV, JSON, XLS, XLSX, XML, and TXT.
Sheet pattern : This option is available for the XLS and XLSX formats only. The pattern used to match the sheet names in the file. This supports regex type matching. You can also use the pattern ".*" to match any sheet in the file.
The sheet pattern match is also a simple regex-based match. For example, to fetch sheets with names such as Sales_2022, Sales_2023, Sales_2024, you can input the pattern Sales_.*
Similarly, to fetch sheets such as PublicData1, PublicData2, PublicData3, use Public.*
If you want to import a single sheet, then specify the pattern using the exact sheet name.
Eg: Leads_Jan_2022.*
Info: Sheet pattern is case-sensitive.
Sheet password : This option is available for XLS and XLSX formats only. Enter the password if the sheet is password protected.
Merge sheets and import : This will merge all the sheets that match the pattern specified and import them as a single dataset.
You can use this option to merge sheets together during the import itself, without having to perform unions post import.
Note: If this checkbox is unchecked then, only 1 sheet will be fetched at a time. 3. Click the Import button.
SEE ALSO