In DataPrep, you can import various file types, including CSV, TSV, JSON, HTML, XLS, XLSX and XML.
2. Choose the Files option from the Choose your data source. You can also click the Files category from the left pane and upload files you select one of the below options:
3. Drag and drop your files into the upload box or click on it to browse for your files. You can also enable the
Import from local network toggle if you want to import local files incrementally based on a specific file pattern.
Click here to know more.
4. The files added will be shown along with their file size. You have the option to add more files or remove the existing ones.
5. Click Import once you have all your files. The upload progress will be shown.
6. If the files are HTML, XLS, XLSX or password protected files the user can enter the password and preview the files before clicking the Import button to complete the import.
7. Once all files are successfully imported, it will take you to the
Pipeline builder page where you can start applying transforms. You can also right-click the stage and choose the
Prepare data option to prepare your data in the
DataPrep Studio page.
Click here to know about transforms.
Note
1. Each file is created as a separate dataset inside your workspace.
2. If the file is password-protected, DataPrep will prompt you to enter the password to open the file. This password will not be saved and is only used for opening the password-protected file.
3. If you upload excel files, DataPrep offers an option to select individual sheets in the file for import. You can also preview the sheets before choosing to import them.
Using the Import from local network toggle, you can perform dynamic file selection based on the regex. This can be used for getting new or incremental data from your local file system. Incremental fetch is a method used to import new files from a source after the previous sync. Zoho DataPrep helps you import incremental data from your local files using Zoho Databridge.
Important: This option is used not only for incremental fetch. You can also use this option for bulk import of files based on the file pattern.
In DataPrep, you can incrementally import various file types, including CSV, TSV, JSON, XML, and TXT.
1. Enable the Import from local network toggle if you want to import local files incrementally based on a specific file pattern.
Info: Please ensure that you have installed Zoho Databridge in your machine to import local files.
Click here to know how to download and install Zoho Databridge.
2. Choose an active Databridge and provide the following details:
Folder path : The folder path where you want to search for files. Eg. D:\DataPrep\Datasets
Info: Folder path is case-sensitive.
Include subfolders : You can select this checkbox if you want to include subfolders while searching for a file.
File pattern : The pattern used to match the file names in the specified location. This supports regex type matching. You can also use the pattern, ".*" to match any file in the path specified.
Info: File pattern is case-sensitive.
Note: The file pattern match is a simple regex type match. For example, to fetch files with names such as Sales_2022.csv,Sales_2023.csv, Sales_2024.csv , you can input the pattern Sales_.*
Similarly to fetch files such as PublicData1.csv , PublicData2.csv , PublicData3.csv , use Public.*
If you want to import a single file, then specify the pattern using the exact file name.
Eg: leads_jan_2022.*
Parse file as : Choose the required extension to parse the file. If your file format is not a commonly used one, you can use this option to parse the file into one of the following formats before importing the data into a readable format. The available formats are CSV, TSV, JSON, XML, and TXT.
Merge files and import - This will merge all the files that match the pattern specified and import them as a single dataset.
You can use this option to merge files together during the import itself, without having to perform unions post import.
Info: This option can merge a maximum of only 5 files at a time.
Note: If this checkbox is unchecked then, only 1 file will be fetched at a time.
Eg. If your local path has 10 files, the first 5 will be merged into one dataset and imported. During the next reload, the remaining 5 files will be merged and imported.
Similarly, if your local path has 8 files, the first 5 will be merged and fetched first, followed by the next 3.
3. Click the Import button.
SEE ALSO