Working With Multiple Files

jcatoe · January 17, 2018, 12:00am

I’m trying to replicate this process with CloverETL:

- download multiple zip files from a website
- unzip the text files within; each has the same name as the zip file they’re in
- the text files get converted into tables within a Postgres database
- then some tables get joined together

My question is: do I need to create multiple paths for each file or can they be transformed together without explicitly referencing individual files?

I have successfully been able to download all of the zipped files from the website by reading in a CSV of filenames and mapping it through the HTTPConnector, but from there, I don’t know how to handle the different metadata for each file without working with each file separately.

I’m hesitant to work with them separately because there are 20-30 files and new ones get generated each year so it would require yearly maintenance that I may not be here for and it would get unwieldy quickly.

I was hoping that http://forum.cloveretl.com/viewtopic.php?t=3622 would help me, but it was written in CLT1, and I haven’t been able to replicate it successfully.

Thank you for the help!

the_goat · January 18, 2018, 2:41pm

Are we talking about some BLOBs or CLOBs in that database? Or as I suspect, you need to have them parsed into fields/columns and then put to the database in some sort of format?

jcatoe · January 19, 2018, 3:51pm

Columns and tables. And we really just drop all of the tables and recreate them from the text files for all but one table.

PedroVazquezRosario · January 23, 2018, 5:16pm

I created an example of how you can achieve your use case. I used a combination of Jobflows and Graphs to generate the metadata, and the database tables dynamically based on the data in each file and then inserted the data based on the table name.

MultipleFilesExample.zip

Topic		Replies	Views
Process each file separately CloverDX Platform	1	2	November 3, 2011
Loop based on metadata CloverDX Platform	2	5	June 19, 2015
Multiple output files CloverDX Platform	5	6	February 2, 2014
How to use single graph to load multiple tables at once in cloveretl? CloverDX Platform	1	1	August 10, 2016
Insert data to the multiple Excel files CloverDX Platform	4	18	January 7, 2013

Working With Multiple Files

Related topics