Exclude records that are duplicated

Dear team,

I currently try to build the following rule:

I have two Excel readers that read correctly two separated Excel sheets.
The first sheet contains the following fields:
- id
- name
- description
The second sheet contains the following fields(not equal except the id)
- id
- test 1
- test 2

I’d like to retrieve rows from the first sheet and to exclude records with id (sheet 1) that are equal to id (sheet 2).

This is an example:

Sheet 1
id name description
------------------------
1 name1 desc1
2 name2 descr2
3 name3 descr3
4 name4 descr4
5 name5 descr5

Sheet 2
id test1 test2
------------------------
2 toto tata
3 titi tete

And I’d like to obtain the following table as a result:

Result
id name description
------------------------
1 name1 desc1
4 name4 descr4
5 name5 descr5

Thanks for your help!
Jerome

Dear Jerome,

you can use following approach:

1. sort both streams of records comming from excels by id
2. send both streams to http://doc.cloveretl.com/documentation/ … ction.html
3. define transformation (probably just copying one input record)
4. on port 0 or 2 (depending on order of inputs - see documentation) you should find data you need

I hope this helps.

Absolutely perfect, thanks.
I just uploaded a screenshot of the solution.
Regards