Hi Agata,
Sorry my first message was not too clear… This is the situation:
Data1.FMT:
<?xml version="1.0" encoding="UTF-8"?>
Data2.FMT:
<?xml version="1.0" encoding="UTF-8"?>
Data1.txt:
1, “albert”, “usa”
2, “charles”, “usa”, "1234567890
Data2.txt:
"
1, “ABC”, “M”
2, “XYZ”,“M”
When I try to join the 2 inputs using joinKey=ID, and set dataPolicy=Controlled, I get this join result:
1, “albert”, “usa”
2, “charles”, “usa”, “1234567890”,“ABC”, “M”
With the missing field (Phone) in Data1.txt in the first row, the “carriage-return line-feed” (\r\n) at the end of the first row and the second row (2, “charles”, “usa”, “1234567890”) are combined as the missing Phone field for the first row. This first row (1, “albert”, “usa”\r\n2, “charles”, “usa”, “1234567890”) would be joined with the first row (1, “ABC”, “M”) in Data2.txt to give us the join result above.
Is this a bug?? Maybe in this situation, the correct result should be ignoring the first row since it is missing field in Data1.txt, and join only the second row… so the correct join result might be:
2, “charles”, “usa”, "1234567890, “XYZ”,“M”
Any help/suggestion is greatly appreciated.
Thanks,
albert