This takes data from a specified
and writes the data out with the specified
The setup is as follows:
as the data source
as the destination.
When setting up the locations, use 2 different
respectively to configure the locations of where the data will be
read from and written to.
When writing the data, you need to specify a link
determine how to slice up the data being written (say in to number of lines per record per file
among other implementations.
Finally, you may specify a batch size for batch read and write if the record reader and writer support it.
Of note, is you can also specify multiple readers.
In which case, it will read from every stream jointly and write out the specified
will work the same with the following exceptions, you must specify
(one split per reader)
and the readers which will be read from
writing to the same record writer.
for more information here.