Setup CSV input plugin setup
Please reference http://www.embulk.org/docs/built-in.html
it's easier to get the config.yml file from guess and moving forward from there.
Change the "path_prefix" part in the following
in:
type: file
path_prefix: "/Users/randy/GitHub/embulk/try1/csv/part-"
parser:
charset: UTF-8
type: csv
out:
type: stdout
Used following command to get config.yml
embulk guess example.yml -o config.yml
Change the csv related field in config.yml
in:
type: file
path_prefix: /Users/randy/GitHub/embulk/try1/csv/part-
parser:
charset: UTF-8
type: csv
newline: CRLF
delimiter: ','
quote: '"'
escape: '"'
trim_if_not_quoted: false
allow_extra_columns: false
allow_optional_columns: false
columns:
- {name: sourceIP, type: string}
- {name: destURL, type: string}
- {name: visitDate, type: timestamp, format: '%Y-%m-%d'}
- {name: adRevenue, type: double}
- {name: userAgent, type: string}
- {name: countryCode, type: string}
- {name: languageCode, type: string}
- {name: searchWord, type: string}
- {name: duration, type: long}
Next step: setup BigObject output plugin