[Solved-1 Solution] File formats that can be read using Pig



What is File format ?

  • A file format is the layout of a file in terms of how the data within the file is organized. A program that uses the data in a file must be able to recognize and possibly access data within the file.

Problem:

  • What kind of file formats can be read using Pig? How to store them in different formats?

Solution 1:

  • There are a few built-in loading and storing methods, but they are limited:
    • BinStorage - "binary" storage
    • PigStorage - loads and stores data that is delimited by something (such as tab or comma)
    • TextLoader - loads data line by line (i.e., delimited by the newline character)
    • piggybank is a library of community contributed user-defined functions and it has a number of loading and storing methods, which includes an XML loader, but not a XML storer.

Related Searches to File formats that can be read using PIG