Uploaded image for project: 'CDAP'
  1. CDAP
  2. CDAP-10373

TPFSParquetSink created through hydrator pipeline is not explorable for hive 0.12

    XMLWordPrintableJSON

    Details

    • Rank:
      1|hzzi9j:

      Description

      When TPFSParquet sink is used in the pipeline, data is written to the sink however it is not explorable. Following exception is seen in the explore container logs:

      2016-07-28 05:37:15,581 - ERROR [pool-3-thread-19:o.a.h.h.q.Driver@419] - FAILED: SemanticException Unrecognized file format in STORED AS clause: parquet
      org.apache.hadoop.hive.ql.parse.SemanticException: Unrecognized file format in STORED AS clause: parquet
              at org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.handleGenericFileFormat(BaseSemanticAnalyzer.java:569)
              at org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.analyzeCreateTable(SemanticAnalyzer.java:8968)
              at org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.analyzeInternal(SemanticAnalyzer.java:8313)
              at org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:284)
              at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:441)
              at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:342)
              at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1000)
              at org.apache.hadoop.hive.ql.Driver.run(Driver.java:911)
              at org.apache.hive.service.cli.operation.SQLOperation.runInternal(SQLOperation.java:102)
              at org.apache.hive.service.cli.operation.SQLOperation.access$000(SQLOperation.java:62)
              at org.apache.hive.service.cli.operation.SQLOperation$1.run(SQLOperation.java:153)
              at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
              at java.util.concurrent.FutureTask.run(FutureTask.java:262)
              at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
              at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
              at java.lang.Thread.run(Thread.java:745)
      

      Syntax used for creating hive table "STORED as parquet" is not supported for Hive 0.12.

        Attachments

          Activity

            People

            • Assignee:
              sagar Sagar Kapare
              Reporter:
              sagar Sagar Kapare
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated: