Uploaded image for project: 'CDAP'
  1. CDAP
  2. CDAP-13334

About Building a Fraud Classification Machine Learning Model

    Details

    • Type: Bug
    • Status: In Progress
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 4.3.4
    • Fix Version/s: None
    • Component/s: CDAP Examples, Docs
    • Labels:
    • Rank:
      1|i00c67:

      Description

      I followed the steps & video in the walkthough for building a Fraud Classification Machine Learning Model: https://docs.cask.co/cdap/4.3.4/en/user-guide/tutorials/logistic.html

      When running the "FraudClassifier" pipeline, i got the following exception:

      org.apache.parquet.hadoop.BadConfigurationException: class org.apache.spark.sql.execution.datasources.parquet.CatalystReadSupport set in job conf at parquet.read.support.class is not a subclass of org.apache.parquet.hadoop.api.ReadSupport
      at org.apache.parquet.hadoop.util.ConfigurationUtil.getClassFromConfig(ConfigurationUtil.java:35) ~[na:na]
      at org.apache.parquet.hadoop.ParquetInputFormat.getReadSupportClass(ParquetInputFormat.java:182) ~[na:na]
      at org.apache.parquet.hadoop.ParquetInputFormat.getReadSupport(ParquetInputFormat.java:257) ~[na:na]
      at org.apache.parquet.hadoop.ParquetInputFormat.createRecordReader(ParquetInputFormat.java:245) ~[na:na]
      at org.apache.spark.rdd.SqlNewHadoopRDD$$anon$1.<init>(SqlNewHadoopRDD.scala:178) ~[na:na]
      at org.apache.spark.rdd.SqlNewHadoopRDD.compute(SqlNewHadoopRDD.scala:126) ~[na:na]
      at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:306) ~[na:na]
      at org.apache.spark.rdd.RDD.iterator(RDD.scala:270) ~[na:na]
      at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:38) ~[na:na]
      at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:306) ~[na:na]
      at org.apache.spark.rdd.RDD.iterator(RDD.scala:270) ~[na:na]
      at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:38) ~[na:na]
      at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:306) ~[na:na]
      at org.apache.spark.rdd.RDD.iterator(RDD.scala:270) ~[na:na]
      at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:66) ~[na:na]
      at org.apache.spark.scheduler.Task.run(Task.scala:89) ~[na:na]
      at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:214) ~[na:na]
      at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) ~[na:1.7.0_13]
      at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) ~[na:1.7.0_13]
      at java.lang.Thread.run(Thread.java:722) [na:1.7.0_13]

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                vinisha Vinisha Shah
                Reporter:
                Soliman Ghada
              • Votes:
                0 Vote for this issue
                Watchers:
                5 Start watching this issue

                Dates

                • Created:
                  Updated: