Uploaded image for project: 'CDAP'
  1. CDAP
  2. CDAP-6227

SpamClassifier example should also support prediction on cdap stream's data

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 3.4.0
    • Fix Version/s: None
    • Component/s: CDAP Examples, Spark
    • Labels:
      None
    • Rank:
      1|hzzek7:

      Description

      Currently, SpamClassifier example reads training data from a stream to build a NaiveBayesModel and then uses this model to predict kafka topic messages as spam/ham. Although, this is a more realistic and futuristic, it hinders user who is interested in trying Spark Streaming in CDAP as the user needs to create topic in kafka, then add messages to it and also pass all this info to program as arguments.

      We should have the example configurable in to support reading prediction data from kafka or cdap streams. Users who just wants to try spark streaming can then easily use this example without any kafka settings.

        Attachments

          Activity

            People

            • Assignee:
              rsinha Rohit Sinha
              Reporter:
              rsinha Rohit Sinha
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated: