Uploaded image for project: 'CDAP'
  1. CDAP
  2. CDAP-11029

Better way to compute records in and out of spark stages

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 4.1.0
    • Component/s: Pipelines
    • Labels:
    • Rank:
      1|hzzifz:

      Description

      For plugin types that don't operate on a record by record basis (like SparkCompute or SparkSink), we have to do a count in order to get the records in and out of that stage, which may be costly. We should see if there is a better way to do it.

        Attachments

          Activity

            People

            • Assignee:
              ashau Albert Shau
              Reporter:
              ashau Albert Shau
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: