Uploaded image for project: 'CDAP'
  1. CDAP
  2. CDAP-8467

Move the logic of tracking external dataset to the Hydrator app

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Duplicate
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: Pipelines
    • Labels:
    • Rank:
      1|hzzurr:

      Description

      Currently inorder to track a source/sink as external dataset for metadata/lineage - users should implement a specfic API that is not a part of hydrator API

      co.cask.hydrator.common.ReferenceBatchSource, co.cask.hydrator.common.ReferenceStreamingSource,
      co.cask.hydrator.common.ReferenceBatchSink,
      co.cask.hydrator.common.ReferenceStreamingSInk instead of the standard ETL apis for the source and sink.

      This has a major disadvantage - users can easily extend from the wrong Api and will loose the tracking functionality. The detection of external dataset should be a part of the app and not depend on extending right APIs from the user perspective.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                ashau Albert Shau
                Reporter:
                sree Sreevatsan Raman
              • Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: