Uploaded image for project: 'CDAP'
  1. CDAP
  2. CDAP-11144

Need capabilities to join two different data sources in Hydrator pipeline

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Duplicate
    • Affects Version/s: None
    • Fix Version/s: 3.5.0
    • Component/s: Pipelines
    • Labels:
    • Release Notes:
      Adding support to join data from multiple sources in hydrator
    • Rank:
      1|hzzapj:

      Description

      One of the most common use cases is joining data from two different sources. Sample use case:

      Data cleansing and preparation
      Data scientists at a telco need to prepare data for model building - they need to get the account data from database; CDR and IVR data from S3. They would like an easy way to bring all of the data into a single place, cleanse them and prepare the data for annotation. All the data is keyed by account id.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                vinisha Vinisha Shah
                Reporter:
                sree Sreevatsan Raman
              • Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: