Details

      Description

      Need capabilities to sample incoming data. - https://wiki.cask.co/display/CE/Sampling+Transform

      Use case:
      To build a machine learning model a data scientists wants to use 60% of annotated data for computing the model
      Ability to support a systematic sample; ie: first record selected at random followed by selection of everything nth record thereafter.

        Attachments

          Activity

            People

            • Assignee:
              romy Romy Khetan
              Reporter:
              sree Sreevatsan Raman
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: