Uploaded image for project: 'CDAP'
  1. CDAP
  2. CDAP-7302

If both Zookeeper and HBase are down, CDAP terminates

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 3.5.0
    • Fix Version/s: 4.1.0
    • Component/s: Master
    • Labels:
    • Release Notes:
      Fixed an issue that caused the CDAP Master to die if HBase was down when a follower became the leader.
    • Rank:
      1|hzzm3b:

      Description

      Suppose Zookeeper goes down for a short time. That will have (at least) two effects:

      • HBase may go down as it relies on ZK
      • CDAP Master will lose its ZK connection and shutdown (becomes follower)

      After Zookeeper comes back, the CDAP Master will become leader again and attempts to start up its services. However, if at that time HBase is still down, DatasetService will fail to start, which will terminate the start up sequence in the master and master will give up and exit.

      This means CDAP can in some cases not recover from a ZK failure and will have to be restarted manually.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                ashau Albert Shau
                Reporter:
                andreas Andreas Neumann
              • Votes:
                0 Vote for this issue
                Watchers:
                5 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: