Distributed Scheduling Heuristics - Heuristic Approaches

4.2 Heuristic Approaches

4.2.2 Distributed Scheduling Heuristics

Scheduling algorithms for tasks with communication usually comprise two different phases [29]. At First a task selection phase, also called the prioritization phase, takes place and determines which task should be scheduled next. The second phase, a processor selection phase, determines the processor on which the task should be executed. In our scenario, the processor selection is fixed a priori, which means we focus on the task ordering mechanism of each heuristic.

4.2. Heuristic Approaches 65

TheHeterogeneous Earliest Finish Time(HEFT) [108] andDominant Sequence Clus-tering (DSC) [121] heuristics are examples for list-scheduling algorithms that main-tain a fixed priority list of tasks which is calculated once. Both heuristics use the length of the longest path (in terms of WCET and communication time) from a taskt, including the communication times, to a sink of the DAG for their task pri-oritization. We will use exit(t) to denote this path. HEFT simply ranks tasks by increasing exit(t) (c.f. Algorithm 2). The DSC heuristic uses the ERT of t plus exit(t) as the priority of t as shown inAlgorithm 3.

Algorithm 2 Heterogeneous Earliest Finish Time (HEFT) heuristic

1: functionScheduleAfter(t, releaset, T^sched)

2: blockedTime ←0

3: for alltsched ∈ T^sched do

4: if µ_t(t) =µ_t(t_sched)∧t_sched.C > release_tthen

5: blockedTime ←max(blockedTime, t_sched.C−release_t)

6: t.S =releaset+blockedTime

7: return {T^sched∪t}

8: functionHEFT(T)

9: T^sched ← ∅

10: ∀t∈ T calculateexit(t) and t.ERT

11: for allt∈ T sorted by decreasingexit(t)do

12: T^sched← ScheduleAfter(t, t.ERT, T^sched)

Algorithm 3 Dominant Sequence Clustering (DSC) heuristic

1: functionDSC(T)

2: T^sched ← ∅

3: ∀t∈ T : calculatet.ERT

4: ∀t∈ T :blevel(t) =exit(t)

5: while T \ T^sched 6=∅ do

6: ∀t∈ T :tlevel(t) =t.ERT

7: t←arg max(tlevel(t) +blevel(t)) overt∈ T \ T^sched

8: T^sched← ScheduleAfter(t, t.ERT, T^sched)

9: ∀t∈ T \T^sched calc t.ERT using tsched.S ast.ERT for all tsched∈ T^sched

66 4. From Specification to Solutions

The Mobility Directed (MD) [120] heuristic chooses tasks based on their mobility, defined as the difference between a task’s LFT and its earliest start time (EST), divided by the task’s WCET. Although the EST is similar to the ERT, it is recal-culated after each task selection and takes into account the scheduling time of the other workflow tasks. The EST is calculated in line 7 of Algorithm 4.

Algorithm 4 Mobility Directed (MD) heuristic

1: function MD(T)

2: T^sched← ∅

3: ∀t∈ T : calculate t.ERT and t.LF T

4: whileT \ T^sched6=∅do

5: t←arg max(^{t.LF T}_|t|^−t.ERT) over t∈ T \ T^sched

6: T^sched ← ScheduleAfter(t, t.ERT, T^sched)

7: ∀t∈ T \T^sched calc t.ERT using tsched.S ast.ERT for alltsched ∈ T^sched

Earliest Task First (ETF) [45] picks a task among the ready tasks, meaning tasks whose predecessors have already been scheduled, by choosing the task with the min-imum EST. Ties are broken by the task with the smallest LFT minus WCET as shown in Algorithm 5.

Algorithm 5 Earliest Task First (ETF) heuristic

1: function etf(T)

2: T^sched← ∅

3: ∀t∈ T : calculate t.ERT and t.LF T

4: whileT \ T^sched6=∅do

5: ReadyTasks ←args min(t.ERT) over t∈ T \ T^sched

6: t←arg min(t.LF T − |t|) over t∈ReadyTasks

7: T^sched ← ScheduleAfter(t, t.ERT, T^sched)

8: ∀t∈ T \T^sched calc t.ERT using t_sched.S ast.ERT for allt_sched ∈ T^sched We propose two additional scheduling heuristics: an adapted version of Potts’ heuris-tic [93] that works in a distributed environment with fixed task-placement, and the Least Delay heuristic (LD). Our proposed LD heuristic tries to determine the im-plications of scheduling each task in the ready set. The heuristic schedules each of the tasks in the ready set in a “what-if” manner and determines the EST of all tasks in the DAG based on this speculative scheduling. The resulting EST of each sink task is compared to its EST before the speculative scheduling, yielding a value for the expected delayt of the sink taskt. The maximumdelayt yields the delay for the entire workflow. The heuristic now chooses the task from the ready set that resulted in the minimum workflow delay. This heuristic has a high run-time complexity, but our evaluation (Section 4.3.1) shows it can generate solutions in many cases where the other heuristics failed to do so. The pseudocode is given in Algorithm 6.

4.2. Heuristic Approaches 67

The adapted Potts’ heuristic is set up by picking the task with the minimum LFT from the available ready set, meaning the current time on the task’s machine is equal or greater than the task’s EST. This initial step is also known as Schrage’s heuristic [93]. In most cases, Schrage’s heuristic yields valid schedules, meaning the DAG sinks do not violate the workflow deadline in the schedules. If the first pass of Schrage’s heuristic was unsuccessful, Potts’ heuristic then analyzes the resulting schedules and looks for a task A that violates its LFT.A is called the critical task.

This means there could be another task B with a smaller EST than A but with a larger latest starting time (LST) scheduled before A because A was not ready at that moment. If such a taskB, also called the interference task, exists on the same machine as A, we introduce an additional edge in the DAG from A toB to ensure that A will be scheduled before B. If no such task exists on the same machine as the critical task, we look for an interference task B⁰ on a different machine. We then take B⁰ as the new critical task and try to locate an interference task C on the same machine as B⁰. If C exists, we introduce an additional edge from C to B⁰ and continue as previously described. The modified workflow is then rescheduled with Schrage’s heuristic. Pseudocode for the modified Potts’ heuristic is shown in Algorithm 7.

Algorithm 6 Least Delay heuristic (LD)

1: functionLeastDelay(T,G)

2: T^sched ← ∅

3: ∀t∈ T : calculatet.ERT andexit(t)

4: while T \ T^sched 6=∅ do

5: T^tmp← T^sched

6: minDelay← ∞

7: tminDelay← ∅

8: for alltroot∈ T \ T^sched:P rec(troot) =∅ do . Unscheduled roots

9: T^tmp ← ScheduleAfter(t_root, t_root.ERT, T^tmp)

10: Tmp ← ∀t∈ T \ T^tmp calc. t.ERT using t.S ast.ERT fort∈ T^tmp

11: delay ← max(Tmp(t_leaf) − t.ERT) for t_leaf ∈ T \ T^sched : Succ(t_leaf) =∅

12: if delay < minDelay∨(delay =minDelay∧maxLen < exit(troot)) then. Scheduling current root leads to a smaller delay in the leafs than before

13: t_minDelay←t_root

14: minDelay←delay

15: maxLen←exit(t_root)

16: T^sched← ScheduleAfter(tminDelay, tminDelay.ERT, T^sched)

17: ∀t∈ T \T^sched calc t.ERT using t_sched.S ast.ERT for all t_sched∈ T^sched

68 4. From Specification to Solutions

Algorithm 7 Modified Potts’ heuristic for distributed systems

1: function Potts(T)

2: AbsoluteEarliest ← ∀t∈ T : calculate t.EST

3: fori≤number of tasks inT do

4: T^Schrage ← SchragesHeuristic(T)

5: if T^Schrage is a valid schedulethen. Valid if t.C of all DAG leafs≤DW 6: return T^Schrage

7: else

8: interference← IdentifyInterference(T^Schrage)

9: if interference 6=∅then

10: add new precedence relation crit≺interference toG

11: else

12: return Infeasible with Potts’ heuristic

13: return Infeasible with Potts’ heuristic

14: function SchragesHeuristic(T)

15: T^sched← ∅

16: ∀t∈ T : calculate t.LF T

17: whileT \ T^sched6=∅do

18: for all t_i ∈ T \ T^sched ordered by increasing t.LF T do

19: machineClock ←max(tj.C) over tj ∈ T^sched∧µt(ti) =µt(tj)

20: if ti.ERT ≥machineClock then

21: t←t_i

22: break for-loop

23: T^sched ← ScheduleAfter(t, t.ERT, T^sched)

24: ∀t∈ T \T^sched calc t.ERT using tsched.S ast.ERT for alltsched ∈ T^sched

25: return T^sched

26: function IdentifyInterference(T)

27: interference← ∅

28: for allt∈ T do .Identify critical task

29: if (t.C > t.LF T ∧t.S >AbsoluteEarliest(t))∨t.C > D_Wthen

30: crit ←t . tviolates its own deadline or the workflow deadline

31: break for-loop

32: for allt∈ T do .Identify interference task delaying the critical task

33: if µt(t) =µt(crit)∧crit.LF T < t.LF T ∧t⊀≺crit then

34: interference←t . tviolates local or workflow deadline

35: break for-loop

36: if interference6=∅then returninterference

37: crit ←arg max(t.ERT −AbsoluteEarliest(t)) over t∈P rec(crit)

38: for allt∈ T do

39: if µ_t(t) =µ_t(crit)∧crit.LF T < t.LF T ∧t⊀≺crit then

40: interference←t

41: break for-loop

42: return interference

Im Dokument A Distributed Data Processing Perspective on Industrial Real-Time Systems (Seite 78-83)