Inductive miner

Inductive miner belongs to a class of algorithms used in process discovery. Various algorithms proposed previously give process models of slightly different type from the same input. The quality of the output model depends on the soundness of the model. A number of techniques such as alpha miner, genetic miner, work on the basis of converting an event log into a workflow model, however, they do not produce models that are sound all the time. Inductive miner relies on building a directly follows graph from event log and using this graph to detect various process relations.

Definitions
A directly follows graph is a directed graph that connects an activity A to another activity B if and only if activity B occurs chronologically right after activity A for any given case in the respective event log. A directly follows graph is represented mathematically by:

$$G(L) = (A_L, \rightarrow_L, A^s_L, A^e_L) $$

Where

$$A_L = Nodes $$ (activities in the log)

$$\rightarrow_L = edges $$ (directly follows relation)

$$A^s_L = Start node $$

$$A^e_L = End node $$

The inductive miner technique relies on the detection of various cuts on the directly follows graph created using the event log. The core idea behind inductive miner lies in the unique methodology of discovering various divisions of the arcs in the directly follows graph, and using the smaller components after division to represent the execution sequence of the activities.The inductive miner algorithm uses the directly follows graph to detect one of the following cuts.


 * Exclusive OR cut: The exclusive OR cut groups the activities such that the activities belonging to different groups have no relations between them.

'''$$(\times, A_1, ..., A_n) $$''' is an exclusive OR cut iff: $$\forall i,j \in \{1, .. , n\} (\forall a \in A_i, b \in A_j (i \neq j \Rightarrow \neg(a \rightarrow_L b) ))

$$


 * Sequence cut: The sequence cut groups activities such that activities between them have a directly follows relation from previous group to next group but not the other way around.

'''$$(\rightarrow_L, A_1, .., A_n)

$$''' is a sequence cut iff: $$\forall i,j \in \{1, .. , n\} (\forall a \in A_i, b \in A_j (a \rightarrow_L b \land \neg(b \rightarrow_L a)) ) $$


 * Parallel cut: Parallel cut groups activities such that each group has a start activity and an end activity, and the activities between the groups have directly follows relations between them.

$$(\land, A_1, .., A_n)

$$ is a parallel cut iff:

- $$\forall i \in \{1, .., n\} ( A_i \cap A^s_L \neq \emptyset \land A_i \cap A^e_L \neq \emptyset )

$$

- $$\forall i,j \in \{1, .. , n\} (\forall a \in A_i, b \in A_j (i \neq j \Rightarrow a \rightarrow_L b ))

$$


 * Redo loop cut: A loop cut groups elements into two parts. A do part and a redo part. The activities in the event log from a redo loop cut can start and end only with the activities from the do part.

$$(\circlearrowleft, A_1, .., A_n)

$$ is a redo loop cut iff:

- $$n \geq 2

$$

- $$A_1 \supseteq A^s_L \cup A^e_L

$$

- $$\forall i,j \in \{2, .. , n\} (\forall a \in A_i, b \in A_j (i \neq j \Rightarrow \neg(a \rightarrow_L b) ))

$$

- $$\forall i \in \{2, .. , n\} (\forall b \in A_i (\exists a \in A^e_L (a \rightarrow_L b)) \Rightarrow (\forall a \in A^e_L (a \rightarrow_L b) ))

$$

- $$\forall i \in \{2, .. , n\} (\forall b \in A_i (\exists a \in A^s_L (b \rightarrow_L a)) \Rightarrow (\forall a \in A^s_L (b \rightarrow_L a) ))

$$

Types

 * 1) Inductive miner with fall through: The complex event log sometimes would make it impossible to detect any cuts using the above techniques. In that case, there are additional fall throughs that can be applied to obtain better representation of process tree instead of a flower model.
 * 2) Inductive miner frequency-based: The less frequent relations in the event log sometimes creates problems in detecting any type of cuts. In that case, the directly follows relations below a certain threshold are removed from the directly follows graph and the resultant graph is used for detecting the cuts.
 * 3) Inductive miner for big data: This includes an improvement on the existing inductive miner to handle big data sets.