This document summarizes research into mining "partially-ordered sequential rules" (POSR) from multiple sequences to address limitations in existing sequential rule mining algorithms. The researchers propose the RuleGrowth algorithm to efficiently mine POSR, where the items in a rule's antecedent and consequent are unordered. An extension called TRuleGrowth accepts a sliding window constraint to find rules occurring within a maximum time frame. Experiments on four real-life datasets show RuleGrowth and TRuleGrowth have good performance and scalability compared to baselines. Applying a sliding window constraint reduced the number of rules discovered by orders of magnitude. POSR also provided higher prediction accuracy than regular sequential rules in a sequence prediction application.
1 of 1
Download to read offline
More Related Content
Mining partially ordered sequential rules common to multiple sequences (2)
1. 2020 2021
#13/ 19, 1st Floor, Municipal Colony, Kangayanellore Road, Gandhi Nagar, Vellore 6.
Off: 0416-2247353 Mo: +91 9500218218 / +91 8220150373
Website: www.shakastech.com, Email - id: shakastech@gmail.com, info@shakastech.com
Mining Partially-Ordered Sequential Rules Common to Multiple Sequences
Abstract:
Sequential rule mining is an important data mining problem with multiple
applications. An important limitation of algorithms for mining sequential rules
common to multiple sequences is that rules are very specific and therefore many
similar rules may represent the same situation. This can cause three major
problems: (1) similar rules can be rated quite differently, (2) rules may not be found
because they are individually considered uninteresting, and (3) rules that are too
specific are less likely to be used for making predictions. To address these issues,
we explore the idea of mining partially-ordered sequential rules (POSR), a more
general form of sequential rules such that items in the antecedent and the
consequent of each rule are unordered. To mine POSR, we propose the RuleGrowth
algorithm, which is efficient and easily extendable. In particular, we present an
extension (TRuleGrowth) that accepts a sliding-window constraint to find rules
occurring within a maximum amount of time. A performance study with four real-life
datasets show that RuleGrowth and TRuleGrowth have excellent performance and
scalability compared to baseline algorithms and that the number of rules discovered
can be several orders of magnitude smaller when the sliding-window constraint is
applied. Furthermore, we also report results from a real application showing that
POSR can provide a much higher prediction accuracy than regular sequential rules
for sequence prediction.