際際滷

際際滷Share a Scribd company logo
2020  2021
#13/ 19, 1st Floor, Municipal Colony, Kangayanellore Road, Gandhi Nagar, Vellore  6.
Off: 0416-2247353 Mo: +91 9500218218 / +91 8220150373
Website: www.shakastech.com, Email - id: shakastech@gmail.com, info@shakastech.com
Mining Partially-Ordered Sequential Rules Common to Multiple Sequences
Abstract:
Sequential rule mining is an important data mining problem with multiple
applications. An important limitation of algorithms for mining sequential rules
common to multiple sequences is that rules are very specific and therefore many
similar rules may represent the same situation. This can cause three major
problems: (1) similar rules can be rated quite differently, (2) rules may not be found
because they are individually considered uninteresting, and (3) rules that are too
specific are less likely to be used for making predictions. To address these issues,
we explore the idea of mining partially-ordered sequential rules (POSR), a more
general form of sequential rules such that items in the antecedent and the
consequent of each rule are unordered. To mine POSR, we propose the RuleGrowth
algorithm, which is efficient and easily extendable. In particular, we present an
extension (TRuleGrowth) that accepts a sliding-window constraint to find rules
occurring within a maximum amount of time. A performance study with four real-life
datasets show that RuleGrowth and TRuleGrowth have excellent performance and
scalability compared to baseline algorithms and that the number of rules discovered
can be several orders of magnitude smaller when the sliding-window constraint is
applied. Furthermore, we also report results from a real application showing that
POSR can provide a much higher prediction accuracy than regular sequential rules
for sequence prediction.

More Related Content

Mining partially ordered sequential rules common to multiple sequences (2)

  • 1. 2020 2021 #13/ 19, 1st Floor, Municipal Colony, Kangayanellore Road, Gandhi Nagar, Vellore 6. Off: 0416-2247353 Mo: +91 9500218218 / +91 8220150373 Website: www.shakastech.com, Email - id: shakastech@gmail.com, info@shakastech.com Mining Partially-Ordered Sequential Rules Common to Multiple Sequences Abstract: Sequential rule mining is an important data mining problem with multiple applications. An important limitation of algorithms for mining sequential rules common to multiple sequences is that rules are very specific and therefore many similar rules may represent the same situation. This can cause three major problems: (1) similar rules can be rated quite differently, (2) rules may not be found because they are individually considered uninteresting, and (3) rules that are too specific are less likely to be used for making predictions. To address these issues, we explore the idea of mining partially-ordered sequential rules (POSR), a more general form of sequential rules such that items in the antecedent and the consequent of each rule are unordered. To mine POSR, we propose the RuleGrowth algorithm, which is efficient and easily extendable. In particular, we present an extension (TRuleGrowth) that accepts a sliding-window constraint to find rules occurring within a maximum amount of time. A performance study with four real-life datasets show that RuleGrowth and TRuleGrowth have excellent performance and scalability compared to baseline algorithms and that the number of rules discovered can be several orders of magnitude smaller when the sliding-window constraint is applied. Furthermore, we also report results from a real application showing that POSR can provide a much higher prediction accuracy than regular sequential rules for sequence prediction.