User: Guest  Login
Title:

Fast Frequent String Mining Using Suffix Arrays

Document type:
Technical Report
Author(s):
Johannes Fischer; Volker Heun; Stefan Kramer
Abstract:
Mining frequent strings in databases has many interesting applications,\\ e.g., in computational biology. We focus on a special kind of constraint-based\\ frequent string mining, namely computing all strings that are frequent in one\\ database and infrequent in another. We present a method to find such strings\\ by using the suffix- and lcp-arrays, which can be computed extremely fast and\\ space efficiently, and further exhibit a good locality behavior. We test our\\ method on several biologica...     »
Keywords:
Data Mining; Pattern Discovery; String Mining; Sequence Mining; Constraint-Based Data Mining; Suffix Array
Year:
2005
Year / month:
2005-11-01 00:00:00
Pages:
14
 BibTeX