User: Guest  Login
Document type:
Technical Report 
Author(s):
Johannes Fischer; Volker Heun; Stefan Kramer 
Title:
Fast Frequent String Mining Using Suffix Arrays 
Abstract:
Mining frequent strings in databases has many interesting applications,\\ e.g., in computational biology. We focus on a special kind of constraint-based\\ frequent string mining, namely computing all strings that are frequent in one\\ database and infrequent in another. We present a method to find such strings\\ by using the suffix- and lcp-arrays, which can be computed extremely fast and\\ space efficiently, and further exhibit a good locality behavior. We test our\\ method on several biologica...    »
 
Keywords:
Data Mining; Pattern Discovery; String Mining; Sequence Mining; Constraint-Based Data Mining; Suffix Array 
Year:
2005 
Year / month:
2005-11-01 00:00:00 
Pages:
14