Home RWTH-Aachen
Home
Lehrstuhl für Informatik 9
Datenmanagement und Exploration
Univ.-Prof. Dr. rer. nat. Thomas Seidl
RWTH-Aachen
RWTH-Aachen - Lehrstuhl für Informatik 9  » Lehrstuhl
 Navigation
Lehrstuhl
Anfahrt
Lehre
Forschung
Publikationen
Team
Algorithmus der Woche
Sitemap
Impressum
Intern
 Sprache
  Deutsch
  English

Sequence Similarity Search

Continuous growth in sensor data and other temporal data increases the importance of retrieval and similarity search in time series data. Analysis of this data typically requires searching for similar time series in the data base and for interactive applications efficiency of the search process is essential.

 

Existing multidimensional indexes like the R-tree provide efficient querying for only relatively few dimensions. Time series are typically long which corresponds to extremely high dimensional data in multidimensional indexes. Due to massive overlap of index descriptors, multidimensional indexes degenerate for high dimensions and access the entire data by random I/O. Consequently, the efficiency benefits of indexing are lost. Therefore we develop new index structures for efficient time series retrieval and similarity search. For example, by exploiting inherent properties of time series, the developed TS-tree indexes high-dimensional data in an overlap-free manner. During query processing, powerful pruning via quantized separator and metadata information greatly reduces the number of pages which have to be accessed, resulting in substantial speed-up.

 

Dynamic Time Warping (DTW) is a widely used high quality similarity measure for time series. As DTW is computationally expensive, efficient algorithms for fast DTW computation are crucial. Scalability to long time series, wide DTW bands, and a high number of attributes are still challenging issues. We proposed a novel technique that exploits the inherent properties of multivariate DTW to substantially reduce the number of calculations required to compare a query time series with the time series in a data base in multistep retrieval. The significant efficiency improvements achieved result in substantial performance gains that scale well to long multivariate time series with large DTW bands. Our technique is highly flexible and can be combined with existing indexing structures and DTW filters.

Beteiligte Mitarbeiter

Assent I., Krieger R., Kremer H.

Publikationen

  1. EN Assent I., Wichterich M., Krieger R., Kremer H., Seidl T.: (2009)
    Anticipatory DTW for Efficient Similarity Search in Time Series Databases
    Proc. 35th International Conference on Very Large Data Bases (VLDB 2009), Lyon, France, PVLDB Journal, Vol. 2, No. 1, 826-837 (Core Database Technology track, acceptance rate 16.7%)
    [VLDB 2009]

  2. EN Assent I., Kremer H.: (2009)
    Robust Adaptable Video Copy Detection
    Proc. 11th International Symposium on Spatial and Temporal Databases (SSTD 2009), Aalborg, Denmark.
    [SSTD 2009]

  3. EN Assent I., Krieger R., Afschari F., Seidl T.: (2008)
    The TS-Tree: Efficient Time Series Search and Retrieval
    Proc. 11th International Conference on Extending Data Base Technology (EDBT 2008), Nantes, France. 252-263
    [EDBT 2008]

Diplom-/Master-arbeiten

Indexunterstützung für Warping Distance-basierte Ähnlichkeitssuche in Sequenzendatenbanken
Student: Farzad AfschariBetreuer: Ira Assent, Ralph Krieger

Haftungsausschluss By I9 2003