Data Mining

Imagine having access to large sets of engineered data with known characteristics chosen by your team, to provide metrics to your performance testing.
Data mining is the process of extracting patterns from data. Data mining is becoming an increasingly important in transforming raw data into useful information. It is commonly used in a wide range of profiling practices, such as marketing, surveillance, fraud detection and scientific discovery.
Performance testing of data mining applications requires very large realistic and representative data sets. It is also necessary that the contents of data be known, in other words, that for benchmarking and qualifying results, that the correct and therefore desired results be known.
ExactData’s Dynamic Data Generator automatically creates the large data “haystacks” of realistic and representative data sets. “Needles” or anomalies are then automatically and intentionally introduced into these test data sets that mimic the naturally occurring patterns which the data mining application is designed to discover. Along with tagging metadata information, creating the ability to specifically measure and improve your data mining applications performance.
