ExactData

  • About
  • Dataset Sale
  • Applications
  • Contact
  • Data Blog
  • Partners
  • Resources
  • Sample Data
  • Smart Data
  • About
  • Dataset Sale
  • Applications
  • Contact
  • Data Blog
  • Partners
  • Resources
  • Sample Data
  • Smart Data

The Data Blog

Data Blog

Synthetic Data Generation Market Update

3/4/2022

0 Comments

 
Due to privacy laws and restrictions, the synthetic data generation market is evolving from a large base of companies generating the data based on legacy methods involving modifying an existing database using Extract Transform Load (ETL) technologies to fully synthetic generation which does not.  Fully synthetic technologies involve the use of algorithms to generate the data or the use of AI/ML to analyze a production database and reproduce a facsimile.  Complexity of the generated fully synthetic data and fit for use for the system under test varies widely from non-sensical randomly generated data using free tools to premium solutions of highly complex systems of systems databases and the ability to generate statistically significant data for the creation of confusion matrixes and measurement of systems error rates.  The fully synthetic data generation market is migrating to higher complexity driven by the ability to make high revenue/profit Enterprise level sales and clear benefits as better test objects that reduce systems error rates while dramatically decreasing software development cycles at less costs than traditional methods. 
​

A recent internet search revealed 43 companies participating in the test data generation market.  The majority of companies relied on traditional ETL methods to generate the data with an impressive growth in new companies generating fully synthetic data.  Many of these new companies were using a combination of AI/ML techniques and traditional ETL or lower complexity algorithmic solutions.  An example is Tonic which has appeared in the market within the last few years with an impressive $35M in Round B venture funding.  ExactData appears to remain the only company participating in the premium fully synthetic data generation market.
0 Comments

    Archives

    April 2025
    August 2023
    April 2022
    March 2022
    November 2021
    October 2021
    September 2021
    August 2021
    July 2021
    June 2021
    April 2021
    March 2021
    February 2021
    January 2021
    December 2020
    November 2020
    October 2020
    September 2020
    August 2020
    July 2020
    June 2020
    May 2020
    April 2020
    March 2020
    February 2020
    January 2020
    December 2019
    November 2019
    October 2019
    September 2019
    August 2019
    July 2019
    June 2019
    May 2019
    April 2019
    March 2019
    February 2019

    Categories

    All
    Artificial Data
    Cyber Data
    Interview
    Other
    Smart Data

    RSS Feed

    Data Blog

Questions? Contact us today, we'd love to hear from you!


Hours

M-F: 9am - 5pm

Email

[email protected]