Hello, what is 'synthetic data'? Specifically, how can synthetic data be sufficient for creating models to run on real-world data? Would it not replicate the narrowness of the small real data they had to begin with?

Do I know anyone on here who explored what kind of niches synthetic data helps & where its use is being over-shot beyond the effective boundary?

Tagging to hope to reach to someone who worked on this #machinelearning

Follow

@missmythreyi I guess that kind of makes sense to test new models/algorithms against some wanted behaviour when you have some understanding of the underlying physical/real world model? I've seen that used a lot for genomics data, for instance

@nicolaromano Yes, I was wondering the same. Especially because I am noticing the synthetic data being generated without having experts on the data set and its representing phenomena and was wondering how can that be effective/erroneous.

Sign in to participate in the conversation
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.